CN107704529A - The recognition methods of information uniqueness, application server, system and storage medium - Google Patents

The recognition methods of information uniqueness, application server, system and storage medium Download PDF

Info

Publication number
CN107704529A
CN107704529A CN201710850369.5A CN201710850369A CN107704529A CN 107704529 A CN107704529 A CN 107704529A CN 201710850369 A CN201710850369 A CN 201710850369A CN 107704529 A CN107704529 A CN 107704529A
Authority
CN
China
Prior art keywords
customer
identification
corporate customer
class
corporate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710850369.5A
Other languages
Chinese (zh)
Other versions
CN107704529B (en
Inventor
王恩贵
项同德
钱慧敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Technology Shenzhen Co Ltd
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Priority to CN201710850369.5A priority Critical patent/CN107704529B/en
Publication of CN107704529A publication Critical patent/CN107704529A/en
Priority to PCT/CN2018/084325 priority patent/WO2019056750A1/en
Application granted granted Critical
Publication of CN107704529B publication Critical patent/CN107704529B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Computer And Data Communications (AREA)

Abstract

The invention discloses the recognition methods of information uniqueness, application server, system and storage medium, wherein, the recognition methods of described information uniqueness includes customer name and identification information by obtaining the essential information of the corporate customer stored in each source database, the essential information;According to the identification information by each corporate customer labeled as precisely identification class or fuzzy diagnosis class;The corporate customer in precisely identification class and fuzzy diagnosis class is precisely identified and fuzzy diagnosis respectively according to default recognition rule, the corporate customer of identification same client each other;The precisely recognition result of identification and fuzzy diagnosis is obtained, and is integrated the essential information of the corporate customer of all clients same each other according to recognition result, draws the corporate customer information of uniqueness identification.Precisely identified respectively according to the type of different corporate customers or fuzzy diagnosis, solve the problems, such as due to customer information it is imperfect caused by can not carry out uniqueness identification and Data Integration.

Description

The recognition methods of information uniqueness, application server, system and storage medium
Technical field
The present invention relates to information discriminating technology field, and in particular to the recognition methods of information uniqueness, application server, system And storage medium.
Background technology
At present, due to the corporate customer substantial amounts of many companies, corporate customer need to be identified and Data Integration, In order to corporate customer management, the identification of traditional information uniqueness is mainly identified by organization's coding of client , this recognition methods is although rigorous, accurate, but because of situations such as customer information is likely occurred excalation, organizes machine Structure coding information saturation degree is not high, carries out accurately identifying the client that partial information can be caused sufficiently complete by organization mechanism code Identification can not be realized and integrated, limit the scope of uniqueness identification corporate customer.
Therefore, prior art has yet to be improved and developed.
The content of the invention
In view of in place of above-mentioned the deficiencies in the prior art, it is an object of the invention to provide a kind of information uniqueness identification side Method, application server, system and storage medium, it can precisely be identified or be obscured respectively according to the type of different corporate customers and be known Not, asking for uniqueness identification and Data Integration can not be carried out by solving the part corporate customer caused by customer information is imperfect Topic.
In order to achieve the above object, this invention takes following technical scheme:
A kind of information uniqueness recognition methods, it comprises the following steps:
The essential information of the corporate customer stored in each source database is obtained, wherein, the essential information includes customer name And identification information;
According to the identification information by each corporate customer labeled as precisely identification class or fuzzy diagnosis class;
The corporate customer in precisely identification class and fuzzy diagnosis class is precisely identified and mould respectively according to default recognition rule Paste identification, identify the corporate customer of same client each other;
The precisely recognition result of identification and fuzzy diagnosis is obtained, and it is whole according to essential information progress of the recognition result to corporate customer Close, draw the corporate customer information of uniqueness identification.
It is described that each corporate customer is labeled as by essence according to the identification information in described information uniqueness recognition methods The step of quasi- identification class or fuzzy diagnosis class, includes:
Parse the identification information of each corporate customer;
Judge whether comprising accurate identification information is preset in the identification information of each corporate customer, if so, then labeled as precisely knowledge Other class;Otherwise it is labeled as fuzzy diagnosis class.
In described information uniqueness recognition methods, the basis presets recognition rule to precisely identification class and fuzzy diagnosis Corporate customer in class is precisely identified respectively and fuzzy diagnosis, identification each other the corporate customer of same client the step of wrap Include:
Text detection is carried out to the customer name of all entities client, obtains the number of words and word content of customer name;
Uniqueness identification is carried out according to the default precisely identification information of each corporate customer in accurate identification class and customer name, known The corporate customer of same client each other in not described precisely identification class;
Uniqueness is carried out according to each identification information of corporate customer, the number of words of customer name and word content in fuzzy diagnosis class Identification, identify in the fuzzy diagnosis class corporate customer of same client each other.
In described information uniqueness recognition methods, the basis precisely identify each corporate customer in class it is default precisely Identification information and customer name carry out uniqueness identification, identify in the precisely identification class corporate customer of same client each other Step includes:
Any corporate customer chosen in precisely identification class, is preset accurate identification information and customer name with precisely knowing Default precisely identification information and the customer name of other corporate customers are contrasted in other class;
Judge whether to preset that accurate identification information is identical and customer name identical corporate customer according to comparing result, if In the presence of will then preset that accurate identification information is identical and customer name identical corporate customer and the corporate customer that is selected identify For same client;If being not present, the corporate customer being selected and other corporate customers in precisely identification class are identified as difference Client;
Continue to choose another corporate customer in precisely identification class and carry out uniqueness identification with other corporate customers, until accurate Identify that corporate customer all in class is identified.
In described information uniqueness recognition methods, the identification letter of each corporate customer in the class according to fuzzy diagnosis Breath, the number of words of customer name and word content carry out uniqueness identification, identify in the fuzzy diagnosis class same client's each other The step of corporate customer, includes:
Judge whether the customer name number of words of each corporate customer in fuzzy diagnosis class is more than or equal to predetermined threshold value;
If being more than or equal to predetermined threshold value, uniqueness knowledge is carried out according to the number of words of the customer name of corporate customer and word content Not;
If being less than predetermined threshold value, carried out according to the identification information of corporate customer, the number of words of customer name and word content unique Property identification.
In described information uniqueness recognition methods, if described be more than or equal to predetermined threshold value, according to the visitor of corporate customer The step of number of words and word content progress uniqueness identification that name in an account book claims, includes:
Any corporate customer chosen customer name number of words in fuzzy diagnosis class and be more than or equal to predetermined threshold value, by its customer name Claim to carry out word content contrast with the customer name of all entities client;
The identical corporate customer of the word content of customer name is judged whether according to comparing result, if in the presence of, by The identical corporate customer of word content is identified as same client with the corporate customer being selected;, will be selected if being not present The corporate customer taken is identified as different clients with other all entities client;
Continue to choose customer name number of words in fuzzy diagnosis class and be more than or equal to another corporate customer progress of predetermined threshold value uniquely Property identification, until in fuzzy diagnosis class all customer name numbers of words be more than or equal to predetermined threshold value corporate customer it is identified.
In described information uniqueness recognition methods, if described be less than predetermined threshold value, believed according to the identification of corporate customer The step of breath, the number of words of customer name and word content progress uniqueness identification, includes:
Any corporate customer chosen customer name number of words in fuzzy diagnosis class and be less than predetermined threshold value, by its customer name with The customer name of all entities client carries out word content contrast;
The identical corporate customer of the word content of customer name is judged whether according to comparing result, if in the presence of, after It is continuous to judge whether the identical corporate customer of word content and the corporate customer being selected have any identical identification information, Word content is identical and be identified as with the corporate customer of any identical identification information with the corporate customer being selected Same client;If being not present, the corporate customer being selected and other all entities client are identified as different clients;
Continue to choose another corporate customer progress uniqueness knowledge that customer name number of words in fuzzy diagnosis class is less than predetermined threshold value Not, until in fuzzy diagnosis class all customer name numbers of words be less than predetermined threshold value corporate customer it is identified.
A kind of application server of information uniqueness identification, it includes:Processor, memory and communication bus;
Being stored with the memory can be by the computer-readable program of the computing device;
The communication bus realizes the connection communication between processor and memory;
The step in information uniqueness recognition methods as described above is realized described in the computing device during computer-readable program Suddenly.
A kind of computer-readable recording medium, the computer-readable recording medium storage have one or more program, One or more of programs can be by one or more computing device, to realize that information uniqueness as described above identifies Step in method.
A kind of information uniqueness identifying system, including several source databases, it is unique that it also includes information as described above Property identification application server;
Each source database, for storing the essential information of corporate customer;
The application server, for obtaining the essential information of the corporate customer stored in each source database, wherein, the base This information includes customer name and identification information;And each corporate customer is identified labeled as accurate according to the identification information Class or fuzzy diagnosis class;The corporate customer in precisely identification class and fuzzy diagnosis class is carried out respectively according to default recognition rule Precisely identification and fuzzy diagnosis, identify the corporate customer of same client each other;And obtain the precisely identification of identification and fuzzy diagnosis As a result, the essential information of the corporate customer of all clients same each other is integrated according to recognition result, show that uniqueness is known Other corporate customer information.
Compared to prior art, the recognition methods of information uniqueness, application server, system and storage provided by the invention are situated between In matter, the recognition methods of described information uniqueness by obtaining the essential information of the corporate customer stored in each source database, its In, the essential information includes customer name and identification information;Each corporate customer is marked according to the identification information afterwards For precisely identification class or fuzzy diagnosis class;Afterwards according to default recognition rule to the group in precisely identification class and fuzzy diagnosis class Body client is precisely identified respectively and fuzzy diagnosis, identifies the corporate customer of same client each other;Precisely identification is obtained afterwards With the recognition result of fuzzy diagnosis, and the essential information of the corporate customer of all clients same each other is carried out according to recognition result Integrate, draw the corporate customer information of uniqueness identification.Can precisely be identified respectively according to the type of different corporate customers or Fuzzy diagnosis, solve because the imperfect caused part corporate customer of customer information can not carry out uniqueness identification and data are whole The problem of conjunction.
Brief description of the drawings
Fig. 1 is the application environment schematic diagram of information uniqueness recognition methods provided by the invention;
Fig. 2 is the flow chart of information uniqueness recognition methods provided by the invention;
Fig. 3 is the flow chart of step S20 in information uniqueness recognition methods provided by the invention;
Fig. 4 is the flow chart of step S30 in information uniqueness recognition methods provided by the invention;
Fig. 5 is the flow chart of step S32 in information uniqueness recognition methods provided by the invention;
Fig. 6 is the flow chart of step S33 in information uniqueness recognition methods provided by the invention;
Fig. 7 is the flow chart of step S332 in information uniqueness recognition methods provided by the invention;
Fig. 8 is the flow chart of step S333 in information uniqueness recognition methods provided by the invention;
Fig. 9 is the running environment schematic diagram of the preferred embodiment of information uniqueness recognizer of the present invention;
Figure 10 is the functional block diagram of the application server preferred embodiment of mount message uniqueness recognizer of the present invention;
Figure 11 is the structured flowchart of information uniqueness identifying system provided by the invention.
Embodiment
In view of the shortcomings of uniqueness identification can not be carried out when customer information is incomplete in the prior art and is integrated, of the invention Purpose is to provide a kind of recognition methods of information uniqueness, application server, system and storage medium, can be according to different groups visitor The type at family is precisely identified respectively or fuzzy diagnosis, solves due to the imperfect caused part corporate customer of customer information The problem of uniqueness identification and Data Integration can not be carried out.
To make the purpose of the present invention, technical scheme and effect clearer, clear and definite, develop simultaneously embodiment pair referring to the drawings The present invention is further described.It should be appreciated that specific embodiment described herein is not used to only to explain the present invention Limit the present invention.
Referring to Fig. 1, it is the application environment schematic diagram of information uniqueness recognition methods provided by the invention., should in figure Related data is handled with one or more application programs can be installed in server.In the present embodiment, application service can be passed through Device receives the essential information of the corporate customer stored in each source database, and by application server according to the basic letter Each corporate customer is identified class or fuzzy diagnosis class by breath labeled as accurate, to the group in precisely identification class and fuzzy diagnosis class Body client is precisely identified respectively and fuzzy diagnosis, identifies the corporate customer of same client each other, according to recognition result to group The essential information of body client is integrated, it is achieved thereby that precisely being identified respectively according to the type of different corporate customers or mould Paste identification, avoid the imperfect situation for leading to not identification of customer information.
Referring to Fig. 2, information uniqueness recognition methods provided by the invention comprises the following steps:
S10, the essential information for obtaining the corporate customer stored in each source database, wherein, the essential information includes client Title and identification information.
In the present embodiment, the corporate customer data that several source databases correspondingly store Different Industries company can be set, by May there is business dealing with the Different Industries company under same house flag in a corporate customer, so as in different source datas The data of the corporate customer are stored with storehouse, it is therefore desirable to which the corporate customer in all source databases is subjected to uniqueness identification And integration, in order to the management and data analysis of corporate customer, specially obtained from each source database of different industry companies The essential information of the corporate customer stored is taken, wherein, the essential information includes customer name and identification information, specific described Identification information is that can recognize that the information of corporate customer identity, such as organization mechanism code, industrial and commercial registration number, the tax registration number, battalion Industry license number etc..
Preferably, the source database is oracle database(Oracle database also known as Oracle RDBMS, or referred to as Oracle, it is a relational database management system of Oracle), MySQL(MySQL is the small-sized of open source code Correlation data base management system)Database or PostgreSQL(PostgreSQL is a free Object-Relation data Storehouse server)Database, the target database are hive(Hive is a Tool for Data Warehouse based on Hadoop)Data Storehouse.These databases are common and easily operated data base management system and instrument, are easy in the present embodiment to data Analyzed and handled.
S20, according to the identification information by each corporate customer labeled as precisely identification class or fuzzy diagnosis class.
After the essential information of all entities client is obtained, because the identification information integrity degree of different corporate customers differs Sample, therefore be marked as according to the identification information of different corporate customers precisely identifying class or fuzzy diagnosis class, to inhomogeneity Other client carries out corresponding uniqueness identification, and no matter client identification information is complete or missing can realize the identification of corporate customer And integrate, the application of information uniqueness identification is widened, referring to Fig. 3, it is identified for information uniqueness provided by the invention Step S20 flow chart in method.
As shown in figure 3, the step S20 includes:
S21, each corporate customer of parsing identification information;
S22, judge whether comprising accurate identification information is preset in the identification information of each corporate customer, if so, then labeled as essence Quasi- identification class;Otherwise it is labeled as fuzzy diagnosis class.
I.e. after the essential information of all entities client is obtained, the identification information of each corporate customer is parsed, is drawn every The content that the identification information of individual corporate customer is included, whether judge successively in the identification information of each corporate customer comprising default Accurate identification information, labeled as precisely identification class if having, otherwise labeled as fuzzy diagnosis class, so as to realize the knowledge of corporate customer Do not classify, data base is provided for follow-up targetedly identification process.In the present embodiment, the default accurate identification information is preferably This unique, thick-and-thin marking code of organization mechanism code, will include the group of organization mechanism code in identification information Body client is labeled as precisely identification class, so as to fast and accurately precisely be identified, and by not comprising organization mechanism code Corporate customer is labeled as fuzzy diagnosis class, is integrated by other customer informations and carries out fuzzy diagnosis, realizes hierarchical identification, enriches The application scenarios of information uniqueness identification.
The default recognition rule of S30, basis is carried out precisely respectively to the corporate customer in precisely identification class and fuzzy diagnosis class Identification and fuzzy diagnosis, identify the corporate customer of same client each other.
In the present embodiment, after being classified to all entities client, enter respectively for different classes of corporate customer Row precisely identification and fuzzy diagnosis, effectively covers all entities customer range, can recognize that same each other in not genbank database The corporate customer of one client, so as to the management and analysis of corporate customer data.Referring to Fig. 4, it is information provided by the invention Step S30 flow chart in uniqueness recognition methods.
As shown in figure 4, the step S30 includes:
S31, the customer name to all entities client carry out text detection, obtain the number of words and word content of customer name;
S32, basis precisely identify the default precisely identification information of each corporate customer and customer name progress uniqueness knowledge in class Not, the corporate customer of same client each other in the precisely identification class is identified;
S33, carried out only according to each identification information of corporate customer in fuzzy diagnosis class, the number of words of customer name and word content One property identifies, identifies in the fuzzy diagnosis class corporate customer of same client each other.
In the present embodiment, after corporate customer classification is carried out, word inspection first is carried out to the customer name of all entities client Survey, obtain the number of words of customer name and word content and preserve, wherein text detection identification can use existing OCR Text regions Technology, it is different according to client's classification afterwards, for the client in precisely identification class, due to including default essence in its identification information Quasi- identification information(It is organization mechanism code in the present embodiment), therefore according to the default essence of each corporate customer in accurate identification class Quasi- identification information and customer name carry out uniqueness identification, and for the client in fuzzy diagnosis class, due in its identification information Not comprising default accurate identification information, it is therefore desirable to the other guide in identification information, and combine the number of words of customer name And in word content and uniqueness identification is carried out, it both ensure that the client in precisely identification class can quickly and accurately be known Not, it also ensure that the client in fuzzy diagnosis class can integrate other every essential informations and be identified, meet different information completelies The client of degree identifies demand.
The process wherein precisely identified is referring to Fig. 5, it is step in information uniqueness recognition methods provided by the invention S32 flow chart, as shown in figure 5, the step S32 includes:
S321, any corporate customer chosen in precisely identification class, preset accurate identification information and customer name with Precisely default precisely identification information and the customer name of other corporate customers are contrasted in identification class;
S322, according to comparing result judge whether to preset that accurate identification information is identical and customer name identical group visitor Family, if in the presence of will preset that accurate identification information is identical and customer name identical corporate customer and the corporate customer that is selected It is identified as same client;If being not present, the corporate customer being selected and other corporate customers in precisely identification class are identified as Different clients;
S323, continue to choose another corporate customer and the progress uniqueness identification of other corporate customers precisely identified in class, directly Into accurate identification class, all corporate customers are identified.
When precisely being identified, any corporate customer chosen in precisely identification class, Institution Code is organized into Contrasted with customer name and other corporate customers in precisely identification class, uniqueness identification is carried out according to comparing result, sentenced Breaking, identical with the presence or absence of organization mechanism code and customer name is identical(Number of words and word content all same)Corporate customer, such as Fruit is present, then by all organization mechanism codes are identical and customer name identical corporate customer and the corporate customer that is selected are known Not Wei same client, no matter identifying that how many individual organization mechanism codes are identical and customer name identical corporate customer, will It is identified as same client with the corporate customer being selected, and all is identified as precisely being identified subsequently for same client Shi Wuxu is repeated to identify, is saved identification and take;If it does not exist, then by the corporate customer being selected with precisely identification class in its His corporate customer is identified as different clients, continues to choose another corporate customer in precisely identification class afterwards and carries out above-mentioned identification Process, until all entities client in precisely identification class is identified.
For example, in the corporate customer that never genbank database is got, have ten labeled as the corporate customer of accurate identification class It is individual, be designated as client 1, client 2 ..., client 10, when precisely being identified, arbitrarily choose one of corporate customer start into Row uniqueness identifies, such as chooses client 1, is organized into Institution Code and customer name and other nine corporate customers and carries out pair Than showing that the organization mechanism code of client 3 and client 4 and customer name are identical with client 1, now by client 1, client 3 and client 4 be identified as same client, the customer data of three can be integrated, and without again when subsequently precisely being identified Client 3 and client 4 are identified;Continue the organization mechanism code of client 2 and customer name and other six groups afterwards Client is contrasted, and show that the organization mechanism code of no corporate customer and customer name are identical with client 2, now will Client 2 is identified as different clients with other nine corporate customers;By above-mentioned identification process successively again to client 5, client 6 ..., visitor Family 10 carries out uniqueness identification, so as to which ten clients in accurate identification class are carried out into uniqueness identification, in favor of corporate customer Data analysis and management.
Wherein the process of fuzzy diagnosis is referring to Fig. 6, it is step in information uniqueness recognition methods provided by the invention S33 flow chart, as shown in fig. 6, the step S33 includes:
S331, judge whether the customer name number of words of each corporate customer in fuzzy diagnosis class is more than or equal to predetermined threshold value;
If S332, being more than or equal to predetermined threshold value, carried out according to the number of words of the customer name of corporate customer and word content unique Property identification;
If S333, being less than predetermined threshold value, carried out according to the identification information of corporate customer, the number of words of customer name and word content Uniqueness identifies
I.e. when carrying out fuzzy diagnosis, different identification process is carried out according to the customer name number of words of corporate customer, first judges mould Whether the customer name number of words of each corporate customer is more than predetermined threshold value in paste identification class, in the present embodiment, the predetermined threshold value Preferably 8, that is, judge whether the customer name of each corporate customer in fuzzy diagnosis class is more than or equal to 8 words, if greater than etc. In 8 words, now because customer name is longer, it is very small the probability of different clients duplication of name occur, therefore according to customer name word Number and word content carry out uniqueness identification, if less than 8 words, then in order to further confirm that the information of corporate customer, not only The number of words and word content of customer name are needed, also needs to carry out uniqueness identification in the lump with reference to its identification information, according to different visitors Name in an account book claims number of words to carry out further Classification and Identification, while ensure that recognition accuracy and recognition efficiency.
Specifically, also referring to Fig. 7, it is step S332 in information uniqueness recognition methods provided by the invention Flow chart, as shown in fig. 7, the step S332 includes:
S3321, any corporate customer chosen customer name number of words in fuzzy diagnosis class and be more than or equal to predetermined threshold value, by it Customer name and the customer name of all entities client carry out word content contrast;
S3322, the identical corporate customer of word content for judging whether according to comparing result customer name, if depositing The identical corporate customer of word content and the corporate customer being selected then are being identified as same client;If being not present, The corporate customer being selected and other all entities client are identified as different clients;
S3323, continue choose fuzzy diagnosis class in customer name number of words be more than or equal to predetermined threshold value another corporate customer enter Row uniqueness identifies, until all customer name numbers of words are known more than or equal to the corporate customer of predetermined threshold value in fuzzy diagnosis class Not.
When carrying out fuzzy diagnosis, it is more than or equal to predetermined threshold value for customer name number of words(It is 8 words in the present embodiment) Situation, the corporate customer that title number of words in fuzzy diagnosis class is more than or equal to 8 words is arbitrarily chosen first, by its customer name Claim and all entities client(Including the corporate customer in precisely identification class and fuzzy diagnosis class)Customer name carry out word in Hold contrast, uniqueness identification is carried out according to comparing result, judges whether the identical group of word content of customer name Body client, if the identical corporate customer of word content and the corporate customer that is selected are identified as into same client in the presence of if, No matter the same word content identical corporate customer for identifying how many individual customer names, by itself and the corporate customer that is selected Be identified as same client, and it is all be identified as same client subsequently carry out fuzzy diagnosis when without repeat identify, save Identification is time-consuming;If the corporate customer being selected and other all entities client are identified as into different clients in the absence of if, afterwards Continue to choose the above-mentioned identification process of corporate customer progress that another customer name in fuzzy diagnosis class is more than or equal to 8 words, Until all customer names are identified more than or equal to the corporate customer of 8 words in fuzzy diagnosis class.
For example, in the corporate customer that never genbank database is got, labeled as fuzzy diagnosis class and customer name number of words Have ten more than or equal to the corporate customer of 8 words, be designated as client 11, client 12 ..., client 20, when carrying out fuzzy diagnosis, appoint Meaning chooses one of corporate customer and proceeds by uniqueness identification, such as chooses client 11, by its customer name and other institutes There is corporate customer to be contrasted, be same client due to client 11 may now occur with the corporate customer in precisely identification class, But cause to be labeled as fuzzy diagnosis class due to having lacked organization mechanism code in its identification information, therefore, carrying out fuzzy knowledge , it is necessary to which the corporate customer being selected and the title of other all entities client are contrasted when other, it is ensured that identification it is comprehensive And accuracy, assume that contrast draws the visitor of the client 2 in precisely identification class, the client 13 in fuzzy diagnosis class and client 14 afterwards Name in an account book claims word content identical with client 11, then client 2, client 11, client 13 and client 14 is identified as into same visitor Family, four customer data can be integrated;Continue to carry out the customer name of client 12 and other six corporate customers afterwards Contrast, show that the customer name of no corporate customer is identical with client 12, now by client 12 and other all entities visitor Family is identified as different clients;By above-mentioned identification process successively again to client 15, client 16 ..., client 20 carry out uniqueness identification, Name Completion number of words is more than the uniqueness identification process of the corporate customer of 8 words.
Further, referring to Fig. 8, it is the flow of step S333 in information uniqueness recognition methods provided by the invention Figure, as shown in figure 8, the step S333 includes:
S3331, any corporate customer chosen customer name number of words in fuzzy diagnosis class and be less than predetermined threshold value, by its client Title and the customer name of all entities client carry out word content contrast;
S3331, the identical corporate customer of word content for judging whether according to comparing result customer name, if depositing Then continuing to judge whether the identical corporate customer of word content there is any identical to know with the corporate customer being selected Other information, by word content it is identical and with any identical identification information corporate customer and be selected group visitor Family is identified as same client;If being not present, the corporate customer being selected is identified as different visitors from other all entities client Family;
S3331, continue choose fuzzy diagnosis class in customer name number of words be less than predetermined threshold value another corporate customer carry out only One property identifies, until all customer name numbers of words are identified less than the corporate customer of predetermined threshold value in fuzzy diagnosis class.
When carrying out fuzzy diagnosis, it is less than predetermined threshold value for customer name number of words(It is 8 words in the present embodiment)Feelings Shape, first any corporate customer chosen title number of words in fuzzy diagnosis class and be less than 8 words, by its customer name and all groups Body client(Including the corporate customer in precisely identification class and fuzzy diagnosis class)Customer name carry out word content contrast, according to Comparing result carries out uniqueness identification, the identical corporate customer of word content of customer name is judged whether, if depositing , because the corporate customer title being selected is shorter, easy situation about bearing the same name, therefore continue to judge the complete phase of word content Whether same corporate customer and the corporate customer being selected have any identical identification information, such as industrial and commercial registration number, the tax At least one in registration number, business license number is identical, and customer name word content is identical, and has any same identification The corporate customer of information is identified as same client with the corporate customer being selected;If in the absence of if by the corporate customer being selected with Other all entities client is identified as different clients, continues to choose another customer name in fuzzy diagnosis class afterwards and is less than 8 The corporate customer of individual word carries out above-mentioned identification process, until all customer names are less than the group visitor of 8 words in fuzzy diagnosis class It is identified per family, completing customer name in fuzzy diagnosis class with reference to identification information and customer name is less than the information of 8 words only One property identifies.
S40, the precisely recognition result of identification and fuzzy diagnosis is obtained, and according to recognition result by all clients same each other The essential information of corporate customer integrated, draw the corporate customer information of uniqueness identification.
In the present embodiment, after accurate identification and fuzzy diagnosis has been carried out respectively for different classes of corporate customer, know Do not go out the corporate customer of all clients same each other, the precisely recognition result of identification and fuzzy diagnosis is now obtained, according to identification As a result the essential information of the corporate customer of same client each other is integrated, specially by the client of all clients same each other Title carries out unification, and respective identification information is carried out into complementary integration, draws the corporate customer information of uniqueness identification, according to The corporate customer information of uniqueness identification can disposably obtain client's number that a certain corporate customer is stored in not genbank database According to, realize and the same customer data in not genbank database is subjected to unified integration analysis, the visitor after being integrated according to identification User data carries out data analysis, with reference to the number of its user data energy comprehensive analysis corporate customer stored in Different Industries company According to life cycle, tendentiousness consumption wish and risk control information etc., be advantageous to the follow-up and management of corporate customer.
As shown in figure 9, being based on above- mentioned information uniqueness recognition methods, the present invention further correspondingly provides a kind of information uniqueness The application server of identification, it includes processor 10, memory 20 and display 30.Fig. 9 illustrate only the identification of information uniqueness Application server members, it should be understood that being not required for implementing all components shown, the reality that can be substituted Apply more or less components.
The memory 20 can be the inside of the application server of described information uniqueness identification in certain embodiments Memory cell, such as the hard disk or internal memory of application server.The memory 20 can also be described in further embodiments On the External memory equipment of the application server of information uniqueness identification, such as the application server of described information uniqueness identification The plug-in type hard disk of outfit, intelligent memory card(Smart Media Card, SMC), secure digital(Secure Digital, SD)Card, flash card(Flash Card)Deng.Further, the memory 20 can also both include institute's information uniqueness identification The internal storage unit of application server also include External memory equipment.The memory 20 is installed on the letter for storage Cease the application software and Various types of data of the application server of uniqueness identification, such as the application of mount message uniqueness identification Program code of server etc..The memory 20 can be also used for temporarily storing the number that has exported or will export According to.In certain embodiments, information uniqueness recognizer 40 is stored with memory 20, the information uniqueness recognizer 40 Can be performed by processor 10, so as to realize the information uniqueness recognition methods of each embodiment of the application.
The processor 10 can be a central processing unit in certain embodiments(Central Processing Unit, CPU), microprocessor or other data processing chips, for running the program code stored in the memory 20 or processing number According to, such as perform described purview certification method etc..
The display 30 can be light-emitting diode display, liquid crystal display, touch-control liquid crystal display in certain embodiments And OLED(Organic Light-Emitting Diode, Organic Light Emitting Diode)Touch device etc..The display 30 is used In the information for the application server for being shown in the identification of described information uniqueness and for showing visual user interface, such as Recognition result interface etc..The part 10-30 of the application server of described information uniqueness identification is in communication with each other by system bus.
In certain embodiments, realized when processor 10 performs information uniqueness recognizer 40 in the memory 20 Following steps:
The essential information of the corporate customer stored in each source database is obtained, wherein, the essential information includes customer name And identification information;
According to the identification information by each corporate customer labeled as precisely identification class or fuzzy diagnosis class;
The corporate customer in precisely identification class and fuzzy diagnosis class is precisely identified and mould respectively according to default recognition rule Paste identification, identify the corporate customer of same client each other;
The precisely recognition result of identification and fuzzy diagnosis is obtained, and it is according to recognition result that the group of all clients same each other is objective The essential information at family is integrated, and draws the corporate customer information of uniqueness identification.
Further, it is described to be incited somebody to action respectively according to the identification information in the application server of described information uniqueness identification Individual corporate customer includes labeled as the step of precisely identification class or fuzzy diagnosis class:
Parse the identification information of each corporate customer;
Judge whether comprising accurate identification information is preset in the identification information of each corporate customer, if so, then labeled as precisely knowledge Other class;Otherwise it is labeled as fuzzy diagnosis class.
The basis presets recognition rule and the corporate customer in precisely identification class and fuzzy diagnosis class is carried out precisely respectively Identification and fuzzy diagnosis, identification each other the corporate customer of same client the step of include:
Text detection is carried out to the customer name of all entities client, obtains the number of words and word content of customer name;
Uniqueness identification is carried out according to the default precisely identification information of each corporate customer in accurate identification class and customer name, known The corporate customer of same client each other in not described precisely identification class;
Uniqueness is carried out according to each identification information of corporate customer, the number of words of customer name and word content in fuzzy diagnosis class Identification, identify in the fuzzy diagnosis class corporate customer of same client each other.
The basis precisely identifies that the default precisely identification information of each corporate customer and customer name are carried out unique in class Property identification, identify it is described precisely identification class in each other the corporate customer of same client the step of include:
Any corporate customer chosen in precisely identification class, is preset accurate identification information and customer name with precisely knowing Default precisely identification information and the customer name of other corporate customers are contrasted in other class;
Judge whether to preset that accurate identification information is identical and customer name identical corporate customer according to comparing result, if In the presence of will then preset that accurate identification information is identical and customer name identical corporate customer and the corporate customer that is selected identify For same client;If being not present, the corporate customer being selected and other corporate customers in precisely identification class are identified as difference Client;
Continue to choose another corporate customer in precisely identification class and carry out uniqueness identification with other corporate customers, until accurate Identify that corporate customer all in class is identified.
The each identification information of corporate customer, the number of words of customer name and word content enter in the class according to fuzzy diagnosis The step of row uniqueness identifies, identifies in the fuzzy diagnosis class corporate customer of same client each other includes:
Judge whether the customer name number of words of each corporate customer in fuzzy diagnosis class is more than or equal to predetermined threshold value;
If being more than or equal to predetermined threshold value, uniqueness knowledge is carried out according to the number of words of the customer name of corporate customer and word content Not;
If being less than predetermined threshold value, carried out according to the identification information of corporate customer, the number of words of customer name and word content unique Property identification.
If described be more than or equal to predetermined threshold value, carried out only according to the number of words of the customer name of corporate customer and word content The step of one property identifies includes:
Any corporate customer chosen customer name number of words in fuzzy diagnosis class and be more than or equal to predetermined threshold value, by its customer name Claim to carry out word content contrast with the customer name of all entities client;
The identical corporate customer of the word content of customer name is judged whether according to comparing result, if in the presence of, by The identical corporate customer of word content is identified as same client with the corporate customer being selected;, will be selected if being not present The corporate customer taken is identified as different clients with other all entities client;
Continue to choose customer name number of words in fuzzy diagnosis class and be more than or equal to another corporate customer progress of predetermined threshold value uniquely Property identification, until in fuzzy diagnosis class all customer name numbers of words be more than or equal to predetermined threshold value corporate customer it is identified.
If described be less than predetermined threshold value, according to the identification information of corporate customer, the number of words of customer name and word content The step of carrying out uniqueness identification includes:
Any corporate customer chosen customer name number of words in fuzzy diagnosis class and be less than predetermined threshold value, by its customer name with The customer name of all entities client carries out word content contrast;
The identical corporate customer of the word content of customer name is judged whether according to comparing result, if in the presence of, after It is continuous to judge whether the identical corporate customer of word content and the corporate customer being selected have any identical identification information, Word content is identical and be identified as with the corporate customer of any identical identification information with the corporate customer being selected Same client;If being not present, the corporate customer being selected and other all entities client are identified as different clients;
Continue to choose another corporate customer progress uniqueness knowledge that customer name number of words in fuzzy diagnosis class is less than predetermined threshold value Not, until in fuzzy diagnosis class all customer name numbers of words be less than predetermined threshold value corporate customer it is identified.
Referring to Fig. 10, its work(for the application server preferred embodiment of mount message uniqueness recognizer of the present invention Can module map.In the present embodiment, the application server of mount message uniqueness recognizer can be divided into one or more Individual module, one or more of modules are stored in the memory 20, and by one or more processors(This implementation Example is the processor 10)It is performed, to complete the present invention.For example, in Fig. 10, mount message uniqueness recognizer is answered It can be divided into acquisition module 21, sort module 22, identification module 23 with server and integrate module 24.Alleged by the present invention Module is the series of computation machine programmed instruction section for referring to complete specific function, than program more suitable for describing the group visitor Implementation procedure of the family uniqueness recognizer in the application server that the corporate customer uniqueness identifies.Description will tool below Body introduces the function of the module 21-24.
Acquisition module 21, for obtaining the essential information of the corporate customer stored in each source database, wherein, the base This information includes customer name and identification information;
Sort module 22, for each corporate customer to be identified into class or fuzzy diagnosis labeled as accurate according to the identification information Class;
Identification module 23, for being entered respectively to the corporate customer in precisely identification class and fuzzy diagnosis class according to default recognition rule Row precisely identification and fuzzy diagnosis, identify the corporate customer of same client each other;
Integrate module 24, for obtain precisely identification and fuzzy diagnosis recognition result, and according to recognition result by it is all each other The essential information of the corporate customer of same client is integrated, and draws the corporate customer information of uniqueness identification.
The sort module 22 includes:
Resolution unit, for parsing the identification information of each corporate customer;
Taxon, whether comprising accurate identification information is preset in the identification information for judging each corporate customer, if so, then Labeled as accurate identification class;Otherwise it is labeled as fuzzy diagnosis class.
The identification module 23 includes:
Detection unit, for carrying out text detection to the customer name of all entities client, obtain the number of words and text of customer name Word content;
Accurate recognition unit, for default precisely identification information and the customer name according to each corporate customer in precisely identification class Uniqueness identification is carried out, identifies the corporate customer of same client each other in the precisely identification class;
Fuzzy diagnosis unit, for according to each identification information of corporate customer in fuzzy diagnosis class, the number of words of customer name and Word content carries out uniqueness identification, identifies in the fuzzy diagnosis class corporate customer of same client each other.
The precisely recognition unit includes:
First comparing subunit, for a corporate customer in any selection precisely identification class, preset precisely identification letter Default precisely identification information and the customer name of breath and customer name and other corporate customers in precisely identification class are contrasted;
First identification subelement, for judging whether to preset according to comparing result, accurate identification information is identical and customer name Claim identical corporate customer, if in the presence of will preset that accurate identification information is identical and customer name identical corporate customer and quilt The corporate customer of selection is identified as same client;If being not present, by the corporate customer being selected and other in precisely identification class Corporate customer is identified as different clients.
The fuzzy diagnosis unit includes:
First judgment sub-unit, for judging whether the customer name number of words of each corporate customer in fuzzy diagnosis class is more than or equal to Predetermined threshold value;
Second identification subelement, if for being more than or equal to predetermined threshold value, according to the number of words and text of the customer name of corporate customer Word content carries out uniqueness identification;
3rd identification subelement, if for being less than predetermined threshold value, according to the identification information of corporate customer, the number of words of customer name Uniqueness identification is carried out with word content.
The second identification subelement includes;
Second comparing subunit, it is more than or equal to one of predetermined threshold value for customer name number of words in any selection fuzzy diagnosis class Corporate customer, its customer name and the customer name of all entities client are subjected to word content contrast;
Second judgment sub-unit, the identical group of word content for judging whether customer name according to comparing result Body client, if in the presence of the identical corporate customer of word content and the corporate customer that is selected are identified as into same client; If being not present, the corporate customer being selected and other all entities client are identified as different clients.
The 3rd identification subelement includes:
3rd comparing subunit, a group of predetermined threshold value is less than for customer name number of words in any selection fuzzy diagnosis class Client, its customer name and the customer name of all entities client are subjected to word content contrast;
3rd judgment sub-unit, the identical group of word content for judging whether customer name according to comparing result Body client, if in the presence of continuing to judge whether the identical corporate customer of word content has with the corporate customer being selected Any identical identification information, by word content is identical and corporate customer and quilt with any identical identification information The corporate customer of selection is identified as same client;If being not present, by the corporate customer being selected and other all entities client It is identified as different clients.
A kind of information uniqueness is correspondingly provided based on the recognition methods of above- mentioned information uniqueness and the application server present invention Identifying system, Figure 11 is referred to, it includes several source databases 110 and the application clothes of information uniqueness as described above identification Business device 120.
Wherein, each source database 110 is used for the essential information for storing corporate customer, and the application server 120 is used for The essential information of the corporate customer stored in each source database is obtained, wherein, the essential information includes customer name and knowledge Other information;And each corporate customer is identified by class or fuzzy diagnosis class labeled as accurate according to the identification information;According to Corporate customer in precisely identification class and fuzzy diagnosis class is precisely identified default recognition rule respectively and fuzzy diagnosis, knows The not corporate customer of same client each other;And the precisely recognition result of identification and fuzzy diagnosis is obtained, according to recognition result by institute The essential information for having the corporate customer of same client each other is integrated, and draws the corporate customer information of uniqueness identification.
The workflow of information uniqueness identifying system is as follows in the present embodiment:
Application server obtains the essential information of the corporate customer stored in each source database, wherein, the essential information bag Include customer name and identification information;
Application server is according to the identification information by each corporate customer labeled as precisely identification class or fuzzy diagnosis class;
Application server carries out essence respectively according to default recognition rule to the corporate customer in precisely identification class and fuzzy diagnosis class Quasi- identification and fuzzy diagnosis, identify the corporate customer of same client each other;
Application server obtains the precisely recognition result of identification and fuzzy diagnosis, and according to recognition result by all visitors same each other The essential information of the corporate customer at family is integrated, and draws the corporate customer information of uniqueness identification.
Further, in described information uniqueness identifying system, the application server will according to the identification information Each corporate customer includes labeled as the precisely flow of identification class or fuzzy diagnosis class:
Application server parses the identification information of each corporate customer;
Whether application server, which judges to include in the identification information of each corporate customer, presets accurate identification information, if so, then marking It is designated as precisely identifying class;Otherwise it is labeled as fuzzy diagnosis class.
The application server is according to default recognition rule to the corporate customer point in precisely identification class and fuzzy diagnosis class Do not identified precisely and fuzzy diagnosis, the flow of the identification corporate customer of same client each other includes:
Application server carries out text detection to the customer name of all entities client, obtains in the number of words and word of customer name Hold;
Application server is carried out only according to the default precisely identification information and customer name of each corporate customer in accurate identification class One property identifies, identifies the corporate customer of same client each other in the precisely identification class;
Application server is according to each identification information of corporate customer, the number of words of customer name and word content in fuzzy diagnosis class Uniqueness identification is carried out, identifies in the fuzzy diagnosis class corporate customer of same client each other.
The application server is according to accurate default precisely identification information and the customer name for identifying each corporate customer in class Claim to carry out uniqueness identification, identify that the flow of the corporate customer of same client includes each other in the precisely identification class:
Application server arbitrarily chooses a corporate customer in precisely identification class, is preset accurate identification information and customer name Default precisely identification information and customer name with other corporate customers in precisely identification class is claimed to be contrasted;
Application server judges whether to preset that accurate identification information is identical and customer name identical group according to comparing result Body client, if in the presence of will preset that accurate identification information is identical and customer name identical corporate customer and the group that is selected Client is identified as same client;If being not present, the corporate customer being selected and other corporate customers in precisely identification class are known Wei not different clients;
Another corporate customer that application server continues to choose in precisely identification class carries out uniqueness knowledge with other corporate customers Not, until corporate customer all in precisely identification class is identified.
The application server according to each identification information of corporate customer in fuzzy diagnosis class, the number of words of customer name and Word content carries out uniqueness identification, identifies that the flow of the corporate customer of same client includes each other in the fuzzy diagnosis class:
Application server judges whether the customer name number of words of each corporate customer in fuzzy diagnosis class is more than or equal to predetermined threshold value;
If being more than or equal to predetermined threshold value, application server is carried out according to the number of words and word content of the customer name of corporate customer Uniqueness identifies;
If being less than predetermined threshold value, application server is according in the identification information of corporate customer, the number of words and word of customer name Hold and carry out uniqueness identification.
If it is more than or equal to predetermined threshold value described, application server is according to the number of words and word of the customer name of corporate customer The flow that content carries out uniqueness identification includes:
Application server arbitrarily chooses the corporate customer that customer name number of words in fuzzy diagnosis class is more than or equal to predetermined threshold value, Its customer name and the customer name of all entities client are subjected to word content contrast;
Application server judges whether the identical corporate customer of the word content of customer name according to comparing result, if In the presence of the identical corporate customer of word content and the corporate customer that is selected then are identified as into same client;If being not present, The corporate customer being selected and other all entities client are then identified as different clients;
Application server continues to choose another group visitor that customer name number of words in fuzzy diagnosis class is more than or equal to predetermined threshold value Family carries out uniqueness identification, until all customer name numbers of words are equal more than or equal to the corporate customer of predetermined threshold value in fuzzy diagnosis class It is identified.
If described be less than predetermined threshold value, application server is according to the identification information of corporate customer, the number of words of customer name The flow of uniqueness identification is carried out with word content to be included:
Application server arbitrarily chooses the corporate customer that customer name number of words in fuzzy diagnosis class is less than predetermined threshold value, by it Customer name and the customer name of all entities client carry out word content contrast;
Application server judges whether the identical corporate customer of the word content of customer name according to comparing result, if In the presence of then continuing to judge whether the identical corporate customer of word content and the corporate customer being selected have any identical Identification information, by the group that word content is identical and has the corporate customer of any identical identification information and is selected Client is identified as same client;If being not present, the corporate customer being selected and other all entities client are identified as difference Client;
Application server continues to choose customer name number of words in fuzzy diagnosis class and entered less than another corporate customer of predetermined threshold value Row uniqueness identifies, until all customer name numbers of words are identified less than the corporate customer of predetermined threshold value in fuzzy diagnosis class.
In summary, in information uniqueness recognition methods provided by the invention, application server, system and storage medium, The recognition methods of described information uniqueness is described basic by obtaining the essential information of the corporate customer stored in each source database Information includes customer name and identification information;Each corporate customer is identified by class labeled as accurate according to the identification information afterwards Or fuzzy diagnosis class;The corporate customer in precisely identification class and fuzzy diagnosis class is entered respectively according to default recognition rule afterwards Row precisely identification and fuzzy diagnosis, identify the corporate customer of same client each other;Precisely identification and fuzzy diagnosis are obtained afterwards Recognition result, and integrated the essential information of the corporate customer of all clients same each other according to recognition result, draw only The corporate customer information of one property identification.Can precisely it be identified respectively according to the type of different corporate customers or fuzzy diagnosis, solution Determine due to the problem of imperfect caused part corporate customer of customer information can not carry out uniqueness identification and Data Integration.
Certainly, one of ordinary skill in the art will appreciate that realizing all or part of flow in above-described embodiment method, It is that related hardware can be instructed by computer program(Such as processor, controller etc.)To complete, described program can store In a computer-readable storage medium, the program may include such as the flow of above-mentioned each method embodiment upon execution.Its Described in storage medium can be memory, magnetic disc, CD etc..
It should be appreciated that the application of the present invention is not limited to above-mentioned citing, for those of ordinary skills, can To be improved or converted according to the above description, all these modifications and variations should all belong to the guarantor of appended claims of the present invention Protect scope.

Claims (10)

1. a kind of information uniqueness recognition methods, it is characterised in that comprise the following steps:
The essential information of the corporate customer stored in each source database is obtained, wherein, the essential information includes customer name And identification information;
According to the identification information by each corporate customer labeled as precisely identification class or fuzzy diagnosis class;
The corporate customer in precisely identification class and fuzzy diagnosis class is precisely identified and mould respectively according to default recognition rule Paste identification, identify the corporate customer of same client each other;
The precisely recognition result of identification and fuzzy diagnosis is obtained, and it is according to recognition result that the group of all clients same each other is objective The essential information at family is integrated, and draws the corporate customer information of uniqueness identification.
2. information uniqueness recognition methods according to claim 1, it is characterised in that described to be incited somebody to action according to the identification information Each corporate customer includes labeled as the step of precisely identification class or fuzzy diagnosis class:
Parse the identification information of each corporate customer;
Judge whether comprising accurate identification information is preset in the identification information of each corporate customer, if so, then labeled as precisely knowledge Other class;Otherwise it is labeled as fuzzy diagnosis class.
3. information uniqueness recognition methods according to claim 2, it is characterised in that the basis presets recognition rule pair Corporate customer in accurate identification class and fuzzy diagnosis class is precisely identified respectively and fuzzy diagnosis, identifies same client each other Corporate customer the step of include:
Text detection is carried out to the customer name of all entities client, obtains the number of words and word content of customer name;
Uniqueness identification is carried out according to the default precisely identification information of each corporate customer in accurate identification class and customer name, known The corporate customer of same client each other in not described precisely identification class;
Uniqueness is carried out according to each identification information of corporate customer, the number of words of customer name and word content in fuzzy diagnosis class Identification, identify in the fuzzy diagnosis class corporate customer of same client each other.
4. information uniqueness recognition methods according to claim 3, it is characterised in that the basis precisely identifies every in class The default precisely identification information and customer name of individual corporate customer carry out uniqueness identification, identify in the precisely identification class each other The step of corporate customer of same client, includes:
Any corporate customer chosen in precisely identification class, is preset accurate identification information and customer name with precisely knowing Default precisely identification information and the customer name of other corporate customers are contrasted in other class;
Judge whether to preset that accurate identification information is identical and customer name identical corporate customer according to comparing result, if In the presence of will then preset that accurate identification information is identical and customer name identical corporate customer and the corporate customer that is selected identify For same client;If being not present, the corporate customer being selected and other corporate customers in precisely identification class are identified as difference Client;
Continue to choose another corporate customer in precisely identification class and carry out uniqueness identification with other corporate customers, until accurate Identify that corporate customer all in class is identified.
5. information uniqueness recognition methods according to claim 3, it is characterised in that every in the class according to fuzzy diagnosis The identification information of individual corporate customer, the number of words of customer name and word content carry out uniqueness identification, identify the fuzzy diagnosis In class each other the corporate customer of same client the step of include:
Judge whether the customer name number of words of each corporate customer in fuzzy diagnosis class is more than or equal to predetermined threshold value;
If being more than or equal to predetermined threshold value, uniqueness knowledge is carried out according to the number of words of the customer name of corporate customer and word content Not;
If being less than predetermined threshold value, carried out according to the identification information of corporate customer, the number of words of customer name and word content unique Property identification.
6. information uniqueness recognition methods according to claim 5, it is characterised in that if described be more than or equal to default threshold Value, then included according to the step of number of words of the customer name of corporate customer and word content progress uniqueness identification:
Any corporate customer chosen customer name number of words in fuzzy diagnosis class and be more than or equal to predetermined threshold value, by its customer name Claim to carry out word content contrast with the customer name of all entities client;
The identical corporate customer of the word content of customer name is judged whether according to comparing result, if in the presence of, by The identical corporate customer of word content is identified as same client with the corporate customer being selected;, will be selected if being not present The corporate customer taken is identified as different clients with other all entities client;
Continue to choose customer name number of words in fuzzy diagnosis class and be more than or equal to another corporate customer progress of predetermined threshold value uniquely Property identification, until in fuzzy diagnosis class all customer name numbers of words be more than or equal to predetermined threshold value corporate customer it is identified.
7. information uniqueness recognition methods according to claim 5, it is characterised in that if described be less than predetermined threshold value, The step of carrying out uniqueness identification according to the identification information of corporate customer, the number of words of customer name and word content includes:
Any corporate customer chosen customer name number of words in fuzzy diagnosis class and be less than predetermined threshold value, by its customer name with The customer name of all entities client carries out word content contrast;
The identical corporate customer of the word content of customer name is judged whether according to comparing result, if in the presence of, after It is continuous to judge whether the identical corporate customer of word content and the corporate customer being selected have any identical identification information, Word content is identical and be identified as with the corporate customer of any identical identification information with the corporate customer being selected Same client;If being not present, the corporate customer being selected and other all entities client are identified as different clients;
Continue to choose another corporate customer progress uniqueness knowledge that customer name number of words in fuzzy diagnosis class is less than predetermined threshold value Not, until in fuzzy diagnosis class all customer name numbers of words be less than predetermined threshold value corporate customer it is identified.
A kind of 8. application server of information uniqueness identification, it is characterised in that including:Processor, memory and communication bus;
Being stored with the memory can be by the computer-readable program of the computing device;
The communication bus realizes the connection communication between processor and memory;
Realize that the information as described in claim 1-7 any one is unique described in the computing device during computer-readable program Step in property recognition methods.
A kind of 9. computer-readable recording medium, it is characterised in that the computer-readable recording medium storage have one or Multiple programs, one or more of programs can be by one or more computing devices, and to realize, such as claim 1-7 appoints The step in information uniqueness recognition methods described in meaning one.
10. a kind of information uniqueness identifying system, including several source databases, it is characterised in that also include such as claim 8 The application server of described information uniqueness identification;
Each source database, for storing the essential information of corporate customer;
The application server, for obtaining the essential information of the corporate customer stored in each source database, wherein, the base This information includes customer name and identification information;And each corporate customer is identified labeled as accurate according to the identification information Class or fuzzy diagnosis class;The corporate customer in precisely identification class and fuzzy diagnosis class is carried out respectively according to default recognition rule Precisely identification and fuzzy diagnosis, identify the corporate customer of same client each other;And obtain the precisely identification of identification and fuzzy diagnosis As a result, the essential information of the corporate customer of all clients same each other is integrated according to recognition result, show that uniqueness is known Other corporate customer information.
CN201710850369.5A 2017-09-20 2017-09-20 Information uniqueness identification method, application server, system and storage medium Active CN107704529B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201710850369.5A CN107704529B (en) 2017-09-20 2017-09-20 Information uniqueness identification method, application server, system and storage medium
PCT/CN2018/084325 WO2019056750A1 (en) 2017-09-20 2018-04-25 Information uniqueness identification method, application server, system, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710850369.5A CN107704529B (en) 2017-09-20 2017-09-20 Information uniqueness identification method, application server, system and storage medium

Publications (2)

Publication Number Publication Date
CN107704529A true CN107704529A (en) 2018-02-16
CN107704529B CN107704529B (en) 2020-04-10

Family

ID=61172973

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710850369.5A Active CN107704529B (en) 2017-09-20 2017-09-20 Information uniqueness identification method, application server, system and storage medium

Country Status (2)

Country Link
CN (1) CN107704529B (en)
WO (1) WO2019056750A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109064342A (en) * 2018-07-20 2018-12-21 阳光保险集团股份有限公司 Client identity recognition methods and device
WO2019056750A1 (en) * 2017-09-20 2019-03-28 平安科技(深圳)有限公司 Information uniqueness identification method, application server, system, and storage medium
CN109815268A (en) * 2018-12-21 2019-05-28 上海诺悦智能科技有限公司 A kind of transaction sanction list matching system
CN111126935A (en) * 2019-11-19 2020-05-08 泰康保险集团股份有限公司 Processing method and device of security data, electronic equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020194018A1 (en) * 2000-06-05 2002-12-19 Gene Scott Method for matching complimentary business interests
CN102663008A (en) * 2012-03-20 2012-09-12 山东浪潮齐鲁软件产业股份有限公司 Government integrated business platform business library and construction method of base library
CN103646110A (en) * 2013-12-26 2014-03-19 中国人民银行征信中心 Natural person basic identity information matching method

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101452556A (en) * 2008-12-31 2009-06-10 中国建设银行股份有限公司 Customer information processing system and method
CN106934509A (en) * 2015-12-30 2017-07-07 平安科技(深圳)有限公司 Customer information merging method and system
CN106407245B (en) * 2016-06-23 2021-05-07 平安科技(深圳)有限公司 Information processing method and device
CN106970994B (en) * 2017-04-01 2019-07-12 长沙智擎信息技术有限公司 A kind of online practical demonstration extracting method of automation
CN107704529B (en) * 2017-09-20 2020-04-10 平安科技(深圳)有限公司 Information uniqueness identification method, application server, system and storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020194018A1 (en) * 2000-06-05 2002-12-19 Gene Scott Method for matching complimentary business interests
CN102663008A (en) * 2012-03-20 2012-09-12 山东浪潮齐鲁软件产业股份有限公司 Government integrated business platform business library and construction method of base library
CN103646110A (en) * 2013-12-26 2014-03-19 中国人民银行征信中心 Natural person basic identity information matching method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘寅斌 等: "电子政务环境下信用档案共享模式研究", 《图书情报工作》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019056750A1 (en) * 2017-09-20 2019-03-28 平安科技(深圳)有限公司 Information uniqueness identification method, application server, system, and storage medium
CN109064342A (en) * 2018-07-20 2018-12-21 阳光保险集团股份有限公司 Client identity recognition methods and device
CN109815268A (en) * 2018-12-21 2019-05-28 上海诺悦智能科技有限公司 A kind of transaction sanction list matching system
CN111126935A (en) * 2019-11-19 2020-05-08 泰康保险集团股份有限公司 Processing method and device of security data, electronic equipment and storage medium

Also Published As

Publication number Publication date
WO2019056750A1 (en) 2019-03-28
CN107704529B (en) 2020-04-10

Similar Documents

Publication Publication Date Title
CN107704529A (en) The recognition methods of information uniqueness, application server, system and storage medium
CN109446347A (en) A kind of multi-modal Hash search method of fast discrete and system having supervision
CN109165209B (en) Data verification method, device, equipment and medium for object types in database
CN112380870A (en) User intention analysis method and device, electronic equipment and computer storage medium
CN115146865A (en) Task optimization method based on artificial intelligence and related equipment
CN114138784B (en) Information tracing method and device based on storage library, electronic equipment and medium
CN116485220A (en) Staff performance assessment method and device, electronic equipment and storage medium
CN114116811B (en) Log processing method, device, equipment and storage medium
CN113434542B (en) Data relationship identification method and device, electronic equipment and storage medium
CN113626558B (en) Intelligent recommendation-based field standardization method and system
CN114756669A (en) Intelligent analysis method and device for problem intention, electronic equipment and storage medium
CN107590017A (en) The detection method and device of a kind of electronic equipment
CN112732567B (en) Mock data testing method and device based on ip, electronic equipment and storage medium
CN112416992B (en) Industry type identification method, system and equipment based on big data and keywords
CN112069269B (en) Big data and multidimensional feature-based data tracing method and big data cloud server
CN111402068B (en) Premium data analysis method and device based on big data and storage medium
CN109145092A (en) A kind of database update, intelligent answer management method, device and its equipment
CN111382710A (en) Drawing comparison method based on image recognition
CN106445949A (en) Method and device for detecting data change in data table
CN113221888B (en) License plate number management system test method and device, electronic equipment and storage medium
CN110147980A (en) Worksheet method and device
CN104573098A (en) Large-scale object recognition method based on Spark system
CN114780688A (en) Text quality inspection method, device and equipment based on rule matching and storage medium
CN113722302B (en) Data management method and device
CN117875908B (en) Work order processing method and system based on enterprise management software SAAS

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant