CN107153991A - The inconsistent integrated conduct method of title in a kind of financial system - Google Patents

The inconsistent integrated conduct method of title in a kind of financial system Download PDF

Info

Publication number
CN107153991A
CN107153991A CN201710290544.XA CN201710290544A CN107153991A CN 107153991 A CN107153991 A CN 107153991A CN 201710290544 A CN201710290544 A CN 201710290544A CN 107153991 A CN107153991 A CN 107153991A
Authority
CN
China
Prior art keywords
title
standard
inconsistent
industrial
financial system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710290544.XA
Other languages
Chinese (zh)
Inventor
周利新
张宏伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
Materials Branch of State Grid Jibei Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
Materials Branch of State Grid Jibei Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, Materials Branch of State Grid Jibei Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN201710290544.XA priority Critical patent/CN107153991A/en
Publication of CN107153991A publication Critical patent/CN107153991A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/12Accounting
    • G06Q40/125Finance or payroll
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides the inconsistent integrated conduct method of title in a kind of financial system, solves to gather off-gauge, the inconsistent title come from different business flow when setting up the non-standard title table of comparisons, the problem of manually-operated inefficient and error-prone.Compared with prior art, increase non-keyword vocabulary, character string S is obtained after non-key words in non-standard title is removed, again S is searched in title table, and increase to S substring and or each word query criteria title include situation, artificial reference is supplied by comprising sort result output, raising efficiency is reached, reduces the effect of error rate.

Description

The inconsistent integrated conduct method of title in a kind of financial system
Technical field
It is inconsistent the present invention relates to title, the inconsistent solution of title particularly in financial system.
Background technology
Explanation of nouns
Form is pre-processed:The pretreatment that respective handling step is carried out is used for unified form.Can be that program is completed, Or manual operations instruction.
Title table:The Collection Table of the title determined according to business demand.The discrepancy that such as bank is provided The organization that account information table is used is as title.Note the difference because business, title is not necessarily most complete Title.
Non-standard title:The titles different from the title that business is determined are referred to as non-standard title.Usually from business Each flow.
The non-standard title table of comparisons:Refer to the contrast relationship form set up between non-standard title and title.
Non-keyword vocabulary:According to service feature, be every class name definition its non-key word (such as:Company, it is limited, Responsibility, share etc.), and form form.
The financial system electronization of each unit is relative to be popularized, in real work, more particularly to treasury trade, such as In the business such as guarantee fund's reimbursement, because of working link and the characteristic of flow, financial system is often obtained from different operation flows Information, and integrated treatment.Because the link being related to is more, when links produce information, the problem of title is inconsistent is often encountered. Such as same unit, its organization should be unique in theory, but can often encounter following problem in practice:
1) unit abbreviation or imperfect appellation.Such as industrial and commercial bank, industrial and commercial bank, the Industrial and Commercial Bank of China, Industrial and Commercial Bank of China's share Company, China Industrial and Commercial Bank Co., Ltd., so-and-so subbranch of area of industrial and commercial bank, etc..
2) format issues.Such as:Simplified traditional font, full-shape half-angle, capital and small letter, space
It is above-mentioned most commonly seen the problem of list, but it is not limited only to this.These problems cause the financial system of unit can be upper State is originally same unit, it is believed that is many different units, causes the confusion and mistake of financial process.
In this case, current way is, when finding mistake (such as to not upper account), the information extraction of error to be gone out Carry out artificial judgment processing.Or use improved method, such as increase form pre-treatment step, to problem 2) situation uses program Or row format conversion is entered in manual operations, to problem 1) situation is then one by one each standard name in examination criteria title table Claim, see whether it includes the non-standard title.If comprising this is added into the non-standard title table of comparisons to title (is typically It is many-to-one), when running into this non-standard title again afterwards, the non-standard title table of comparisons is searched, is replaced with corresponding title.
When data volume is big, even if using above-mentioned improved method, the quantity of error is also quite big.Now artificial treatment is built When founding the non-standard title table of comparisons, for most non-esbablished corporation, found out by hand in title table correct Correspondence unit is also quite laborious.Such as foregoing problems 1) in industrial and commercial bank example, if industrial and commercial bank is non-esbablished corporation, in Pang It is not an easy thing manually to find title corresponding with " industrial and commercial bank " in big title table.And such a search Method exist another problem be, when title be referred to as or than non-standard title in short-term, even if this non-standard title exists There is corresponding title in title table, such a search also can not find.
The content of the invention
A kind of title of present invention offer is inconsistent, the inconsistent comprehensive treatment technique side of title particularly in financial system Case, it is therefore intended that maximum automatic business processing title is inconsistent, and when needing manual intervention, provided to the greatest extent for artificial judgment It can accurately recommend, so as to largely reduce the inconsistent caused mistake of title, reduce the workload and error rate of artificial treatment.
The technical solution adopted by the present invention particular content is:
Compared with prior art, " non-keyword vocabulary " is increased, i.e.,:It is that each needs unified name according to traffic performance The name item (field) of title, defines corresponding non-keyword phrase.As in organization field, non-key words can be included: Company, limited, responsibility, share etc.;To department name field, non-key words can be:Place, office, section, room etc.;To place name, Non-key words can be:Province, city, area, county, township, town, village etc..
When handling non-standard title, if also there is no its corresponding title in the non-standard title table of comparisons, by such as Lower step process:
1) listed words in the corresponding non-keyword vocabulary in non-standard title is removed, according to described removed The difference of non-key words position in the former non-standard title, may resolve into S1 to Sn's by the original non-standard title Some substrings, all S1 to Sn are merged and obtain character string S;
2) S is searched, if finding the standard for including the S as overall character string in the title table Title, then by the non-standard title and the title to adding the non-standard title table of comparisons, if do not had The title for including the S is found, then:
3) check the situation that includes to S1 to the Sn one by one in the title table, and will be pressed comprising result from many To few sequence output;
4) situation of each word of each title comprising the S is checked in the title table, and will bag Pressed containing result from more to few sequence output.
The result of " sequence output " supplies artificial reference in above-mentioned steps, come the title of foremost most likely this The corresponding title of non-standard title.Such step can highlight the corresponding title of most probable, reduce and manually exist The difficulty and workload searched for by hand in huge title table.
It is referred to as, so scheme of the present invention is removed not using first in non-standard title because title is entirely possible Necessary words, i.e., the non-key words (such as company, limited, responsibility, share) defined in advance according to traffic performance, is reexamined Comprising, by the title not can not checked Chu Lai in the prior art the 2) step can check to come automatically, improve efficiency.For Still the non-standard title not detected, then by the 3) to the 4) step handle, provides most probable recommendation for artificial determination, Man efficiency can be improved further.
Understood via above-mentioned technical scheme, compared with prior art, the skill of title inconsistence problems disclosed by the invention Art solution improves deficiency of the prior art.
Brief description of the drawings
Fig. 1 is the inconsistent process step flow chart of title disclosed by the invention.
Embodiment
The specific embodiment of the invention is illustrated by taking organization as an example, but should not be construed as limiting in organization, " title " of the present invention can also be (but being not limited only to) name, place name, department's name, entry name, bank of deposit's name etc..
" table " of the present invention (such as title table, the non-standard title table of comparisons), can be " the work in excel " table " in book ", database, or other can realize the module of identical function.
Specific embodiment is following (reference picture 1):
In the automated system of similar financial system, the data collected from different working links and flow run into title not Unanimously very universal, below by taking organization as an example, reference picture 1 describes the particular content of the present invention in detail.
By taking industrial and commercial bank as an example, system is collected the title come and potentially included:Industrial and commercial bank, industrial and commercial bank, the Industrial and Commercial Bank of China, in Joint-stock company of industrial and commercial bank of state, China Industrial and Commercial Bank Co., Ltd., so-and-so so-and-so sub-department of area of industrial and commercial bank, etc..It is determined that During title, not necessarily with " China Industrial and Commercial Bank Co., Ltd. " be standard, according to business the need for, it may be necessary to refer to Fixed " industrial and commercial bank so-and-so branch " is its title, and is recorded in the title table of comparisons, and other titles are considered to Criteria of right and wrong title.
To handle non-standard title, financial system can set up the non-standard title table of comparisons, when system runs into non-standard title Table look-up and find its corresponding title, and continue with.When some non-standard title is not in the non-standard title table of comparisons, Need this new non-standard title finding its corresponding title and add in the non-standard title table of comparisons.
Prior art is with this new non-standard title (by taking " China Industrial and Commercial Bank Co., Ltd. so-and-so branch " as an example) Search criterion title table, but because being " industrial and commercial bank so-and-so branch " in standard scale, and with " the limited public affairs of Industrial and Commercial Bank of China's share Take charge of so-and-so in lines " search, it can not find.Even if now artificial operation, it is also difficult to find " work in huge title table Business bank so-and-so branch " correspond to therewith.
The solution of the present invention is to set up non-keyword vocabulary, and it is corresponding non-key to be that this name column (or domain name) sets up its Words, such as:China, share, limited, responsibility, company.
Handled according to step 101 in Fig. 1, will be in " China Industrial and Commercial Bank Co., Ltd. so-and-so branch " all non-close Key words is removed, and is obtained " industrial and commercial bank " (S1) and " so-and-so branch " (S2), merging S1 and S2, obtain " industrial and commercial bank so-and-so divide OK " (S).Here the example selected is in order to illustrate S1 to Sn decomposition and synthesis S, if removing only one of which after non-key words Character string S1, then S1 is S.
Step 102, " industrial and commercial bank so-and-so branch " (S) is searched in title table, finds, go to step 105, afterwards Terminate.
In other examples, if in step 102, not finding S, then going to step 103;
Step 103, S1 to Sn is searched respectively in title table, exported by comprising how many sequences.It is such as each mark Quasi- title increase counting module (such as domain, cell), when detecting whether it includes S1 to Sn, often comprising one, the standard name Count is incremented for the counting module of title, by from more to major general's count sort, and export and supply artificial reference.
Step 104, each word (or letter) in S is searched in title table, by comprising from more to few sequence Output supplies artificial reference, and terminates process step.This step can be searched S each word by character in realization, or by S Each word split into a character string, then by string searching.
Step 105, this non-standard title and corresponding title are added into the non-standard title table of comparisons.

Claims (1)

1. the inconsistent integrated conduct method of title in a kind of financial system, including title table and the control of non-standard title Table, it is characterized in that also including non-keyword vocabulary, and includes following process step:
Listed words in the corresponding non-keyword vocabulary in non-standard title is removed, according to described removed non-key The difference of words position in the former non-standard title, may resolve into the former non-standard title S1 to Sn some sons Character string, all S1 to Sn are merged and obtain character string S;
The S is searched as overall character string in the title table, if finding the title for including the S, By the non-standard title and the title to adding the non-standard title table of comparisons, included if do not found The title of the S, then;
Check the situation that includes to S1 to the Sn one by one in the title table, and will be pressed comprising result from more to few row Sequence is exported;
The situation of each word of each title comprising the S is checked in the title table, and result will be included Exported by from more to few sequence.
CN201710290544.XA 2017-04-28 2017-04-28 The inconsistent integrated conduct method of title in a kind of financial system Pending CN107153991A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710290544.XA CN107153991A (en) 2017-04-28 2017-04-28 The inconsistent integrated conduct method of title in a kind of financial system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710290544.XA CN107153991A (en) 2017-04-28 2017-04-28 The inconsistent integrated conduct method of title in a kind of financial system

Publications (1)

Publication Number Publication Date
CN107153991A true CN107153991A (en) 2017-09-12

Family

ID=59793010

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710290544.XA Pending CN107153991A (en) 2017-04-28 2017-04-28 The inconsistent integrated conduct method of title in a kind of financial system

Country Status (1)

Country Link
CN (1) CN107153991A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110298747A (en) * 2019-07-04 2019-10-01 中国工商银行股份有限公司 Remittance message blacklist monitoring system and method
CN110555089A (en) * 2019-09-09 2019-12-10 广东电网有限责任公司 character name matching method and device and computer readable storage medium
CN114880430A (en) * 2022-05-10 2022-08-09 马上消费金融股份有限公司 Name processing method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104424202A (en) * 2013-08-21 2015-03-18 北大方正集团有限公司 Method and system for performing duplication checking on customer information in customer relationship management (CRM) system
CN105184713A (en) * 2015-07-17 2015-12-23 四川久远银海软件股份有限公司 Intelligent matching and sorting system and method capable of benefitting contrast of assigned drugs of medical insurance
CN105404686A (en) * 2015-12-10 2016-03-16 湖南科技大学 Method for matching place name and address in news event based on geographical feature hierarchical segmented words
CN106354871A (en) * 2016-09-18 2017-01-25 长城计算机软件与系统有限公司 Similarity search method of enterprise names

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104424202A (en) * 2013-08-21 2015-03-18 北大方正集团有限公司 Method and system for performing duplication checking on customer information in customer relationship management (CRM) system
CN105184713A (en) * 2015-07-17 2015-12-23 四川久远银海软件股份有限公司 Intelligent matching and sorting system and method capable of benefitting contrast of assigned drugs of medical insurance
CN105404686A (en) * 2015-12-10 2016-03-16 湖南科技大学 Method for matching place name and address in news event based on geographical feature hierarchical segmented words
CN106354871A (en) * 2016-09-18 2017-01-25 长城计算机软件与系统有限公司 Similarity search method of enterprise names

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110298747A (en) * 2019-07-04 2019-10-01 中国工商银行股份有限公司 Remittance message blacklist monitoring system and method
CN110298747B (en) * 2019-07-04 2022-04-12 中国工商银行股份有限公司 Remittance message blacklist monitoring system and method
CN110555089A (en) * 2019-09-09 2019-12-10 广东电网有限责任公司 character name matching method and device and computer readable storage medium
CN114880430A (en) * 2022-05-10 2022-08-09 马上消费金融股份有限公司 Name processing method and device
CN114880430B (en) * 2022-05-10 2023-07-18 马上消费金融股份有限公司 Name processing method and device

Similar Documents

Publication Publication Date Title
US20120102002A1 (en) Automatic data validation and correction
CN107153991A (en) The inconsistent integrated conduct method of title in a kind of financial system
MX2012008714A (en) System and method for aggregation and association of professional affiliation data with commercial data content.
US11151099B2 (en) System and method for data structure migration control
CN110889310B (en) Financial document information intelligent extraction system and method
CN109582787B (en) Entity classification method and device for corpus data in thermal power generation field
CN106649557B (en) Semantic association mining method for defect report and mail list
WO2012080077A1 (en) Cleansing a database system to improve data quality
CN108446391A (en) Processing method, device, electronic equipment and the computer-readable medium of data
EP2558988A1 (en) Ascribing actionable attributes to data that describes a personal identity
JP2019204535A (en) Accounting support system
AU2019200371A1 (en) Utilizing artificial intelligence to integrate data from multiple diverse sources into a data structure
CN104933077B (en) Rule-based multifile information analysis method
CN104424399A (en) Knowledge navigation method, device and system based on virus protein body
CN106775694B (en) A kind of hierarchy classification method of software configuration code product
CN110597796B (en) Big data real-time modeling method and system based on full life cycle
CN112416918A (en) Data management system and working method thereof
CN109063063B (en) Data processing method and device based on multi-source data
CN105389378A (en) System for integrating separate data
CN112068981A (en) Knowledge base-based fault scanning recovery method and system in Linux operating system
US20120254132A1 (en) Enhanced Contact Information
US20190278568A1 (en) Recording medium recording generation program, information processing apparatus, and generation method
CN110046341B (en) Method and system for matching information
CN111597322B (en) Automatic template mining system and method based on frequent item sets
CN105320717B (en) The semi-automatic construction method of individual in body learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20170912

WD01 Invention patent application deemed withdrawn after publication