CN107153991A - The inconsistent integrated conduct method of title in a kind of financial system - Google Patents
The inconsistent integrated conduct method of title in a kind of financial system Download PDFInfo
- Publication number
- CN107153991A CN107153991A CN201710290544.XA CN201710290544A CN107153991A CN 107153991 A CN107153991 A CN 107153991A CN 201710290544 A CN201710290544 A CN 201710290544A CN 107153991 A CN107153991 A CN 107153991A
- Authority
- CN
- China
- Prior art keywords
- title
- standard
- inconsistent
- industrial
- financial system
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/12—Accounting
- G06Q40/125—Finance or payroll
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Finance (AREA)
- Accounting & Taxation (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Development Economics (AREA)
- Economics (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Technology Law (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention provides the inconsistent integrated conduct method of title in a kind of financial system, solves to gather off-gauge, the inconsistent title come from different business flow when setting up the non-standard title table of comparisons, the problem of manually-operated inefficient and error-prone.Compared with prior art, increase non-keyword vocabulary, character string S is obtained after non-key words in non-standard title is removed, again S is searched in title table, and increase to S substring and or each word query criteria title include situation, artificial reference is supplied by comprising sort result output, raising efficiency is reached, reduces the effect of error rate.
Description
Technical field
It is inconsistent the present invention relates to title, the inconsistent solution of title particularly in financial system.
Background technology
Explanation of nouns
Form is pre-processed:The pretreatment that respective handling step is carried out is used for unified form.Can be that program is completed,
Or manual operations instruction.
Title table:The Collection Table of the title determined according to business demand.The discrepancy that such as bank is provided
The organization that account information table is used is as title.Note the difference because business, title is not necessarily most complete
Title.
Non-standard title:The titles different from the title that business is determined are referred to as non-standard title.Usually from business
Each flow.
The non-standard title table of comparisons:Refer to the contrast relationship form set up between non-standard title and title.
Non-keyword vocabulary:According to service feature, be every class name definition its non-key word (such as:Company, it is limited,
Responsibility, share etc.), and form form.
The financial system electronization of each unit is relative to be popularized, in real work, more particularly to treasury trade, such as
In the business such as guarantee fund's reimbursement, because of working link and the characteristic of flow, financial system is often obtained from different operation flows
Information, and integrated treatment.Because the link being related to is more, when links produce information, the problem of title is inconsistent is often encountered.
Such as same unit, its organization should be unique in theory, but can often encounter following problem in practice:
1) unit abbreviation or imperfect appellation.Such as industrial and commercial bank, industrial and commercial bank, the Industrial and Commercial Bank of China, Industrial and Commercial Bank of China's share
Company, China Industrial and Commercial Bank Co., Ltd., so-and-so subbranch of area of industrial and commercial bank, etc..
2) format issues.Such as:Simplified traditional font, full-shape half-angle, capital and small letter, space
It is above-mentioned most commonly seen the problem of list, but it is not limited only to this.These problems cause the financial system of unit can be upper
State is originally same unit, it is believed that is many different units, causes the confusion and mistake of financial process.
In this case, current way is, when finding mistake (such as to not upper account), the information extraction of error to be gone out
Carry out artificial judgment processing.Or use improved method, such as increase form pre-treatment step, to problem 2) situation uses program
Or row format conversion is entered in manual operations, to problem 1) situation is then one by one each standard name in examination criteria title table
Claim, see whether it includes the non-standard title.If comprising this is added into the non-standard title table of comparisons to title (is typically
It is many-to-one), when running into this non-standard title again afterwards, the non-standard title table of comparisons is searched, is replaced with corresponding title.
When data volume is big, even if using above-mentioned improved method, the quantity of error is also quite big.Now artificial treatment is built
When founding the non-standard title table of comparisons, for most non-esbablished corporation, found out by hand in title table correct
Correspondence unit is also quite laborious.Such as foregoing problems 1) in industrial and commercial bank example, if industrial and commercial bank is non-esbablished corporation, in Pang
It is not an easy thing manually to find title corresponding with " industrial and commercial bank " in big title table.And such a search
Method exist another problem be, when title be referred to as or than non-standard title in short-term, even if this non-standard title exists
There is corresponding title in title table, such a search also can not find.
The content of the invention
A kind of title of present invention offer is inconsistent, the inconsistent comprehensive treatment technique side of title particularly in financial system
Case, it is therefore intended that maximum automatic business processing title is inconsistent, and when needing manual intervention, provided to the greatest extent for artificial judgment
It can accurately recommend, so as to largely reduce the inconsistent caused mistake of title, reduce the workload and error rate of artificial treatment.
The technical solution adopted by the present invention particular content is:
Compared with prior art, " non-keyword vocabulary " is increased, i.e.,:It is that each needs unified name according to traffic performance
The name item (field) of title, defines corresponding non-keyword phrase.As in organization field, non-key words can be included:
Company, limited, responsibility, share etc.;To department name field, non-key words can be:Place, office, section, room etc.;To place name,
Non-key words can be:Province, city, area, county, township, town, village etc..
When handling non-standard title, if also there is no its corresponding title in the non-standard title table of comparisons, by such as
Lower step process:
1) listed words in the corresponding non-keyword vocabulary in non-standard title is removed, according to described removed
The difference of non-key words position in the former non-standard title, may resolve into S1 to Sn's by the original non-standard title
Some substrings, all S1 to Sn are merged and obtain character string S;
2) S is searched, if finding the standard for including the S as overall character string in the title table
Title, then by the non-standard title and the title to adding the non-standard title table of comparisons, if do not had
The title for including the S is found, then:
3) check the situation that includes to S1 to the Sn one by one in the title table, and will be pressed comprising result from many
To few sequence output;
4) situation of each word of each title comprising the S is checked in the title table, and will bag
Pressed containing result from more to few sequence output.
The result of " sequence output " supplies artificial reference in above-mentioned steps, come the title of foremost most likely this
The corresponding title of non-standard title.Such step can highlight the corresponding title of most probable, reduce and manually exist
The difficulty and workload searched for by hand in huge title table.
It is referred to as, so scheme of the present invention is removed not using first in non-standard title because title is entirely possible
Necessary words, i.e., the non-key words (such as company, limited, responsibility, share) defined in advance according to traffic performance, is reexamined
Comprising, by the title not can not checked Chu Lai in the prior art the 2) step can check to come automatically, improve efficiency.For
Still the non-standard title not detected, then by the 3) to the 4) step handle, provides most probable recommendation for artificial determination,
Man efficiency can be improved further.
Understood via above-mentioned technical scheme, compared with prior art, the skill of title inconsistence problems disclosed by the invention
Art solution improves deficiency of the prior art.
Brief description of the drawings
Fig. 1 is the inconsistent process step flow chart of title disclosed by the invention.
Embodiment
The specific embodiment of the invention is illustrated by taking organization as an example, but should not be construed as limiting in organization,
" title " of the present invention can also be (but being not limited only to) name, place name, department's name, entry name, bank of deposit's name etc..
" table " of the present invention (such as title table, the non-standard title table of comparisons), can be " the work in excel
" table " in book ", database, or other can realize the module of identical function.
Specific embodiment is following (reference picture 1):
In the automated system of similar financial system, the data collected from different working links and flow run into title not
Unanimously very universal, below by taking organization as an example, reference picture 1 describes the particular content of the present invention in detail.
By taking industrial and commercial bank as an example, system is collected the title come and potentially included:Industrial and commercial bank, industrial and commercial bank, the Industrial and Commercial Bank of China, in
Joint-stock company of industrial and commercial bank of state, China Industrial and Commercial Bank Co., Ltd., so-and-so so-and-so sub-department of area of industrial and commercial bank, etc..It is determined that
During title, not necessarily with " China Industrial and Commercial Bank Co., Ltd. " be standard, according to business the need for, it may be necessary to refer to
Fixed " industrial and commercial bank so-and-so branch " is its title, and is recorded in the title table of comparisons, and other titles are considered to
Criteria of right and wrong title.
To handle non-standard title, financial system can set up the non-standard title table of comparisons, when system runs into non-standard title
Table look-up and find its corresponding title, and continue with.When some non-standard title is not in the non-standard title table of comparisons,
Need this new non-standard title finding its corresponding title and add in the non-standard title table of comparisons.
Prior art is with this new non-standard title (by taking " China Industrial and Commercial Bank Co., Ltd. so-and-so branch " as an example)
Search criterion title table, but because being " industrial and commercial bank so-and-so branch " in standard scale, and with " the limited public affairs of Industrial and Commercial Bank of China's share
Take charge of so-and-so in lines " search, it can not find.Even if now artificial operation, it is also difficult to find " work in huge title table
Business bank so-and-so branch " correspond to therewith.
The solution of the present invention is to set up non-keyword vocabulary, and it is corresponding non-key to be that this name column (or domain name) sets up its
Words, such as:China, share, limited, responsibility, company.
Handled according to step 101 in Fig. 1, will be in " China Industrial and Commercial Bank Co., Ltd. so-and-so branch " all non-close
Key words is removed, and is obtained " industrial and commercial bank " (S1) and " so-and-so branch " (S2), merging S1 and S2, obtain " industrial and commercial bank so-and-so divide
OK " (S).Here the example selected is in order to illustrate S1 to Sn decomposition and synthesis S, if removing only one of which after non-key words
Character string S1, then S1 is S.
Step 102, " industrial and commercial bank so-and-so branch " (S) is searched in title table, finds, go to step 105, afterwards
Terminate.
In other examples, if in step 102, not finding S, then going to step 103;
Step 103, S1 to Sn is searched respectively in title table, exported by comprising how many sequences.It is such as each mark
Quasi- title increase counting module (such as domain, cell), when detecting whether it includes S1 to Sn, often comprising one, the standard name
Count is incremented for the counting module of title, by from more to major general's count sort, and export and supply artificial reference.
Step 104, each word (or letter) in S is searched in title table, by comprising from more to few sequence
Output supplies artificial reference, and terminates process step.This step can be searched S each word by character in realization, or by S
Each word split into a character string, then by string searching.
Step 105, this non-standard title and corresponding title are added into the non-standard title table of comparisons.
Claims (1)
1. the inconsistent integrated conduct method of title in a kind of financial system, including title table and the control of non-standard title
Table, it is characterized in that also including non-keyword vocabulary, and includes following process step:
Listed words in the corresponding non-keyword vocabulary in non-standard title is removed, according to described removed non-key
The difference of words position in the former non-standard title, may resolve into the former non-standard title S1 to Sn some sons
Character string, all S1 to Sn are merged and obtain character string S;
The S is searched as overall character string in the title table, if finding the title for including the S,
By the non-standard title and the title to adding the non-standard title table of comparisons, included if do not found
The title of the S, then;
Check the situation that includes to S1 to the Sn one by one in the title table, and will be pressed comprising result from more to few row
Sequence is exported;
The situation of each word of each title comprising the S is checked in the title table, and result will be included
Exported by from more to few sequence.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710290544.XA CN107153991A (en) | 2017-04-28 | 2017-04-28 | The inconsistent integrated conduct method of title in a kind of financial system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710290544.XA CN107153991A (en) | 2017-04-28 | 2017-04-28 | The inconsistent integrated conduct method of title in a kind of financial system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107153991A true CN107153991A (en) | 2017-09-12 |
Family
ID=59793010
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710290544.XA Pending CN107153991A (en) | 2017-04-28 | 2017-04-28 | The inconsistent integrated conduct method of title in a kind of financial system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107153991A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110298747A (en) * | 2019-07-04 | 2019-10-01 | 中国工商银行股份有限公司 | Remittance message blacklist monitoring system and method |
CN110555089A (en) * | 2019-09-09 | 2019-12-10 | 广东电网有限责任公司 | character name matching method and device and computer readable storage medium |
CN114880430A (en) * | 2022-05-10 | 2022-08-09 | 马上消费金融股份有限公司 | Name processing method and device |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104424202A (en) * | 2013-08-21 | 2015-03-18 | 北大方正集团有限公司 | Method and system for performing duplication checking on customer information in customer relationship management (CRM) system |
CN105184713A (en) * | 2015-07-17 | 2015-12-23 | 四川久远银海软件股份有限公司 | Intelligent matching and sorting system and method capable of benefitting contrast of assigned drugs of medical insurance |
CN105404686A (en) * | 2015-12-10 | 2016-03-16 | 湖南科技大学 | Method for matching place name and address in news event based on geographical feature hierarchical segmented words |
CN106354871A (en) * | 2016-09-18 | 2017-01-25 | 长城计算机软件与系统有限公司 | Similarity search method of enterprise names |
-
2017
- 2017-04-28 CN CN201710290544.XA patent/CN107153991A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104424202A (en) * | 2013-08-21 | 2015-03-18 | 北大方正集团有限公司 | Method and system for performing duplication checking on customer information in customer relationship management (CRM) system |
CN105184713A (en) * | 2015-07-17 | 2015-12-23 | 四川久远银海软件股份有限公司 | Intelligent matching and sorting system and method capable of benefitting contrast of assigned drugs of medical insurance |
CN105404686A (en) * | 2015-12-10 | 2016-03-16 | 湖南科技大学 | Method for matching place name and address in news event based on geographical feature hierarchical segmented words |
CN106354871A (en) * | 2016-09-18 | 2017-01-25 | 长城计算机软件与系统有限公司 | Similarity search method of enterprise names |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110298747A (en) * | 2019-07-04 | 2019-10-01 | 中国工商银行股份有限公司 | Remittance message blacklist monitoring system and method |
CN110298747B (en) * | 2019-07-04 | 2022-04-12 | 中国工商银行股份有限公司 | Remittance message blacklist monitoring system and method |
CN110555089A (en) * | 2019-09-09 | 2019-12-10 | 广东电网有限责任公司 | character name matching method and device and computer readable storage medium |
CN114880430A (en) * | 2022-05-10 | 2022-08-09 | 马上消费金融股份有限公司 | Name processing method and device |
CN114880430B (en) * | 2022-05-10 | 2023-07-18 | 马上消费金融股份有限公司 | Name processing method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20120102002A1 (en) | Automatic data validation and correction | |
CN107153991A (en) | The inconsistent integrated conduct method of title in a kind of financial system | |
MX2012008714A (en) | System and method for aggregation and association of professional affiliation data with commercial data content. | |
US11151099B2 (en) | System and method for data structure migration control | |
CN110889310B (en) | Financial document information intelligent extraction system and method | |
CN109582787B (en) | Entity classification method and device for corpus data in thermal power generation field | |
CN106649557B (en) | Semantic association mining method for defect report and mail list | |
WO2012080077A1 (en) | Cleansing a database system to improve data quality | |
CN108446391A (en) | Processing method, device, electronic equipment and the computer-readable medium of data | |
EP2558988A1 (en) | Ascribing actionable attributes to data that describes a personal identity | |
JP2019204535A (en) | Accounting support system | |
AU2019200371A1 (en) | Utilizing artificial intelligence to integrate data from multiple diverse sources into a data structure | |
CN104933077B (en) | Rule-based multifile information analysis method | |
CN104424399A (en) | Knowledge navigation method, device and system based on virus protein body | |
CN106775694B (en) | A kind of hierarchy classification method of software configuration code product | |
CN110597796B (en) | Big data real-time modeling method and system based on full life cycle | |
CN112416918A (en) | Data management system and working method thereof | |
CN109063063B (en) | Data processing method and device based on multi-source data | |
CN105389378A (en) | System for integrating separate data | |
CN112068981A (en) | Knowledge base-based fault scanning recovery method and system in Linux operating system | |
US20120254132A1 (en) | Enhanced Contact Information | |
US20190278568A1 (en) | Recording medium recording generation program, information processing apparatus, and generation method | |
CN110046341B (en) | Method and system for matching information | |
CN111597322B (en) | Automatic template mining system and method based on frequent item sets | |
CN105320717B (en) | The semi-automatic construction method of individual in body learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20170912 |
|
WD01 | Invention patent application deemed withdrawn after publication |