CN106709805A - Method and system for acquiring user income data - Google Patents

Method and system for acquiring user income data Download PDF

Info

Publication number
CN106709805A
CN106709805A CN201610493459.9A CN201610493459A CN106709805A CN 106709805 A CN106709805 A CN 106709805A CN 201610493459 A CN201610493459 A CN 201610493459A CN 106709805 A CN106709805 A CN 106709805A
Authority
CN
China
Prior art keywords
data
user
information
account
data source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610493459.9A
Other languages
Chinese (zh)
Other versions
CN106709805B (en
Inventor
麦金凯
何锐邦
戴云峰
罗谚君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201610493459.9A priority Critical patent/CN106709805B/en
Publication of CN106709805A publication Critical patent/CN106709805A/en
Application granted granted Critical
Publication of CN106709805B publication Critical patent/CN106709805B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/06Asset management; Financial planning or analysis

Abstract

The invention discloses a method and a system for acquiring user income data. The method comprises the steps of acquiring current data released by a plurality of data sources through a plurality of distributed servers; if the data released by the plurality of data resources conflicts, performing correction on the data released by the data source according to the weight of each data source; acquiring account data information of a user; and acquiring current all income data of the user account according to the account data information of the user and the current data released by the plurality of data sources. According to the invention, the current data released by the plurality of data sources is acquired through the plurality of distributed servers, so that mass data can be captured from the plurality of data sources quickly, and the updating speed of the user income data is greatly improved; and automatic data correction is performed when the data released by the plurality of data sources conflicts, and the overall income data of the user is automatically calculated.

Description

A kind of user's avail data acquisition methods and system
Technical field
The present invention relates to Internet technical field, more particularly to a kind of user's avail data acquisition methods and system.
Background technology
In the prior art, when calculating user's income especially by terminals such as mobile phones, user is usually allowed to be manually entered receipts Beneficial data, then further according to user typing data, update daily income.The major defect of do so has:User is manual Typing avail data is cumbersome, easily error;Avail data may be incorrect and be updated not in time.And, prior art In finance product management system it is single, can only often obtain single avail information, it is impossible to obtain all finance products of user Whole avail datas.
The content of the invention
In view of this, the invention provides a kind of user's avail data acquisition methods, user can in time be obtained current Whole avail datas.What the present invention was realized in:
A kind of user's avail data acquisition methods, including:
The current data that multiple data sources are issued is obtained by multiple distributed servers;
If the data of multiple data source issues have conflict, according to the weight of each data source, data source is sent out The data of cloth are corrected;
Obtain the account data information of user;
The current data of account data information and the multiple the data source issue according to the user obtains user account Current whole avail datas.
System is obtained present invention also offers a kind of user's avail data, including:
Current data acquisition module, the current data for obtaining multiple data source issues;
The Current data acquisition module includes:
Data acquisition unit, for obtaining the current data that multiple data sources are issued by multiple distributed servers;
Conflict prewarning unit, for judging the data of multiple data source issues with the presence or absence of conflict;
Data correction unit, during for the data in multiple data source issues with the presence or absence of conflict, according to every number According to the weight in source, the data to data source issue are corrected;
Subscriber information management module, the account data information for obtaining user;
Avail data acquisition module, issues for the account data information according to the user and the multiple data source Current data obtains current whole avail datas of user account.
Implement the present invention, have the advantages that:
(1) user's avail data acquisition methods that the present invention is provided, first, obtain the current number of multiple data source issues According to;These current datas include the price of various finance products of multiple data source issues;Secondly, the account data of user is obtained Information;The account data information of user includes user's account data information related to finance and money management, such as account class, account The amount of money;Finally, according to the user the current of account that account data information and the current data calculates user is all received Beneficial data.The invention provides the method to various finance product integration managements, different finance product integral benefits are calculated automatically, Efficiently solve user's avail data that the single varieties of finance product management tool of the prior art management cause not complete Whole, not system problem.And the present invention can automatically update the daily avail information of user, provide the user with complete, comprehensive Avail information.
(2) present invention obtains the current data that multiple data sources are issued by multiple distributed servers, due to using many Individual distributed server captures data, rapidly can capture mass data from multiple data sources, drastically increases user's receipts The renewal speed of beneficial data, and due to that using multiple distributed servers, can obtain comprehensive data, is easy to system, complete Ground calculates the avail data of user.
(3) because the information that multiple data sources are issued to same finance product may be inconsistent, or it is multiple numbers There is conflict in the data issued according to source, the present invention is when the data of multiple data source issues have conflict that is, multiple When the financing data of data source crawl are clashed, correcting algorithm is intersected by multi-data source, according to the weight of each data source, The data obtained from data source are corrected, the correctness of data is captured from multi-data source so as to ensure that.
(4) according to the correction result, dynamic adjustment updates the weight of each data source, increases adopted times the present invention The weight of many data sources, reduces the weight of the few data source of adopted times, so as to improve the reliability of the current data of acquisition Property.
(5) due to each finance product one account of correspondence of user, the present invention obtains subscription client association first Information flow, the account data information of user is obtained according to described information stream, and account data information includes short message bill and mail account It is single.The present invention can automatically obtain the finance product information of user according to the bill of user, and being manually entered financing without user produces Product information, simplifies the flow that user obtains finance product income, improves Consumer's Experience.
(6) present invention employs the bill automatic analysis method that positioning mode is recalled based on same root, the use extracted as needed The account data information at family, sets and extracts expression formula;According to expression formula is extracted, searched in information flow and matched with extraction expression formula Element, by with extract that expression formula has same ancestors and element with identification feature is set to datum mark;Looked into information flow Look for the ancestors that datum mark is nearest;In the range of ancestors, searched and the account data information association of user by the selector of CSS Information;In the information for finding, the account data information of user is extracted by regular expression.By the way that ancestors' model is determined Enclose, reduce seeking scope;Seeking scope is further reduced by the selector of CSS, finally using regular expression, accurately Search the account data information of user.The method of the present invention can quickly and accurately search the account data information of user, improve The efficiency that user account data message is extracted.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art and advantage, below will be to implementing Example or the accompanying drawing to be used needed for description of the prior art are briefly described, it should be apparent that, drawings in the following description are only Only it is some embodiments of the present invention, for those of ordinary skill in the art, on the premise of not paying creative work, Other accompanying drawings can also be obtained according to these accompanying drawings.
Fig. 1 is the flow chart of the method for the embodiment of the present invention 1;
The flow chart of the step of Fig. 2 is the method for the embodiment of the present invention 1 S102;
The flow chart of the step of Fig. 3 is the method for the embodiment of the present invention 1 S103;
Fig. 4 is the schematic diagram of the system of the embodiment of the present invention 2;
Fig. 5 is the schematic diagram of the data acquisition unit of the system of the embodiment of the present invention 2;
Fig. 6 is the schematic diagram of the Current data acquisition module of the system of the embodiment of the present invention 2;
Fig. 7 is the schematic diagram of the weight adjustment unit of the system of the embodiment of the present invention 2;
Fig. 8 is the schematic diagram of the information analysis unit of the system of the embodiment of the present invention 2;
Fig. 9 is another schematic diagram of the system of the embodiment of the present invention 2;
Figure 10 is the structured flowchart of the terminal of the embodiment of the present invention.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation is described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art obtained on the premise of creative work is not made it is all its His embodiment, belongs to the scope of protection of the invention.
Embodiment 1:
As shown in figure 1, the embodiment of the present invention 1 provides a kind of user's avail data acquisition methods, it is necessary to illustrate, The step of flow of accompanying drawing is illustrated can perform in the computer system of such as one or more groups of computer executable instructions, And, although logical order is shown in flow charts, but in some cases, can be performed with different from order herein Shown or described step.
The method of the present invention is comprised the following steps:
S101, the current data that multiple data sources issues are obtained by multiple distributed servers.
Used as a kind of optional implementation method, step S101 includes:
S1011, obtain current initial data from multiple default data sources by multiple distributed servers;
Data source is to issue the data source of finance product data, for example, issue financial product, the website of finance data;This hair Current data in bright refers to that the price of the finance products such as fund, bond, stock, foreign exchange, futures, P2P, the exchange rate etc. are related to income Current data.
The data unrelated with account data information in S1012, the deletion initial data.
Wherein, the data unrelated with account data information include mess code, advertisement, the rubbish in data source (such as webpage) Deng.
The present invention obtains the current data that multiple data sources are issued by multiple distributed servers, due to using multiple points Cloth server captures data, rapidly can capture mass data from multiple data sources, drastically increases finance product number According to renewal speed, and due to using multiple distributed servers, comprehensive finance product data can be obtained, be easy to system, Intactly calculate user's financing income.
S102, the data of multiple data sources issues are judged with the presence or absence of conflict, if then according to each data source Weight, the data to data source issue are corrected.
As shown in Fig. 2 used as a kind of optional implementation method, step S102 includes:
S1021, the initial weight for obtaining default each data source.The initial weight of each data source can be in step Just set before S101 and S102, be not specifically limited.
S1022, by multiple data sources according to the packet of issue, the weight of each group of corresponding data source is added.
Specifically, multiple data sources are included according to the packet of issue:The data identical data source that will be issued It is divided into one group.
S1023, will add up one group of maximum data of rear weight and be set to final data.
As an example it is assumed that fund data is main obtained from three data sources, three data sources are respectively funds, good everyday Buy fund, number meter Ji Jin.Presetting the data reliability weight of three data sources is respectively:Fund 40%, buys fund well everyday 30%, number meter Ji Jin 30%.Referring to table one, three data source days grab the net value difference of China's growth (000001) fund It is:
Table one
Unit net value 1.0680 1.0680 1.0681
Accumulative net value 3.3690 3.3691 3.3691
Data source Fund everyday Buy fund well Number meter Ji Jin
By multiple data sources according to the packet of issue, each group of weight of corresponding data source is added.
Specifically, the numerical value identical data source of unit net value data is divided into one group, for example, being by unit net value 1.0680 data source everyday become reconciled and buy fund and be divided into one group by fund, and corresponding weight is added:40%+30%;Unit net value is 1.0681 data source is number meter Ji Jin, and weight is 30%.
Will add up one group of maximum data of rear weight and be set to final data.
Unit net value:1.0680 (40%+30%)>1.0681 (30%);
Accumulative net value:3.3690 (40%)<3.3691 (30%+30%).
So, as shown in Table 2, the final data of unit net value is 1.0680, and the final data of accumulative net value is 3.3691。
Table two
As a kind of optional implementation method, the confidence level of data is obtained from data source in order to improve, can also included:
S1024, according to the correction result, adjust the weight of each data source.
Specifically, if the data that data source is obtained are arranged to final data, the weight of the data source is increased;If number The data obtained according to source are not set to final data, then reduce the weight of the data source.
S103, the account data information for obtaining user.
The account data information of user includes user's account data information related to finance and money management, such as account class, Account etc..Account class includes the fund of user's purchase, bond, stock, foreign exchange, futures, P2P, various can obtain depositing for interest The title and species of the finance such as money or finance product.Account amount of money includes the gold of each financial and finance product that user participates in Specified number evidence.
The account data information of user can be the information that user is manually entered.
Used as a kind of optional implementation method, step S103 includes:
The information flow of subscription client association is obtained, the account data information of user is obtained according to described information stream, it is described Account data information includes short message bill and mail bill.
The information flow of heretofore described subscription client association, refers to subscription client reception or all letters for sending Breath, including short message, mail, the message for receiving and sending by MSN etc..
The bill page is typically system generation, and complex structure is analyzed by conventional matching regular expressions and carried without feature Take bill information very difficult.In order to solve the problem, as shown in figure 3, in a kind of optional implementation method, step S103 enters One step includes:
S1031, the account data information of the user for extracting as needed, set and extract expression formula.
S1032, according to the extraction expression formula, the unit matched with the extraction expression formula is searched in described information stream Element, will have same ancestors and element with identification feature is set to datum mark with the extraction expression formula;
S1033, the nearest ancestors of the datum mark are searched in described information stream;
S1034, in the range of the ancestors, searched and the account data information association of user by the selector of CSS Information;
S1035, in the information for finding, by regular expression extract user account data information.
For example, user is in the bill of certain bank, there is current period minimum amount to pay:50.00.Obtain the account data letter of user The method of breath is as follows:Setting is needed from the bill, extracts current period minimum amount to pay:50.00.
The present inventor is attempted being analyzed by regular expression and extracts " current period minimum amount to pay ", is found due to mail The html page structures of bill are too complicated, cause the regular expression of matching also to become increasingly complex, and the program is not feasible.Separately Outward, inventor is attempted by way of based on CSS selector, and analysis is extracted, because the html pages of mail bill are all to use Table is laid out, and content similarity is high, and without identity, the program is not also feasible.Using the method for the present invention, energy It is enough to solve the problem well.Specifically, method is as follows:
First, it is determined that extracting expression formula;
Secondly, find out and " minimum amount to pay " has germanus, while there is the element of identification feature, as benchmark Point;Datum mark is " current period minimum amount to pay ".Here ancestors are the upper level form where " minimum amount to pay ".
3rd, find out both nearest identical ancestors;Because in same form, nearest identical ancestors are the table Lattice;
4th, by CSS (Cascading Style Sheets Chineses:CSS) selector approach " minimum amount to pay " data;Wherein, the selector of CSS can be neighboring selectors, can also be other CSS such as progeny selection device Selector;
Finally, by regular expression, " minimum amount to pay " data are extracted.
Configuration expression formula of the invention can be as follows:
Due to each finance product one account of correspondence of user, the present invention obtains the information of subscription client association first Stream, the account data information of user is obtained according to described information stream, and account data information includes short message bill and mail bill.This Invention can automatically obtain the finance product information of user according to the bill of user, and finance product letter is manually entered without user Breath, simplifies the flow that user obtains finance product income, improves Consumer's Experience.
Present invention employs the bill automatic analysis method that positioning mode is recalled based on same root, the user's for extracting as needed Account data information, sets and extracts expression formula;According to expression formula is extracted, searched in information flow and extract the unit that expression formula is matched Element, by with extract that expression formula has same ancestors and element with identification feature is set to datum mark;Base is searched in information flow On schedule nearest ancestors;In the range of ancestors, the letter with the account data information association of user is searched by the selector of CSS Breath;In the information for finding, the account data information of user is extracted by regular expression.By the way that ancestors' scope is determined, Reduce seeking scope;Seeking scope is further reduced by the selector of CSS, finally using regular expression, is accurately looked into Look for the account data information of user.The method of the present invention can quickly and accurately search the account data information of user, well Solve because bill content is complicated, bill information hardly possible extraction problem;Improve the efficiency of user account data message extraction.
The current data of S104, the account data information according to the user and the multiple data source issue obtains user Current whole avail datas of account.
Specifically, step S104 includes:What the account data information according to the user was obtained from step S102 works as Current data corresponding with the account data information of user is searched in preceding data, each account of calculating user is corresponding current respectively Avail data.Certainly, in order to provide the user with more intuitive avail data, can be by the current avail data of each account of user Integrate, obtain the income summation of user account.
User's avail data acquisition methods that the present invention is provided, first, obtain the current data of multiple data source issues;This A little current datas include the price of various finance products of multiple data source issues;Secondly, the account data information of user is obtained; The account data information of user includes user's account data information related to finance and money management, such as account class, account amount of money; Finally, according to the user account data information and the current data calculates current whole income numbers of the account of user According to.The invention provides the method to various finance product integration managements, different finance product integral benefits are calculated automatically, effectively To solve user's avail data that the single varieties of finance product management tool of the prior art management cause imperfect, no The problem of system.And the present invention can automatically update the daily avail information of user, complete, comprehensive income letter is provided the user with Breath.
Because the information that multiple data sources are issued to same finance product may be inconsistent, or it is multiple data sources There is conflict in the data of issue, the present invention is when the data of multiple data source issues have conflict, that is, multiple data When the financing data of source crawl are clashed, correcting algorithm is intersected by multi-data source, according to the weight of each data source, correction The initial data of the finance product, the correctness of data is captured so as to ensure that from multi-data source.
The present invention updates the weight of each data source, increases adopted times many according to the correction result, dynamic adjustment Data source weight, reduce the weight of the few data source of adopted times, so as to improve the reliability of the current data of acquisition.
Embodiment 2:
As shown in figure 4, system is obtained the invention provides a kind of user's avail data, including:
Current data acquisition module, the current data for obtaining multiple data source issues;
Current data acquisition module includes:
Data acquisition unit, for obtaining the current data that multiple data sources are issued by multiple distributed servers;
Conflict prewarning unit, for judging the data of multiple data source issues with the presence or absence of conflict;
Data correction unit, during for the data in multiple data source issues with the presence or absence of conflict, according to every number According to the weight in source, the data to data source issue are corrected;
Subscriber information management module, the account data information for obtaining user;
Avail data acquisition module, issues for the account data information according to the user and the multiple data source Current data obtains current whole avail datas of user account.
Used as a kind of optional embodiment, Fig. 5 is the schematic diagram of data acquisition unit of the invention, as shown in figure 5, described Data acquisition unit includes:
Initial data obtains subelement, for obtaining current from multiple default data sources by multiple distributed servers Initial data;
Cleaning subelement, for deleting unrelated with account data information data in the initial data.
Used as a kind of optional embodiment, Fig. 6 is another structural representation of Current data acquisition module of the invention, such as Shown in Fig. 6, Current data acquisition module includes:
Data acquisition unit, for obtaining the current data that multiple data sources are issued by multiple distributed servers;
Conflict prewarning unit, for judging the data of multiple data source issues with the presence or absence of conflict;
Data correction unit, during for the data in multiple data source issues with the presence or absence of conflict, according to every number According to the weight in source, the data to data source issue are corrected.
Weight adjustment unit, for according to the correction result, adjusting the weight of each data source.
Used as a kind of optional embodiment, Fig. 7 is the schematic diagram of weight adjustment unit of the invention, as shown in fig. 7, described Weight adjustment unit includes:
Initial value sets subelement, the initial weight for obtaining default each data source;
Packet computation subunit, for by multiple data sources according to issue packet, by each group of corresponding data The weight in source is added;
Correction subelement, final data is set to for will add up one group of maximum data of rear weight.
Used as a kind of optional embodiment, the subscriber information management module includes:Information analysis unit, uses for obtaining The information flow of family client associate, the account data information of user, the account data packet are obtained according to described information stream Include short message bill and mail bill.
As a kind of optional embodiment, as shown in figure 8, described information resolution unit includes:
Expression formula sets subelement, and the account data information of the user for extracting as needed sets and extracts expression formula;
Datum mark sets subelement, for according to the extraction expression formula, being searched in described information stream and the extraction The element of expression formula matching, will have same ancestors and element with identification feature is set to benchmark with the extraction expression formula Point;
Ancestors search subelement, for searching the nearest ancestors of the datum mark in described information stream;
Subelement is approached, in the range of the ancestors, being searched by the selector of CSS and being believed with the account data of user Cease the information of association;
Subelement is extracted, in the information for finding, the account data information of user being extracted by regular expression.
Fig. 9 is the structured flowchart of system of the invention in a specific application scenarios.
System of the invention can apply to terminal management software, such as convenient to calculate user's in Tencent mobile phone manager The income of all finance products.
When the number of client terminal is very huge, finance, finance product that correspondence is participated in the user of each client Species be also diversified.If often during the avail data of one client of acquisition, server all goes to be obtained from data source Initial data, it will huge pressure is caused to server, causes server excessively busy.The present invention is in order to solve this Problem, the server cluster constituted using multiple distributed servers obtains the current data that multiple data sources are issued, and will obtain The data for taking are put into database.So, when obtaining the avail data of each client, it is only necessary to extracted from database The data of needs, realize data sharing, improve the efficiency of system, dramatically reduce the burden of server.
Data source is to issue the data source of finance product data, for example, issue financial product, the website of finance data;This hair Current data in bright refers to that the price of the finance products such as fund, bond, stock, foreign exchange, futures, P2P, the exchange rate etc. are related to income Current data.
Current data acquisition module of the invention is referred to as finance and money management data acquisition backstage, including data cleansing mould Block and the regular module of data, carry out data cleansing and data are regular, for example first for the data to acquisition:Removal advertisement, unrest The useless data such as code, obtain effective data.
Current data acquisition module can also include database, by after data cleansing module and the regular resume module of data Data can be stored in database, database can be called finance and money management database.
Current data acquisition module also includes conflict early warning plane, for obtaining data, and the number to obtaining from database Colliding data in carries out contrast verification, corrects and carry out conflict early warning automatically, and the data after correction are stored in into finance and money management Database is medium to be called.Specifically, conflict early warning plane is used to judge that the data of multiple data source issues whether there is Conflict, if so, then sending conflict early warning, finance and money management data acquisition backstage is carried out automatically after being connected to conflict early warning to colliding data Correction.Automatically the process of correction is:According to the weight of each data source, the data to data source issue are corrected.Corrected Journey includes:The initial weight of default each data source is obtained in advance.By multiple data sources according to the packet issued, will be every One group of weight of corresponding data source is added.Specifically, multiple data sources are included according to the packet of issue:Will issue Data identical data source be divided into one group.Will add up one group of maximum data of rear weight and be set to the data after correction.Correction Data afterwards can be deposited into finance and money management database.
The confidence level of data is obtained from data source in order to improve, conflict early warning plane can also include weight adjusting module, For according to the correction result, adjusting the weight of each data source.Specifically, if the data that data source is obtained are arranged to Final data, then increase the weight of the data source;If the data that data source is obtained are not set to final data, reducing should The weight of data source.
Data after renewal are stored in financial reason by conflict early warning plane after the weight adjustment for completing Data correction and data source In wealth database.
Current data acquisition module can at predetermined intervals obtain the current data of multiple data source issues, for example Once a day, certainly, for renewal speed avail data faster, it is also possible to improve data acquisition frequency.Because user is general Only need to periodically check avail data, data are obtained at predetermined intervals can provide the user the avail data of needs, Meanwhile, server only needs to periodically obtain data from data source, can save the spending of server.
Subscriber information management module is referred to as user's financing data management system, and hardware is serviced for one or more Device.Subscriber information management module can receive the account data information of user's typing, for example, User logs in Tencent mobile phone manager account Number system, the then finance product information of its purchase of typing, and authorize Tencent mobile phone manager to manage.In addition, authorized in user Under the conditions of, subscriber information management module can also be used by the automatic parsing algorithm based on same root backtracking positioning mode by obtaining The mail bill at family, short message bill, help the key of user one to import all finance products such as including stock, fund, p2p of purchase automatically Information.User need to only log in Tencent mobile phone manager, and authorize Tencent mobile phone manager to manage its finance product, and mobile phone house keeper is just The avail data of user can be regularly updated by Tengxun's financing income calculation system.
Specifically, subscriber information management module includes user's financing information database, and the account for storing user is believed Breath.The account data information of user includes user's account data information related to finance and money management, such as account class, account Deng.Etc. fund that account class is bought including user, bond, stock, foreign exchange, futures, P2P, the various deposits that can obtain interest The title and species of finance or finance product.Account amount of money includes the amount of money number of each financial and finance product that user participates in According to.
The account data information of user can be obtained by two ways, and the first is user's typing, subscriber information management Module provides the interface of user's typing information, and user is input into account data information by the interface.Second is to be authorized through user System obtain automatically.
Subscriber information management module includes information analysis unit, for after user authorizes, obtaining subscription client association Information flow, according to described information stream obtain user account data information, the account data information include short message bill and Mail bill.
The information flow of heretofore described subscription client association, refers to subscription client reception or all letters for sending Breath, including short message, mail, the message for receiving and sending by MSN etc..
Described information resolution unit includes:
Expression formula sets subelement, and the account data information of the user for extracting as needed sets and extracts expression formula;
Datum mark sets subelement, for according to the extraction expression formula, being searched in described information stream and the extraction The element of expression formula matching, will have same ancestors and element with identification feature is set to benchmark with the extraction expression formula Point;
Ancestors search subelement, for searching the nearest ancestors of the datum mark in described information stream;
Subelement is approached, in the range of the ancestors, being searched by the selector of CSS and being believed with the account data of user Cease the information of association;
Subelement is extracted, in the information for finding, the account data information of user being extracted by regular expression.
Avail data acquisition module, it is also possible to which income calculation engine of referred to as managing money matters, its hardware is also server.According to described The account data information of user and the current data corresponding with the account data information of user obtained from data source, calculate user The corresponding current avail data of each account.Certainly, in order to provide the user with more intuitive avail data, can by user each The current avail data of account is integrated, and forms the income summation of user account.Specifically, avail data acquisition module according to The finance product of family purchase, with reference to certain computing formula, calculates the income of all finance products of user, and integrate all receipts Beneficial situation, unification is pushed to user, user is apparent that being born interest for its financing.Can be at predetermined intervals User is pushed to, the time interval that avail data acquisition module can set according to user pushes avail data to user, so that Enhancing Consumer's Experience.
Embodiment 3
Embodiments of the invention also provide a kind of terminal, the terminal can be terminal group in Any one computer terminal.Alternatively, in the present embodiment, above computer terminal can also replace with mobile terminal Deng terminal device.
Alternatively, in the present embodiment, during above computer terminal may be located at multiple network equipments of computer network At least one network equipment.
Alternatively, Figure 10 is the structured flowchart of terminal according to embodiments of the present invention.As shown in Figure 10, the calculating Machine terminal A can include:One or more (one is only shown in figure) processor 101, memory 103 and transmitting devices 105。
Wherein, memory 103 can be used to store software program and module, the short text classification such as in the embodiment of the present invention The corresponding programmed instruction/module of method and apparatus, processor 101 is by running software program of the storage in memory 103 And module, so as to perform various function application and data processing, that is, realize above-mentioned short text classification.Memory 103 can Including high speed random access memory, can also include nonvolatile memory, such as one or more magnetic storage device, flash memory, Or other non-volatile solid state memories.In some instances, memory 103 can be further included relative to processor 101 Remotely located memory, these remote memories can be by network connection to terminal A.The example bag of above-mentioned network Include but be not limited to internet, intranet, LAN, mobile radio communication and combinations thereof.
Above-mentioned transmitting device 105 is used to that data to be received or sent via a network.Above-mentioned network instantiation May include cable network and wireless network.In an example, transmitting device 105 includes a network adapter, and it can pass through Netting twine is connected so as to be communicated with internet or LAN with other network equipments with router.In an example, pass Defeated device 105 is radio-frequency module, and it is used to wirelessly be communicated with internet.
Wherein, specifically, memory 103 is used to store information, the Yi Jiying of deliberate action condition and default access user Use program.
Processor 101 can call the information and application program of the storage of memory 103 by transmitting device, following to perform Step:
Optionally, above-mentioned processor 101 can also carry out the program code of following steps:
Obtain the current data of multiple data source issues;
Judge the data of multiple data source issues with the presence or absence of conflict;
When the data of multiple data source issues are with the presence or absence of conflict, according to the weight of each data source, to data The data of source issue are corrected;
Obtain the account data information of user;
The current data of account data information and the multiple the data source issue according to the user obtains user account Current whole avail datas.
Alternatively, the specific example in the present embodiment may be referred to above-described embodiment 1 to showing described in embodiment 2 Example, the present embodiment will not be repeated here.
The embodiments of the present invention are for illustration only, and the quality of embodiment is not represented.
If integrated unit in above-described embodiment is to realize in the form of SFU software functional unit and as independent product When selling or using, can store in the storage medium that above computer can read.Based on such understanding, skill of the invention The part or all or part of the technical scheme that art scheme substantially contributes to prior art in other words can be with soft The form of part product is embodied, and the computer software product is stored in storage medium, including some instructions are used to so that one Platform or multiple stage computers equipment (can be personal computer, server or network equipment etc.) perform each embodiment institute of the invention State all or part of step of method.
In the above embodiment of the present invention, the description to each embodiment all emphasizes particularly on different fields, and does not have in certain embodiment The part of detailed description, may refer to the associated description of other embodiment.
In several embodiments provided herein, it should be understood that disclosed client, can be by other sides Formula is realized.Wherein, device embodiment described above is only schematical, such as division of described unit, only one Kind of division of logic function, can there is other dividing mode when actually realizing, such as multiple units or component can combine or Another system is desirably integrated into, or some features can be ignored, or do not perform.It is another, it is shown or discussed it is mutual it Between coupling or direct-coupling or communication connection can be the INDIRECT COUPLING or communication link of unit or module by some interfaces Connect, can be electrical or other forms.
The unit that is illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit The part for showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.Some or all of unit therein can be according to the actual needs selected to realize the mesh of this embodiment scheme 's.
In addition, during each functional unit in each embodiment of the invention can be integrated in a processing unit, it is also possible to It is that unit is individually physically present, it is also possible to which two or more units are integrated in a unit.Above-mentioned integrated list Unit can both be realized in the form of hardware, it would however also be possible to employ the form of SFU software functional unit is realized.
The above is only the preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art For member, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications also should It is considered as protection scope of the present invention.
The above is the preferred embodiment of the present invention, it is noted that for those skilled in the art For, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications are also considered as Protection scope of the present invention.

Claims (14)

1. a kind of user's avail data acquisition methods, it is characterised in that including:
The current data that multiple data sources are issued is obtained by multiple distributed servers;
If there is conflict in the data of multiple data source issues, according to the weight of each data source, to data source issue Data are corrected;
Obtain the account data information of user;
The current data of account data information and the multiple the data source issue according to the user obtains working as user account Preceding whole avail datas.
2. user's avail data acquisition methods according to claim 1, it is characterised in that described according to each data source Weight, the data to data source issue are corrected, including:
Obtain the initial weight of default each data source;
By multiple data sources according to the packet of issue, each group of weight of corresponding data source is added;
Will add up one group of maximum data of rear weight and be set to final data.
3. user's avail data acquisition methods according to claim 2, it is characterised in that described according to each data source Weight, after being corrected to the data that data source is issued, also includes:
According to the correction result, the weight of each data source is adjusted.
4. user's avail data acquisition methods according to claim 3, it is characterised in that described according to the correction knot Really, the weight of each data source is adjusted, including:
If the data that data source is obtained are arranged to final data, increase the weight of the data source;If the number that data source is obtained According to final data is not set to, then reduce the weight of the data source.
5. user's avail data acquisition methods according to claim 1, it is characterised in that described distributed to be taken by multiple Business device obtains the current data of multiple data source issues, including:
By multiple distributed servers current initial data is obtained from multiple default data sources;
Delete unrelated with account data information data in the initial data.
6. user's avail data acquisition methods according to claim 1, it is characterised in that the account number of the acquisition user It is believed that breath, including:
The information flow of subscription client association is obtained, the account data information of user, the account are obtained according to described information stream Data message includes short message bill and mail bill.
7. user's avail data acquisition methods according to claim 6, it is characterised in that described to be obtained according to described information stream The account data information at family is taken, including:
The account data information of the user for extracting as needed, sets and extracts expression formula;
According to the extraction expression formula, the element matched with the extraction expression formula is searched in described information stream, will with it is described Extraction expression formula has same ancestors and the element with identification feature is set to datum mark;
The nearest ancestors of the datum mark are searched in described information stream;
In the range of the ancestors, the information with the account data information association of user is searched by the selector of CSS;
In the information for finding, the account data information of user is extracted by regular expression.
8. a kind of user's avail data obtains system, it is characterised in that including:
Current data acquisition module, the current data for obtaining multiple data source issues;
The Current data acquisition module includes:
Data acquisition unit, for obtaining the current data that multiple data sources are issued by multiple distributed servers;
Conflict prewarning unit, for judging the data of multiple data source issues with the presence or absence of conflict;
Data correction unit, during for the data in multiple data source issues with the presence or absence of conflict, according to each data source Weight, to data source issue data be corrected;
Subscriber information management module, the account data information for obtaining user;
Avail data acquisition module, for the account data information according to the user and the multiple data source issue it is current Data obtain current whole avail datas of user account.
9. user's avail data according to claim 8 obtains system, it is characterised in that the Current data acquisition module Also include weight adjustment unit, the weight adjustment unit is used for according to the correction result, adjusts the weight of each data source.
10. user's avail data according to claim 8 obtains system, it is characterised in that the weight adjustment unit bag Include:
Initial value sets subelement, the initial weight for obtaining default each data source;
Packet computation subunit, for by multiple data sources according to the packet of issue, by each group of corresponding data source Weight is added;
Correction subelement, final data is set to for will add up one group of maximum data of rear weight.
11. user's avail datas according to claim 9 obtain system, it is characterised in that the weight adjustment unit is entered One step is used for:If the data that data source is obtained are arranged to final data, increase the weight of the data source;If data source is obtained Data be not set to final data, then reduce the weight of the data source.
12. user's avail datas according to claim 8 obtain system, it is characterised in that the data acquisition unit bag Include:
Initial data obtains subelement, for obtaining current original from multiple default data sources by multiple distributed servers Beginning data;
Cleaning subelement, for deleting unrelated with account data information data in the initial data.
13. user's avail datas according to claim 8 obtain system, it is characterised in that the subscriber information management mould Block includes:Information analysis unit, the information flow for obtaining subscription client association, the account of user is obtained according to described information stream User data information, the account data information includes short message bill and mail bill.
14. user's avail datas according to claim 13 obtain system, it is characterised in that described information resolution unit bag Include:
Expression formula sets subelement, and the account data information of the user for extracting as needed sets and extracts expression formula;
Datum mark sets subelement, and expression is extracted with described for according to the extraction expression formula, being searched in described information stream The element of formula matching, will have same ancestors and element with identification feature is set to datum mark with the extraction expression formula;
Ancestors search subelement, for searching the nearest ancestors of the datum mark in described information stream;
Subelement is approached, in the range of the ancestors, being searched by the selector of CSS and being closed with the account data information of user The information of connection;
Subelement is extracted, in the information for finding, the account data information of user being extracted by regular expression.
CN201610493459.9A 2016-06-29 2016-06-29 User income data acquisition method and system Active CN106709805B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610493459.9A CN106709805B (en) 2016-06-29 2016-06-29 User income data acquisition method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610493459.9A CN106709805B (en) 2016-06-29 2016-06-29 User income data acquisition method and system

Publications (2)

Publication Number Publication Date
CN106709805A true CN106709805A (en) 2017-05-24
CN106709805B CN106709805B (en) 2020-09-25

Family

ID=58939748

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610493459.9A Active CN106709805B (en) 2016-06-29 2016-06-29 User income data acquisition method and system

Country Status (1)

Country Link
CN (1) CN106709805B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108764348A (en) * 2018-05-30 2018-11-06 口口相传(北京)网络技术有限公司 Collecting method based on multiple data sources and system
CN110502521A (en) * 2019-08-28 2019-11-26 上海寰创通信科技股份有限公司 A kind of method for building up of file store
CN110517083A (en) * 2019-08-27 2019-11-29 秒针信息技术有限公司 A kind of method and device of determining customer attribute information
CN111563778A (en) * 2020-05-12 2020-08-21 北京口袋财富信息科技有限公司 Information pushing method and device
CN116089907A (en) * 2023-04-13 2023-05-09 民航成都信息技术有限公司 Fusion method and device of aviation multi-source data, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050171884A1 (en) * 2004-02-04 2005-08-04 Research Affiliates, Llc Non-capitalization weighted indexing system, method and computer program product
CN101576990A (en) * 2008-05-06 2009-11-11 中国建设银行股份有限公司 Banking service processing system
CN103593368A (en) * 2012-08-16 2014-02-19 深圳市世纪光速信息技术有限公司 Method, server, terminal and system for selecting data sources
CN104978688A (en) * 2014-04-02 2015-10-14 陈衡 Unbidden fund value increasing device, unbidden fund value increasing method and financing system
CN105323654A (en) * 2014-08-05 2016-02-10 优视科技有限公司 Method and device for displaying content data from network
CN105427166A (en) * 2015-11-13 2016-03-23 中国建设银行股份有限公司 Bank account type detection method and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050171884A1 (en) * 2004-02-04 2005-08-04 Research Affiliates, Llc Non-capitalization weighted indexing system, method and computer program product
CN101576990A (en) * 2008-05-06 2009-11-11 中国建设银行股份有限公司 Banking service processing system
CN103593368A (en) * 2012-08-16 2014-02-19 深圳市世纪光速信息技术有限公司 Method, server, terminal and system for selecting data sources
CN104978688A (en) * 2014-04-02 2015-10-14 陈衡 Unbidden fund value increasing device, unbidden fund value increasing method and financing system
CN105323654A (en) * 2014-08-05 2016-02-10 优视科技有限公司 Method and device for displaying content data from network
CN105427166A (en) * 2015-11-13 2016-03-23 中国建设银行股份有限公司 Bank account type detection method and system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
孙飞: ""基于DOM节点文本密度的网页核心块抽取算法研究"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108764348A (en) * 2018-05-30 2018-11-06 口口相传(北京)网络技术有限公司 Collecting method based on multiple data sources and system
CN108764348B (en) * 2018-05-30 2020-07-10 口口相传(北京)网络技术有限公司 Data acquisition method and system based on multiple data sources
CN110517083A (en) * 2019-08-27 2019-11-29 秒针信息技术有限公司 A kind of method and device of determining customer attribute information
CN110502521A (en) * 2019-08-28 2019-11-26 上海寰创通信科技股份有限公司 A kind of method for building up of file store
CN110502521B (en) * 2019-08-28 2023-05-09 上海寰创通信科技股份有限公司 Method for establishing archive
CN111563778A (en) * 2020-05-12 2020-08-21 北京口袋财富信息科技有限公司 Information pushing method and device
CN111563778B (en) * 2020-05-12 2021-08-03 北京口袋财富信息科技有限公司 Information pushing method and device
CN116089907A (en) * 2023-04-13 2023-05-09 民航成都信息技术有限公司 Fusion method and device of aviation multi-source data, electronic equipment and storage medium
CN116089907B (en) * 2023-04-13 2023-06-23 民航成都信息技术有限公司 Fusion method and device of aviation multi-source data, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN106709805B (en) 2020-09-25

Similar Documents

Publication Publication Date Title
CN106709805A (en) Method and system for acquiring user income data
CN102857493B (en) Content filtering method and device
CN112307762B (en) Search result sorting method and device, storage medium and electronic device
CN105224606A (en) A kind of disposal route of user ID and device
US11704682B2 (en) Pre-processing financial market data prior to machine learning training
CN111382956A (en) Enterprise group relationship mining method and device
CN110288193A (en) Mission Monitor processing method, device, computer equipment and storage medium
CN111881302A (en) Bank public opinion analysis method and system based on knowledge graph
CN106557558A (en) A kind of data analysing method and device
CN110009416A (en) A kind of system based on big data cleaning and AI precision marketing
CN107832333A (en) Method and system based on distributed treatment and DPI data structure user network data fingerprint
CN107274141A (en) A kind of event-handling method and the network equipment
CN111061837A (en) Topic identification method, device, equipment and medium
CN115423578A (en) Bidding method and system based on micro-service containerization cloud platform
CN110362607A (en) Abnormal number identification method, device, computer equipment and storage medium
CN108648017B (en) User requirement matching method, device, equipment and storage medium easy to expand
CN105681287A (en) Screening rule based user service allocation screening method
CN111831817A (en) Questionnaire generation and analysis method and device, computer equipment and readable storage medium
CN107122464A (en) A kind of aid decision-making system and method
CN107277095A (en) session dividing method and device
CN108171417B (en) Planting task adjusting method, electronic device and storage medium
CN114840183A (en) Micro front end adjusting method and device based on user behaviors
CN106909545A (en) A kind of method and apparatus of the attaching information for determining user
TW202006617A (en) Cloud self-service analysis platform and analysis method thereof
CN108287834A (en) Method, apparatus and computing device for pushed information

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant