CN106709805A - Method and system for acquiring user income data - Google Patents
Method and system for acquiring user income data Download PDFInfo
- Publication number
- CN106709805A CN106709805A CN201610493459.9A CN201610493459A CN106709805A CN 106709805 A CN106709805 A CN 106709805A CN 201610493459 A CN201610493459 A CN 201610493459A CN 106709805 A CN106709805 A CN 106709805A
- Authority
- CN
- China
- Prior art keywords
- data
- user
- information
- account
- data source
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/06—Asset management; Financial planning or analysis
Abstract
The invention discloses a method and a system for acquiring user income data. The method comprises the steps of acquiring current data released by a plurality of data sources through a plurality of distributed servers; if the data released by the plurality of data resources conflicts, performing correction on the data released by the data source according to the weight of each data source; acquiring account data information of a user; and acquiring current all income data of the user account according to the account data information of the user and the current data released by the plurality of data sources. According to the invention, the current data released by the plurality of data sources is acquired through the plurality of distributed servers, so that mass data can be captured from the plurality of data sources quickly, and the updating speed of the user income data is greatly improved; and automatic data correction is performed when the data released by the plurality of data sources conflicts, and the overall income data of the user is automatically calculated.
Description
Technical field
The present invention relates to Internet technical field, more particularly to a kind of user's avail data acquisition methods and system.
Background technology
In the prior art, when calculating user's income especially by terminals such as mobile phones, user is usually allowed to be manually entered receipts
Beneficial data, then further according to user typing data, update daily income.The major defect of do so has:User is manual
Typing avail data is cumbersome, easily error;Avail data may be incorrect and be updated not in time.And, prior art
In finance product management system it is single, can only often obtain single avail information, it is impossible to obtain all finance products of user
Whole avail datas.
The content of the invention
In view of this, the invention provides a kind of user's avail data acquisition methods, user can in time be obtained current
Whole avail datas.What the present invention was realized in:
A kind of user's avail data acquisition methods, including:
The current data that multiple data sources are issued is obtained by multiple distributed servers;
If the data of multiple data source issues have conflict, according to the weight of each data source, data source is sent out
The data of cloth are corrected;
Obtain the account data information of user;
The current data of account data information and the multiple the data source issue according to the user obtains user account
Current whole avail datas.
System is obtained present invention also offers a kind of user's avail data, including:
Current data acquisition module, the current data for obtaining multiple data source issues;
The Current data acquisition module includes:
Data acquisition unit, for obtaining the current data that multiple data sources are issued by multiple distributed servers;
Conflict prewarning unit, for judging the data of multiple data source issues with the presence or absence of conflict;
Data correction unit, during for the data in multiple data source issues with the presence or absence of conflict, according to every number
According to the weight in source, the data to data source issue are corrected;
Subscriber information management module, the account data information for obtaining user;
Avail data acquisition module, issues for the account data information according to the user and the multiple data source
Current data obtains current whole avail datas of user account.
Implement the present invention, have the advantages that:
(1) user's avail data acquisition methods that the present invention is provided, first, obtain the current number of multiple data source issues
According to;These current datas include the price of various finance products of multiple data source issues;Secondly, the account data of user is obtained
Information;The account data information of user includes user's account data information related to finance and money management, such as account class, account
The amount of money;Finally, according to the user the current of account that account data information and the current data calculates user is all received
Beneficial data.The invention provides the method to various finance product integration managements, different finance product integral benefits are calculated automatically,
Efficiently solve user's avail data that the single varieties of finance product management tool of the prior art management cause not complete
Whole, not system problem.And the present invention can automatically update the daily avail information of user, provide the user with complete, comprehensive
Avail information.
(2) present invention obtains the current data that multiple data sources are issued by multiple distributed servers, due to using many
Individual distributed server captures data, rapidly can capture mass data from multiple data sources, drastically increases user's receipts
The renewal speed of beneficial data, and due to that using multiple distributed servers, can obtain comprehensive data, is easy to system, complete
Ground calculates the avail data of user.
(3) because the information that multiple data sources are issued to same finance product may be inconsistent, or it is multiple numbers
There is conflict in the data issued according to source, the present invention is when the data of multiple data source issues have conflict that is, multiple
When the financing data of data source crawl are clashed, correcting algorithm is intersected by multi-data source, according to the weight of each data source,
The data obtained from data source are corrected, the correctness of data is captured from multi-data source so as to ensure that.
(4) according to the correction result, dynamic adjustment updates the weight of each data source, increases adopted times the present invention
The weight of many data sources, reduces the weight of the few data source of adopted times, so as to improve the reliability of the current data of acquisition
Property.
(5) due to each finance product one account of correspondence of user, the present invention obtains subscription client association first
Information flow, the account data information of user is obtained according to described information stream, and account data information includes short message bill and mail account
It is single.The present invention can automatically obtain the finance product information of user according to the bill of user, and being manually entered financing without user produces
Product information, simplifies the flow that user obtains finance product income, improves Consumer's Experience.
(6) present invention employs the bill automatic analysis method that positioning mode is recalled based on same root, the use extracted as needed
The account data information at family, sets and extracts expression formula;According to expression formula is extracted, searched in information flow and matched with extraction expression formula
Element, by with extract that expression formula has same ancestors and element with identification feature is set to datum mark;Looked into information flow
Look for the ancestors that datum mark is nearest;In the range of ancestors, searched and the account data information association of user by the selector of CSS
Information;In the information for finding, the account data information of user is extracted by regular expression.By the way that ancestors' model is determined
Enclose, reduce seeking scope;Seeking scope is further reduced by the selector of CSS, finally using regular expression, accurately
Search the account data information of user.The method of the present invention can quickly and accurately search the account data information of user, improve
The efficiency that user account data message is extracted.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art and advantage, below will be to implementing
Example or the accompanying drawing to be used needed for description of the prior art are briefly described, it should be apparent that, drawings in the following description are only
Only it is some embodiments of the present invention, for those of ordinary skill in the art, on the premise of not paying creative work,
Other accompanying drawings can also be obtained according to these accompanying drawings.
Fig. 1 is the flow chart of the method for the embodiment of the present invention 1;
The flow chart of the step of Fig. 2 is the method for the embodiment of the present invention 1 S102;
The flow chart of the step of Fig. 3 is the method for the embodiment of the present invention 1 S103;
Fig. 4 is the schematic diagram of the system of the embodiment of the present invention 2;
Fig. 5 is the schematic diagram of the data acquisition unit of the system of the embodiment of the present invention 2;
Fig. 6 is the schematic diagram of the Current data acquisition module of the system of the embodiment of the present invention 2;
Fig. 7 is the schematic diagram of the weight adjustment unit of the system of the embodiment of the present invention 2;
Fig. 8 is the schematic diagram of the information analysis unit of the system of the embodiment of the present invention 2;
Fig. 9 is another schematic diagram of the system of the embodiment of the present invention 2;
Figure 10 is the structured flowchart of the terminal of the embodiment of the present invention.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation is described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiments.It is based on
Embodiment in the present invention, those of ordinary skill in the art obtained on the premise of creative work is not made it is all its
His embodiment, belongs to the scope of protection of the invention.
Embodiment 1:
As shown in figure 1, the embodiment of the present invention 1 provides a kind of user's avail data acquisition methods, it is necessary to illustrate,
The step of flow of accompanying drawing is illustrated can perform in the computer system of such as one or more groups of computer executable instructions,
And, although logical order is shown in flow charts, but in some cases, can be performed with different from order herein
Shown or described step.
The method of the present invention is comprised the following steps:
S101, the current data that multiple data sources issues are obtained by multiple distributed servers.
Used as a kind of optional implementation method, step S101 includes:
S1011, obtain current initial data from multiple default data sources by multiple distributed servers;
Data source is to issue the data source of finance product data, for example, issue financial product, the website of finance data;This hair
Current data in bright refers to that the price of the finance products such as fund, bond, stock, foreign exchange, futures, P2P, the exchange rate etc. are related to income
Current data.
The data unrelated with account data information in S1012, the deletion initial data.
Wherein, the data unrelated with account data information include mess code, advertisement, the rubbish in data source (such as webpage)
Deng.
The present invention obtains the current data that multiple data sources are issued by multiple distributed servers, due to using multiple points
Cloth server captures data, rapidly can capture mass data from multiple data sources, drastically increases finance product number
According to renewal speed, and due to using multiple distributed servers, comprehensive finance product data can be obtained, be easy to system,
Intactly calculate user's financing income.
S102, the data of multiple data sources issues are judged with the presence or absence of conflict, if then according to each data source
Weight, the data to data source issue are corrected.
As shown in Fig. 2 used as a kind of optional implementation method, step S102 includes:
S1021, the initial weight for obtaining default each data source.The initial weight of each data source can be in step
Just set before S101 and S102, be not specifically limited.
S1022, by multiple data sources according to the packet of issue, the weight of each group of corresponding data source is added.
Specifically, multiple data sources are included according to the packet of issue:The data identical data source that will be issued
It is divided into one group.
S1023, will add up one group of maximum data of rear weight and be set to final data.
As an example it is assumed that fund data is main obtained from three data sources, three data sources are respectively funds, good everyday
Buy fund, number meter Ji Jin.Presetting the data reliability weight of three data sources is respectively:Fund 40%, buys fund well everyday
30%, number meter Ji Jin 30%.Referring to table one, three data source days grab the net value difference of China's growth (000001) fund
It is:
Table one
Unit net value | 1.0680 | 1.0680 | 1.0681 |
Accumulative net value | 3.3690 | 3.3691 | 3.3691 |
Data source | Fund everyday | Buy fund well | Number meter Ji Jin |
By multiple data sources according to the packet of issue, each group of weight of corresponding data source is added.
Specifically, the numerical value identical data source of unit net value data is divided into one group, for example, being by unit net value
1.0680 data source everyday become reconciled and buy fund and be divided into one group by fund, and corresponding weight is added:40%+30%;Unit net value is
1.0681 data source is number meter Ji Jin, and weight is 30%.
Will add up one group of maximum data of rear weight and be set to final data.
Unit net value:1.0680 (40%+30%)>1.0681 (30%);
Accumulative net value:3.3690 (40%)<3.3691 (30%+30%).
So, as shown in Table 2, the final data of unit net value is 1.0680, and the final data of accumulative net value is
3.3691。
Table two
As a kind of optional implementation method, the confidence level of data is obtained from data source in order to improve, can also included:
S1024, according to the correction result, adjust the weight of each data source.
Specifically, if the data that data source is obtained are arranged to final data, the weight of the data source is increased;If number
The data obtained according to source are not set to final data, then reduce the weight of the data source.
S103, the account data information for obtaining user.
The account data information of user includes user's account data information related to finance and money management, such as account class,
Account etc..Account class includes the fund of user's purchase, bond, stock, foreign exchange, futures, P2P, various can obtain depositing for interest
The title and species of the finance such as money or finance product.Account amount of money includes the gold of each financial and finance product that user participates in
Specified number evidence.
The account data information of user can be the information that user is manually entered.
Used as a kind of optional implementation method, step S103 includes:
The information flow of subscription client association is obtained, the account data information of user is obtained according to described information stream, it is described
Account data information includes short message bill and mail bill.
The information flow of heretofore described subscription client association, refers to subscription client reception or all letters for sending
Breath, including short message, mail, the message for receiving and sending by MSN etc..
The bill page is typically system generation, and complex structure is analyzed by conventional matching regular expressions and carried without feature
Take bill information very difficult.In order to solve the problem, as shown in figure 3, in a kind of optional implementation method, step S103 enters
One step includes:
S1031, the account data information of the user for extracting as needed, set and extract expression formula.
S1032, according to the extraction expression formula, the unit matched with the extraction expression formula is searched in described information stream
Element, will have same ancestors and element with identification feature is set to datum mark with the extraction expression formula;
S1033, the nearest ancestors of the datum mark are searched in described information stream;
S1034, in the range of the ancestors, searched and the account data information association of user by the selector of CSS
Information;
S1035, in the information for finding, by regular expression extract user account data information.
For example, user is in the bill of certain bank, there is current period minimum amount to pay:50.00.Obtain the account data letter of user
The method of breath is as follows:Setting is needed from the bill, extracts current period minimum amount to pay:50.00.
The present inventor is attempted being analyzed by regular expression and extracts " current period minimum amount to pay ", is found due to mail
The html page structures of bill are too complicated, cause the regular expression of matching also to become increasingly complex, and the program is not feasible.Separately
Outward, inventor is attempted by way of based on CSS selector, and analysis is extracted, because the html pages of mail bill are all to use
Table is laid out, and content similarity is high, and without identity, the program is not also feasible.Using the method for the present invention, energy
It is enough to solve the problem well.Specifically, method is as follows:
First, it is determined that extracting expression formula;
Secondly, find out and " minimum amount to pay " has germanus, while there is the element of identification feature, as benchmark
Point;Datum mark is " current period minimum amount to pay ".Here ancestors are the upper level form where " minimum amount to pay ".
3rd, find out both nearest identical ancestors;Because in same form, nearest identical ancestors are the table
Lattice;
4th, by CSS (Cascading Style Sheets Chineses:CSS) selector approach
" minimum amount to pay " data;Wherein, the selector of CSS can be neighboring selectors, can also be other CSS such as progeny selection device
Selector;
Finally, by regular expression, " minimum amount to pay " data are extracted.
Configuration expression formula of the invention can be as follows:
Due to each finance product one account of correspondence of user, the present invention obtains the information of subscription client association first
Stream, the account data information of user is obtained according to described information stream, and account data information includes short message bill and mail bill.This
Invention can automatically obtain the finance product information of user according to the bill of user, and finance product letter is manually entered without user
Breath, simplifies the flow that user obtains finance product income, improves Consumer's Experience.
Present invention employs the bill automatic analysis method that positioning mode is recalled based on same root, the user's for extracting as needed
Account data information, sets and extracts expression formula;According to expression formula is extracted, searched in information flow and extract the unit that expression formula is matched
Element, by with extract that expression formula has same ancestors and element with identification feature is set to datum mark;Base is searched in information flow
On schedule nearest ancestors;In the range of ancestors, the letter with the account data information association of user is searched by the selector of CSS
Breath;In the information for finding, the account data information of user is extracted by regular expression.By the way that ancestors' scope is determined,
Reduce seeking scope;Seeking scope is further reduced by the selector of CSS, finally using regular expression, is accurately looked into
Look for the account data information of user.The method of the present invention can quickly and accurately search the account data information of user, well
Solve because bill content is complicated, bill information hardly possible extraction problem;Improve the efficiency of user account data message extraction.
The current data of S104, the account data information according to the user and the multiple data source issue obtains user
Current whole avail datas of account.
Specifically, step S104 includes:What the account data information according to the user was obtained from step S102 works as
Current data corresponding with the account data information of user is searched in preceding data, each account of calculating user is corresponding current respectively
Avail data.Certainly, in order to provide the user with more intuitive avail data, can be by the current avail data of each account of user
Integrate, obtain the income summation of user account.
User's avail data acquisition methods that the present invention is provided, first, obtain the current data of multiple data source issues;This
A little current datas include the price of various finance products of multiple data source issues;Secondly, the account data information of user is obtained;
The account data information of user includes user's account data information related to finance and money management, such as account class, account amount of money;
Finally, according to the user account data information and the current data calculates current whole income numbers of the account of user
According to.The invention provides the method to various finance product integration managements, different finance product integral benefits are calculated automatically, effectively
To solve user's avail data that the single varieties of finance product management tool of the prior art management cause imperfect, no
The problem of system.And the present invention can automatically update the daily avail information of user, complete, comprehensive income letter is provided the user with
Breath.
Because the information that multiple data sources are issued to same finance product may be inconsistent, or it is multiple data sources
There is conflict in the data of issue, the present invention is when the data of multiple data source issues have conflict, that is, multiple data
When the financing data of source crawl are clashed, correcting algorithm is intersected by multi-data source, according to the weight of each data source, correction
The initial data of the finance product, the correctness of data is captured so as to ensure that from multi-data source.
The present invention updates the weight of each data source, increases adopted times many according to the correction result, dynamic adjustment
Data source weight, reduce the weight of the few data source of adopted times, so as to improve the reliability of the current data of acquisition.
Embodiment 2:
As shown in figure 4, system is obtained the invention provides a kind of user's avail data, including:
Current data acquisition module, the current data for obtaining multiple data source issues;
Current data acquisition module includes:
Data acquisition unit, for obtaining the current data that multiple data sources are issued by multiple distributed servers;
Conflict prewarning unit, for judging the data of multiple data source issues with the presence or absence of conflict;
Data correction unit, during for the data in multiple data source issues with the presence or absence of conflict, according to every number
According to the weight in source, the data to data source issue are corrected;
Subscriber information management module, the account data information for obtaining user;
Avail data acquisition module, issues for the account data information according to the user and the multiple data source
Current data obtains current whole avail datas of user account.
Used as a kind of optional embodiment, Fig. 5 is the schematic diagram of data acquisition unit of the invention, as shown in figure 5, described
Data acquisition unit includes:
Initial data obtains subelement, for obtaining current from multiple default data sources by multiple distributed servers
Initial data;
Cleaning subelement, for deleting unrelated with account data information data in the initial data.
Used as a kind of optional embodiment, Fig. 6 is another structural representation of Current data acquisition module of the invention, such as
Shown in Fig. 6, Current data acquisition module includes:
Data acquisition unit, for obtaining the current data that multiple data sources are issued by multiple distributed servers;
Conflict prewarning unit, for judging the data of multiple data source issues with the presence or absence of conflict;
Data correction unit, during for the data in multiple data source issues with the presence or absence of conflict, according to every number
According to the weight in source, the data to data source issue are corrected.
Weight adjustment unit, for according to the correction result, adjusting the weight of each data source.
Used as a kind of optional embodiment, Fig. 7 is the schematic diagram of weight adjustment unit of the invention, as shown in fig. 7, described
Weight adjustment unit includes:
Initial value sets subelement, the initial weight for obtaining default each data source;
Packet computation subunit, for by multiple data sources according to issue packet, by each group of corresponding data
The weight in source is added;
Correction subelement, final data is set to for will add up one group of maximum data of rear weight.
Used as a kind of optional embodiment, the subscriber information management module includes:Information analysis unit, uses for obtaining
The information flow of family client associate, the account data information of user, the account data packet are obtained according to described information stream
Include short message bill and mail bill.
As a kind of optional embodiment, as shown in figure 8, described information resolution unit includes:
Expression formula sets subelement, and the account data information of the user for extracting as needed sets and extracts expression formula;
Datum mark sets subelement, for according to the extraction expression formula, being searched in described information stream and the extraction
The element of expression formula matching, will have same ancestors and element with identification feature is set to benchmark with the extraction expression formula
Point;
Ancestors search subelement, for searching the nearest ancestors of the datum mark in described information stream;
Subelement is approached, in the range of the ancestors, being searched by the selector of CSS and being believed with the account data of user
Cease the information of association;
Subelement is extracted, in the information for finding, the account data information of user being extracted by regular expression.
Fig. 9 is the structured flowchart of system of the invention in a specific application scenarios.
System of the invention can apply to terminal management software, such as convenient to calculate user's in Tencent mobile phone manager
The income of all finance products.
When the number of client terminal is very huge, finance, finance product that correspondence is participated in the user of each client
Species be also diversified.If often during the avail data of one client of acquisition, server all goes to be obtained from data source
Initial data, it will huge pressure is caused to server, causes server excessively busy.The present invention is in order to solve this
Problem, the server cluster constituted using multiple distributed servers obtains the current data that multiple data sources are issued, and will obtain
The data for taking are put into database.So, when obtaining the avail data of each client, it is only necessary to extracted from database
The data of needs, realize data sharing, improve the efficiency of system, dramatically reduce the burden of server.
Data source is to issue the data source of finance product data, for example, issue financial product, the website of finance data;This hair
Current data in bright refers to that the price of the finance products such as fund, bond, stock, foreign exchange, futures, P2P, the exchange rate etc. are related to income
Current data.
Current data acquisition module of the invention is referred to as finance and money management data acquisition backstage, including data cleansing mould
Block and the regular module of data, carry out data cleansing and data are regular, for example first for the data to acquisition:Removal advertisement, unrest
The useless data such as code, obtain effective data.
Current data acquisition module can also include database, by after data cleansing module and the regular resume module of data
Data can be stored in database, database can be called finance and money management database.
Current data acquisition module also includes conflict early warning plane, for obtaining data, and the number to obtaining from database
Colliding data in carries out contrast verification, corrects and carry out conflict early warning automatically, and the data after correction are stored in into finance and money management
Database is medium to be called.Specifically, conflict early warning plane is used to judge that the data of multiple data source issues whether there is
Conflict, if so, then sending conflict early warning, finance and money management data acquisition backstage is carried out automatically after being connected to conflict early warning to colliding data
Correction.Automatically the process of correction is:According to the weight of each data source, the data to data source issue are corrected.Corrected
Journey includes:The initial weight of default each data source is obtained in advance.By multiple data sources according to the packet issued, will be every
One group of weight of corresponding data source is added.Specifically, multiple data sources are included according to the packet of issue:Will issue
Data identical data source be divided into one group.Will add up one group of maximum data of rear weight and be set to the data after correction.Correction
Data afterwards can be deposited into finance and money management database.
The confidence level of data is obtained from data source in order to improve, conflict early warning plane can also include weight adjusting module,
For according to the correction result, adjusting the weight of each data source.Specifically, if the data that data source is obtained are arranged to
Final data, then increase the weight of the data source;If the data that data source is obtained are not set to final data, reducing should
The weight of data source.
Data after renewal are stored in financial reason by conflict early warning plane after the weight adjustment for completing Data correction and data source
In wealth database.
Current data acquisition module can at predetermined intervals obtain the current data of multiple data source issues, for example
Once a day, certainly, for renewal speed avail data faster, it is also possible to improve data acquisition frequency.Because user is general
Only need to periodically check avail data, data are obtained at predetermined intervals can provide the user the avail data of needs,
Meanwhile, server only needs to periodically obtain data from data source, can save the spending of server.
Subscriber information management module is referred to as user's financing data management system, and hardware is serviced for one or more
Device.Subscriber information management module can receive the account data information of user's typing, for example, User logs in Tencent mobile phone manager account
Number system, the then finance product information of its purchase of typing, and authorize Tencent mobile phone manager to manage.In addition, authorized in user
Under the conditions of, subscriber information management module can also be used by the automatic parsing algorithm based on same root backtracking positioning mode by obtaining
The mail bill at family, short message bill, help the key of user one to import all finance products such as including stock, fund, p2p of purchase automatically
Information.User need to only log in Tencent mobile phone manager, and authorize Tencent mobile phone manager to manage its finance product, and mobile phone house keeper is just
The avail data of user can be regularly updated by Tengxun's financing income calculation system.
Specifically, subscriber information management module includes user's financing information database, and the account for storing user is believed
Breath.The account data information of user includes user's account data information related to finance and money management, such as account class, account
Deng.Etc. fund that account class is bought including user, bond, stock, foreign exchange, futures, P2P, the various deposits that can obtain interest
The title and species of finance or finance product.Account amount of money includes the amount of money number of each financial and finance product that user participates in
According to.
The account data information of user can be obtained by two ways, and the first is user's typing, subscriber information management
Module provides the interface of user's typing information, and user is input into account data information by the interface.Second is to be authorized through user
System obtain automatically.
Subscriber information management module includes information analysis unit, for after user authorizes, obtaining subscription client association
Information flow, according to described information stream obtain user account data information, the account data information include short message bill and
Mail bill.
The information flow of heretofore described subscription client association, refers to subscription client reception or all letters for sending
Breath, including short message, mail, the message for receiving and sending by MSN etc..
Described information resolution unit includes:
Expression formula sets subelement, and the account data information of the user for extracting as needed sets and extracts expression formula;
Datum mark sets subelement, for according to the extraction expression formula, being searched in described information stream and the extraction
The element of expression formula matching, will have same ancestors and element with identification feature is set to benchmark with the extraction expression formula
Point;
Ancestors search subelement, for searching the nearest ancestors of the datum mark in described information stream;
Subelement is approached, in the range of the ancestors, being searched by the selector of CSS and being believed with the account data of user
Cease the information of association;
Subelement is extracted, in the information for finding, the account data information of user being extracted by regular expression.
Avail data acquisition module, it is also possible to which income calculation engine of referred to as managing money matters, its hardware is also server.According to described
The account data information of user and the current data corresponding with the account data information of user obtained from data source, calculate user
The corresponding current avail data of each account.Certainly, in order to provide the user with more intuitive avail data, can by user each
The current avail data of account is integrated, and forms the income summation of user account.Specifically, avail data acquisition module according to
The finance product of family purchase, with reference to certain computing formula, calculates the income of all finance products of user, and integrate all receipts
Beneficial situation, unification is pushed to user, user is apparent that being born interest for its financing.Can be at predetermined intervals
User is pushed to, the time interval that avail data acquisition module can set according to user pushes avail data to user, so that
Enhancing Consumer's Experience.
Embodiment 3
Embodiments of the invention also provide a kind of terminal, the terminal can be terminal group in
Any one computer terminal.Alternatively, in the present embodiment, above computer terminal can also replace with mobile terminal
Deng terminal device.
Alternatively, in the present embodiment, during above computer terminal may be located at multiple network equipments of computer network
At least one network equipment.
Alternatively, Figure 10 is the structured flowchart of terminal according to embodiments of the present invention.As shown in Figure 10, the calculating
Machine terminal A can include:One or more (one is only shown in figure) processor 101, memory 103 and transmitting devices
105。
Wherein, memory 103 can be used to store software program and module, the short text classification such as in the embodiment of the present invention
The corresponding programmed instruction/module of method and apparatus, processor 101 is by running software program of the storage in memory 103
And module, so as to perform various function application and data processing, that is, realize above-mentioned short text classification.Memory 103 can
Including high speed random access memory, can also include nonvolatile memory, such as one or more magnetic storage device, flash memory,
Or other non-volatile solid state memories.In some instances, memory 103 can be further included relative to processor 101
Remotely located memory, these remote memories can be by network connection to terminal A.The example bag of above-mentioned network
Include but be not limited to internet, intranet, LAN, mobile radio communication and combinations thereof.
Above-mentioned transmitting device 105 is used to that data to be received or sent via a network.Above-mentioned network instantiation
May include cable network and wireless network.In an example, transmitting device 105 includes a network adapter, and it can pass through
Netting twine is connected so as to be communicated with internet or LAN with other network equipments with router.In an example, pass
Defeated device 105 is radio-frequency module, and it is used to wirelessly be communicated with internet.
Wherein, specifically, memory 103 is used to store information, the Yi Jiying of deliberate action condition and default access user
Use program.
Processor 101 can call the information and application program of the storage of memory 103 by transmitting device, following to perform
Step:
Optionally, above-mentioned processor 101 can also carry out the program code of following steps:
Obtain the current data of multiple data source issues;
Judge the data of multiple data source issues with the presence or absence of conflict;
When the data of multiple data source issues are with the presence or absence of conflict, according to the weight of each data source, to data
The data of source issue are corrected;
Obtain the account data information of user;
The current data of account data information and the multiple the data source issue according to the user obtains user account
Current whole avail datas.
Alternatively, the specific example in the present embodiment may be referred to above-described embodiment 1 to showing described in embodiment 2
Example, the present embodiment will not be repeated here.
The embodiments of the present invention are for illustration only, and the quality of embodiment is not represented.
If integrated unit in above-described embodiment is to realize in the form of SFU software functional unit and as independent product
When selling or using, can store in the storage medium that above computer can read.Based on such understanding, skill of the invention
The part or all or part of the technical scheme that art scheme substantially contributes to prior art in other words can be with soft
The form of part product is embodied, and the computer software product is stored in storage medium, including some instructions are used to so that one
Platform or multiple stage computers equipment (can be personal computer, server or network equipment etc.) perform each embodiment institute of the invention
State all or part of step of method.
In the above embodiment of the present invention, the description to each embodiment all emphasizes particularly on different fields, and does not have in certain embodiment
The part of detailed description, may refer to the associated description of other embodiment.
In several embodiments provided herein, it should be understood that disclosed client, can be by other sides
Formula is realized.Wherein, device embodiment described above is only schematical, such as division of described unit, only one
Kind of division of logic function, can there is other dividing mode when actually realizing, such as multiple units or component can combine or
Another system is desirably integrated into, or some features can be ignored, or do not perform.It is another, it is shown or discussed it is mutual it
Between coupling or direct-coupling or communication connection can be the INDIRECT COUPLING or communication link of unit or module by some interfaces
Connect, can be electrical or other forms.
The unit that is illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit
The part for showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple
On NE.Some or all of unit therein can be according to the actual needs selected to realize the mesh of this embodiment scheme
's.
In addition, during each functional unit in each embodiment of the invention can be integrated in a processing unit, it is also possible to
It is that unit is individually physically present, it is also possible to which two or more units are integrated in a unit.Above-mentioned integrated list
Unit can both be realized in the form of hardware, it would however also be possible to employ the form of SFU software functional unit is realized.
The above is only the preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art
For member, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications also should
It is considered as protection scope of the present invention.
The above is the preferred embodiment of the present invention, it is noted that for those skilled in the art
For, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications are also considered as
Protection scope of the present invention.
Claims (14)
1. a kind of user's avail data acquisition methods, it is characterised in that including:
The current data that multiple data sources are issued is obtained by multiple distributed servers;
If there is conflict in the data of multiple data source issues, according to the weight of each data source, to data source issue
Data are corrected;
Obtain the account data information of user;
The current data of account data information and the multiple the data source issue according to the user obtains working as user account
Preceding whole avail datas.
2. user's avail data acquisition methods according to claim 1, it is characterised in that described according to each data source
Weight, the data to data source issue are corrected, including:
Obtain the initial weight of default each data source;
By multiple data sources according to the packet of issue, each group of weight of corresponding data source is added;
Will add up one group of maximum data of rear weight and be set to final data.
3. user's avail data acquisition methods according to claim 2, it is characterised in that described according to each data source
Weight, after being corrected to the data that data source is issued, also includes:
According to the correction result, the weight of each data source is adjusted.
4. user's avail data acquisition methods according to claim 3, it is characterised in that described according to the correction knot
Really, the weight of each data source is adjusted, including:
If the data that data source is obtained are arranged to final data, increase the weight of the data source;If the number that data source is obtained
According to final data is not set to, then reduce the weight of the data source.
5. user's avail data acquisition methods according to claim 1, it is characterised in that described distributed to be taken by multiple
Business device obtains the current data of multiple data source issues, including:
By multiple distributed servers current initial data is obtained from multiple default data sources;
Delete unrelated with account data information data in the initial data.
6. user's avail data acquisition methods according to claim 1, it is characterised in that the account number of the acquisition user
It is believed that breath, including:
The information flow of subscription client association is obtained, the account data information of user, the account are obtained according to described information stream
Data message includes short message bill and mail bill.
7. user's avail data acquisition methods according to claim 6, it is characterised in that described to be obtained according to described information stream
The account data information at family is taken, including:
The account data information of the user for extracting as needed, sets and extracts expression formula;
According to the extraction expression formula, the element matched with the extraction expression formula is searched in described information stream, will with it is described
Extraction expression formula has same ancestors and the element with identification feature is set to datum mark;
The nearest ancestors of the datum mark are searched in described information stream;
In the range of the ancestors, the information with the account data information association of user is searched by the selector of CSS;
In the information for finding, the account data information of user is extracted by regular expression.
8. a kind of user's avail data obtains system, it is characterised in that including:
Current data acquisition module, the current data for obtaining multiple data source issues;
The Current data acquisition module includes:
Data acquisition unit, for obtaining the current data that multiple data sources are issued by multiple distributed servers;
Conflict prewarning unit, for judging the data of multiple data source issues with the presence or absence of conflict;
Data correction unit, during for the data in multiple data source issues with the presence or absence of conflict, according to each data source
Weight, to data source issue data be corrected;
Subscriber information management module, the account data information for obtaining user;
Avail data acquisition module, for the account data information according to the user and the multiple data source issue it is current
Data obtain current whole avail datas of user account.
9. user's avail data according to claim 8 obtains system, it is characterised in that the Current data acquisition module
Also include weight adjustment unit, the weight adjustment unit is used for according to the correction result, adjusts the weight of each data source.
10. user's avail data according to claim 8 obtains system, it is characterised in that the weight adjustment unit bag
Include:
Initial value sets subelement, the initial weight for obtaining default each data source;
Packet computation subunit, for by multiple data sources according to the packet of issue, by each group of corresponding data source
Weight is added;
Correction subelement, final data is set to for will add up one group of maximum data of rear weight.
11. user's avail datas according to claim 9 obtain system, it is characterised in that the weight adjustment unit is entered
One step is used for:If the data that data source is obtained are arranged to final data, increase the weight of the data source;If data source is obtained
Data be not set to final data, then reduce the weight of the data source.
12. user's avail datas according to claim 8 obtain system, it is characterised in that the data acquisition unit bag
Include:
Initial data obtains subelement, for obtaining current original from multiple default data sources by multiple distributed servers
Beginning data;
Cleaning subelement, for deleting unrelated with account data information data in the initial data.
13. user's avail datas according to claim 8 obtain system, it is characterised in that the subscriber information management mould
Block includes:Information analysis unit, the information flow for obtaining subscription client association, the account of user is obtained according to described information stream
User data information, the account data information includes short message bill and mail bill.
14. user's avail datas according to claim 13 obtain system, it is characterised in that described information resolution unit bag
Include:
Expression formula sets subelement, and the account data information of the user for extracting as needed sets and extracts expression formula;
Datum mark sets subelement, and expression is extracted with described for according to the extraction expression formula, being searched in described information stream
The element of formula matching, will have same ancestors and element with identification feature is set to datum mark with the extraction expression formula;
Ancestors search subelement, for searching the nearest ancestors of the datum mark in described information stream;
Subelement is approached, in the range of the ancestors, being searched by the selector of CSS and being closed with the account data information of user
The information of connection;
Subelement is extracted, in the information for finding, the account data information of user being extracted by regular expression.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610493459.9A CN106709805B (en) | 2016-06-29 | 2016-06-29 | User income data acquisition method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610493459.9A CN106709805B (en) | 2016-06-29 | 2016-06-29 | User income data acquisition method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106709805A true CN106709805A (en) | 2017-05-24 |
CN106709805B CN106709805B (en) | 2020-09-25 |
Family
ID=58939748
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610493459.9A Active CN106709805B (en) | 2016-06-29 | 2016-06-29 | User income data acquisition method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106709805B (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108764348A (en) * | 2018-05-30 | 2018-11-06 | 口口相传(北京)网络技术有限公司 | Collecting method based on multiple data sources and system |
CN110502521A (en) * | 2019-08-28 | 2019-11-26 | 上海寰创通信科技股份有限公司 | A kind of method for building up of file store |
CN110517083A (en) * | 2019-08-27 | 2019-11-29 | 秒针信息技术有限公司 | A kind of method and device of determining customer attribute information |
CN111563778A (en) * | 2020-05-12 | 2020-08-21 | 北京口袋财富信息科技有限公司 | Information pushing method and device |
CN116089907A (en) * | 2023-04-13 | 2023-05-09 | 民航成都信息技术有限公司 | Fusion method and device of aviation multi-source data, electronic equipment and storage medium |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050171884A1 (en) * | 2004-02-04 | 2005-08-04 | Research Affiliates, Llc | Non-capitalization weighted indexing system, method and computer program product |
CN101576990A (en) * | 2008-05-06 | 2009-11-11 | 中国建设银行股份有限公司 | Banking service processing system |
CN103593368A (en) * | 2012-08-16 | 2014-02-19 | 深圳市世纪光速信息技术有限公司 | Method, server, terminal and system for selecting data sources |
CN104978688A (en) * | 2014-04-02 | 2015-10-14 | 陈衡 | Unbidden fund value increasing device, unbidden fund value increasing method and financing system |
CN105323654A (en) * | 2014-08-05 | 2016-02-10 | 优视科技有限公司 | Method and device for displaying content data from network |
CN105427166A (en) * | 2015-11-13 | 2016-03-23 | 中国建设银行股份有限公司 | Bank account type detection method and system |
-
2016
- 2016-06-29 CN CN201610493459.9A patent/CN106709805B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050171884A1 (en) * | 2004-02-04 | 2005-08-04 | Research Affiliates, Llc | Non-capitalization weighted indexing system, method and computer program product |
CN101576990A (en) * | 2008-05-06 | 2009-11-11 | 中国建设银行股份有限公司 | Banking service processing system |
CN103593368A (en) * | 2012-08-16 | 2014-02-19 | 深圳市世纪光速信息技术有限公司 | Method, server, terminal and system for selecting data sources |
CN104978688A (en) * | 2014-04-02 | 2015-10-14 | 陈衡 | Unbidden fund value increasing device, unbidden fund value increasing method and financing system |
CN105323654A (en) * | 2014-08-05 | 2016-02-10 | 优视科技有限公司 | Method and device for displaying content data from network |
CN105427166A (en) * | 2015-11-13 | 2016-03-23 | 中国建设银行股份有限公司 | Bank account type detection method and system |
Non-Patent Citations (1)
Title |
---|
孙飞: ""基于DOM节点文本密度的网页核心块抽取算法研究"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108764348A (en) * | 2018-05-30 | 2018-11-06 | 口口相传(北京)网络技术有限公司 | Collecting method based on multiple data sources and system |
CN108764348B (en) * | 2018-05-30 | 2020-07-10 | 口口相传(北京)网络技术有限公司 | Data acquisition method and system based on multiple data sources |
CN110517083A (en) * | 2019-08-27 | 2019-11-29 | 秒针信息技术有限公司 | A kind of method and device of determining customer attribute information |
CN110502521A (en) * | 2019-08-28 | 2019-11-26 | 上海寰创通信科技股份有限公司 | A kind of method for building up of file store |
CN110502521B (en) * | 2019-08-28 | 2023-05-09 | 上海寰创通信科技股份有限公司 | Method for establishing archive |
CN111563778A (en) * | 2020-05-12 | 2020-08-21 | 北京口袋财富信息科技有限公司 | Information pushing method and device |
CN111563778B (en) * | 2020-05-12 | 2021-08-03 | 北京口袋财富信息科技有限公司 | Information pushing method and device |
CN116089907A (en) * | 2023-04-13 | 2023-05-09 | 民航成都信息技术有限公司 | Fusion method and device of aviation multi-source data, electronic equipment and storage medium |
CN116089907B (en) * | 2023-04-13 | 2023-06-23 | 民航成都信息技术有限公司 | Fusion method and device of aviation multi-source data, electronic equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN106709805B (en) | 2020-09-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106709805A (en) | Method and system for acquiring user income data | |
CN102857493B (en) | Content filtering method and device | |
CN112307762B (en) | Search result sorting method and device, storage medium and electronic device | |
CN105224606A (en) | A kind of disposal route of user ID and device | |
US11704682B2 (en) | Pre-processing financial market data prior to machine learning training | |
CN111382956A (en) | Enterprise group relationship mining method and device | |
CN110288193A (en) | Mission Monitor processing method, device, computer equipment and storage medium | |
CN111881302A (en) | Bank public opinion analysis method and system based on knowledge graph | |
CN106557558A (en) | A kind of data analysing method and device | |
CN110009416A (en) | A kind of system based on big data cleaning and AI precision marketing | |
CN107832333A (en) | Method and system based on distributed treatment and DPI data structure user network data fingerprint | |
CN107274141A (en) | A kind of event-handling method and the network equipment | |
CN111061837A (en) | Topic identification method, device, equipment and medium | |
CN115423578A (en) | Bidding method and system based on micro-service containerization cloud platform | |
CN110362607A (en) | Abnormal number identification method, device, computer equipment and storage medium | |
CN108648017B (en) | User requirement matching method, device, equipment and storage medium easy to expand | |
CN105681287A (en) | Screening rule based user service allocation screening method | |
CN111831817A (en) | Questionnaire generation and analysis method and device, computer equipment and readable storage medium | |
CN107122464A (en) | A kind of aid decision-making system and method | |
CN107277095A (en) | session dividing method and device | |
CN108171417B (en) | Planting task adjusting method, electronic device and storage medium | |
CN114840183A (en) | Micro front end adjusting method and device based on user behaviors | |
CN106909545A (en) | A kind of method and apparatus of the attaching information for determining user | |
TW202006617A (en) | Cloud self-service analysis platform and analysis method thereof | |
CN108287834A (en) | Method, apparatus and computing device for pushed information |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |