CN108563706A - A kind of collection big data intelligent service system and its operation method - Google Patents
A kind of collection big data intelligent service system and its operation method Download PDFInfo
- Publication number
- CN108563706A CN108563706A CN201810256557.XA CN201810256557A CN108563706A CN 108563706 A CN108563706 A CN 108563706A CN 201810256557 A CN201810256557 A CN 201810256557A CN 108563706 A CN108563706 A CN 108563706A
- Authority
- CN
- China
- Prior art keywords
- data
- database
- service system
- intelligent service
- server
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Telephonic Communication Services (AREA)
Abstract
Due to the big data intelligent service system of existing collection platform, the scarce capacity of dynamic data is handled, and data are unable to get cross validation, resulting in the efficiency of collection personnel greatly reduces, very big so as to cause the pressure cost of collection company.In order to solve the problems, such as this, a kind of collection big data intelligent service system of present invention offer and its operation method.The collection big data intelligent service system of the present invention and its purpose of operation method extract through scene and source analysis from a large amount of dynamic interaction data and contain the efficiency for improving collection in potential information therein.
Description
Technical field
The invention belongs to big data digging technology fields, more specifically, to be related to a kind of acquisition of multi-user data shared
The system and method for polymerization are more specifically related to collection big data intelligent service system and its operation method.
Background technology
Data mining, also known as Knowledge Discovery in Database (Khowledge Discovery from Datebase, letter
Claim KDD), it is one and extracts the complex process for excavating the knowledge such as unknown, valuable pattern or rule from mass data.
The big data intelligent service system of existing collection platform, has the following defects:(1) static data can only be handled,
Handling the scarce capacity and source and Scene Blur of dynamic data so that user must settle at one go when carrying out data mining,
Precision is not high;(2) data are unable to get cross validation so that user cannot accurately learn source.Due to this 2 points of defect, just
So that the efficiency of collection personnel greatly reduces, it is very big so as to cause the pressure cost of collection company.
Invention content
In view of this, a kind of collection big data intelligent service system of present invention offer and its operation method.The present invention's urges
The purpose of receipts big data intelligent service system and its operation method is from a large amount of dynamic interaction data, through scene and source point
Analysis extracts the efficiency contained and improve collection in potential information therein.
A kind of collection big data intelligent service system comprising:
Data acquisition interface:The data acquisition interface can be connected on telephone traffic system or other database interfaces,
In database during these initial data are uploaded onto the server;
Data inquiry module:Request data and subscriber identity information are submitted in server together by request event,
Server returns to requesting data information by search index database, while the data of server record request and identity information arrive
Cache database;
Data cache module:Request data for preserving request time and identity information;
Data disaply moudle:When returning to requesting data information, the data information inquired will be indexed and carry out needing to mark
And splicing, form the more complete meaning of one's words;
Data analysis module:The initial data and data inquiry module rope of server are periodically transferred to data acquisition interface
The data for drawing inquiry are cleaned and apply mechanically corresponding model evaluation module and analyzed;
Model evaluation module:Personnel query identity information and inquiry data are unfolded into information, extract keyword, structure is special
Levy marking algorithm;
Data Identification module:Spy of the data by data analysis module analysis, model evaluation module in cache database
After levying algorithm calculating, data characteristics is identified;
Core database:By the data storage after mark to core database.
Wherein, data acquisition interface can be used for acquiring initial data, and initial data includes the voice communication of telephone traffic system
Content, the beginning and ending time of voice, the object of dialing, the identity card of dialing object, phone number, address, other contact persons, whether
It connects, dial number, operator message, the amount of the loan, payment period, related party loans mechanism, bank etc..
The invention further relates to the operation methods of collection big data intelligent service system comprising following steps:
S1:The gathered data of report and backstage reptile is actively put forward by personnel query query event, personnel query, and will
Data are deposited into the database of server, and collected data information includes the voice communication content of telephone traffic system, voice
Whether the beginning and ending time object of dialing, the identity card of dialing object, phone number, address, other contact persons, connects, dials time
Number, operator message, the amount of the loan, payment period, related party loans mechanism, bank etc.;
S2:It is for statistical analysis to collected data in S1 by python language, to user's exhibition in a manner of label
The basic description information of existing data;
S3:Pass through the data screenings models such as python language encapsulation classification, cluster, association and time series;Setting is provided
Corresponding model analysis parameter;
S4:The result of the data in database in a manner of list etc. is presented to user by python language, by looking into
Interface is ask data result can be shown with json formats;
S5:A variety of model evaluation methods such as accurate rate, error rate and confusion matrix are provided by using python language, and
The parameter of policy engine module is provided.
Further, step S1 includes step in detail below:
S11 is identified classification by pre-set rule and policy, is formed altogether when specific people's query event
Property data be written to data buffer storage to row, by analyzing and Supplementing Data eventually enters into the database of data storage server;
S12, when personnel query actively puies forward count off evidence, the common denominator data that is formed by actively proposing count off according to classification is identified
It is written in the database of data storage server;
S13 is written to when the gathered data of backstage reptile by carrying out Data acquisition and issuance to specific data source
In the database of data storage server.
Further, in step s 2, the statistical analysis of data includes following procedure:Database in server is carried out
The processing of shortage of data item, duplicate data processing, noise data processing and dealing of abnormal data etc..
More specifically, dealing of abnormal data includes:To ambiguous information, illegal ID card No. and cell-phone number
Code, the meaningless data submitted, the data not timely updated are cleaned.
Further, in step S1~S5, it is desirable to provide customize the work(of Mining Platform in a manner of editing configuration file
The user interface of energy.
The collection big data intelligent service system that the present invention is built is a kind of inquiring, can identify, can save based on Web
The elastic data communal space.Advantage of the invention is that for data mining it is continuous repeat, constantly modification, continuous iteration
Complexity provides a kind of elastic data excavation shared platform of facing multiple users cooperation, greatly improves accurate data degree, number
According to the process for being a continuous inquiry, constantly sharing, constantly identifying.
Specific implementation mode
Case 1 is embodied:
A kind of collection big data intelligent service system comprising:
Data acquisition interface:The data acquisition interface can be connected on telephone traffic system or other database interfaces,
In database during these initial data are uploaded onto the server;
Data inquiry module:Request data and subscriber identity information are submitted in server together by request event,
Server returns to requesting data information by search index database, while the data of server record request and identity information arrive
Cache database;
Data cache module:Request data for preserving request time and identity information;
Data disaply moudle:When returning to requesting data information, the data information inquired will be indexed and carry out needing to mark
And splicing, form the more complete meaning of one's words;
Data analysis module:The initial data and data inquiry module rope of server are periodically transferred to data acquisition interface
The data for drawing inquiry are cleaned and apply mechanically corresponding model evaluation module and analyzed;
Model evaluation module:Personnel query identity information and inquiry data are unfolded into information, extract keyword, structure is special
Levy marking algorithm;
Data Identification module:Spy of the data by data analysis module analysis, model evaluation module in cache database
After levying algorithm calculating, data characteristics is identified;
Core database:By the data storage after mark to core database.
Wherein, data acquisition interface can be used for acquiring initial data, and initial data includes the voice communication of telephone traffic system
Content, the beginning and ending time of voice, the object of dialing, the identity card of dialing object, phone number, address, other contact persons, whether
It connects, dial number, operator message, the amount of the loan, payment period, related party loans mechanism, bank etc..
The invention further relates to the operation methods of collection big data intelligent service system comprising following steps:
S1:The gathered data of report and backstage reptile is actively put forward by personnel query query event, personnel query, and will
Data are deposited into the database of server, and collected data information includes the voice communication content of telephone traffic system, voice
Whether the beginning and ending time object of dialing, the identity card of dialing object, phone number, address, other contact persons, connects, dials time
Number, operator message, the amount of the loan, payment period, related party loans mechanism, bank etc.;
S2:It is for statistical analysis to collected data in S1 by python language, to user's exhibition in a manner of label
The basic description information of existing data;
S3:Pass through the data screenings models such as python language encapsulation classification, cluster, association and time series;Setting is provided
Corresponding model analysis parameter;
S4:The result of the data in database in a manner of list etc. is presented to user by python language, by looking into
Interface is ask data result can be shown with json formats;
S5:A variety of model evaluation methods such as accurate rate, error rate and confusion matrix are provided by using python language, and
The parameter of policy engine module is provided.
Further, step S1 includes step in detail below:
S11 is identified classification by pre-set rule and policy, is formed altogether when specific people's query event
Property data be written to data buffer storage to row, by analyzing and Supplementing Data eventually enters into the database of data storage server;
S12, when personnel query actively puies forward count off evidence, the common denominator data that is formed by actively proposing count off according to classification is identified
It is written in the database of data storage server;
S13 is written to when the gathered data of backstage reptile by carrying out Data acquisition and issuance to specific data source
In the database of data storage server.
Further, in step s 2, the statistical analysis of data further includes following procedure:To the database in server into
The processing of row shortage of data item, duplicate data processing, noise data processing and dealing of abnormal data etc..
More specifically, dealing of abnormal data includes:To ambiguous information, illegal ID card No. and cell-phone number
Code, the meaningless data submitted, the data not timely updated are cleaned.
Further, in step S1~S5, it is desirable to provide customize the work(of Mining Platform in a manner of editing configuration file
The user interface of energy.
Several embodiments of the invention above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously
Cannot the limitation to the scope of the claims of the present invention therefore be interpreted as.It should be pointed out that for those of ordinary skill in the art
For, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to the guarantor of the present invention
Protect range.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.
Claims (6)
1. a kind of collection big data intelligent service system comprising:
Data acquisition interface:The data acquisition interface can be connected on telephone traffic system or other database interfaces, by this
In database during initial data are uploaded onto the server a bit;
Data inquiry module:Request data and subscriber identity information are submitted in server together by request event, serviced
Device returns to requesting data information by search index database, while server records the data asked and identity information to caching
Database;
Data cache module:Request data for preserving request time and identity information;
Data disaply moudle:When returning to requesting data information, the data information inquired will be indexed and carry out needing label and spelling
It connects, forms the more complete meaning of one's words;
Data analysis module:The initial data of server periodically is transferred to data acquisition interface and data query module index is looked into
The data of inquiry are cleaned and apply mechanically corresponding model evaluation module and analyzed;
Model evaluation module:Personnel query identity information and inquiry data are unfolded into information, extract keyword, construction feature mark
Know algorithm;
Data Identification module:Feature of the data by data analysis module analysis, model evaluation module in cache database is calculated
After method calculates, data characteristics is identified;
Core database:By the data storage after mark to core database.
2. a kind of operation method of collection big data intelligent service system comprising following steps:
S1:The gathered data of report and backstage reptile is actively put forward by personnel query query event, personnel query, and by data
It is deposited into the database of server, collected data information includes the start-stop of the voice communication content, voice of telephone traffic system
Time, the object of dialing, the identity card of dialing object, phone number, address, other contact persons, whether connect, the number that dials,
Operator message, the amount of the loan, payment period, related party loans mechanism, bank;
S2:It is for statistical analysis to collected data in S1 by python language, show number to user in a manner of label
According to basic description information;
S3:Pass through the data screenings models such as python language encapsulation classification, cluster, association and time series;It is corresponding to provide setting
Model analysis parameter;
S4:The result of the data in database in a manner of list etc. is presented to user by python language, is connect by inquiry
Mouth can be shown data result with json formats;
S5:A variety of model evaluation methods such as accurate rate, error rate and confusion matrix are provided by using python language, and are provided
The parameter of policy engine module.
3. the operation method of collection big data intelligent service system as claimed in claim 2, it is characterised in that:The step
Step S1 includes step in detail below:
S11 is identified classification by pre-set rule and policy, forms general character number when specific people's query event
According to being written to data buffer storage to row, by analyzing and Supplementing Data eventually enters into the database of data storage server;
S12, when personnel query actively puies forward count off evidence, the common denominator data that classification formation is identified by actively putting forward count off evidence is written
Into the database of data storage server;
S13 is written to data when the gathered data of backstage reptile by carrying out Data acquisition and issuance to specific data source
In the database of storage server.
4. the operation method of collection big data intelligent service system as claimed in claim 2, it is characterised in that:In step S2
In, the statistical analysis of data further includes carrying out the processing of shortage of data item, duplicate data processing, noise to the database in server
Data processing and dealing of abnormal data.
5. the operation method of collection big data intelligent service system as claimed in claim 4, it is characterised in that:At abnormal data
Reason includes:To the meaningless data of ambiguous information, illegal ID card No. and phone number, submission, in time more
New data are cleaned.
6. the operation method of the collection big data intelligent service system as described in any one of claim 2~5, feature exist
In:In step S1~S5, it is desirable to provide customize the user interface of the function of Mining Platform in a manner of editing configuration file.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810256557.XA CN108563706A (en) | 2018-03-27 | 2018-03-27 | A kind of collection big data intelligent service system and its operation method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810256557.XA CN108563706A (en) | 2018-03-27 | 2018-03-27 | A kind of collection big data intelligent service system and its operation method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108563706A true CN108563706A (en) | 2018-09-21 |
Family
ID=63533314
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810256557.XA Pending CN108563706A (en) | 2018-03-27 | 2018-03-27 | A kind of collection big data intelligent service system and its operation method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108563706A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109711984A (en) * | 2019-01-23 | 2019-05-03 | 北京市天元网络技术股份有限公司 | Risk monitoring and control method and device before a kind of loan based on collection |
CN111291042A (en) * | 2019-12-23 | 2020-06-16 | 创意信息技术股份有限公司 | Power data processing system and method for power supply service |
CN115982503A (en) * | 2023-02-07 | 2023-04-18 | 梁礼津 | Website information acquisition method and system based on cloud platform |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006190060A (en) * | 2005-01-06 | 2006-07-20 | Kyocera Mita Corp | Database retieval method, database retieval program, and original processor |
CN1877583A (en) * | 2006-07-12 | 2006-12-13 | 百度在线网络技术(北京)有限公司 | Accessing identification index system and accessing identification index library generation method |
CN1877582A (en) * | 2006-07-12 | 2006-12-13 | 百度在线网络技术(北京)有限公司 | Advertisement information retrieval system and method therefor |
CN101227687A (en) * | 2007-01-16 | 2008-07-23 | 天津中环腾梁技术有限公司 | Processing method of TEMS mobile network testing data |
CN102254265A (en) * | 2010-05-18 | 2011-11-23 | 北京首家通信技术有限公司 | Rich media internet advertisement content matching and effect evaluation method |
CN103853821A (en) * | 2014-02-21 | 2014-06-11 | 河海大学 | Method for constructing multiuser collaboration oriented data mining platform |
CN104462340A (en) * | 2014-12-04 | 2015-03-25 | 华为技术有限公司 | Target object information search method and device |
CN105808605A (en) * | 2014-12-31 | 2016-07-27 | 北京奇虎科技有限公司 | Search log combination method and system |
CN105897667A (en) * | 2015-10-22 | 2016-08-24 | 乐视致新电子科技(天津)有限公司 | Device access history tracking method, apparatus, server and system |
CN106682925A (en) * | 2015-11-06 | 2017-05-17 | 北京奇虎科技有限公司 | Method and device for recommending advertisement content |
CN107016528A (en) * | 2017-04-11 | 2017-08-04 | 连云港大锤网络科技有限公司 | A kind of internet collection big data intelligent service system and its operation method |
CN107609019A (en) * | 2017-08-07 | 2018-01-19 | 国网辽宁省电力有限公司 | A kind of method that corporate information based on internet public information obtains |
-
2018
- 2018-03-27 CN CN201810256557.XA patent/CN108563706A/en active Pending
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006190060A (en) * | 2005-01-06 | 2006-07-20 | Kyocera Mita Corp | Database retieval method, database retieval program, and original processor |
CN1877583A (en) * | 2006-07-12 | 2006-12-13 | 百度在线网络技术(北京)有限公司 | Accessing identification index system and accessing identification index library generation method |
CN1877582A (en) * | 2006-07-12 | 2006-12-13 | 百度在线网络技术(北京)有限公司 | Advertisement information retrieval system and method therefor |
CN101227687A (en) * | 2007-01-16 | 2008-07-23 | 天津中环腾梁技术有限公司 | Processing method of TEMS mobile network testing data |
CN102254265A (en) * | 2010-05-18 | 2011-11-23 | 北京首家通信技术有限公司 | Rich media internet advertisement content matching and effect evaluation method |
CN103853821A (en) * | 2014-02-21 | 2014-06-11 | 河海大学 | Method for constructing multiuser collaboration oriented data mining platform |
CN104462340A (en) * | 2014-12-04 | 2015-03-25 | 华为技术有限公司 | Target object information search method and device |
CN105808605A (en) * | 2014-12-31 | 2016-07-27 | 北京奇虎科技有限公司 | Search log combination method and system |
CN105897667A (en) * | 2015-10-22 | 2016-08-24 | 乐视致新电子科技(天津)有限公司 | Device access history tracking method, apparatus, server and system |
CN106682925A (en) * | 2015-11-06 | 2017-05-17 | 北京奇虎科技有限公司 | Method and device for recommending advertisement content |
CN107016528A (en) * | 2017-04-11 | 2017-08-04 | 连云港大锤网络科技有限公司 | A kind of internet collection big data intelligent service system and its operation method |
CN107609019A (en) * | 2017-08-07 | 2018-01-19 | 国网辽宁省电力有限公司 | A kind of method that corporate information based on internet public information obtains |
Non-Patent Citations (1)
Title |
---|
崔航 等: "基于用户日志的查询扩展统计模型", 《软件学报》 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109711984A (en) * | 2019-01-23 | 2019-05-03 | 北京市天元网络技术股份有限公司 | Risk monitoring and control method and device before a kind of loan based on collection |
CN111291042A (en) * | 2019-12-23 | 2020-06-16 | 创意信息技术股份有限公司 | Power data processing system and method for power supply service |
CN115982503A (en) * | 2023-02-07 | 2023-04-18 | 梁礼津 | Website information acquisition method and system based on cloud platform |
CN115982503B (en) * | 2023-02-07 | 2023-10-13 | 深圳慧梧科技有限公司 | Website information acquisition method and system based on cloud platform |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6535728B1 (en) | Event manager for use in fraud detection | |
CN112053221A (en) | Knowledge graph-based internet financial group fraud detection method | |
US7770221B2 (en) | Method and apparatus for combining traffic analysis and monitoring center in lawful interception | |
CN111104521B (en) | Anti-fraud detection method and detection system based on graph analysis | |
WO2019041774A1 (en) | Customer information screening method and apparatus, electronic device, and medium | |
CN101453358B (en) | Sql sentence audit method and system for oracle database binding variable | |
CN108563706A (en) | A kind of collection big data intelligent service system and its operation method | |
CN106022708A (en) | Method for predicting employee resignation | |
CN112053222A (en) | Knowledge graph-based internet financial group fraud detection method | |
CN109145050B (en) | Computing device | |
CN108961082A (en) | A kind of vehicle insurance loss assessment system and method based on AI image recognition | |
CN109150894A (en) | A kind of method and system for identifying malicious user | |
CN110246033B (en) | Credit risk monitoring method, device, equipment and storage medium | |
CN112445870A (en) | Knowledge graph string parallel case analysis method based on mobile phone evidence obtaining electronic data | |
CN115038083A (en) | Telecom fraud early warning identification method and system applied to AI operator industry | |
CN111861733B (en) | Fraud prevention and control system and method based on address fuzzy matching | |
CN102256255A (en) | Detection method for parallel-used-card proof based on time and geographic location collisions | |
CN109410035A (en) | A kind of method and tool for assisting anti-fraud analysis cluster structure | |
CN113407734B (en) | Method for constructing knowledge graph system based on real-time big data | |
CN111739180B (en) | Path segmentation method based on ETC portal frame | |
CN109377391A (en) | A kind of tracking of information method, storage medium and server | |
CN109919811B (en) | Insurance agent culture scheme generation method based on big data and related equipment | |
CN114677144A (en) | Vehicle insurance claim settlement fraud risk identification method and system based on geographic big data | |
CN112185083A (en) | Repeated alarm judging method | |
CN116844171A (en) | Large data identification model method and device for virtual open value-added tax invoice |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180921 |