CN108563706A - A kind of collection big data intelligent service system and its operation method - Google Patents

A kind of collection big data intelligent service system and its operation method Download PDF

Info

Publication number
CN108563706A
CN108563706A CN201810256557.XA CN201810256557A CN108563706A CN 108563706 A CN108563706 A CN 108563706A CN 201810256557 A CN201810256557 A CN 201810256557A CN 108563706 A CN108563706 A CN 108563706A
Authority
CN
China
Prior art keywords
data
database
service system
intelligent service
server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810256557.XA
Other languages
Chinese (zh)
Inventor
康彦昌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hefei And Jun Longitudinal Data Technology Co Ltd
Shizuishan And Jun Longitudinal Data Technology Co Ltd
Suzhou And Jun Longitudinal Data Technology Co Ltd
Kunshan And Jun Chang Da Data Technology Co Ltd
Original Assignee
Hefei And Jun Longitudinal Data Technology Co Ltd
Shizuishan And Jun Longitudinal Data Technology Co Ltd
Suzhou And Jun Longitudinal Data Technology Co Ltd
Kunshan And Jun Chang Da Data Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hefei And Jun Longitudinal Data Technology Co Ltd, Shizuishan And Jun Longitudinal Data Technology Co Ltd, Suzhou And Jun Longitudinal Data Technology Co Ltd, Kunshan And Jun Chang Da Data Technology Co Ltd filed Critical Hefei And Jun Longitudinal Data Technology Co Ltd
Priority to CN201810256557.XA priority Critical patent/CN108563706A/en
Publication of CN108563706A publication Critical patent/CN108563706A/en
Pending legal-status Critical Current

Links

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

Due to the big data intelligent service system of existing collection platform, the scarce capacity of dynamic data is handled, and data are unable to get cross validation, resulting in the efficiency of collection personnel greatly reduces, very big so as to cause the pressure cost of collection company.In order to solve the problems, such as this, a kind of collection big data intelligent service system of present invention offer and its operation method.The collection big data intelligent service system of the present invention and its purpose of operation method extract through scene and source analysis from a large amount of dynamic interaction data and contain the efficiency for improving collection in potential information therein.

Description

A kind of collection big data intelligent service system and its operation method
Technical field
The invention belongs to big data digging technology fields, more specifically, to be related to a kind of acquisition of multi-user data shared The system and method for polymerization are more specifically related to collection big data intelligent service system and its operation method.
Background technology
Data mining, also known as Knowledge Discovery in Database (Khowledge Discovery from Datebase, letter Claim KDD), it is one and extracts the complex process for excavating the knowledge such as unknown, valuable pattern or rule from mass data.
The big data intelligent service system of existing collection platform, has the following defects:(1) static data can only be handled, Handling the scarce capacity and source and Scene Blur of dynamic data so that user must settle at one go when carrying out data mining, Precision is not high;(2) data are unable to get cross validation so that user cannot accurately learn source.Due to this 2 points of defect, just So that the efficiency of collection personnel greatly reduces, it is very big so as to cause the pressure cost of collection company.
Invention content
In view of this, a kind of collection big data intelligent service system of present invention offer and its operation method.The present invention's urges The purpose of receipts big data intelligent service system and its operation method is from a large amount of dynamic interaction data, through scene and source point Analysis extracts the efficiency contained and improve collection in potential information therein.
A kind of collection big data intelligent service system comprising:
Data acquisition interface:The data acquisition interface can be connected on telephone traffic system or other database interfaces, In database during these initial data are uploaded onto the server;
Data inquiry module:Request data and subscriber identity information are submitted in server together by request event, Server returns to requesting data information by search index database, while the data of server record request and identity information arrive Cache database;
Data cache module:Request data for preserving request time and identity information;
Data disaply moudle:When returning to requesting data information, the data information inquired will be indexed and carry out needing to mark And splicing, form the more complete meaning of one's words;
Data analysis module:The initial data and data inquiry module rope of server are periodically transferred to data acquisition interface The data for drawing inquiry are cleaned and apply mechanically corresponding model evaluation module and analyzed;
Model evaluation module:Personnel query identity information and inquiry data are unfolded into information, extract keyword, structure is special Levy marking algorithm;
Data Identification module:Spy of the data by data analysis module analysis, model evaluation module in cache database After levying algorithm calculating, data characteristics is identified;
Core database:By the data storage after mark to core database.
Wherein, data acquisition interface can be used for acquiring initial data, and initial data includes the voice communication of telephone traffic system Content, the beginning and ending time of voice, the object of dialing, the identity card of dialing object, phone number, address, other contact persons, whether It connects, dial number, operator message, the amount of the loan, payment period, related party loans mechanism, bank etc..
The invention further relates to the operation methods of collection big data intelligent service system comprising following steps:
S1:The gathered data of report and backstage reptile is actively put forward by personnel query query event, personnel query, and will Data are deposited into the database of server, and collected data information includes the voice communication content of telephone traffic system, voice Whether the beginning and ending time object of dialing, the identity card of dialing object, phone number, address, other contact persons, connects, dials time Number, operator message, the amount of the loan, payment period, related party loans mechanism, bank etc.;
S2:It is for statistical analysis to collected data in S1 by python language, to user's exhibition in a manner of label The basic description information of existing data;
S3:Pass through the data screenings models such as python language encapsulation classification, cluster, association and time series;Setting is provided Corresponding model analysis parameter;
S4:The result of the data in database in a manner of list etc. is presented to user by python language, by looking into Interface is ask data result can be shown with json formats;
S5:A variety of model evaluation methods such as accurate rate, error rate and confusion matrix are provided by using python language, and The parameter of policy engine module is provided.
Further, step S1 includes step in detail below:
S11 is identified classification by pre-set rule and policy, is formed altogether when specific people's query event Property data be written to data buffer storage to row, by analyzing and Supplementing Data eventually enters into the database of data storage server;
S12, when personnel query actively puies forward count off evidence, the common denominator data that is formed by actively proposing count off according to classification is identified It is written in the database of data storage server;
S13 is written to when the gathered data of backstage reptile by carrying out Data acquisition and issuance to specific data source In the database of data storage server.
Further, in step s 2, the statistical analysis of data includes following procedure:Database in server is carried out The processing of shortage of data item, duplicate data processing, noise data processing and dealing of abnormal data etc..
More specifically, dealing of abnormal data includes:To ambiguous information, illegal ID card No. and cell-phone number Code, the meaningless data submitted, the data not timely updated are cleaned.
Further, in step S1~S5, it is desirable to provide customize the work(of Mining Platform in a manner of editing configuration file The user interface of energy.
The collection big data intelligent service system that the present invention is built is a kind of inquiring, can identify, can save based on Web The elastic data communal space.Advantage of the invention is that for data mining it is continuous repeat, constantly modification, continuous iteration Complexity provides a kind of elastic data excavation shared platform of facing multiple users cooperation, greatly improves accurate data degree, number According to the process for being a continuous inquiry, constantly sharing, constantly identifying.
Specific implementation mode
Case 1 is embodied:
A kind of collection big data intelligent service system comprising:
Data acquisition interface:The data acquisition interface can be connected on telephone traffic system or other database interfaces, In database during these initial data are uploaded onto the server;
Data inquiry module:Request data and subscriber identity information are submitted in server together by request event, Server returns to requesting data information by search index database, while the data of server record request and identity information arrive Cache database;
Data cache module:Request data for preserving request time and identity information;
Data disaply moudle:When returning to requesting data information, the data information inquired will be indexed and carry out needing to mark And splicing, form the more complete meaning of one's words;
Data analysis module:The initial data and data inquiry module rope of server are periodically transferred to data acquisition interface The data for drawing inquiry are cleaned and apply mechanically corresponding model evaluation module and analyzed;
Model evaluation module:Personnel query identity information and inquiry data are unfolded into information, extract keyword, structure is special Levy marking algorithm;
Data Identification module:Spy of the data by data analysis module analysis, model evaluation module in cache database After levying algorithm calculating, data characteristics is identified;
Core database:By the data storage after mark to core database.
Wherein, data acquisition interface can be used for acquiring initial data, and initial data includes the voice communication of telephone traffic system Content, the beginning and ending time of voice, the object of dialing, the identity card of dialing object, phone number, address, other contact persons, whether It connects, dial number, operator message, the amount of the loan, payment period, related party loans mechanism, bank etc..
The invention further relates to the operation methods of collection big data intelligent service system comprising following steps:
S1:The gathered data of report and backstage reptile is actively put forward by personnel query query event, personnel query, and will Data are deposited into the database of server, and collected data information includes the voice communication content of telephone traffic system, voice Whether the beginning and ending time object of dialing, the identity card of dialing object, phone number, address, other contact persons, connects, dials time Number, operator message, the amount of the loan, payment period, related party loans mechanism, bank etc.;
S2:It is for statistical analysis to collected data in S1 by python language, to user's exhibition in a manner of label The basic description information of existing data;
S3:Pass through the data screenings models such as python language encapsulation classification, cluster, association and time series;Setting is provided Corresponding model analysis parameter;
S4:The result of the data in database in a manner of list etc. is presented to user by python language, by looking into Interface is ask data result can be shown with json formats;
S5:A variety of model evaluation methods such as accurate rate, error rate and confusion matrix are provided by using python language, and The parameter of policy engine module is provided.
Further, step S1 includes step in detail below:
S11 is identified classification by pre-set rule and policy, is formed altogether when specific people's query event Property data be written to data buffer storage to row, by analyzing and Supplementing Data eventually enters into the database of data storage server;
S12, when personnel query actively puies forward count off evidence, the common denominator data that is formed by actively proposing count off according to classification is identified It is written in the database of data storage server;
S13 is written to when the gathered data of backstage reptile by carrying out Data acquisition and issuance to specific data source In the database of data storage server.
Further, in step s 2, the statistical analysis of data further includes following procedure:To the database in server into The processing of row shortage of data item, duplicate data processing, noise data processing and dealing of abnormal data etc..
More specifically, dealing of abnormal data includes:To ambiguous information, illegal ID card No. and cell-phone number Code, the meaningless data submitted, the data not timely updated are cleaned.
Further, in step S1~S5, it is desirable to provide customize the work(of Mining Platform in a manner of editing configuration file The user interface of energy.
Several embodiments of the invention above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously Cannot the limitation to the scope of the claims of the present invention therefore be interpreted as.It should be pointed out that for those of ordinary skill in the art For, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to the guarantor of the present invention Protect range.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.

Claims (6)

1. a kind of collection big data intelligent service system comprising:
Data acquisition interface:The data acquisition interface can be connected on telephone traffic system or other database interfaces, by this In database during initial data are uploaded onto the server a bit;
Data inquiry module:Request data and subscriber identity information are submitted in server together by request event, serviced Device returns to requesting data information by search index database, while server records the data asked and identity information to caching Database;
Data cache module:Request data for preserving request time and identity information;
Data disaply moudle:When returning to requesting data information, the data information inquired will be indexed and carry out needing label and spelling It connects, forms the more complete meaning of one's words;
Data analysis module:The initial data of server periodically is transferred to data acquisition interface and data query module index is looked into The data of inquiry are cleaned and apply mechanically corresponding model evaluation module and analyzed;
Model evaluation module:Personnel query identity information and inquiry data are unfolded into information, extract keyword, construction feature mark Know algorithm;
Data Identification module:Feature of the data by data analysis module analysis, model evaluation module in cache database is calculated After method calculates, data characteristics is identified;
Core database:By the data storage after mark to core database.
2. a kind of operation method of collection big data intelligent service system comprising following steps:
S1:The gathered data of report and backstage reptile is actively put forward by personnel query query event, personnel query, and by data It is deposited into the database of server, collected data information includes the start-stop of the voice communication content, voice of telephone traffic system Time, the object of dialing, the identity card of dialing object, phone number, address, other contact persons, whether connect, the number that dials, Operator message, the amount of the loan, payment period, related party loans mechanism, bank;
S2:It is for statistical analysis to collected data in S1 by python language, show number to user in a manner of label According to basic description information;
S3:Pass through the data screenings models such as python language encapsulation classification, cluster, association and time series;It is corresponding to provide setting Model analysis parameter;
S4:The result of the data in database in a manner of list etc. is presented to user by python language, is connect by inquiry Mouth can be shown data result with json formats;
S5:A variety of model evaluation methods such as accurate rate, error rate and confusion matrix are provided by using python language, and are provided The parameter of policy engine module.
3. the operation method of collection big data intelligent service system as claimed in claim 2, it is characterised in that:The step Step S1 includes step in detail below:
S11 is identified classification by pre-set rule and policy, forms general character number when specific people's query event According to being written to data buffer storage to row, by analyzing and Supplementing Data eventually enters into the database of data storage server;
S12, when personnel query actively puies forward count off evidence, the common denominator data that classification formation is identified by actively putting forward count off evidence is written Into the database of data storage server;
S13 is written to data when the gathered data of backstage reptile by carrying out Data acquisition and issuance to specific data source In the database of storage server.
4. the operation method of collection big data intelligent service system as claimed in claim 2, it is characterised in that:In step S2 In, the statistical analysis of data further includes carrying out the processing of shortage of data item, duplicate data processing, noise to the database in server Data processing and dealing of abnormal data.
5. the operation method of collection big data intelligent service system as claimed in claim 4, it is characterised in that:At abnormal data Reason includes:To the meaningless data of ambiguous information, illegal ID card No. and phone number, submission, in time more New data are cleaned.
6. the operation method of the collection big data intelligent service system as described in any one of claim 2~5, feature exist In:In step S1~S5, it is desirable to provide customize the user interface of the function of Mining Platform in a manner of editing configuration file.
CN201810256557.XA 2018-03-27 2018-03-27 A kind of collection big data intelligent service system and its operation method Pending CN108563706A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810256557.XA CN108563706A (en) 2018-03-27 2018-03-27 A kind of collection big data intelligent service system and its operation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810256557.XA CN108563706A (en) 2018-03-27 2018-03-27 A kind of collection big data intelligent service system and its operation method

Publications (1)

Publication Number Publication Date
CN108563706A true CN108563706A (en) 2018-09-21

Family

ID=63533314

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810256557.XA Pending CN108563706A (en) 2018-03-27 2018-03-27 A kind of collection big data intelligent service system and its operation method

Country Status (1)

Country Link
CN (1) CN108563706A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109711984A (en) * 2019-01-23 2019-05-03 北京市天元网络技术股份有限公司 Risk monitoring and control method and device before a kind of loan based on collection
CN111291042A (en) * 2019-12-23 2020-06-16 创意信息技术股份有限公司 Power data processing system and method for power supply service
CN115982503A (en) * 2023-02-07 2023-04-18 梁礼津 Website information acquisition method and system based on cloud platform

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006190060A (en) * 2005-01-06 2006-07-20 Kyocera Mita Corp Database retieval method, database retieval program, and original processor
CN1877583A (en) * 2006-07-12 2006-12-13 百度在线网络技术(北京)有限公司 Accessing identification index system and accessing identification index library generation method
CN1877582A (en) * 2006-07-12 2006-12-13 百度在线网络技术(北京)有限公司 Advertisement information retrieval system and method therefor
CN101227687A (en) * 2007-01-16 2008-07-23 天津中环腾梁技术有限公司 Processing method of TEMS mobile network testing data
CN102254265A (en) * 2010-05-18 2011-11-23 北京首家通信技术有限公司 Rich media internet advertisement content matching and effect evaluation method
CN103853821A (en) * 2014-02-21 2014-06-11 河海大学 Method for constructing multiuser collaboration oriented data mining platform
CN104462340A (en) * 2014-12-04 2015-03-25 华为技术有限公司 Target object information search method and device
CN105808605A (en) * 2014-12-31 2016-07-27 北京奇虎科技有限公司 Search log combination method and system
CN105897667A (en) * 2015-10-22 2016-08-24 乐视致新电子科技(天津)有限公司 Device access history tracking method, apparatus, server and system
CN106682925A (en) * 2015-11-06 2017-05-17 北京奇虎科技有限公司 Method and device for recommending advertisement content
CN107016528A (en) * 2017-04-11 2017-08-04 连云港大锤网络科技有限公司 A kind of internet collection big data intelligent service system and its operation method
CN107609019A (en) * 2017-08-07 2018-01-19 国网辽宁省电力有限公司 A kind of method that corporate information based on internet public information obtains

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006190060A (en) * 2005-01-06 2006-07-20 Kyocera Mita Corp Database retieval method, database retieval program, and original processor
CN1877583A (en) * 2006-07-12 2006-12-13 百度在线网络技术(北京)有限公司 Accessing identification index system and accessing identification index library generation method
CN1877582A (en) * 2006-07-12 2006-12-13 百度在线网络技术(北京)有限公司 Advertisement information retrieval system and method therefor
CN101227687A (en) * 2007-01-16 2008-07-23 天津中环腾梁技术有限公司 Processing method of TEMS mobile network testing data
CN102254265A (en) * 2010-05-18 2011-11-23 北京首家通信技术有限公司 Rich media internet advertisement content matching and effect evaluation method
CN103853821A (en) * 2014-02-21 2014-06-11 河海大学 Method for constructing multiuser collaboration oriented data mining platform
CN104462340A (en) * 2014-12-04 2015-03-25 华为技术有限公司 Target object information search method and device
CN105808605A (en) * 2014-12-31 2016-07-27 北京奇虎科技有限公司 Search log combination method and system
CN105897667A (en) * 2015-10-22 2016-08-24 乐视致新电子科技(天津)有限公司 Device access history tracking method, apparatus, server and system
CN106682925A (en) * 2015-11-06 2017-05-17 北京奇虎科技有限公司 Method and device for recommending advertisement content
CN107016528A (en) * 2017-04-11 2017-08-04 连云港大锤网络科技有限公司 A kind of internet collection big data intelligent service system and its operation method
CN107609019A (en) * 2017-08-07 2018-01-19 国网辽宁省电力有限公司 A kind of method that corporate information based on internet public information obtains

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
崔航 等: "基于用户日志的查询扩展统计模型", 《软件学报》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109711984A (en) * 2019-01-23 2019-05-03 北京市天元网络技术股份有限公司 Risk monitoring and control method and device before a kind of loan based on collection
CN111291042A (en) * 2019-12-23 2020-06-16 创意信息技术股份有限公司 Power data processing system and method for power supply service
CN115982503A (en) * 2023-02-07 2023-04-18 梁礼津 Website information acquisition method and system based on cloud platform
CN115982503B (en) * 2023-02-07 2023-10-13 深圳慧梧科技有限公司 Website information acquisition method and system based on cloud platform

Similar Documents

Publication Publication Date Title
US6535728B1 (en) Event manager for use in fraud detection
CN112053221A (en) Knowledge graph-based internet financial group fraud detection method
US7770221B2 (en) Method and apparatus for combining traffic analysis and monitoring center in lawful interception
CN111104521B (en) Anti-fraud detection method and detection system based on graph analysis
WO2019041774A1 (en) Customer information screening method and apparatus, electronic device, and medium
CN101453358B (en) Sql sentence audit method and system for oracle database binding variable
CN108563706A (en) A kind of collection big data intelligent service system and its operation method
CN106022708A (en) Method for predicting employee resignation
CN112053222A (en) Knowledge graph-based internet financial group fraud detection method
CN109145050B (en) Computing device
CN108961082A (en) A kind of vehicle insurance loss assessment system and method based on AI image recognition
CN109150894A (en) A kind of method and system for identifying malicious user
CN110246033B (en) Credit risk monitoring method, device, equipment and storage medium
CN112445870A (en) Knowledge graph string parallel case analysis method based on mobile phone evidence obtaining electronic data
CN115038083A (en) Telecom fraud early warning identification method and system applied to AI operator industry
CN111861733B (en) Fraud prevention and control system and method based on address fuzzy matching
CN102256255A (en) Detection method for parallel-used-card proof based on time and geographic location collisions
CN109410035A (en) A kind of method and tool for assisting anti-fraud analysis cluster structure
CN113407734B (en) Method for constructing knowledge graph system based on real-time big data
CN111739180B (en) Path segmentation method based on ETC portal frame
CN109377391A (en) A kind of tracking of information method, storage medium and server
CN109919811B (en) Insurance agent culture scheme generation method based on big data and related equipment
CN114677144A (en) Vehicle insurance claim settlement fraud risk identification method and system based on geographic big data
CN112185083A (en) Repeated alarm judging method
CN116844171A (en) Large data identification model method and device for virtual open value-added tax invoice

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180921