CN104504445A - Method for realizing reference value in on-line data analysis service - Google Patents

Method for realizing reference value in on-line data analysis service Download PDF

Info

Publication number
CN104504445A
CN104504445A CN201410847872.1A CN201410847872A CN104504445A CN 104504445 A CN104504445 A CN 104504445A CN 201410847872 A CN201410847872 A CN 201410847872A CN 104504445 A CN104504445 A CN 104504445A
Authority
CN
China
Prior art keywords
data
reference value
enterprise
data analysis
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410847872.1A
Other languages
Chinese (zh)
Inventor
何江
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TIANJIN NANKAI UNIVERSITY GENERAL DATA TECHNOLOGIES Co Ltd
Original Assignee
TIANJIN NANKAI UNIVERSITY GENERAL DATA TECHNOLOGIES Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TIANJIN NANKAI UNIVERSITY GENERAL DATA TECHNOLOGIES Co Ltd filed Critical TIANJIN NANKAI UNIVERSITY GENERAL DATA TECHNOLOGIES Co Ltd
Priority to CN201410847872.1A priority Critical patent/CN104504445A/en
Publication of CN104504445A publication Critical patent/CN104504445A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • G06F21/6227Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database where protection concerns the structure of data, e.g. records, types, queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases

Abstract

The invention provides a method for realizing a reference value in on-line data analysis service. The method comprises the steps: realizing an enterprise-class statistic data reference value, and realizing a non enterprise-class statistic data reference value. In conventional data analysis, only a user is allowed to access own data, but this is often not enough; in on-line data analysis, data of all users are stored in a cloud system, so that simultaneous analysis on the data of a plurality of users is possible. However, under the requirement on data security, the data of other users cannot be directly supplied to a certain user. The method has the beneficial effects that the data of other users cannot be revealed, and the confidentiality of the user data is enhanced.

Description

The implementation method of reference value in a kind of on-line data analysis service
Technical field
The invention belongs to business intelligence field, particularly relate to the implementation method of reference value in the service of a kind of on-line data analysis.
Background technology
Data analysis changes on-line data analysis service into from conventional business intelligent artifact.Conventional business intelligent artifact user can only from the business datum of oneself obtaining information, and on-line data analysis service not only can from the business datum of oneself obtaining information, reference significance information can also be obtained from mass data.These reference informations can help people to understand oneself position in affiliated environment and situation, better to carry out decision-making.
Such as when carrying out enterprise personnel data analysis, when analyzing this enterprise personnel separation rate, the also separation rate level of Water demand other enterprises of the same trade, mutually compares thus show whether normal this enterprise personnel leaves office; Or when analyzing this enterprise year range of raising the salary again, also whether the annual pay rise situation of other enterprises of Water demand, competitive to understand this Enterprise Salary policy.
But these data analyses at present exist can not ensure the defects such as the privacy of user data.
Summary of the invention
The problem to be solved in the present invention is: in on-line data analysis service, for Users'Data Analysis provides valuable reference value under user data privacy conditions.
The present invention solves the technical scheme that its technical matters takes: the implementation method of reference value in a kind of on-line data analysis service, comprises the following steps:
The first step: classify with reference to value: be divided into enterprise-level statistics reference value and nonbusiness's level statistics reference value two class;
Second step: realize respectively for the reference value of two kinds described in the first step:
(1) implementation method of enterprise-level statistics reference value: first cancel the corporate identify forced filtration to data; Then by particular data scope, data are filtered; Finally specify polymerization methods generating reference value according to user;
(2) implementation method of nonbusiness's level statistics reference value: nonbusiness's level statistics must be polymerizing value, then cancels the corporate identify forced filtration to data; And by particular data scope, filtration generating reference value is carried out to data.
Further, described particular data scope comprises industry data, this area data and total data three kinds of data areas.
The invention has the beneficial effects as follows:
(1) statistics reference value of the present invention realizes the corporate identify forced filtration that must cancel data, enables inquiry have access to all customer data, and not only reads this business data of user, ensures the comprehensive of data;
(2) user can only carry out data selection from three kinds of particular data scopes of system regulation, user's factor data selection range can be avoided too small, cause reference value to affect by individual data items too large, thus conjecture is selected the situation generation of the content of data to a certain extent;
(3) reference value that enterprise-level statistics of the present invention generates is polymerizing value instead of detailed value, thus makes reference point not be detail or the aggregation information of unique user, but the result that multiple user profile condenses together, make result more reliable.
Embodiment
Below technical scheme of the present invention is set forth further:
Embodiment: the implementation method of reference value in a kind of on-line data analysis service, comprises the following steps:
The first step: classify with reference to value: be divided into enterprise-level statistics reference value and nonbusiness's level statistics reference value two class;
Second step: realize respectively for the reference value of two kinds described in the first step:
(1) for Enterprise Data: such as inquire about the average number of industry and enterprise, realize by the following method:
A. data are stored in relational database, distinguish different customer data with corporate identify;
Then human resources personal information table stores as follows:
Profession identity Corporate identify Person number Name Hiring date On-job state
005 0003128996 00001 Zhang San 2010/04/10 On-job
005 0003128996 00002 Li Si 2012/01/09 On-job
005 0003128996 00003 King five 2010/07/10 On-job
005 0001008123 00001 Wei seven 2011/12/09 On-job
……
B. carry out data analysis operation to user at interface, first system generates following SQL statement:
SELECT COUNT (person number) AS ' enterprise number ' the on-job state=of FROM HR_EMPLOYEE_INFOWHERE ' on-job ';
C., before Query Result is sent to relational database execution, system is to above-mentioned SQL statement:
1) add industry to filter:
SELECT COUNT (person number) AS ' enterprise number ' the on-job state=of FROM HR_EMPLOYEE_INFOWHERE ' on-job ' AND profession identity=' 005 ';
2) divide into groups by corporate identify:
SELECT COUNT (person number) AS ' enterprise number ' the on-job state=of FROM HR_EMPLOYEE_INFOWHERE ' on-job ' AND profession identity=' 005 ' GROUP BY corporate identify;
3) user is used to specify polymerization methods to carry out after polymerization:
The average number of SELECT AVG (enterprise's number) AS ' ' FROM (SELECT COUNT (person number) AS ' enterprise number ' the on-job state=of FROM HR_EMPLOYEE_INFO WHERE ' on-job ' AND profession identity=' 005 ' GROUP BY corporate identify);
Use corporate identify Field Sanitization different from inquiry enterprise during data, used herein is profession identity Field Sanitization, by changing data limited range, obtains industry average compensation reference value;
D. Query Result is returned, and present to client with chart.
(2) for nonbusiness's DBMS: such as inquire about tradesman's average compensation, realize by the following method:
A. data are stored in relational database, distinguish different customer data with corporate identify;
Then human resources personnel payroll stores as follows:
Profession identity Corporate identify Person number Name Monthly pay
005 0003128996 00001 Zhang San 8000
005 0003128996 00002 Li Si 6000
005 0003128996 00003 King five 5000
005 0001008123 00001 Wei seven 6500
……
B. carry out data analysis operation to user at interface, first system generates following SQL statement:
SELECT AVG (monthly pay) AS ' average compensation ' FROM HR_EMPLOYEE_SALARY
C., before Query Result is sent to relational database execution, system is added industry to above-mentioned SQL statement and is filtered:
SELECT AVG (monthly pay) AS ' average compensation ' FROM HR_EMPLOYEE_SALARY profession identity=' 005 ';
Use corporate identify Field Sanitization different from inquiry enterprise during data, used herein is profession identity Field Sanitization, by changing data limited range, obtains industry average compensation reference value;
D. Query Result is returned, and present to client with chart.
Above method is all under user data privacy conditions, for user provides the relevant reference information of user data query.
Above a specific embodiment of the present invention has been described in detail, but described content being only preferred embodiment of the present invention, can not being considered to for limiting practical range of the present invention.All equalizations done according to the present patent application scope change and improve, and all should still belong within patent covering scope of the present invention.

Claims (2)

1. the implementation method of reference value in on-line data analysis service, is characterized in that: comprise the following steps:
The first step: classify with reference to value: be divided into enterprise-level statistics reference value and nonbusiness's level statistics reference value two class;
Second step: realize respectively for the reference value of two kinds described in the first step:
(1) implementation method of enterprise-level statistics reference value: first cancel the corporate identify forced filtration to data; Then by particular data scope, data are filtered; Finally specify polymerization methods generating reference value according to user;
(2) implementation method of nonbusiness's level statistics reference value: nonbusiness's level statistics must be polymerizing value, then cancels the corporate identify forced filtration to data; And by particular data scope, filtration generating reference value is carried out to data.
2. the implementation method of reference value in on-line data analysis according to claim 1 service, is characterized in that: described particular data scope comprises industry data, this area data and total data three kinds of data areas.
CN201410847872.1A 2014-12-31 2014-12-31 Method for realizing reference value in on-line data analysis service Pending CN104504445A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410847872.1A CN104504445A (en) 2014-12-31 2014-12-31 Method for realizing reference value in on-line data analysis service

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410847872.1A CN104504445A (en) 2014-12-31 2014-12-31 Method for realizing reference value in on-line data analysis service

Publications (1)

Publication Number Publication Date
CN104504445A true CN104504445A (en) 2015-04-08

Family

ID=52945840

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410847872.1A Pending CN104504445A (en) 2014-12-31 2014-12-31 Method for realizing reference value in on-line data analysis service

Country Status (1)

Country Link
CN (1) CN104504445A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050021350A1 (en) * 2003-07-25 2005-01-27 Bryan Scott System and method for managing contract labor data elements
CN1870025A (en) * 2005-10-14 2006-11-29 华为技术有限公司 Generating method and device of user service property
CN101149829A (en) * 2006-09-22 2008-03-26 阿里巴巴公司 Client data centralized processing method and system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050021350A1 (en) * 2003-07-25 2005-01-27 Bryan Scott System and method for managing contract labor data elements
CN1870025A (en) * 2005-10-14 2006-11-29 华为技术有限公司 Generating method and device of user service property
CN101149829A (en) * 2006-09-22 2008-03-26 阿里巴巴公司 Client data centralized processing method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
GHGHG009,叶子: "sql如何选出大于行业平均值的公司", 《HTTP://BBS.CSDN.NET/TOPICS/380178149》 *
罗度军: "SQL语言中GROUP BY子句的应用剖析", 《软件技术研究》 *

Similar Documents

Publication Publication Date Title
DE112018004946B4 (en) COGNITIVE DATA ANONYMIZATION
Ghebru et al. Drivers of perceived land tenure (in) security: Empirical evidence from Ghana
US8027993B2 (en) Techniques for establishing and enforcing row level database security
DE202015009875U1 (en) Transparent discovery of a semi-structured data schema
KR20200017512A (en) How to authorize the manipulation of form field values
CN107967313A (en) A kind of method for merging different industries data based on field data and coordinate general character
US20150019303A1 (en) Data quality integration
CN101996218B (en) Method for realizing data access authority control
KR20150009798A (en) System for online monitering individual information and method of online monitering the same
CN108920914B (en) Authority control method and device
Strohmeier et al. Human Resource Intelligence und Analytics
CN105160273A (en) Knowledge management method based on access control and intelligent retrieval
Zhang et al. Shocking the crowd: The effect of censorship shocks on Chinese Wikipedia
CN106649363A (en) Data query method and device
CN104715341A (en) Permission assigning method and device
CN107292143A (en) The System right management method and system that a kind of operating right is combined with data permission
CN110992005B (en) Method and system for realizing data authority control processing in big data application
CN104504445A (en) Method for realizing reference value in on-line data analysis service
CN103207835A (en) Mass data storage method through self-adaptive Range partitions
US11403421B2 (en) Security system for benchmark access
CN110472111A (en) Rights management, user right inquiry and resource information authorization method
CN104504132A (en) Custom sorting achieving method in data analysis query
CN114022188A (en) Target crowd circling method, device, equipment and storage medium
US10803092B1 (en) Metadata driven catalog definition
Wibowo et al. A multicriteria group decision making procedure for selecting cloud based ERP system providers

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150408

RJ01 Rejection of invention patent application after publication