CN109325027A - One kind is based on the analysis of cloud data, Situation Awareness algorithm - Google Patents
One kind is based on the analysis of cloud data, Situation Awareness algorithm Download PDFInfo
- Publication number
- CN109325027A CN109325027A CN201810956569.3A CN201810956569A CN109325027A CN 109325027 A CN109325027 A CN 109325027A CN 201810956569 A CN201810956569 A CN 201810956569A CN 109325027 A CN109325027 A CN 109325027A
- Authority
- CN
- China
- Prior art keywords
- data
- target
- target data
- screened
- attribute
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention provides one kind based on the analysis of cloud data, Situation Awareness algorithm, obtains the target data to be screened such as user oriented;Determine the attribute of target data, the target data to be screened has score data and data category attribute information;The target data to be screened includes first kind target data and Second Type target data;Prescreening processing is carried out to the target data to be screened, data value corresponding to the attribute of upper layer is established and is indexed, B+ tree index structure is just constructed if it is numeric type data, inverted index is just constructed if it is character type data, data screening filters out valid data, it is then based on the calculating that valid data carry out article similarity automatically, enables and accurately carries out Products Show according to the similarity data being calculated.The data analysing method is compatible with multiple business scene, can effectively reduce data production, verification and operation cost.
Description
Technical field
The present invention is a kind of based on the analysis of cloud data, Situation Awareness algorithm, belongs to data processing field.
Background technique
In the prior art, data analysis refer to statistical analysis technique appropriate to collect come mass data divide
Analysis, extract useful information and formed conclusion and to data be subject in detail research and summary process.This process is also matter
Measure the support process of management system.In practical, data analysis can help people to judge, to take appropriate action, number
It has just been established according to the Fundamentals of Mathematics of analysis in early stage in 20th century, but until the appearance of computer just makes practical operation as can
Can, and data analysis is promoted.Data analysis is the product that mathematical and computer sciences combine, traditional data point
Analysis there is a problem of it is relatively complicated, so needing a kind of new method to solve the above problems.
Summary of the invention
In view of the deficienciess of the prior art, it is an object of the present invention to provide one kind based on the analysis of cloud data, Situation Awareness
Algorithm, to solve the problems mentioned in the above background technology.
To achieve the goals above, the present invention is to realize by the following technical solutions: one kind is based on cloud data point
Analysis, Situation Awareness algorithm, include the following steps:
S1: the target data to be screened such as user oriented is obtained;Determine the attribute of target data, the mesh to be screened
Marking data has score data and data category attribute information;The target data to be screened includes first kind target data
With Second Type target data;
S2: prescreening processing is carried out to the target data to be screened, data value corresponding to the attribute of upper layer is established
Index, B+ tree index structure is just constructed if it is numeric type data, just constructs inverted index if it is character type data;
S3: category attribute information based on the data, to the prescreening of different data category attribute treated mesh
Mark data are grouped, and determine data attribute Value Types, if numeric type data, then create B+ tree index for it;If character
Type attribute then establishes inverted index structure for it
S4: it to each group of the prescreening treated target data, is counted according to the score data of target data
According to the normalized of scoring, the normalization grading parameters of the target data are generated;The normalization grading parameters have mesh
Mark the information of the target object ID of data, the User ID of data category ID and the user;
S5: the normalization grading parameters of the target data of multiple users, the position where the normalization grading parameters are obtained
It is set to father node, depth and/or breadth traversal are carried out in the recommending data set based on orderly multiway tree, so as to institute
It states user and exports suitable one or more recommendations number;
S6: according to the data category ID, phase is carried out to the unitized grading parameters of multiple target datas of different user
It is calculated like degree, obtains the value of measuring similarity;
S7: according to the value of the measuring similarity, the phase between the corresponding target object of the multiple target data is determined
Guan Du.
Further, the format of the target object ID of the User ID of the user and the target data;The determining institute
State whether first object data are that invalid data specifically includes: whether the playing duration for determining the first object data is more than to have
Imitate play time threshold value.
Beneficial effects of the present invention: one kind of the invention is based on the analysis of cloud data, Situation Awareness algorithm, data screening filter
Valid data out are then based on the calculating that valid data carry out article similarity automatically, so that according to the similarity being calculated
Data can accurately carry out Products Show.The data analysing method be compatible with multiple business scene, can effectively reduce data production,
Verification and operation cost.
Specific embodiment
To be easy to understand the technical means, the creative features, the aims and the efficiencies achieved by the present invention, below with reference to
Specific embodiment, the present invention is further explained.
The present invention provides a kind of technical solution: one kind is based on the analysis of cloud data, Situation Awareness algorithm, including walks as follows
It is rapid:
S1: the target data to be screened such as user oriented is obtained;Determine the attribute of target data, the mesh to be screened
Marking data has score data and data category attribute information;The target data to be screened includes first kind target data
With Second Type target data;
S2: prescreening processing is carried out to the target data to be screened, data value corresponding to the attribute of upper layer is established
Index, B+ tree index structure is just constructed if it is numeric type data, just constructs inverted index if it is character type data;
S3: category attribute information based on the data, to the prescreening of different data category attribute treated mesh
Mark data are grouped, and determine data attribute Value Types, if numeric type data, then create B+ tree index for it;If character
Type attribute then establishes inverted index structure for it
S4: it to each group of the prescreening treated target data, is counted according to the score data of target data
According to the normalized of scoring, the normalization grading parameters of the target data are generated;The normalization grading parameters have mesh
Mark the information of the target object ID of data, the User ID of data category ID and the user;
S5: the normalization grading parameters of the target data of multiple users, the position where the normalization grading parameters are obtained
It is set to father node, depth and/or breadth traversal are carried out in the recommending data set based on orderly multiway tree, so as to institute
It states user and exports suitable one or more recommendations number;
S6: according to the data category ID, phase is carried out to the unitized grading parameters of multiple target datas of different user
It is calculated like degree, obtains the value of measuring similarity;
S7: according to the value of the measuring similarity, the phase between the corresponding target object of the multiple target data is determined
Guan Du.
The format of the target object ID of the User ID of user and the target data;The determination first object data
Whether be that invalid data specifically includes: whether the playing duration for determining the first object data is more than effective play time threshold
Value.
Embodiment 1: one kind based on cloud data analysis, Situation Awareness algorithm, include the following steps: first obtain towards with
Family etc. target data to be screened;Determine the attribute of target data, the target data to be screened have score data and
Data category attribute information;The target data to be screened includes first kind target data and Second Type target data,
Then prescreening processing is carried out to the target data to be screened, data value corresponding to the attribute of upper layer is established and is indexed, such as
Fruit is that numeric type data just constructs B+ tree index structure, just constructs inverted index if it is character type data, is next based on institute
Data category attribute information is stated, treated that target data is grouped to the prescreening of different data category attribute, really
Fixed number is according to attribute Value Types, if numeric type data, then creates B+ tree index for it;If character type attribute then is established for it
Arrange index structure, then to each group of the prescreening treated target data, according to the score data of target data into
The normalized of row data scoring, generates the normalization grading parameters of the target data;The normalization grading parameters tool
There are the information of the target object ID of target data, the User ID of data category ID and the user, finally obtains multiple users'
The normalization grading parameters of target data, the position where the normalization grading parameters is father node, described based on orderly
Depth and/or breadth traversal are carried out in the recommending data set of multiway tree, to export suitable one or more to the user
A recommendation number carries out the unitized grading parameters of multiple target datas of different user similar according to the data category ID
Degree calculates, and obtains the value of measuring similarity, according to the value of the measuring similarity, determines the corresponding mesh of the multiple target data
Mark the degree of correlation between object.
The format of the target object ID of the User ID of user and the target data;The determination first object data
Whether be that invalid data specifically includes: whether the playing duration for determining the first object data is more than effective play time threshold
Value.
The above shows and describes the basic principles and main features of the present invention and the advantages of the present invention, for this field skill
For art personnel, it is clear that invention is not limited to the details of the above exemplary embodiments, and without departing substantially from spirit of the invention or
In the case where essential characteristic, the present invention can be realized in other specific forms.Therefore, in all respects, should all incite somebody to action
Embodiment regards exemplary as, and is non-limiting, the scope of the present invention by appended claims rather than on state
Bright restriction, it is intended that including all changes that fall within the meaning and scope of the equivalent elements of the claims in the present invention
It is interior.Claim should not be construed as limiting the claims involved.
In addition, it should be understood that although this specification is described in terms of embodiments, but not each embodiment is only wrapped
Containing an independent technical solution, this description of the specification is merely for the sake of clarity, and those skilled in the art should
It considers the specification as a whole, the technical solutions in the various embodiments may also be suitably combined, forms those skilled in the art
The other embodiments being understood that.
Claims (2)
1. one kind is based on the analysis of cloud data, Situation Awareness algorithm, it is characterised in that include the following steps:
S1: the target data to be screened such as user oriented is obtained;Determine the attribute of target data, the number of targets to be screened
According to score data and data category attribute information;The target data to be screened includes first kind target data and
Two type target data;
S2: carrying out prescreening processing to the target data to be screened, establish and index to data value corresponding to the attribute of upper layer,
B+ tree index structure is just constructed if it is numeric type data, just constructs inverted index if it is character type data;
S3: category attribute information based on the data, to the prescreening of different data category attribute treated number of targets
According to being grouped, data attribute Value Types are determined, if numeric type data, then create B+ tree index for it;If character type category
Property then establishes inverted index structure for it
S4: to each group of the prescreening treated target data, data is carried out according to the score data of target data and are commented
The normalized divided, generates the normalization grading parameters of the target data;The normalization grading parameters have number of targets
According to target object ID, data category ID and the user User ID information;
S5: obtaining the normalization grading parameters of the target data of multiple users, and the position where the normalization grading parameters is
Father node carries out depth and/or breadth traversal in the recommending data set based on orderly multiway tree, so as to the use
Family exports suitable one or more recommendations number;
S6: according to the data category ID, similarity is carried out to the unitized grading parameters of multiple target datas of different user
It calculates, obtains the value of measuring similarity;
S7: according to the value of the measuring similarity, the degree of correlation between the corresponding target object of the multiple target data is determined.
2. according to claim 1 a kind of based on the analysis of cloud data, Situation Awareness algorithm, it is characterised in that: the use
The format of the target object ID of the User ID at family and the target data;Whether the determination first object data are invalid
Data specifically include: whether the playing duration for determining the first object data is more than effective play time threshold value.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810956569.3A CN109325027A (en) | 2018-08-21 | 2018-08-21 | One kind is based on the analysis of cloud data, Situation Awareness algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810956569.3A CN109325027A (en) | 2018-08-21 | 2018-08-21 | One kind is based on the analysis of cloud data, Situation Awareness algorithm |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109325027A true CN109325027A (en) | 2019-02-12 |
Family
ID=65264494
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810956569.3A Pending CN109325027A (en) | 2018-08-21 | 2018-08-21 | One kind is based on the analysis of cloud data, Situation Awareness algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109325027A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113191852A (en) * | 2021-05-19 | 2021-07-30 | 拉扎斯网络科技(上海)有限公司 | Data verification method and device, storage medium and computer equipment |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103838775A (en) * | 2012-11-27 | 2014-06-04 | 中国银联股份有限公司 | Data analysis method and data analysis device |
CN106484813A (en) * | 2016-09-23 | 2017-03-08 | 广东港鑫科技有限公司 | A kind of big data analysis system and method |
CN107220382A (en) * | 2017-06-28 | 2017-09-29 | 环球智达科技(北京)有限公司 | Data analysing method |
-
2018
- 2018-08-21 CN CN201810956569.3A patent/CN109325027A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103838775A (en) * | 2012-11-27 | 2014-06-04 | 中国银联股份有限公司 | Data analysis method and data analysis device |
CN106484813A (en) * | 2016-09-23 | 2017-03-08 | 广东港鑫科技有限公司 | A kind of big data analysis system and method |
CN107220382A (en) * | 2017-06-28 | 2017-09-29 | 环球智达科技(北京)有限公司 | Data analysing method |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113191852A (en) * | 2021-05-19 | 2021-07-30 | 拉扎斯网络科技(上海)有限公司 | Data verification method and device, storage medium and computer equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103743486B (en) | A kind of automatic Grading System based on magnanimity tobacco leaf data and method | |
Liu et al. | Research fronts in data envelopment analysis | |
CN105630743B (en) | A kind of system of selection of spectrum wave number | |
Malcolm | Edge effects in central Amazonian forest fragments | |
CN104931430B (en) | A kind of redried natural alcoholization quality evaluation and model building method | |
CN102929942B (en) | The overlapping community discovery method of a kind of community network based on integrated study | |
CN106447388A (en) | Method and system for recommending dishes | |
CN103424542A (en) | Tobacco leaf quality evaluation method based on sensory evaluation | |
CN109344150A (en) | A kind of spatiotemporal data structure analysis method based on FP- tree | |
US20240260885A1 (en) | Street greening quality detection method based on physiological activation recognition | |
Estornell et al. | Estimation of wood volume and height of olive tree plantations using airborne discrete-return LiDAR data | |
CN108804683A (en) | Associate(d) matrix decomposes and the film of collaborative filtering recommends method | |
CN108280124A (en) | Product classification method and device, ranking list generation method and device, electronic equipment | |
CN108132964A (en) | A kind of collaborative filtering method to be scored based on user item class | |
CN110132879A (en) | A kind of flue-cured tobacco note discrimination method based near infrared spectrum | |
CN106644983A (en) | Spectrum wavelength selection method based on PLS-VIP-ACO algorithm | |
CN105092789A (en) | Analysis method for tobacco leaf style characteristic variation trend in lamina aging process | |
CN110287423A (en) | A kind of farm Products Show system and method based on collaborative filtering | |
CN107169424A (en) | Atrial fibrillation detection algorithm based on artificial neural network | |
CN109325027A (en) | One kind is based on the analysis of cloud data, Situation Awareness algorithm | |
CN106645530B (en) | A method of the multi-model based on tobacco leaf aroma component evaluates raw tobacco material similarity | |
CN105468740A (en) | Game player data storage and analysis method and apparatus | |
CN107133317A (en) | A kind of network public-opinion subject extraction method based on neologisms | |
CN104463627B (en) | Data processing method and device | |
CN107169051A (en) | Based on semantic related method for searching three-dimension model and system between body |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190212 |
|
RJ01 | Rejection of invention patent application after publication |