CN109325027A - One kind is based on the analysis of cloud data, Situation Awareness algorithm - Google Patents

One kind is based on the analysis of cloud data, Situation Awareness algorithm Download PDF

Info

Publication number
CN109325027A
CN109325027A CN201810956569.3A CN201810956569A CN109325027A CN 109325027 A CN109325027 A CN 109325027A CN 201810956569 A CN201810956569 A CN 201810956569A CN 109325027 A CN109325027 A CN 109325027A
Authority
CN
China
Prior art keywords
data
target
target data
screened
attribute
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810956569.3A
Other languages
Chinese (zh)
Inventor
朱常林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201810956569.3A priority Critical patent/CN109325027A/en
Publication of CN109325027A publication Critical patent/CN109325027A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides one kind based on the analysis of cloud data, Situation Awareness algorithm, obtains the target data to be screened such as user oriented;Determine the attribute of target data, the target data to be screened has score data and data category attribute information;The target data to be screened includes first kind target data and Second Type target data;Prescreening processing is carried out to the target data to be screened, data value corresponding to the attribute of upper layer is established and is indexed, B+ tree index structure is just constructed if it is numeric type data, inverted index is just constructed if it is character type data, data screening filters out valid data, it is then based on the calculating that valid data carry out article similarity automatically, enables and accurately carries out Products Show according to the similarity data being calculated.The data analysing method is compatible with multiple business scene, can effectively reduce data production, verification and operation cost.

Description

One kind is based on the analysis of cloud data, Situation Awareness algorithm
Technical field
The present invention is a kind of based on the analysis of cloud data, Situation Awareness algorithm, belongs to data processing field.
Background technique
In the prior art, data analysis refer to statistical analysis technique appropriate to collect come mass data divide Analysis, extract useful information and formed conclusion and to data be subject in detail research and summary process.This process is also matter Measure the support process of management system.In practical, data analysis can help people to judge, to take appropriate action, number It has just been established according to the Fundamentals of Mathematics of analysis in early stage in 20th century, but until the appearance of computer just makes practical operation as can Can, and data analysis is promoted.Data analysis is the product that mathematical and computer sciences combine, traditional data point Analysis there is a problem of it is relatively complicated, so needing a kind of new method to solve the above problems.
Summary of the invention
In view of the deficienciess of the prior art, it is an object of the present invention to provide one kind based on the analysis of cloud data, Situation Awareness Algorithm, to solve the problems mentioned in the above background technology.
To achieve the goals above, the present invention is to realize by the following technical solutions: one kind is based on cloud data point Analysis, Situation Awareness algorithm, include the following steps:
S1: the target data to be screened such as user oriented is obtained;Determine the attribute of target data, the mesh to be screened Marking data has score data and data category attribute information;The target data to be screened includes first kind target data With Second Type target data;
S2: prescreening processing is carried out to the target data to be screened, data value corresponding to the attribute of upper layer is established Index, B+ tree index structure is just constructed if it is numeric type data, just constructs inverted index if it is character type data;
S3: category attribute information based on the data, to the prescreening of different data category attribute treated mesh Mark data are grouped, and determine data attribute Value Types, if numeric type data, then create B+ tree index for it;If character Type attribute then establishes inverted index structure for it
S4: it to each group of the prescreening treated target data, is counted according to the score data of target data According to the normalized of scoring, the normalization grading parameters of the target data are generated;The normalization grading parameters have mesh Mark the information of the target object ID of data, the User ID of data category ID and the user;
S5: the normalization grading parameters of the target data of multiple users, the position where the normalization grading parameters are obtained It is set to father node, depth and/or breadth traversal are carried out in the recommending data set based on orderly multiway tree, so as to institute It states user and exports suitable one or more recommendations number;
S6: according to the data category ID, phase is carried out to the unitized grading parameters of multiple target datas of different user It is calculated like degree, obtains the value of measuring similarity;
S7: according to the value of the measuring similarity, the phase between the corresponding target object of the multiple target data is determined Guan Du.
Further, the format of the target object ID of the User ID of the user and the target data;The determining institute State whether first object data are that invalid data specifically includes: whether the playing duration for determining the first object data is more than to have Imitate play time threshold value.
Beneficial effects of the present invention: one kind of the invention is based on the analysis of cloud data, Situation Awareness algorithm, data screening filter Valid data out are then based on the calculating that valid data carry out article similarity automatically, so that according to the similarity being calculated Data can accurately carry out Products Show.The data analysing method be compatible with multiple business scene, can effectively reduce data production, Verification and operation cost.
Specific embodiment
To be easy to understand the technical means, the creative features, the aims and the efficiencies achieved by the present invention, below with reference to Specific embodiment, the present invention is further explained.
The present invention provides a kind of technical solution: one kind is based on the analysis of cloud data, Situation Awareness algorithm, including walks as follows It is rapid:
S1: the target data to be screened such as user oriented is obtained;Determine the attribute of target data, the mesh to be screened Marking data has score data and data category attribute information;The target data to be screened includes first kind target data With Second Type target data;
S2: prescreening processing is carried out to the target data to be screened, data value corresponding to the attribute of upper layer is established Index, B+ tree index structure is just constructed if it is numeric type data, just constructs inverted index if it is character type data;
S3: category attribute information based on the data, to the prescreening of different data category attribute treated mesh Mark data are grouped, and determine data attribute Value Types, if numeric type data, then create B+ tree index for it;If character Type attribute then establishes inverted index structure for it
S4: it to each group of the prescreening treated target data, is counted according to the score data of target data According to the normalized of scoring, the normalization grading parameters of the target data are generated;The normalization grading parameters have mesh Mark the information of the target object ID of data, the User ID of data category ID and the user;
S5: the normalization grading parameters of the target data of multiple users, the position where the normalization grading parameters are obtained It is set to father node, depth and/or breadth traversal are carried out in the recommending data set based on orderly multiway tree, so as to institute It states user and exports suitable one or more recommendations number;
S6: according to the data category ID, phase is carried out to the unitized grading parameters of multiple target datas of different user It is calculated like degree, obtains the value of measuring similarity;
S7: according to the value of the measuring similarity, the phase between the corresponding target object of the multiple target data is determined Guan Du.
The format of the target object ID of the User ID of user and the target data;The determination first object data Whether be that invalid data specifically includes: whether the playing duration for determining the first object data is more than effective play time threshold Value.
Embodiment 1: one kind based on cloud data analysis, Situation Awareness algorithm, include the following steps: first obtain towards with Family etc. target data to be screened;Determine the attribute of target data, the target data to be screened have score data and Data category attribute information;The target data to be screened includes first kind target data and Second Type target data, Then prescreening processing is carried out to the target data to be screened, data value corresponding to the attribute of upper layer is established and is indexed, such as Fruit is that numeric type data just constructs B+ tree index structure, just constructs inverted index if it is character type data, is next based on institute Data category attribute information is stated, treated that target data is grouped to the prescreening of different data category attribute, really Fixed number is according to attribute Value Types, if numeric type data, then creates B+ tree index for it;If character type attribute then is established for it Arrange index structure, then to each group of the prescreening treated target data, according to the score data of target data into The normalized of row data scoring, generates the normalization grading parameters of the target data;The normalization grading parameters tool There are the information of the target object ID of target data, the User ID of data category ID and the user, finally obtains multiple users' The normalization grading parameters of target data, the position where the normalization grading parameters is father node, described based on orderly Depth and/or breadth traversal are carried out in the recommending data set of multiway tree, to export suitable one or more to the user A recommendation number carries out the unitized grading parameters of multiple target datas of different user similar according to the data category ID Degree calculates, and obtains the value of measuring similarity, according to the value of the measuring similarity, determines the corresponding mesh of the multiple target data Mark the degree of correlation between object.
The format of the target object ID of the User ID of user and the target data;The determination first object data Whether be that invalid data specifically includes: whether the playing duration for determining the first object data is more than effective play time threshold Value.
The above shows and describes the basic principles and main features of the present invention and the advantages of the present invention, for this field skill For art personnel, it is clear that invention is not limited to the details of the above exemplary embodiments, and without departing substantially from spirit of the invention or In the case where essential characteristic, the present invention can be realized in other specific forms.Therefore, in all respects, should all incite somebody to action Embodiment regards exemplary as, and is non-limiting, the scope of the present invention by appended claims rather than on state Bright restriction, it is intended that including all changes that fall within the meaning and scope of the equivalent elements of the claims in the present invention It is interior.Claim should not be construed as limiting the claims involved.
In addition, it should be understood that although this specification is described in terms of embodiments, but not each embodiment is only wrapped Containing an independent technical solution, this description of the specification is merely for the sake of clarity, and those skilled in the art should It considers the specification as a whole, the technical solutions in the various embodiments may also be suitably combined, forms those skilled in the art The other embodiments being understood that.

Claims (2)

1. one kind is based on the analysis of cloud data, Situation Awareness algorithm, it is characterised in that include the following steps:
S1: the target data to be screened such as user oriented is obtained;Determine the attribute of target data, the number of targets to be screened According to score data and data category attribute information;The target data to be screened includes first kind target data and Two type target data;
S2: carrying out prescreening processing to the target data to be screened, establish and index to data value corresponding to the attribute of upper layer, B+ tree index structure is just constructed if it is numeric type data, just constructs inverted index if it is character type data;
S3: category attribute information based on the data, to the prescreening of different data category attribute treated number of targets According to being grouped, data attribute Value Types are determined, if numeric type data, then create B+ tree index for it;If character type category Property then establishes inverted index structure for it
S4: to each group of the prescreening treated target data, data is carried out according to the score data of target data and are commented The normalized divided, generates the normalization grading parameters of the target data;The normalization grading parameters have number of targets According to target object ID, data category ID and the user User ID information;
S5: obtaining the normalization grading parameters of the target data of multiple users, and the position where the normalization grading parameters is Father node carries out depth and/or breadth traversal in the recommending data set based on orderly multiway tree, so as to the use Family exports suitable one or more recommendations number;
S6: according to the data category ID, similarity is carried out to the unitized grading parameters of multiple target datas of different user It calculates, obtains the value of measuring similarity;
S7: according to the value of the measuring similarity, the degree of correlation between the corresponding target object of the multiple target data is determined.
2. according to claim 1 a kind of based on the analysis of cloud data, Situation Awareness algorithm, it is characterised in that: the use The format of the target object ID of the User ID at family and the target data;Whether the determination first object data are invalid Data specifically include: whether the playing duration for determining the first object data is more than effective play time threshold value.
CN201810956569.3A 2018-08-21 2018-08-21 One kind is based on the analysis of cloud data, Situation Awareness algorithm Pending CN109325027A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810956569.3A CN109325027A (en) 2018-08-21 2018-08-21 One kind is based on the analysis of cloud data, Situation Awareness algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810956569.3A CN109325027A (en) 2018-08-21 2018-08-21 One kind is based on the analysis of cloud data, Situation Awareness algorithm

Publications (1)

Publication Number Publication Date
CN109325027A true CN109325027A (en) 2019-02-12

Family

ID=65264494

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810956569.3A Pending CN109325027A (en) 2018-08-21 2018-08-21 One kind is based on the analysis of cloud data, Situation Awareness algorithm

Country Status (1)

Country Link
CN (1) CN109325027A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113191852A (en) * 2021-05-19 2021-07-30 拉扎斯网络科技(上海)有限公司 Data verification method and device, storage medium and computer equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103838775A (en) * 2012-11-27 2014-06-04 中国银联股份有限公司 Data analysis method and data analysis device
CN106484813A (en) * 2016-09-23 2017-03-08 广东港鑫科技有限公司 A kind of big data analysis system and method
CN107220382A (en) * 2017-06-28 2017-09-29 环球智达科技(北京)有限公司 Data analysing method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103838775A (en) * 2012-11-27 2014-06-04 中国银联股份有限公司 Data analysis method and data analysis device
CN106484813A (en) * 2016-09-23 2017-03-08 广东港鑫科技有限公司 A kind of big data analysis system and method
CN107220382A (en) * 2017-06-28 2017-09-29 环球智达科技(北京)有限公司 Data analysing method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113191852A (en) * 2021-05-19 2021-07-30 拉扎斯网络科技(上海)有限公司 Data verification method and device, storage medium and computer equipment

Similar Documents

Publication Publication Date Title
CN103743486B (en) A kind of automatic Grading System based on magnanimity tobacco leaf data and method
CN105630743B (en) A kind of system of selection of spectrum wave number
Malcolm Edge effects in central Amazonian forest fragments
CN104931430B (en) A kind of redried natural alcoholization quality evaluation and model building method
CN102929942B (en) The overlapping community discovery method of a kind of community network based on integrated study
Trinder et al. Dynamic trajectories of growth and nitrogen capture by competing plants
CN106447388A (en) Method and system for recommending dishes
CN103424542A (en) Tobacco leaf quality evaluation method based on sensory evaluation
CN109344150A (en) A kind of spatiotemporal data structure analysis method based on FP- tree
Estornell et al. Estimation of wood volume and height of olive tree plantations using airborne discrete-return LiDAR data
Li et al. Identifying overlapping communities in social networks using multi-scale local information expansion
CN108804683A (en) Associate(d) matrix decomposes and the film of collaborative filtering recommends method
CN107368573A (en) Video quality evaluation method and device
CN108280124A (en) Product classification method and device, ranking list generation method and device, electronic equipment
CN108132964A (en) A kind of collaborative filtering method to be scored based on user item class
CN110132879A (en) A kind of flue-cured tobacco note discrimination method based near infrared spectrum
CN106644983A (en) Spectrum wavelength selection method based on PLS-VIP-ACO algorithm
CN107491447A (en) Establish inquiry rewriting discrimination model, method for distinguishing and corresponding intrument are sentenced in inquiry rewriting
CN107169424A (en) Atrial fibrillation detection algorithm based on artificial neural network
CN109325027A (en) One kind is based on the analysis of cloud data, Situation Awareness algorithm
CN106251230A (en) A kind of community discovery method propagated based on election label
CN106645530B (en) A method of the multi-model based on tobacco leaf aroma component evaluates raw tobacco material similarity
CN110287423A (en) A kind of farm Products Show system and method based on collaborative filtering
CN106844743B (en) Emotion classification method and device for Uygur language text
CN105468740A (en) Game player data storage and analysis method and apparatus

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190212

RJ01 Rejection of invention patent application after publication