CN110378550A - The processing method of the extensive food data of multi-source based on distributed structure/architecture - Google Patents
The processing method of the extensive food data of multi-source based on distributed structure/architecture Download PDFInfo
- Publication number
- CN110378550A CN110378550A CN201910477552.4A CN201910477552A CN110378550A CN 110378550 A CN110378550 A CN 110378550A CN 201910477552 A CN201910477552 A CN 201910477552A CN 110378550 A CN110378550 A CN 110378550A
- Authority
- CN
- China
- Prior art keywords
- data
- period
- distributed
- index
- architecture
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 235000013305 food Nutrition 0.000 title claims abstract description 23
- 238000003672 processing method Methods 0.000 title claims abstract description 12
- 238000001514 detection method Methods 0.000 claims abstract description 14
- 238000000034 method Methods 0.000 claims abstract description 13
- 230000005856 abnormality Effects 0.000 claims abstract description 10
- 238000007405 data analysis Methods 0.000 claims description 9
- 238000012545 processing Methods 0.000 claims description 7
- 238000007689 inspection Methods 0.000 claims description 5
- 238000005498 polishing Methods 0.000 claims description 5
- 230000002123 temporal effect Effects 0.000 claims description 5
- 238000004364 calculation method Methods 0.000 claims description 4
- 238000003064 k means clustering Methods 0.000 claims description 4
- 230000000694 effects Effects 0.000 claims description 3
- 230000005540 biological transmission Effects 0.000 claims description 2
- 238000011156 evaluation Methods 0.000 claims description 2
- 230000008676 import Effects 0.000 claims description 2
- 238000002203 pretreatment Methods 0.000 claims description 2
- 238000004458 analytical method Methods 0.000 abstract description 6
- 230000002159 abnormal effect Effects 0.000 abstract 1
- 238000005516 engineering process Methods 0.000 description 4
- 241000406668 Loxodonta cyclotis Species 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000013450 outlier detection Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 235000021404 traditional food Nutrition 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/18—File system types
- G06F16/182—Distributed file systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0639—Performance analysis of employees; Performance analysis of enterprise or organisation operations
- G06Q10/06393—Score-carding, benchmarking or key performance indicator [KPI] analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/067—Enterprise or organisation modelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Human Resources & Organizations (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- General Engineering & Computer Science (AREA)
- Educational Administration (AREA)
- Entrepreneurship & Innovation (AREA)
- Development Economics (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Marketing (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Game Theory and Decision Science (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Probability & Statistics with Applications (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a kind of processing methods of extensive food data of the multi-source based on distributed structure/architecture.Content is as follows: on the basis of distributed structure/architecture, having built Distributed Computing Platform, has carried out distributed storage, operation to data based on Map-Reduce algorithm;For extensive multi-source food data, defines key index and space-time data abnormal problem clusters data using the space-time big data key index analysis method of proposition, divide.Key index is calculated, to realize abnormality detection.Algorithm includes the following steps: that Distributed Computing Platform is built;Data are directed into distributed file system;Data prediction is inputted as algorithm;Data object is clustered, index of the computing object in different dimensions;Visualize index analysis result.The present invention can efficiently solve traditional single object time series data method for detecting abnormality there are the problem of so that detection accuracy and speed are all effectively promoted.
Description
Technical field
The invention belongs to field of computer technology, more particularly to a kind of extensive food of the multi-source based on distributed structure/architecture
The key index parser and system of data.
Background technique
In information-based highly developed today, interested regulatory authorities can effectively integrate the big of food industries Lian Zhong each department
Scale data resource;For example, Food Inspection data and food and medicine complain data etc..On the whole, this kind of data resource presents
Data scale is big, institutional framework is complicated and the diversified feature of form of presentation.Although being passed along with the development of artificial intelligence technology
Advantage of the machine learning algorithm of system in big data processing has been embodied in all kinds of fields of industrialization.But in face of multi-source heterogeneous
Multiple database data fusion and Cooperative Analysis, still suffer from that data redudancy is high, form of presentation disunity (such as retouch by text
State and numerical statistic) etc. difficulties.Towards the classical Space-time Model big data with " multi-source heterogeneous flow data " characteristic, how data are used
Integration carries out research modeling with key technologies such as space-time data analyses to be connected to " each department's information island ", and it is potential sufficiently to excavate its
Risk.
In data research field, space-time big data refers to that spatially mutually independent several objects are produced in different moments
Raw data, and each object then forms a time series data, also known as flow data in the data that different moments generate.
The time series data abnormality detection of traditional single object only considers single object, and the data dimension of consideration is also relatively low, is unable to fully
Excavate hiding effective information.All there are some problems in detection accuracy and speed in the algorithm of standalone.It needs to develop
It is a kind of more accurate, detection algorithm more efficiently.
Summary of the invention
Technical problem: the present invention existing problem and shortage in view of the above technology provides a kind of based on distributed structure/architecture
The extensive food data of multi-source key index parser.Realize to the distributed storage of extensive multi-source data, reading,
Operation, and this concept of key index is utilized, modeling analysis is carried out to data, according to the key index of food data, Lai Jinhang
Data analysis, abnormality detection.
Technical solution: for achieving the above object, a kind of multi-source based on distributed structure/architecture of the invention is eaten on a large scale
The technical solution that the processing method of product data uses includes the following steps:
Step 1: writing code builds Distributed Computing Platform;
Step 2: the extensive food inspection flow data of multi-source is directed into distributed file system from MySQL database
HDFS (Hadoop Distributed File System), is stored with text file, with being for data processing, analyze;
Step 3: the data being stored in HDFS are pre-processed, the data object of each batch, according to temporal characteristics
It is divided, is divided into different time sections, while carrying out dimension polishing, guarantee each object period dimension having the same.
Step 4: to each period of each object, clustering is carried out, using K-Means clustering algorithm, is obtained pair
The cluster centre point and class cluster number answered.
Step 5: calculating separately the index of the n class cluster of each period to each object.Here index definition is
Data departure degree.Each object is calculated in accounting of the data in inhomogeneity cluster of each period, it is different right to finally obtain
The index series of elephant.
Step 6: visualized according to obtained index series, so that the fluctuation situation of each object is intuitively observed,
Realize data analysis and abnormality detection.
Wherein:
It is described step 1: writing code builds Distributed Computing Platform;Based on Hadoop environment, configured in server end
Include a master node, the Distributed Computing Platform of three slave nodes, for realizing Distributed Parallel Computing.
It is described step 2: writing data imports scripted code, by the extensive food inspection flow data of multi-source from original
MySQL database is directed into distributed file system HDFS, is stored with text file, with being for data processing, analyze.
It is described step 3: the data being stored in HDFS are pre-processed, pre-treatment step are as follows: to the number of each batch
It according to object, is divided according to temporal characteristics, is divided into different time sections, while carrying out dimension polishing, guarantee each object tool
There is identical period dimension.
It is described step 4: each period to each object, carries out clustering, clustering method are as follows: use K-Means
Clustering algorithm first initializes cluster centre for each period, iterates to calculate new cluster centre until convergence, obtains pair
The final cluster centre point and class cluster number answered.
It is described step 5: calculate separately the index of the n class cluster of each period to each object, index here is fixed
Justice is data departure degree, calculation method are as follows: each class cluster to each object in each period, to the cluster centre of class cluster
It is weighted summation, the evaluation index as respective class cluster;Each object is calculated again in the data of each period in inhomogeneity
Accounting in cluster, calculation method are as follows: using class number of clusters where object divided by total class number of clusters;Finally obtain the index sequence of different objects
Column.
7. the processing method of the extensive food data of the multi-source according to claim 1 based on distributed structure/architecture,
It is characterized in that: described step 6: being visualized according to obtained index series, method for visualizing are as follows: by obtained index
Data show that display effect is different objects with json format transmission to system webpage web terminal using Echarts after parsing
Corresponding histogram, horizontal axis indicate the corresponding period, and the longitudinal axis indicates corresponding key index value, to intuitively observe each
The fluctuation situation of object realizes data analysis and abnormality detection.
The utility model has the advantages that the invention proposes one kind sides of efficiently solving aiming at the problem that traditional food data analysis technique
Case.Innovatively using servicing based on Distributed Computing Platform, on the basis of Distributed Computing Platform, a kind of base is devised
In the data analysis of key index and Outlier Detection Algorithm.The present invention not only has certain amplitude promotion in detection accuracy, simultaneously
Based on Distributed Computing Platform, standalone algorithm is compared, processing speed of the invention and efficiency also all have a distinct increment.Effectively solution
The analysis and abnormality detection problem of the extensive food data of multi-source of having determined.
Detailed description of the invention
Fig. 1 is the flow chart of the method for the present invention,
Fig. 2 is the integrated stand composition for the risk evaluating system that the method for the present invention is built,
Fig. 3 is the method for the present invention simplified diagram,
Fig. 4 is the effect of visualization figure of the method for the present invention.
Specific embodiment
In the following with reference to the drawings and specific embodiments, the present invention is furture elucidated, it should be understood that these embodiments are merely to illustrate
It the present invention rather than limits the scope of the invention, after the present invention has been read, those skilled in the art are to of the invention each
The modification of kind equivalent form falls within the application range as defined in the appended claims.
Step 1: writing code builds Distributed Computing Platform.
Step 2: data are directed into distributed file system HDFS from MySQL, stored with text file.It is used as
Data processing, analysis.The each field of initial text data with "=" separate, wherein the 2nd, 3,5,6 field be respectively the review time,
Enterprise ID, detected value in a certain respect, standard value.Data about 60,000, respectively 2017 6,8,9,10,11, December it is daily
Detection data.
Step 3: the data being stored in HDFS are pre-processed, the data object of each batch, according to temporal characteristics
It is divided, is divided into different time sections, while carrying out dimension polishing, guarantee each object period dimension having the same.
Data are integrated according to division methods, each interfield with ", " separate, respectively indicate enterprise ID, the time (specific to day), with
And the data of different dimensions.Number of data is 3665 after integration, and every data dimension is 11 dimensions.
Step 4: to each period of each object, clustering is carried out, using K-Means clustering algorithm, is obtained pair
The cluster centre point and class cluster number answered.Enabling period T is one month, uses K-Means to above-mentioned processed data respectively
Algorithm is clustered.New data are obtained, new data packets contain 4 fields, respectively period (specific to the moon), corresponding data institute
The cluster centre of category class cluster, enterprise ID, 11 dimension data of enterprise, these fields are separated by ", ", and number of data remains as 3665
Item.
Step 5: calculating separately the index of the n class cluster of each period to each object.Here index definition is
Data departure degree.Each object is calculated in accounting of the data in inhomogeneity cluster of each period, it is different right to finally obtain
The index series of elephant.Calculate separately 3 class clusters of each period index and each enterprise each period number
According to the accounting in inhomogeneity cluster, different enterprise's bath index series in different time periods are finally obtained, output data is 3 words
Section, respectively enterprise ID, period, response time short index.Totally 231 data.
Step 6: visualized according to obtained index series, so that the fluctuation situation of each object is intuitively observed,
Realize data analysis and abnormality detection.Output data includes two fields, respectively exception enterprise ID and corresponding business indicators
Fluctuation.
The above is only the preferred embodiment of the present invention, protection scope of the present invention is not limited merely to above-described embodiment,
All technical solutions belonged under thinking of the present invention all belong to the scope of protection of the present invention.It should be pointed out that for the art
For those of ordinary skill, several improvements and modifications without departing from the principles of the present invention should be regarded as protection of the invention
Range.
Claims (7)
1. a kind of processing method of the extensive food data of multi-source based on distributed structure/architecture, it is characterised in that: the place of the data
Reason method the following steps are included:
Step 1: writing code builds Distributed Computing Platform;
Step 2: the extensive food inspection flow data of multi-source is directed into distributed file system HDFS from MySQL database,
It is stored with text file, with being for data processing, analyze;
Step 3: the data being stored in HDFS are pre-processed, the data object of each batch is carried out according to temporal characteristics
It divides, is divided into different time sections, while carrying out dimension polishing, guarantees each object period dimension having the same;
Step 4: each period of each object is carried out clustering and is obtained corresponding using K-Means clustering algorithm
Cluster centre point and class cluster number;
Step 5: calculating separately the index of the n class cluster of each period to each object;Here index definition is data
Departure degree.Each object is calculated in accounting of the data in inhomogeneity cluster of each period, finally obtains different objects
Index series;
Step 6: being visualized according to obtained index series, to intuitively observe the fluctuation situation of each object, realize
Data analysis and abnormality detection.
2. the processing method of the extensive food data of the multi-source according to claim 1 based on distributed structure/architecture, feature
It is: described step 1: writing code builds Distributed Computing Platform;Based on Hadoop environment, configures and wrap in server end
Containing a master node, the Distributed Computing Platform of three slave nodes, for realizing Distributed Parallel Computing.
3. the processing method of the extensive food data of the multi-source according to claim 1 based on distributed structure/architecture, feature
It is: described step 2: writing data imports scripted code, by the extensive food inspection flow data of multi-source from original MySQL
Database is directed into distributed file system HDFS, is stored with text file, with being for data processing, analyze.
4. the processing method of the extensive food data of the multi-source according to claim 1 based on distributed structure/architecture, feature
It is: described step 3: the data being stored in HDFS are pre-processed, pre-treatment step are as follows: to the data of each batch
Object is divided according to temporal characteristics, is divided into different time sections, while carrying out dimension polishing, guarantees that each object has
Identical period dimension.
5. the processing method of the extensive food data of the multi-source according to claim 1 based on distributed structure/architecture, feature
It is: described step 4: each period to each object, carries out clustering, clustering method are as follows: poly- using K-Means
Class algorithm first initializes cluster centre for each period, iterates to calculate new cluster centre until convergence, is corresponded to
Final cluster centre point and class cluster number.
6. the processing method of the extensive food data of the multi-source according to claim 1 based on distributed structure/architecture, feature
It is: described step 5: calculating separately the index of the n class cluster of each period to each object, index definition here is
Data departure degree, calculation method are as follows: each class cluster to each object in each period carries out the cluster centre of class cluster
Weighted sum, the evaluation index as respective class cluster;Each object is calculated again in the data of each period in inhomogeneity cluster
Accounting, calculation method are as follows: using class number of clusters where object divided by total class number of clusters;Finally obtain the index series of different objects.
7. the processing method of the extensive food data of the multi-source according to claim 1 based on distributed structure/architecture, feature
It is: described step 6: being visualized according to obtained index series, method for visualizing are as follows: by obtained achievement data
It with json format transmission to system webpage web terminal, is shown after parsing using Echarts, display effect is corresponding for different objects
Histogram, horizontal axis indicates corresponding period, and the longitudinal axis indicates corresponding key index value, to intuitively observe each object
Fluctuation situation, realize data analysis and abnormality detection.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910477552.4A CN110378550A (en) | 2019-06-03 | 2019-06-03 | The processing method of the extensive food data of multi-source based on distributed structure/architecture |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910477552.4A CN110378550A (en) | 2019-06-03 | 2019-06-03 | The processing method of the extensive food data of multi-source based on distributed structure/architecture |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110378550A true CN110378550A (en) | 2019-10-25 |
Family
ID=68249712
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910477552.4A Pending CN110378550A (en) | 2019-06-03 | 2019-06-03 | The processing method of the extensive food data of multi-source based on distributed structure/architecture |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110378550A (en) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013153027A1 (en) * | 2012-04-12 | 2013-10-17 | Telefónica, S.A. | Method and system for streaming processing in a map and reduce distributed computing platform |
CN104142986A (en) * | 2014-07-24 | 2014-11-12 | 中国软件与技术服务股份有限公司 | Big data situation analysis early warning method and system based on clustering |
CN105138661A (en) * | 2015-09-02 | 2015-12-09 | 西北大学 | Hadoop-based k-means clustering analysis system and method of network security log |
CN105678398A (en) * | 2015-12-24 | 2016-06-15 | 国家电网公司 | Power load forecasting method based on big data technology, and research and application system based on method |
CN107193967A (en) * | 2017-05-25 | 2017-09-22 | 南开大学 | A kind of multi-source heterogeneous industry field big data handles full link solution |
CN109101632A (en) * | 2018-08-15 | 2018-12-28 | 中国人民解放军海军航空大学 | Product quality abnormal data retrospective analysis method based on manufacture big data |
CN109271421A (en) * | 2018-09-20 | 2019-01-25 | 重庆第二师范学院 | A kind of large data clustering method based on MapReduce |
CN109657712A (en) * | 2018-12-11 | 2019-04-19 | 浙江工业大学 | A kind of electric business food and drink data analysing method based on the improved K-Means algorithm of Spark |
-
2019
- 2019-06-03 CN CN201910477552.4A patent/CN110378550A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013153027A1 (en) * | 2012-04-12 | 2013-10-17 | Telefónica, S.A. | Method and system for streaming processing in a map and reduce distributed computing platform |
CN104142986A (en) * | 2014-07-24 | 2014-11-12 | 中国软件与技术服务股份有限公司 | Big data situation analysis early warning method and system based on clustering |
CN105138661A (en) * | 2015-09-02 | 2015-12-09 | 西北大学 | Hadoop-based k-means clustering analysis system and method of network security log |
CN105678398A (en) * | 2015-12-24 | 2016-06-15 | 国家电网公司 | Power load forecasting method based on big data technology, and research and application system based on method |
CN107193967A (en) * | 2017-05-25 | 2017-09-22 | 南开大学 | A kind of multi-source heterogeneous industry field big data handles full link solution |
CN109101632A (en) * | 2018-08-15 | 2018-12-28 | 中国人民解放军海军航空大学 | Product quality abnormal data retrospective analysis method based on manufacture big data |
CN109271421A (en) * | 2018-09-20 | 2019-01-25 | 重庆第二师范学院 | A kind of large data clustering method based on MapReduce |
CN109657712A (en) * | 2018-12-11 | 2019-04-19 | 浙江工业大学 | A kind of electric business food and drink data analysing method based on the improved K-Means algorithm of Spark |
Non-Patent Citations (1)
Title |
---|
祖晓晖: "基于数据挖掘的智慧健康服务平台设计与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20220358528A1 (en) | Methods and apparatus for self-adaptive time series forecasting engine | |
Arbesser et al. | Visplause: Visual data quality assessment of many time series using plausibility checks | |
CN106548381A (en) | Intelligent subscriber tag systems and implementation method | |
CA3042926A1 (en) | Technology incident management platform | |
US20140372175A1 (en) | Method and system for detection, classification and prediction of user behavior trends | |
CN110647512B (en) | Data storage and analysis method, device, equipment and readable medium | |
del Río-Chanona et al. | Trends of the world input and output network of global trade | |
Guerrero-Prado et al. | The power of big data and data analytics for AMI data: A case study | |
Rosário et al. | Industry 4.0 and marketing: towards an integrated future research agenda | |
CN110532152A (en) | A kind of monitoring alarm processing method and system based on Kapacitor computing engines | |
CN104598449A (en) | Preference-based clustering | |
Zhao et al. | Visual analysis system for market sales data of agricultural products | |
CN113032403A (en) | Data insight method, device, electronic equipment and storage medium | |
CN116009495A (en) | Resource model establishment method, device, equipment and medium based on digital twin | |
Ahmed | Applying Big Data analytics for energy efficiency. | |
Martin et al. | Service systems, smart service systems and cyber-physical systems—what’s the difference? towards a unified terminology | |
Sawalha et al. | Towards an efficient big data management schema for IoT | |
Li et al. | Research on the evaluation of real estate inventory management in China | |
CN109285063A (en) | A kind of credit card equity tendency judgement system based on big data analysis | |
CN108154380A (en) | The method for carrying out the online real-time recommendation of commodity to user based on extensive score data | |
Gumz et al. | Using industry 4.0’s big data and IoT to perform feature-based and past data-based energy consumption predictions | |
CN110378550A (en) | The processing method of the extensive food data of multi-source based on distributed structure/architecture | |
CN110489732A (en) | Method for processing report data and equipment | |
Neramballi et al. | Effects of a design support on practitioners designing a Product/Service System–a case study | |
Gumelar | An Anatomy of Machine Learning Data Visualization |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |