CN104133907A - Cloud computing data automatic classifying and counting method and system - Google Patents

Cloud computing data automatic classifying and counting method and system Download PDF

Info

Publication number
CN104133907A
CN104133907A CN201410382816.5A CN201410382816A CN104133907A CN 104133907 A CN104133907 A CN 104133907A CN 201410382816 A CN201410382816 A CN 201410382816A CN 104133907 A CN104133907 A CN 104133907A
Authority
CN
China
Prior art keywords
data
type
classification
classified
message
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410382816.5A
Other languages
Chinese (zh)
Inventor
康暖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Opzoon Technology Co Ltd
Original Assignee
Opzoon Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Opzoon Technology Co Ltd filed Critical Opzoon Technology Co Ltd
Priority to CN201410382816.5A priority Critical patent/CN104133907A/en
Publication of CN104133907A publication Critical patent/CN104133907A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a cloud computing data automatic classifying and counting method. The cloud computing data automatic classifying and counting method comprises the steps that a cloud storage device receives data and analyzes registration information of the data; the cloud storage device sends the data and the registration information of the data to a data counting center; the data counting center receives the data and the registration information and classifies the data according to the registration information; the classified data is sent back to the cloud storage device by the data counting center to be stored in a classified mode. According to the method, the data can be sent, analyzed and counted in real time, when a new data type occurs, a data category can be added automatically, and the situation that after a data counting result is obtained, a user adds the new data type and classification is conducted again is avoided; when the user needs to output data counting results, only the results of the registered data need to be output; the method is used for data classification and counting and is rapid and efficient, user experience is improved, the number of counting errors is reduced, and the data maintenance cost is reduced.

Description

A kind of method and system of cloud computing data automatic classification statistics
Technical field
The present invention relates to cloud computing field, relate to particularly a kind of method and system of cloud computing data automatic classification statistics.
Background technology
The application direction of current cloud computing storage is large data storage direction, and wherein, the maximum use of data storage is exactly to large data analysis, the Operation Decision under auxiliary cloud computing environment.In the epoch of current data big bang, how the mass data having in cloud computing is carried out to quick and real-time extraction, seem very important.Sorting technique research is the important research field of data analysis and management under cloud computing environment.Current data analysis all adopts the mode of taxonomic revision, first data is collected, and then artificial division data type arranges data., although be widely adopted, there is following several problem in the data analysis mode of above-mentioned taxonomic revision:
1, after collection, arrange, data volume accumulation too much, can cause data calculating and analysis time long again, and user experiences not good;
2, artificial division data type, can cause Data classification accurate not, can only be in fixing scope "ball-park" estimate.
Therefore the method for being badly in need of the Data classification statistics of the transmission in real time of a kind of data, real-time analysis, real-time statistics, in the time having new data type to occur, can add data class by automatic powder adding, and wait for data statistics result without user.
Summary of the invention
In view of above-mentioned technical matters, the object of this invention is to provide a kind of can logarithm the method for cloud computing data automatic classification statistics of transmission factually time, real-time analysis, real-time statistics, in the time having new data type to occur, can add data class by automatic powder adding, and wait for data statistics result without user.
For achieving the above object, technical scheme of the present invention is:
A method for cloud computing data automatic classification statistics, said method comprising the steps of:
Cloud memory device receives data, analyzes the log-on message of data;
Cloud memory device sends the log-on message of described data and described data to data statistics center;
Data statistics center receives described data and log-on message, and according to described log-on message, described data is classified;
Data statistics center is by the storage of classifying of sorted data back cloud memory device; Wherein, described log-on message at least comprises data type, key word and size of data.
Further, described step of described data being classified according to described log-on message further comprises according to described log-on message described data is classified by multiclass classification method.
Further, described multiclass classification method comprises: chopped-off head classification is classified by the type of data; Wherein, the type of described data is text, picture/mb-type, compression type, sound-type and video type.
Further, described multiclass classification method further comprises: in the time that chopped-off head classification is classified by text, subclassificatio is classified by the key word of data; In the time that chopped-off head classification is classified by picture/mb-type, subclassificatio is by picture size and/or application class; In the time that chopped-off head classification is classified by compression type, first subclassificatio scans compressed file content, after classify by the data type comprising in compressed file; In the time that sound-type or video type classification are pressed in chopped-off head classification, user's counting that subclassificatio is added up by sound-type or video type is classified.
Further, first described subclassificatio scans compressed file content, after the step of classifying by the data type comprising in compressed file further comprise: be that text, picture/mb-type, sound-type or video type are carried out reclassify to compressed file by compression file content.
Further, described method further comprises: cloud memory device is by the output of classifying of the data of classification storage.
The present invention also provides a kind of system of cloud computing data automatic classification statistics, and described system comprises:
Cloud memory device, for receiving data, and analyzes the log-on message of data; Also for send the log-on message of described data and described data to data statistics center;
Data statistics center, the described data and the log-on message that send for receiving cloud memory device, and according to described log-on message, described data are classified; And for storage that sorted data back cloud memory device is classified;
Wherein, described log-on message at least comprises data type, key word and size of data.
Further, described data statistics center, also carries out following operation: according to described log-on message, described data are classified by multiclass classification method.
Further, classifying by multiclass classification method to described data according to described log-on message in described data statistics center, comprising: chopped-off head classification is classified by the type of data; Wherein, the type of described data is text, picture/mb-type, compression type, sound-type and video type.
Further, classifying by multiclass classification method to described data according to described log-on message in described data statistics center, further comprises: in the time that chopped-off head classification is classified by text, subclassificatio is classified by the key word of data; In the time that chopped-off head classification is classified by picture/mb-type, subclassificatio is by picture size and/or application class; In the time that chopped-off head classification is classified by compression type, first subclassificatio scans compressed file content, after classify by the data type comprising in compressed file; In the time that sound-type or video type classification are pressed in chopped-off head classification, user's counting that subclassificatio is added up by sound-type or video type is classified.
Further, classifying by multiclass classification method to described data according to described log-on message in described data statistics center, further comprises: be that text, picture/mb-type, sound-type or video type are carried out reclassify to compressed file by compression file content.
Further, described cloud memory device is by the output of classifying of the data of classification storage.。
The beneficial effect of the inventive method is: method energy logarithm of the present invention transmission factually time, real-time analysis, real-time statistics, in the time having new data type to occur, can add data class by automatic powder adding, wait for that without user after data statistics result out, adding new data kind classifies again again; In the time that user needs data statistics output, need only be by the output in real time of the data result of registration; Adopt method of the present invention to carry out Data classification statistics, fast, efficiently, promote user and experience, reduce statistics and make mistakes, reduced the cost of data maintenance.
Brief description of the drawings
Fig. 1 is the schematic flow sheet of the method for the cloud computing data automatic classification statistics of the preferred embodiment of the present invention;
Fig. 2 is the schematic flow sheet of the method for the cloud computing data automatic classification statistics of another preferred embodiment of the present invention;
Fig. 3 is the structured flowchart of the system of cloud computing data automatic classification statistics of the present invention.
Embodiment
For making the object, technical solutions and advantages of the present invention more cheer and bright, below in conjunction with embodiment and with reference to accompanying drawing, the present invention is described in more detail.Should be appreciated that, these descriptions are exemplary, and do not really want to limit the scope of the invention.In addition, in the following description, omitted the description to known features and technology, to avoid unnecessarily obscuring concept of the present invention.
Fig. 1 is the schematic flow sheet of the method for the cloud computing data automatic classification statistics of the preferred embodiment of the present invention.
As shown in Figure 1, the method for cloud computing data automatic classification statistics of the present invention comprises the steps:
Step S11: cloud memory device receives data, analyzes the log-on message of data;
The reception data that cloud memory device energy is real-time, and simultaneously to its log-on message of data analysis receiving.Described log-on message comprises the information such as data type, key word, size of data, application, user's counting.Cloud memory device, according to the number of types of received data, carries out partition holding to self, if what receive is new type, in memory block, automatically adds a region, for depositing the data of newtype.In the data storage area of each type, then carry out same secondary subregion.For example, the data type receiving is text and picture/mb-type, and the storage area of oneself is divided into two regions by cloud memory device; Data type beyond the data that receive are text and picture/mb-type, during as sound-type, automatically adds the data of a region for storaged voice type, thereby realizes dynamic memory in cloud memory device.
Step S12: the log-on message that sends described data and described data to data statistics center;
Cloud memory device, after analyzing the log-on message of data, sends data and and the log-on message of these data to data statistics center.
Step S13: data statistics center receives described data and log-on message, and according to described log-on message, described data are classified;
After data statistics center receives log-on message, record the current time that receives these data and log-on message, the data that receive are carried out to Data classification, wherein the classification of data is adopted to multiclass classification method.
Multiclass classification method specifically comprises: chopped-off head classification is classified by the type of data; Wherein, the type of described data is text, picture/mb-type, compression type, sound-type and video type.After chopped-off head has been classified, carry out subclassificatio.Concrete, in the time that chopped-off head classification is classified by text, subclassificatio is classified by the key word of data; In the time that chopped-off head classification is classified by picture/mb-type, subclassificatio is by picture size and/or application class; In the time that chopped-off head classification is classified by compression type, first subclassificatio scans compressed file content, after classify by the data type comprising in compressed file; In the time that sound-type or video type classification are pressed in chopped-off head classification, user's counting that subclassificatio is added up by sound-type or video type is classified.Step S14: data statistics center is by the storage of classifying of sorted data back cloud memory device.
Cloud memory device is opened up different storage areas, the storage of then data of classification being classified by the kind number of classification.In addition, classification is stored in to data on cloud equipment, can be according to user's output of need to classifying.
Fig. 2 is the schematic flow sheet of the method for the cloud computing data automatic classification statistics of another preferred embodiment of the present invention.
In Fig. 2, step S21-S23 is consistent with step S11-S13 content in Fig. 1, repeats no more here, and its difference is: increased step S24-S28.
Step S24: judge whether data are compression type, if compression type carries out step S25, it are carried out to reclassify, if not compression type, carry out step S26, and it is carried out to secondary classification.
Step S25: because the data of compression type are generally all larger, be therefore necessary it further to classify, can carry out reclassify to it.Carry out three subseries by the subclassificatio of the each file type in its compressed file.Concrete: while being compression type in subclassificatio, first compressed file content is scanned, the step of then classifying by the data type comprising in compressed file further comprises: be that text, picture/mb-type, sound-type or video type are carried out reclassify to compressed file by compression file content.
Step S26: data are carried out to secondary classification.
Step S27: in order to simplify classification, can be numbered by different types and grade data.
For example chopped-off head classification, by the classification of data type, is labeled as 1 by text, and picture/mb-type is labeled as 2, and compression type is labeled as 3, and sound-type is labeled as 4, and video type is labeled as 5.
Subclassificatio, text key sorting is labeled as 1.1, picture/mb-type is labeled as 2.1 by its magnitude classification, picture/mb-type is labeled as 2.2 by its application class, compression type, text in compressed file is labeled as 3.1, picture/mb-type in compressed file is labeled as 3.2, compression type in compressed file is labeled as 3.3, sound-type in compressed file is labeled as 3.4, video type in compressed file is labeled as 3.5, and sound-type and video type are labeled as respectively 4.1 and 5.1 by user's counting statistics wherein.
Reclassify, text key sorting in compressed file is labeled as 3.1.1, picture/mb-type in compressed file by size key words sorting is 3.2.1, picture in compressed file is labeled as 3.2.2 by application class, sound-type in compressed file according to keywords user's counting statistics key words sorting is 3.4.1, and the video type in compressed file is 3.5.1 by the key words sorting of user's counting statistics.
Step S28: data statistics center is by the storage of classifying of data back cloud memory device.
Cloud equipment is opened up different storage areas, the storage of then data of classification being classified by the kind number of classification.In addition, the data to classified and stored on cloud equipment, can need to sort out output according to user.
Data are classified according to type and grade, make to simplify processing procedure, operate easier, because every kind of file type is all markd hereof, so just can know its type according to the data that obtain, in the time having the file of newtype to occur, just dynamically increase new type, can increase automatically or reduce kind according to dissimilar data like this.
Fig. 3 is the structured flowchart of the system of cloud computing data automatic classification statistics of the present invention.This system comprises:
Cloud memory device, for receiving data, and analyzes the log-on message of data; Also for send the log-on message of described data and described data to data statistics center; Can also be to the data output of classifying of storage.
Data statistics center, the described data and the log-on message that send for receiving cloud memory device, and according to described log-on message to described data classify (can be also multiclass classification for single-stage classification); And for storage that sorted data back cloud memory device is classified.
Should be understood that, above-mentioned embodiment of the present invention is only for exemplary illustration or explain principle of the present invention, and is not construed as limiting the invention.Therefore any amendment of, making, be equal to replacement, improvement etc., within protection scope of the present invention all should be included in without departing from the spirit and scope of the present invention in the situation that.In addition, claims of the present invention are intended to contain whole variations and the modification in the equivalents that falls into claims scope and border or this scope and border.

Claims (10)

1. a method for cloud computing data automatic classification statistics, is characterized in that, said method comprising the steps of:
Cloud memory device receives data, analyzes the log-on message of data;
Cloud memory device sends the log-on message of described data and described data to data statistics center;
Data statistics center receives described data and log-on message, and according to described log-on message, described data is classified;
Data statistics center is by the storage of classifying of sorted data back cloud memory device; Wherein, described log-on message at least comprises data type, key word and size of data.
2. the method for cloud computing data automatic classification statistics according to claim 1, it is characterized in that, described step of described data being classified according to described log-on message further comprises according to described log-on message classifies by multiclass classification method to described data.
3. the method for cloud computing data automatic classification statistics according to claim 2, is characterized in that, described multiclass classification method comprises: chopped-off head classification is classified by the type of data; Wherein, the type of described data is text, picture/mb-type, compression type, sound-type and video type.
4. the method for cloud computing data automatic classification statistics according to claim 3, is characterized in that, described multiclass classification method further comprises: in the time that chopped-off head classification is classified by text, subclassificatio is classified by the key word of data; In the time that chopped-off head classification is classified by picture/mb-type, subclassificatio is by picture size and/or application class; In the time that chopped-off head classification is classified by compression type, first subclassificatio scans compressed file content, after classify by the data type comprising in compressed file; In the time that sound-type or video type classification are pressed in chopped-off head classification, user's counting that subclassificatio is added up by sound-type or video type is classified.
5. the method for cloud computing data automatic classification statistics according to claim 4, it is characterized in that, first described subclassificatio scans compressed file content, after the step of classifying by the data type comprising in compressed file further comprise: be that text, picture/mb-type, sound-type or video type are carried out reclassify to compressed file by compression file content.
6. a system for cloud computing data automatic classification statistics, is characterized in that, described system comprises:
Cloud memory device, for receiving data, and analyzes the log-on message of data; Also for send the log-on message of described data and described data to data statistics center;
Data statistics center, the described data and the log-on message that send for receiving cloud memory device, and according to described log-on message, described data are classified; And for storage that sorted data back cloud memory device is classified;
Wherein, described log-on message at least comprises data type, key word and size of data.
7. system according to claim 6, is characterized in that, following operation is also carried out: according to described log-on message, described data are classified by multiclass classification method in described data statistics center.
8. system according to claim 7, is characterized in that, classifying by multiclass classification method to described data according to described log-on message in described data statistics center, comprising: chopped-off head classification is classified by the type of data; Wherein, the type of described data is text, picture/mb-type, compression type, sound-type and video type.
9. system according to claim 8, it is characterized in that, classifying by multiclass classification method to described data according to described log-on message in described data statistics center, further comprises: in the time that chopped-off head classification is classified by text, subclassificatio is classified by the key word of data; In the time that chopped-off head classification is classified by picture/mb-type, subclassificatio is by picture size and/or application class; In the time that chopped-off head classification is classified by compression type, first subclassificatio scans compressed file content, after classify by the data type comprising in compressed file; In the time that sound-type or video type classification are pressed in chopped-off head classification, user's counting that subclassificatio is added up by sound-type or video type is classified.
10. system according to claim 9, it is characterized in that, classifying by multiclass classification method to described data according to described log-on message in described data statistics center, further comprises: be that text, picture/mb-type, sound-type or video type are carried out reclassify to compressed file by compression file content.
CN201410382816.5A 2014-08-06 2014-08-06 Cloud computing data automatic classifying and counting method and system Pending CN104133907A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410382816.5A CN104133907A (en) 2014-08-06 2014-08-06 Cloud computing data automatic classifying and counting method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410382816.5A CN104133907A (en) 2014-08-06 2014-08-06 Cloud computing data automatic classifying and counting method and system

Publications (1)

Publication Number Publication Date
CN104133907A true CN104133907A (en) 2014-11-05

Family

ID=51806585

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410382816.5A Pending CN104133907A (en) 2014-08-06 2014-08-06 Cloud computing data automatic classifying and counting method and system

Country Status (1)

Country Link
CN (1) CN104133907A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105931155A (en) * 2016-04-26 2016-09-07 北京京师乐学教育科技有限公司 A student knowledge space system based on student growth archives and a management method thereof
CN107590273A (en) * 2017-09-27 2018-01-16 安徽硕威智能科技有限公司 Bank self-help robotic archival arranges save set
CN111399756A (en) * 2019-09-29 2020-07-10 杭州海康威视系统技术有限公司 Data storage method, data downloading method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120173642A1 (en) * 2009-02-17 2012-07-05 Tagle Information Technology Inc. Methods and Systems Using Taglets for Management of Data
CN102957720A (en) * 2011-08-23 2013-03-06 大连中软卓越信息技术有限公司 Mobile multimedia training platform based on cloud computing
US20130282330A1 (en) * 2012-04-20 2013-10-24 International Business Machines Corporation Comparing Event Data Sets
CN103458273A (en) * 2013-09-11 2013-12-18 上海美琦浦悦通讯科技有限公司 System and method of digital copyright control applied to multi-media file transmission

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120173642A1 (en) * 2009-02-17 2012-07-05 Tagle Information Technology Inc. Methods and Systems Using Taglets for Management of Data
CN102957720A (en) * 2011-08-23 2013-03-06 大连中软卓越信息技术有限公司 Mobile multimedia training platform based on cloud computing
US20130282330A1 (en) * 2012-04-20 2013-10-24 International Business Machines Corporation Comparing Event Data Sets
CN103458273A (en) * 2013-09-11 2013-12-18 上海美琦浦悦通讯科技有限公司 System and method of digital copyright control applied to multi-media file transmission

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105931155A (en) * 2016-04-26 2016-09-07 北京京师乐学教育科技有限公司 A student knowledge space system based on student growth archives and a management method thereof
CN107590273A (en) * 2017-09-27 2018-01-16 安徽硕威智能科技有限公司 Bank self-help robotic archival arranges save set
CN111399756A (en) * 2019-09-29 2020-07-10 杭州海康威视系统技术有限公司 Data storage method, data downloading method and device
CN111399756B (en) * 2019-09-29 2024-01-02 杭州海康威视系统技术有限公司 Data storage method, data downloading method and device

Similar Documents

Publication Publication Date Title
CN107403173B (en) Face recognition system and method
US9332546B2 (en) Radio resource optimizing method, apparatus, and system
CN110490246B (en) Garbage category determination method and device, storage medium and electronic equipment
CN110149266B (en) Junk mail identification method and device
CN110659560B (en) Method and system for identifying associated object
CN101937445B (en) Automatic file classification system
CN102831405B (en) Method and system for outdoor large-scale object identification on basis of distributed and brute-force matching
CN102222213A (en) Distributed vision computing method based on open type Web Service framework
CN111507479B (en) Feature binning method, device, equipment and computer-readable storage medium
CN102880879A (en) Distributed processing and support vector machine (SVM) classifier-based outdoor massive object recognition method and system
CN104391781A (en) Processing method and system for log information
CN104133907A (en) Cloud computing data automatic classifying and counting method and system
CN103685517A (en) Storage hierarchical scheduling method and system based on service class characteristics
CN112232881A (en) Data detection method and device, electronic equipment and storage medium
CN102497297A (en) System and method for realizing deep packet inspection technology based on multi-core and multi-thread
CN111027397B (en) Comprehensive feature target detection method, system, medium and equipment suitable for intelligent monitoring network
CN111461630B (en) Monitoring method, device, equipment and storage medium for delivering express packages
CN104933178A (en) Official website determining method and system
CN103220555A (en) Method, device and system for classifying digital television users
US20180109656A1 (en) Server and method for managing position change
CN112988829A (en) Big data analysis processing system
CN112671845B (en) Data processing method and device, electronic equipment, storage medium and cloud system
CN111737371B (en) Data flow detection classification method and device capable of dynamically predicting
CN107529190B (en) User data acquisition system and method
CN114996207A (en) Big data analysis method and system based on 5G cloud computing

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20141105

RJ01 Rejection of invention patent application after publication