WO2017195943A1 - Procédé et dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine - Google Patents
Procédé et dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine Download PDFInfo
- Publication number
- WO2017195943A1 WO2017195943A1 PCT/KR2016/010801 KR2016010801W WO2017195943A1 WO 2017195943 A1 WO2017195943 A1 WO 2017195943A1 KR 2016010801 W KR2016010801 W KR 2016010801W WO 2017195943 A1 WO2017195943 A1 WO 2017195943A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- file
- files
- data
- storage server
- popularity
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 30
- 238000012545 processing Methods 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 5
- 238000003491 array Methods 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/11—File system administration, e.g. details of archiving or snapshots
- G06F16/113—Details of archiving
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/18—File system types
- G06F16/185—Hierarchical storage management [HSM] systems, e.g. file migration or policies thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/01—Social networking
Definitions
- the data classification method of the storage server is the step of updating the priority of each of the files based on the time when each of the files are stored in the storage server and the current popularity of each of the files
- the dividing may further include dividing each of the files into hot data and cold data based on the priority of each of the updated files.
- the dividing into the hot data and the cold data may be divided into the hot data and the cold data based on the priority of each of the files based on a storage tiering policy.
- an apparatus for classifying data of a storage server may include: a storage unit configured to store a three-dimensional data structure in consideration of a human relationship and popularity corresponding to the new file when the new file is stored in the storage server; A setting unit for setting a priority of the new file based on the popularity of the new file; And a division unit that divides each of the files into hot data and cold data based on a priority of each of the files stored in the storage server.
- Embodiments of the present invention are to classify a file of a storage server for providing a high level of human relations and time, such as a social network service, into hot data and cold data, and a three-dimensional data structure considering human relations and popularity.
- a storage server for providing a high level of human relations and time, such as a social network service, into hot data and cold data, and a three-dimensional data structure considering human relations and popularity.
- celebrities have a higher frequency of file exposure than the general public. For example, files uploaded by the public are often exposed for a week, while files uploaded by celebrities are often exposed for more than a month.
- the priority of the new file is set based on the popularity of the new file (S120).
- the priority of the new file may be the same as the popularity value at the time when the new file is uploaded.
- the priority of each of the files is not limited to the popularity of the file, but is calculated only by the time (or period) stored in the storage server, the popularity of the person who uploaded the file, the popularity of the file, The frequency of contact between accounts related to the file, the extent of the relationship between the file, the frequency of recent access to the file, the frequency of recent exposure to the file, the age of the file and the attributes of the file may vary. That is, the priority for each of the files may be calculated based on at least one of the above-described parameters.
- the contact frequency between accounts may include the number of times of sending and receiving e-mails and messages
- the attributes of the file may include live time and active time.
- step S140 If it is determined in step S140 that the priority of the file is equal to or higher than the reference rank, the file is classified as hot data, otherwise, the file is classified as cold data (S150 and S160).
- the file uploaded to the storage server with the above-described three-dimensional data structure has a priority of the same value as the popularity of the original file, and this priority can be reduced by a certain ratio over time, and the priority of the file is based on If it falls further than the rank, the file can be classified as cold data.
- the access ratio of the file can be predicted efficiently without complicated calculation, and it is also easy to use in a large storage server.
- the newly created files that is, files 1 to 3 are located at the same level of popularity because they have a priority of the same value as the popularity of the file, and these files decrease with a certain time period or unit at a predetermined rate over time. can do.
- file 1 has a priority decrease of the file over time by a certain rate, and the file 2 has a higher popularity as time goes on. You can see that it rose. In this case, the file 2 also has a higher priority due to an increase in the popularity of the file.
- File 3 is a case in which only the file is popular compared to popularity, and there is a value that can be prioritized higher than popularity in proportion to popularity, and this value is the maximum value that can be increased. It can be set to increase the maximum value.
- File 4 is a file classified as cold data because the value that continues to fall over time becomes smaller than a certain value, that is, smaller than the reference rank.
- File 5 is a file classified as cold data, but has a high priority due to unexpected popularity. Rising file is hot data.
- the method according to the embodiment of the present invention stores a file in a three-dimensional data structure in consideration of human relations and time, and thus reduces the overall cost by reducing the ratio of high performance storage servers in operating services affected by human relations and time. Can be saved.
- FIG. 5 illustrates a configuration of a data classification device of a storage server according to an embodiment of the present invention, and illustrates a configuration of an apparatus for performing the operations of FIGS. 1 to 4.
- the storage unit 510 expresses the human relations of the file in two dimensions, and expresses the popularity of the file in the two-dimensional human relations, thereby storing the three-dimensional data structure when saving a new file. It can be calculated by various methods, such as the number of friends or followers, scores, and the total number of likes of a file.
- the division unit 540 divides each of the files into hot data and cold data based on the priority of each of the files stored in the storage server.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Economics (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Abstract
La présente invention concerne un procédé et un dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine. Selon un mode de réalisation de la présente invention, le procédé d'identification de données d'un serveur de stockage comprend les étapes suivantes consistant : à stocker un nouveau fichier dans une structure de données tridimensionnelle en prenant en compte une relation humaine et une cote de popularité correspondant au nouveau fichier lorsque le nouveau fichier est stocké dans le serveur de stockage ; à régler la priorité du nouveau fichier sur la base de la cote de popularité du nouveau fichier ; et à identifier chaque fichier stocké dans le serveur de stockage en tant que données chaudes ou données froides sur la base de la priorité de chaque fichier.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020160056794A KR101825294B1 (ko) | 2016-05-10 | 2016-05-10 | 인간관계와 시간성이 높은 서비스를 위한 스토리지 서버의 데이터 구분 방법 및 장치 |
KR10-2016-0056794 | 2016-05-10 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2017195943A1 true WO2017195943A1 (fr) | 2017-11-16 |
Family
ID=60267468
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2016/010801 WO2017195943A1 (fr) | 2016-05-10 | 2016-09-27 | Procédé et dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR101825294B1 (fr) |
WO (1) | WO2017195943A1 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110442309A (zh) * | 2019-07-24 | 2019-11-12 | 广东紫晶信息存储技术股份有限公司 | 一种基于光存储的冷热数据交换方法及系统 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102175176B1 (ko) * | 2017-12-29 | 2020-11-06 | 한양대학교 산학협력단 | 문자 종류 개수에 기반한 데이터 구분 방법, 데이터 분류기 및 스토리지 시스템 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140188947A1 (en) * | 2012-12-31 | 2014-07-03 | Teradata Corporation | Data storage management based on indicated storage levels and other criteria for multilevel storage systems |
US20150052180A1 (en) * | 2012-08-15 | 2015-02-19 | Facebook, Inc. | File storage system based on coordinated exhaustible and non-exhaustible storage |
KR20160014111A (ko) * | 2010-12-30 | 2016-02-05 | 페이스북, 인크. | 그래프 데이터용 분산형 캐시 |
US20160092353A1 (en) * | 2014-09-25 | 2016-03-31 | Robert C. Swanson | Establishing cold storage pools from aging memory |
-
2016
- 2016-05-10 KR KR1020160056794A patent/KR101825294B1/ko active IP Right Grant
- 2016-09-27 WO PCT/KR2016/010801 patent/WO2017195943A1/fr active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20160014111A (ko) * | 2010-12-30 | 2016-02-05 | 페이스북, 인크. | 그래프 데이터용 분산형 캐시 |
US20150052180A1 (en) * | 2012-08-15 | 2015-02-19 | Facebook, Inc. | File storage system based on coordinated exhaustible and non-exhaustible storage |
US20140188947A1 (en) * | 2012-12-31 | 2014-07-03 | Teradata Corporation | Data storage management based on indicated storage levels and other criteria for multilevel storage systems |
US20160092353A1 (en) * | 2014-09-25 | 2016-03-31 | Robert C. Swanson | Establishing cold storage pools from aging memory |
Non-Patent Citations (1)
Title |
---|
KHIL, KI JEONG ET AL.: "Hot and Cold Data Replacement Method for Hybrid Storage System", JOURNAL OF ADVANCED INFORMATION TECHNOLOGY AND CONVERGENCE, vol. 11, no. 1, January 2013 (2013-01-01), pages 135 - 142 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110442309A (zh) * | 2019-07-24 | 2019-11-12 | 广东紫晶信息存储技术股份有限公司 | 一种基于光存储的冷热数据交换方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
KR101825294B1 (ko) | 2018-02-02 |
KR20170126587A (ko) | 2017-11-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10778756B2 (en) | Location of actor resources | |
US10516585B2 (en) | System and method for network information mapping and displaying | |
US11902173B2 (en) | Dynamic allocation of network resources using external inputs | |
US10048996B1 (en) | Predicting infrastructure failures in a data center for hosted service mitigation actions | |
CN111538558B (zh) | 用于自动选择安全虚拟机的系统和方法 | |
JP2022552034A (ja) | クラスタリング方法及び装置、電子機器並びに記憶媒体 | |
WO2012148067A1 (fr) | Procédé et appareil pour distribuer et stocker une pluralité de copies dans un système de stockage en nuage | |
CN110753112A (zh) | 云服务的弹性伸缩方法和装置 | |
US7856626B2 (en) | Method of refactoring methods within an application | |
WO2013122338A1 (fr) | Procédé d'indexation et de recherche distribuées pour analyser efficacement des données de série chronologique dans des systèmes de recherche | |
US20160019090A1 (en) | Data processing control method, computer-readable recording medium, and data processing control device | |
WO2017195943A1 (fr) | Procédé et dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine | |
CN111459650B (zh) | 管理专用处理资源的存储器的方法、设备和介质 | |
WO2018122961A1 (fr) | Système, procédé de gestion de données, et serveur de fichiers | |
WO2019225799A1 (fr) | Procédé et dispositif de suppression d'informations d'utilisateur à l'aide d'un modèle génératif d'apprentissage profond | |
CN110610450A (zh) | 数据处理方法、电子设备和计算机可读存储介质 | |
US20210389877A1 (en) | Identifying host functionalities based on process characterization | |
CN114461369A (zh) | 一种面向复杂应用场景的自适应数据调度系统及方法 | |
US9747451B2 (en) | File system modification | |
CN113934361A (zh) | 用于管理存储系统的方法、设备和计算机程序产品 | |
WO2018070732A1 (fr) | Procédé et système de service pour réseau thématique présentant la structure arborescente d'un hashtag | |
Guo et al. | Learning-based characterizing and modeling performance bottlenecks of big data workloads | |
CN216352283U (zh) | 一种适用于边缘计算环境的请求过滤装置 | |
CN115022094B (zh) | 一种便于了解单位内部计算机使用情况的监控系统 | |
US11783325B1 (en) | Removal probability-based weighting for resource access |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16901777 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 16901777 Country of ref document: EP Kind code of ref document: A1 |