WO2017195943A1 - Procédé et dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine - Google Patents

Procédé et dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine Download PDF

Info

Publication number
WO2017195943A1
WO2017195943A1 PCT/KR2016/010801 KR2016010801W WO2017195943A1 WO 2017195943 A1 WO2017195943 A1 WO 2017195943A1 KR 2016010801 W KR2016010801 W KR 2016010801W WO 2017195943 A1 WO2017195943 A1 WO 2017195943A1
Authority
WO
WIPO (PCT)
Prior art keywords
file
files
data
storage server
popularity
Prior art date
Application number
PCT/KR2016/010801
Other languages
English (en)
Korean (ko)
Inventor
이재면
강경태
Original Assignee
한양대학교 에리카산학협력단
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 한양대학교 에리카산학협력단 filed Critical 한양대학교 에리카산학협력단
Publication of WO2017195943A1 publication Critical patent/WO2017195943A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/11File system administration, e.g. details of archiving or snapshots
    • G06F16/113Details of archiving
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/185Hierarchical storage management [HSM] systems, e.g. file migration or policies thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Definitions

  • the data classification method of the storage server is the step of updating the priority of each of the files based on the time when each of the files are stored in the storage server and the current popularity of each of the files
  • the dividing may further include dividing each of the files into hot data and cold data based on the priority of each of the updated files.
  • the dividing into the hot data and the cold data may be divided into the hot data and the cold data based on the priority of each of the files based on a storage tiering policy.
  • an apparatus for classifying data of a storage server may include: a storage unit configured to store a three-dimensional data structure in consideration of a human relationship and popularity corresponding to the new file when the new file is stored in the storage server; A setting unit for setting a priority of the new file based on the popularity of the new file; And a division unit that divides each of the files into hot data and cold data based on a priority of each of the files stored in the storage server.
  • Embodiments of the present invention are to classify a file of a storage server for providing a high level of human relations and time, such as a social network service, into hot data and cold data, and a three-dimensional data structure considering human relations and popularity.
  • a storage server for providing a high level of human relations and time, such as a social network service, into hot data and cold data, and a three-dimensional data structure considering human relations and popularity.
  • celebrities have a higher frequency of file exposure than the general public. For example, files uploaded by the public are often exposed for a week, while files uploaded by celebrities are often exposed for more than a month.
  • the priority of the new file is set based on the popularity of the new file (S120).
  • the priority of the new file may be the same as the popularity value at the time when the new file is uploaded.
  • the priority of each of the files is not limited to the popularity of the file, but is calculated only by the time (or period) stored in the storage server, the popularity of the person who uploaded the file, the popularity of the file, The frequency of contact between accounts related to the file, the extent of the relationship between the file, the frequency of recent access to the file, the frequency of recent exposure to the file, the age of the file and the attributes of the file may vary. That is, the priority for each of the files may be calculated based on at least one of the above-described parameters.
  • the contact frequency between accounts may include the number of times of sending and receiving e-mails and messages
  • the attributes of the file may include live time and active time.
  • step S140 If it is determined in step S140 that the priority of the file is equal to or higher than the reference rank, the file is classified as hot data, otherwise, the file is classified as cold data (S150 and S160).
  • the file uploaded to the storage server with the above-described three-dimensional data structure has a priority of the same value as the popularity of the original file, and this priority can be reduced by a certain ratio over time, and the priority of the file is based on If it falls further than the rank, the file can be classified as cold data.
  • the access ratio of the file can be predicted efficiently without complicated calculation, and it is also easy to use in a large storage server.
  • the newly created files that is, files 1 to 3 are located at the same level of popularity because they have a priority of the same value as the popularity of the file, and these files decrease with a certain time period or unit at a predetermined rate over time. can do.
  • file 1 has a priority decrease of the file over time by a certain rate, and the file 2 has a higher popularity as time goes on. You can see that it rose. In this case, the file 2 also has a higher priority due to an increase in the popularity of the file.
  • File 3 is a case in which only the file is popular compared to popularity, and there is a value that can be prioritized higher than popularity in proportion to popularity, and this value is the maximum value that can be increased. It can be set to increase the maximum value.
  • File 4 is a file classified as cold data because the value that continues to fall over time becomes smaller than a certain value, that is, smaller than the reference rank.
  • File 5 is a file classified as cold data, but has a high priority due to unexpected popularity. Rising file is hot data.
  • the method according to the embodiment of the present invention stores a file in a three-dimensional data structure in consideration of human relations and time, and thus reduces the overall cost by reducing the ratio of high performance storage servers in operating services affected by human relations and time. Can be saved.
  • FIG. 5 illustrates a configuration of a data classification device of a storage server according to an embodiment of the present invention, and illustrates a configuration of an apparatus for performing the operations of FIGS. 1 to 4.
  • the storage unit 510 expresses the human relations of the file in two dimensions, and expresses the popularity of the file in the two-dimensional human relations, thereby storing the three-dimensional data structure when saving a new file. It can be calculated by various methods, such as the number of friends or followers, scores, and the total number of likes of a file.
  • the division unit 540 divides each of the files into hot data and cold data based on the priority of each of the files stored in the storage server.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Economics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

La présente invention concerne un procédé et un dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine. Selon un mode de réalisation de la présente invention, le procédé d'identification de données d'un serveur de stockage comprend les étapes suivantes consistant : à stocker un nouveau fichier dans une structure de données tridimensionnelle en prenant en compte une relation humaine et une cote de popularité correspondant au nouveau fichier lorsque le nouveau fichier est stocké dans le serveur de stockage ; à régler la priorité du nouveau fichier sur la base de la cote de popularité du nouveau fichier ; et à identifier chaque fichier stocké dans le serveur de stockage en tant que données chaudes ou données froides sur la base de la priorité de chaque fichier.
PCT/KR2016/010801 2016-05-10 2016-09-27 Procédé et dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine WO2017195943A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020160056794A KR101825294B1 (ko) 2016-05-10 2016-05-10 인간관계와 시간성이 높은 서비스를 위한 스토리지 서버의 데이터 구분 방법 및 장치
KR10-2016-0056794 2016-05-10

Publications (1)

Publication Number Publication Date
WO2017195943A1 true WO2017195943A1 (fr) 2017-11-16

Family

ID=60267468

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2016/010801 WO2017195943A1 (fr) 2016-05-10 2016-09-27 Procédé et dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine

Country Status (2)

Country Link
KR (1) KR101825294B1 (fr)
WO (1) WO2017195943A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110442309A (zh) * 2019-07-24 2019-11-12 广东紫晶信息存储技术股份有限公司 一种基于光存储的冷热数据交换方法及系统

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102175176B1 (ko) * 2017-12-29 2020-11-06 한양대학교 산학협력단 문자 종류 개수에 기반한 데이터 구분 방법, 데이터 분류기 및 스토리지 시스템

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140188947A1 (en) * 2012-12-31 2014-07-03 Teradata Corporation Data storage management based on indicated storage levels and other criteria for multilevel storage systems
US20150052180A1 (en) * 2012-08-15 2015-02-19 Facebook, Inc. File storage system based on coordinated exhaustible and non-exhaustible storage
KR20160014111A (ko) * 2010-12-30 2016-02-05 페이스북, 인크. 그래프 데이터용 분산형 캐시
US20160092353A1 (en) * 2014-09-25 2016-03-31 Robert C. Swanson Establishing cold storage pools from aging memory

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160014111A (ko) * 2010-12-30 2016-02-05 페이스북, 인크. 그래프 데이터용 분산형 캐시
US20150052180A1 (en) * 2012-08-15 2015-02-19 Facebook, Inc. File storage system based on coordinated exhaustible and non-exhaustible storage
US20140188947A1 (en) * 2012-12-31 2014-07-03 Teradata Corporation Data storage management based on indicated storage levels and other criteria for multilevel storage systems
US20160092353A1 (en) * 2014-09-25 2016-03-31 Robert C. Swanson Establishing cold storage pools from aging memory

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
KHIL, KI JEONG ET AL.: "Hot and Cold Data Replacement Method for Hybrid Storage System", JOURNAL OF ADVANCED INFORMATION TECHNOLOGY AND CONVERGENCE, vol. 11, no. 1, January 2013 (2013-01-01), pages 135 - 142 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110442309A (zh) * 2019-07-24 2019-11-12 广东紫晶信息存储技术股份有限公司 一种基于光存储的冷热数据交换方法及系统

Also Published As

Publication number Publication date
KR101825294B1 (ko) 2018-02-02
KR20170126587A (ko) 2017-11-20

Similar Documents

Publication Publication Date Title
US10778756B2 (en) Location of actor resources
US10516585B2 (en) System and method for network information mapping and displaying
US11902173B2 (en) Dynamic allocation of network resources using external inputs
US10048996B1 (en) Predicting infrastructure failures in a data center for hosted service mitigation actions
CN111538558B (zh) 用于自动选择安全虚拟机的系统和方法
JP2022552034A (ja) クラスタリング方法及び装置、電子機器並びに記憶媒体
WO2012148067A1 (fr) Procédé et appareil pour distribuer et stocker une pluralité de copies dans un système de stockage en nuage
CN110753112A (zh) 云服务的弹性伸缩方法和装置
US7856626B2 (en) Method of refactoring methods within an application
WO2013122338A1 (fr) Procédé d'indexation et de recherche distribuées pour analyser efficacement des données de série chronologique dans des systèmes de recherche
US20160019090A1 (en) Data processing control method, computer-readable recording medium, and data processing control device
WO2017195943A1 (fr) Procédé et dispositif d'identification de données d'un serveur de stockage destinés à un service ayant une temporalité élevée avec une relation humaine
CN111459650B (zh) 管理专用处理资源的存储器的方法、设备和介质
WO2018122961A1 (fr) Système, procédé de gestion de données, et serveur de fichiers
WO2019225799A1 (fr) Procédé et dispositif de suppression d'informations d'utilisateur à l'aide d'un modèle génératif d'apprentissage profond
CN110610450A (zh) 数据处理方法、电子设备和计算机可读存储介质
US20210389877A1 (en) Identifying host functionalities based on process characterization
CN114461369A (zh) 一种面向复杂应用场景的自适应数据调度系统及方法
US9747451B2 (en) File system modification
CN113934361A (zh) 用于管理存储系统的方法、设备和计算机程序产品
WO2018070732A1 (fr) Procédé et système de service pour réseau thématique présentant la structure arborescente d'un hashtag
Guo et al. Learning-based characterizing and modeling performance bottlenecks of big data workloads
CN216352283U (zh) 一种适用于边缘计算环境的请求过滤装置
CN115022094B (zh) 一种便于了解单位内部计算机使用情况的监控系统
US11783325B1 (en) Removal probability-based weighting for resource access

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16901777

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 16901777

Country of ref document: EP

Kind code of ref document: A1