WO2024113382A1 - Image data dna storage method and system, and electronic device and storage medium - Google Patents

Image data dna storage method and system, and electronic device and storage medium Download PDF

Info

Publication number
WO2024113382A1
WO2024113382A1 PCT/CN2022/136367 CN2022136367W WO2024113382A1 WO 2024113382 A1 WO2024113382 A1 WO 2024113382A1 CN 2022136367 W CN2022136367 W CN 2022136367W WO 2024113382 A1 WO2024113382 A1 WO 2024113382A1
Authority
WO
WIPO (PCT)
Prior art keywords
base
data
conversion
image
base sequence
Prior art date
Application number
PCT/CN2022/136367
Other languages
French (fr)
Chinese (zh)
Inventor
洪经纬
罗昌国
韩萍
姜青山
陈会
Original Assignee
中国科学院深圳先进技术研究院
Filing date
Publication date
Application filed by 中国科学院深圳先进技术研究院 filed Critical 中国科学院深圳先进技术研究院
Publication of WO2024113382A1 publication Critical patent/WO2024113382A1/en

Links

Images

Abstract

The embodiments of the present application relate to the technical field of data storage. Provided are an image data DNA storage method and system, and an electronic device and a storage medium. The method comprises: acquiring an image, which needs to be encoded, and splitting the acquired image to obtain several image blocks; performing binary conversion on the several obtained image blocks to obtain corresponding binary data strings; using a quasi-quaternary bit-base mapping rule to perform base conversion on the binary data strings to obtain a corresponding first base sequence, wherein the quasi-quaternary bit-base mapping rule comprises several quasi-quaternary bit-base mapping tables; using numbers after conversion by means of a specific base conversion rule to number the first base sequence, so as to obtain a second base sequence, which comprises block numbers, wherein the block numbers are located at a header of the second base sequence; and adding a new primer to the second base sequence and performing DNA synthesis, so as to obtain DNA storage data. By means of the embodiments of the present application, the problems in the related art of image data DNA storage methods not being diversified and storage efficiency being low are solved.

Description

图像数据DNA存储方法、系统、电子设备及存储介质Image data DNA storage method, system, electronic device and storage medium 技术领域Technical Field
本申请涉及DNA数据存储数据领域,具体而言,本申请涉及一种图像数据DNA存储方法、系统、电子设备及存储介质。The present application relates to the field of DNA data storage, and in particular, to an image data DNA storage method, system, electronic device and storage medium.
背景技术Background technique
由于数据存储需求的不断增加,现代数据存储系统因高昂的基础设施成本和运行功耗而不堪重负,急需一种耐久、可扩展和经济的替代存储介质。作为遗传信息的载体,DNA本身就是一种天然的优良存储介质不仅存储了从微生物到人类的亿万生命的遗传信息,并且保证了生命现象的稳定遗传。DNA存储技术是由DNA合成与测序技术和计算机存储相融合的新领域,通过编码算法将计算机中的0、1二进制数据转换成A、T、C、G四种碱基组成的DNA序列,进而通过合成含有指定碱基序列的DNA实现数据信息的存储。As the demand for data storage continues to increase, modern data storage systems are overwhelmed by high infrastructure costs and operating power consumption, and a durable, scalable and economical alternative storage medium is urgently needed. As a carrier of genetic information, DNA itself is a natural and excellent storage medium that not only stores the genetic information of billions of lives from microorganisms to humans, but also ensures the stable inheritance of life phenomena. DNA storage technology is a new field that integrates DNA synthesis and sequencing technology with computer storage. It converts the 0 and 1 binary data in the computer into a DNA sequence composed of four bases, A, T, C, and G, through a coding algorithm, and then realizes the storage of data information by synthesizing DNA containing a specified base sequence.
然而,现有的DNA数据存储技术方式单一,仅仅把文件转换成二进制串进行编码,解码时一旦丢失部分序列,便会导致灾难性的错误传播,而且无法适配特殊的存储应用场景。不仅如此,现有技术中很少有针对分块读取的机制,在对数据进行解码时只能解码全部序列,耗时高,存储效率低。However, the existing DNA data storage technology is single-minded, only converting files into binary strings for encoding. Once part of the sequence is lost during decoding, it will lead to catastrophic error propagation, and it cannot be adapted to special storage application scenarios. In addition, there are few mechanisms for block reading in the existing technology, and only the entire sequence can be decoded when decoding data, which is time-consuming and has low storage efficiency.
由上可知,如何实现分块读取提高存储效率的问题仍有待解决。From the above, we can see that the problem of how to achieve block reading to improve storage efficiency still needs to be solved.
发明内容Summary of the invention
本申请各实施例提供了一种图像数据DNA存储方法、系统、电子设备及存储介质,可以解决相关技术中存在的存储方式单一,效率低的问题。所述技术方案如下:The embodiments of the present application provide a method, system, electronic device and storage medium for storing image data in DNA, which can solve the problem of single storage method and low efficiency in related technologies. The technical solution is as follows:
根据本身请实施例的一个方面,一种图像数据DNA存储方法,所述方法包括:获取所需编码的图像,对所述获取到的图像进行拆分,得到若干图像块;对所述得到的若干图像块进行二进制转换得到对应的二进制数据串; 使用类四进制比特-碱基映射规则对所述二进制数据串进行碱基转换,得到对应的第一碱基序列;所述类四进制比特-碱基映射规则包括若干类四进制比特-碱基映射表;使用特定碱基转换规则转换后的编号对第一碱基序列进行编号,得到包含块编号的第二碱基序列;所述块编号位于所述第二碱基序列的头部;对所述第二碱基碱基序列添加新的引物并进行DNA合成,得到DNA存储数据。According to one aspect of the embodiments of the present invention, a method for DNA storage of image data includes: obtaining an image to be encoded, splitting the obtained image to obtain a number of image blocks; performing binary conversion on the obtained number of image blocks to obtain a corresponding binary data string; performing base conversion on the binary data string using a quaternary bit-base mapping rule to obtain a corresponding first base sequence; the quaternary bit-base mapping rule includes a number of quaternary bit-base mapping tables; numbering the first base sequence using the number converted using a specific base conversion rule to obtain a second base sequence including a block number; the block number is located at the head of the second base sequence; adding a new primer to the second base sequence and performing DNA synthesis to obtain DNA storage data.
根据本申请实施例的一个方面,一种图像数据DNA存储方法,所述方法还包括:对所述得到的DNA存储数据进行测序得到碱基序列数据,根据得到的所述碱基序列数据中头部碱基序列筛选所需进行分块解码的数据。According to one aspect of an embodiment of the present application, a method for DNA storage of image data further includes: sequencing the obtained DNA storage data to obtain base sequence data, and screening the data required for block decoding according to the head base sequence in the obtained base sequence data.
根据本申请实施例的一个方面,一种图像数据DNA存储编解码系统,所述系统包括:图像获取模块,用于获取所需编码的图像,对所述获取到的图像进行拆分,得到若干图像块;数据转换模块,用于对所述得到的若干图像块进行二进制转换得到对应的二进制数据串;数据编码模块,用于使用类四进制比特-碱基映射规则对所述二进制数据串进行碱基转换,得到对应的第一碱基序列;所述类四进制比特-碱基映射规则包括若干类四进制比特-碱基映射表;序列编号模块,用于使用特定碱基转换规则转换后的编号对第一碱基序列进行编号,得到包含块编号的第二碱基序列;所述块编号位于所述第二碱基序列的头部;数据合成模块,用于对所述第二碱基碱基序列添加新的引物并进行DNA合成,得到DNA存储数据;数据解码模块,用于对所述得到的DNA存储数据进行测序得到碱基序列数据,根据得到的所述碱基序列数据中头部碱基序列筛选所需进行分块解码的数据。According to one aspect of an embodiment of the present application, a DNA storage encoding and decoding system for image data includes: an image acquisition module, which is used to acquire an image to be encoded, and split the acquired image to obtain a number of image blocks; a data conversion module, which is used to perform binary conversion on the obtained number of image blocks to obtain corresponding binary data strings; a data encoding module, which is used to perform base conversion on the binary data string using a quaternary bit-base mapping rule to obtain a corresponding first base sequence; the quaternary bit-base mapping rule includes a number of quaternary bit-base mapping tables; a sequence numbering module, which is used to number the first base sequence using the number converted by a specific base conversion rule to obtain a second base sequence including a block number; the block number is located at the head of the second base sequence; a data synthesis module, which is used to add a new primer to the second base sequence and perform DNA synthesis to obtain DNA storage data; a data decoding module, which is used to sequence the obtained DNA storage data to obtain base sequence data, and filter the data required for block decoding according to the head base sequence in the obtained base sequence data.
根据本申请实施例的一个方面,一种电子设备,包括:至少一个处理器、至少一个存储器、以及至少一条通信总线,其中,存储器上存储有计算机程序,处理器通过通信总线读取存储器中的计算机程序;计算机程序被处理器执行时实现如上所述的图像数据DNA存储方法。According to one aspect of an embodiment of the present application, an electronic device includes: at least one processor, at least one memory, and at least one communication bus, wherein a computer program is stored in the memory, and the processor reads the computer program in the memory through the communication bus; when the computer program is executed by the processor, the image data DNA storage method as described above is implemented.
根据本申请实施例的一个方面,一种存储介质,其上存储有计算机程序,计算机程序被处理器执行时实现如上所述的图像数据DNA存储方法。According to one aspect of an embodiment of the present application, a storage medium stores a computer program thereon, and when the computer program is executed by a processor, the image data DNA storage method as described above is implemented.
根据本申请实施例的一个方面,一种计算机程序产品,计算机程序产品包括计算机程序,计算机程序存储在存储介质中,计算机设备的处理器从存储介质读取计算机程序,处理器执行计算机程序,使得计算机设备执行时实 现如上所述的图像数据DNA存储方法。According to one aspect of an embodiment of the present application, a computer program product includes a computer program, the computer program is stored in a storage medium, a processor of a computer device reads the computer program from the storage medium, and the processor executes the computer program, so that when the computer device executes the computer program, the image data DNA storage method as described above is implemented.
本申请提供的技术方案带来的有益效果是:The beneficial effects of the technical solution provided by this application are:
在上述技术方案中,利用对图像进行分割的优势,以支持分块读取,从而能够降低读取的测序成本,并且支持并行编解码,进一步提高存储效率;所提供的类四进制比特-碱基映射规则,具有适配生化约束的快速筛选方式,相较于现有技术能够更快的完成编解码,从而能够有效地解决相关技术中存在的进行解码时只能解码全部序列,耗时高,存储效率低的问题。In the above technical solution, the advantage of image segmentation is utilized to support block reading, thereby reducing the sequencing cost of reading, and supporting parallel encoding and decoding to further improve storage efficiency; the provided quaternary bit-base mapping rule has a fast screening method adapted to biochemical constraints, and can complete encoding and decoding faster than the prior art, thereby effectively solving the problem that only the entire sequence can be decoded during decoding, which is time-consuming and has low storage efficiency in the related art.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本申请实施例中的技术方案,下面将对本申请实施例描述中所需要使用的附图作简单地介绍。In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings required for use in describing the embodiments of the present application are briefly introduced below.
图1是根据一示例性实施例示出的一种图像数据DNA存储方法的流程图;FIG1 is a flow chart of a method for storing image data in a DNA according to an exemplary embodiment;
图2是根据一示例性实施例示出的步骤330的流程图;FIG2 is a flow chart showing step 330 according to an exemplary embodiment;
图3是根据一示例性实施例示出的另一种图像数据DNA存储方法的流程图;FIG3 is a flow chart of another method for storing image data in a DNA according to an exemplary embodiment;
图4是根据一示例性实施例示出的一种图像数据DNA编解码完整流程图;FIG4 is a complete flow chart of DNA encoding and decoding of image data according to an exemplary embodiment;
图5是根据一示例性实施例示出的一种图像数据DNA存储系统的结构框图;FIG5 is a structural block diagram of an image data DNA storage system according to an exemplary embodiment;
图6是根据一示例性实施例示出的一种电子设备的结构框图。Fig. 6 is a structural block diagram of an electronic device according to an exemplary embodiment.
具体实施方式Detailed ways
下面详细描述本申请的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本申请,而不能解释为对本申请的限制。The embodiments of the present application are described in detail below, and examples of the embodiments are shown in the accompanying drawings, wherein the same or similar reference numerals throughout represent the same or similar elements or elements having the same or similar functions. The embodiments described below with reference to the accompanying drawings are exemplary and are only used to explain the present application, and cannot be interpreted as limiting the present application.
本技术领域技术人员可以理解,除非特意声明,这里使用的单数形式“一”、“一个”、“所述”和“该”也可包括复数形式。应该进一步理解的是,本申请的说明书中使用的措辞“包括”是指存在所述特征、整数、步骤、操 作、元件和/或组件,但是并不排除存在或添加一个或多个其他特征、整数、步骤、操作、元件、组件和/或它们的组。应该理解,当我们称元件被“连接”或“耦接”到另一元件时,它可以直接连接或耦接到其他元件,或者也可以存在中间元件。此外,这里使用的“连接”或“耦接”可以包括无线连接或无线耦接。这里使用的措辞“和/或”包括一个或更多个相关联的列出项的全部或任一单元和全部组合。It will be understood by those skilled in the art that, unless expressly stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the term "comprising" used in the specification of the present application refers to the presence of the features, integers, steps, operations, elements and/or components, but does not exclude the presence or addition of one or more other features, integers, steps, operations, elements, components and/or groups thereof. It should be understood that when we refer to an element as being "connected" or "coupled" to another element, it may be directly connected or coupled to the other element, or there may be intermediate elements. In addition, the "connection" or "coupling" used herein may include wireless connection or wireless coupling. The term "and/or" used herein includes all or any unit and all combinations of one or more associated listed items.
如前所述,由于数据存储需求的不断增加,急需一种耐久、可扩展和经济的替代存储介质。作为遗传信息的载体,DNA本身就是一种天然的优良存储介质,通过DNA存储数据的方法具有存储密度高、保存寿命长、维护成本低以及数据易备份等优点。As mentioned above, due to the increasing demand for data storage, there is an urgent need for a durable, scalable and economical alternative storage medium. As a carrier of genetic information, DNA itself is a natural and excellent storage medium. The method of storing data through DNA has the advantages of high storage density, long shelf life, low maintenance cost and easy data backup.
最常见的图像DNA编解码方法主要有两种:一、直接文件编解码,此种方法无需考虑图像文件的格式,直接将文件转换成二进制串,进行分段后编码为碱基序列,解码则为逆过程;二、频率系数编解码,此种方法只限于JPEG格式的图像文件,根据JPEG中频率系数的误差敏感性,将DC/AC频率系数分别编码成不同的DNA序列进行存储,并设计适当的内部索引将这些系数与其对应的图像相关联,解码时具有较好的容错性。There are two main methods for the most common image DNA encoding and decoding: 1. Direct file encoding and decoding. This method does not need to consider the format of the image file. It directly converts the file into a binary string, segments it, and encodes it into a base sequence. Decoding is the inverse process. 2. Frequency coefficient encoding and decoding. This method is limited to image files in JPEG format. According to the error sensitivity of the frequency coefficients in JPEG, the DC/AC frequency coefficients are encoded into different DNA sequences for storage, and appropriate internal indexes are designed to associate these coefficients with their corresponding images, which has better fault tolerance during decoding.
但上述图像DNA编解码方法也有不足之处,包括:1、编码方式较为单一,仅仅将文件转换成二进制串进行编码,解码时一旦丢失部分序列便会导致灾难性的错误传播,而且无法适配特殊的存储应用场景,例如只需要看图像的某个位置,而现有技术只能解码全部序列,较为耗时;2、可被编码的图像格式受限,且为了追求高质量序列大多引入了复杂的序列筛选机制,编码耗时长,编码效率低。However, the above-mentioned image DNA encoding and decoding method also has shortcomings, including: 1. The encoding method is relatively simple, and the file is only converted into a binary string for encoding. Once part of the sequence is lost during decoding, it will lead to catastrophic error propagation, and it cannot be adapted to special storage application scenarios. For example, if you only need to look at a certain position of the image, the existing technology can only decode the entire sequence, which is time-consuming; 2. The image format that can be encoded is limited, and in order to pursue high-quality sequences, most of them introduce complex sequence screening mechanisms, which takes a long time to encode and has low encoding efficiency.
可见,相关技术中仍存在存储方式单一且效率低的问题。It can be seen that the related technology still has the problem of single storage method and low efficiency.
为此,本申请提供的图像数据DNA存储方法,实现了对图像数据的分块读取,在高效编码的同时追求低成本读取,缩短编解码时间,提高编解码效率。To this end, the image data DNA storage method provided by the present application realizes the block reading of image data, pursues low-cost reading while efficiently encoding, shortens the encoding and decoding time, and improves the encoding and decoding efficiency.
为使本申请的目的、技术方案和优点更加清楚,下面将结合附图对本 申请实施方式作进一步地详细描述。In order to make the objectives, technical solutions and advantages of the present application clearer, the implementation methods of the present application will be further described in detail below with reference to the accompanying drawings.
图1为一种图像数据DNA存储方法,该方法可以包括以下步骤:FIG1 is a method for storing image data in DNA, which may include the following steps:
步骤310,获取所需编码的图像,对获取到的图像进行拆分,得到若干图像块。Step 310: obtain an image to be encoded, and split the obtained image into several image blocks.
其中,所需编码的图像为需要通过DNA数据存储技术进行保存的图像。The image to be encoded is an image that needs to be stored through DNA data storage technology.
在一种可能的实现方式,获取所需编码的图像后,为了将图像尽可能的等分,获取图像的宽、高像素值,将两个像素值除以需要等分的块数,并向下取整,得到若干等分的第一图像块。需要等分的块数可根据实际应用场景进行自定义,在此不做特殊说明。In a possible implementation, after obtaining the image to be encoded, in order to divide the image as equally as possible, the width and height pixel values of the image are obtained, and the two pixel values are divided by the number of blocks to be equally divided, and the two pixel values are rounded down to obtain a number of equally divided first image blocks. The number of blocks to be equally divided can be customized according to the actual application scenario and is not specifically described here.
步骤320,对得到的若干图像块进行二进制转换得到对应的二进制数据串。Step 320: performing binary conversion on the obtained image blocks to obtain corresponding binary data strings.
对于得到的所有图像块,根据图像中各个像素点对应的像素值,将十进制表示下的像素值通过二进制转换成二进制数据串。For all the obtained image blocks, according to the pixel values corresponding to the respective pixel points in the image, the pixel values in the decimal representation are converted into binary data strings through binary.
步骤330,使用类四进制比特-碱基映射规则对二进制数据串进行碱基转换,得到对应的第一碱基序列;类四进制比特-碱基映射规则包括若干类四进制比特-碱基映射表。 Step 330, using a quaternary bit-base mapping rule to perform base conversion on the binary data string to obtain a corresponding first base sequence; the quaternary bit-base mapping rule includes a plurality of quaternary bit-base mapping tables.
在一种可能的实现方式,本申请提供了一种新的比特-碱基转换规则——类四进制比特-碱基映射规则。根据特定的映射规则将二进制数据串转换成包含A、T、C、G四种碱基的DNA序列。In a possible implementation, the present application provides a new bit-base conversion rule, a quaternary bit-base mapping rule, which converts a binary data string into a DNA sequence containing four bases, A, T, C, and G, according to a specific mapping rule.
步骤340,使用特定碱基转换规则转换后的编号对第一碱基序列进行编号,得到包含块编号的第二碱基序列;所述块编号位于所述第二碱基序列的头部。Step 340: Number the first base sequence using the numbers converted according to the specific base conversion rule to obtain a second base sequence including a block number; the block number is located at the head of the second base sequence.
获取用于编号的编号序列,块编号为将原始的十进制编号进行二进制转换后,并以00-A,01-T,10-C,11-G的规则进行碱基转换得到的,从而将碱基转换后的块编号添加至第二碱基序列的头部进行编号。A numbering sequence for numbering is obtained, where the block number is obtained by converting the original decimal number into binary and performing base conversion according to the rules of 00-A, 01-T, 10-C, and 11-G, thereby adding the block number after base conversion to the head of the second base sequence for numbering.
步骤350,对第二碱基碱基序列添加新的引物并进行DNA合成,得到DNA存储数据。 Step 350, adding a new primer to the second base sequence and performing DNA synthesis to obtain DNA storage data.
在一种可能的实现方式,如图2所示,上述步骤330包括以下步骤:In a possible implementation, as shown in FIG2 , the above step 330 includes the following steps:
步骤331,使用类四进制比特-碱基映射表A*对二进制数据串的前4个比特进行碱基转换,得到两位初始碱基。 Step 331, use the quaternary bit-base mapping table A* to perform base conversion on the first 4 bits of the binary data string to obtain two initial bases.
步骤332,若两位初始碱基为重叠形式,则后续1或2个比特先使用类四进制比特-碱基映射表A进行碱基转换,得到第三位碱基。Step 332: If the two initial bases are in overlapping form, the subsequent 1 or 2 bits are first converted into bases using the quaternary bit-base mapping table A to obtain the third base.
步骤333,基于两位初始碱基为重叠形式,若下一次遇到重叠形式,则使用类四进制比特-碱基映射表B进行碱基转换;否则使用类四进制比特-碱基映射表B*进行碱基转换。 Step 333, based on the two initial bases being in overlapping form, if an overlapping form is encountered next time, the quaternary bit-base mapping table B is used to perform base conversion; otherwise, the quaternary bit-base mapping table B* is used to perform base conversion.
步骤334,若上一次重叠形式使用类四进制比特-碱基映射表B进行碱基转换,则下一次重叠形式使用类四进制比特-碱基映射表A进行碱基转换;非重叠形式使用类四进制比特-碱基映射表A*进行碱基转换。 Step 334, if the last overlapping form uses the quaternary bit-base mapping table B for base conversion, the next overlapping form uses the quaternary bit-base mapping table A for base conversion; the non-overlapping form uses the quaternary bit-base mapping table A* for base conversion.
步骤335,若两位初始碱基为非重叠形式,则后续1或2个比特继续使用类四进制比特-碱基映射表A*进行碱基转换,得到第三位碱基。Step 335: If the two initial bases are in a non-overlapping form, the subsequent 1 or 2 bits continue to use the quaternary bit-base mapping table A* to perform base conversion to obtain the third base.
在一示例性实施例中,如表1所示,本申请提供了一种新的比特-碱基映射规则——类四进制比特-碱基映射规则。In an exemplary embodiment, as shown in Table 1, the present application provides a new bit-base mapping rule - a quaternary bit-base mapping rule.
具体的,例如一串二进制数据0010 1101,使用类四进制比特-碱基映射表A*对二进制数据串的前4个比特0010进行碱基转换,得到两位初始碱基为AC,对于非重叠形式的两位初始碱基,后续两位比特11使用使用类四进制比特-碱基映射表A*进行碱基转换为G,同时对于后续需要进行碱基转换的比特来说,新的两位碱基即为CG,基于新的两位碱基为非重叠形式,最后的比特01采用类四进制比特-碱基映射表A*进行碱基转换为T,即对于二进制数据串0010 1101,进行碱基转换后为ACGT。Specifically, for example, a string of binary data 0010 1101, the first 4 bits 0010 of the binary data string are base converted using the quaternary bit-base mapping table A*, and the two initial bases are AC. For the two non-overlapping initial bases, the subsequent two bits 11 are base converted to G using the quaternary bit-base mapping table A*. At the same time, for the subsequent bits that need to be base converted, the new two bases are CG. Based on the fact that the new two bases are in a non-overlapping form, the last bit 01 is base converted to T using the quaternary bit-base mapping table A*, that is, for the binary data string 0010 1101, the base conversion is ACGT.
再比如一串二进制数据串0000 1101,先使用类四进制比特-碱基映射表A*对二进制数据串的前4个比特0000进行碱基转换,得到两位初始碱基为AA,对于重叠形式的两位初始碱基,后续两位比特11使用类四进制比特-碱基映射表A进行碱基转换为G,同时对于后续需要进行碱基转换的比特来说,新的两位碱基即为AG,同时下次遇到重叠形式时使用类四进制比特-碱基映射表B进行碱基转化,非重叠形式使用类四进制比特-碱基映射表B*进行碱基转换。基于新的两位碱基为非重叠形式,最后的比特01采用类四进制比特-碱基映射表B*进行碱基转换为C,即对于二进制数据串0000 1101,进行碱基转换后为AAGC。For another example, a binary data string 0000 1101, first use the quaternary bit-base mapping table A* to perform base conversion on the first 4 bits 0000 of the binary data string, and obtain two initial bases of AA. For the two initial bases in overlapping form, the subsequent two bits 11 use the quaternary bit-base mapping table A to perform base conversion to G. At the same time, for the bits that need to be converted later, the new two bases are AG. At the same time, the quaternary bit-base mapping table B is used for base conversion when the overlapping form is encountered next time, and the quaternary bit-base mapping table B* is used for base conversion in non-overlapping form. Based on the new two bases being in non-overlapping form, the last bit 01 is converted to C using the quaternary bit-base mapping table B*, that is, for the binary data string 0000 1101, the base conversion is AAGC.
在一示例性实施例中,如表二所示,所述类四进制比特-碱基映射规则还包括末端比特处理表。In an exemplary embodiment, as shown in Table 2, the quaternary bit-base mapping rule further includes a terminal bit processing table.
当二进制数据串进行碱基转换中剩下最后的1或2个比特,使用末端比特处理表进行碱基转换。When the last 1 or 2 bits of the binary data string are left for base conversion, the end bit processing table is used to perform base conversion.
具体的,在固定二进制串长度的情况下,可能会遇到最后剩下1个或者2个比特无法找到映射规则的问题,此时便根据末端比特处理表对其进行碱基转换。对于最后的1个或者2个比特,基于最接近的两个碱基的形式,从而选择不同的碱基进行对应。例如,当前两位碱基为AA时,若最后一个比特为0,则其对应的碱基为TC;若最后一个比特为1,则其对应的碱基为TG,从而实现对最后的1或2个比特进行碱基转换。Specifically, in the case of a fixed binary string length, there may be a problem that there are 1 or 2 bits left at the end and no mapping rules can be found. In this case, base conversion is performed on them according to the end bit processing table. For the last 1 or 2 bits, different bases are selected for correspondence based on the form of the two closest bases. For example, when the first two bases are AA, if the last bit is 0, the corresponding base is TC; if the last bit is 1, the corresponding base is TG, thereby achieving base conversion for the last 1 or 2 bits.
表一 类四进制比特-碱基映射表Table 1 Quaternary bit-base mapping table
Figure PCTCN2022136367-appb-000001
Figure PCTCN2022136367-appb-000001
表二 末端比特映射表Table 2 Terminal bit mapping table
Figure PCTCN2022136367-appb-000002
Figure PCTCN2022136367-appb-000002
在一种可能的实现方式,如图3所示,本申请提供的图像数据DNA存储方法还包括以下步骤:In a possible implementation, as shown in FIG3 , the image data DNA storage method provided by the present application further includes the following steps:
步骤410,对类四进制比特-碱基映射表A和表B进行变换,得到若干 新的映射规则。 Step 410, transform the quaternary bit-base mapping table A and table B to obtain several new mapping rules.
步骤430,对得到的若干新的映射规则使用如上编码流程进行编码,并记录碱基转换后的结果序列中GC的含量以及编码密度。Step 430: Encode the obtained several new mapping rules using the above encoding process, and record the GC content and encoding density in the result sequence after base conversion.
步骤450,选择结果序列中GC含量在40%-60%且编码密度最高的对应的映射规则作为输出。 Step 450 , selecting the mapping rule corresponding to the result sequence with a GC content of 40%-60% and the highest coding density as output.
具体的,改变类四进制比特-碱基映射表A和表B中的TT两列,对于两个表各有四种不同的变换形式,从而得到十六种新的映射规则。使用本申请提供的编码流程对十六种新的映射规则进行编码同时记录结果序列中的GC的含量以及编码密度。根据最终的结果序列中GC含量在40%-60%且编码密度最高的对应的映射规则作为最优选的比特-碱基映射规则实现数据编码。Specifically, the two columns TT in the quaternary bit-base mapping table A and table B are changed, and there are four different transformation forms for each of the two tables, thereby obtaining sixteen new mapping rules. The sixteen new mapping rules are encoded using the encoding process provided by the present application, and the GC content and encoding density in the result sequence are recorded at the same time. The corresponding mapping rule with a GC content of 40%-60% and the highest encoding density in the final result sequence is used as the most preferred bit-base mapping rule to implement data encoding.
在一种可能的实现方式,如图4所示,本申请步骤350之后还包括:In a possible implementation, as shown in FIG4 , after step 350 of the present application, the following steps are further included:
步骤360,对得到的DNA存储数据进行测序得到碱基序列数据,根据得到的碱基序列数据中头部碱基序列筛选所需进行分块解码的数据。 Step 360, sequencing the obtained DNA storage data to obtain base sequence data, and filtering the data required for block decoding according to the head base sequence in the obtained base sequence data.
在一示例性实施例中,对于一串碱基序列,根据其头部碱基序列确定对应的块编号,根据块编号选择所需进行解码的碱基序列,接着使用上述编码过程的逆过程对碱基序列进行解码,得到对应的图像数据,从而实现了完整的编解码过程。In an exemplary embodiment, for a string of base sequences, the corresponding block number is determined according to its head base sequence, and the base sequence to be decoded is selected according to the block number. Then, the inverse process of the above encoding process is used to decode the base sequence to obtain the corresponding image data, thereby realizing a complete encoding and decoding process.
下述为本申请系统实施例,可以用于执行本申请所涉及的图像数据DNA存储方法。对于本申请装置实施例中未披露的细节,请参照本申请所涉及的图像数据DNA存储方法的方法实施例。The following is a system embodiment of the present application, which can be used to execute the image data DNA storage method involved in the present application. For details not disclosed in the device embodiment of the present application, please refer to the method embodiment of the image data DNA storage method involved in the present application.
请参阅图5,本申请实施例中提供了一种图像数据DNA存储系统500,包括但不限于图像获取模块510、数据转换模块520、数据编码模块530、序列编号模块540、数据合成模块550、数据解码模块560。Please refer to Figure 5. An image data DNA storage system 500 is provided in an embodiment of the present application, including but not limited to an image acquisition module 510, a data conversion module 520, a data encoding module 530, a sequence numbering module 540, a data synthesis module 550, and a data decoding module 560.
图像获取模块510,用于获取所需编码的图像,对获取到的图像进行拆分,得到若干图像块;An image acquisition module 510 is used to acquire an image to be encoded and split the acquired image into a plurality of image blocks;
数据转换模块520,用于对得到的若干图像块进行二进制转换得到对应的二进制数据串;A data conversion module 520 is used to perform binary conversion on the obtained image blocks to obtain corresponding binary data strings;
数据编码模块530,用于使用类四进制比特-碱基映射规则对二进制数据 串进行碱基转换,得到对应的第一碱基序列;类四进制比特-碱基映射规则包括若干类四进制比特-碱基映射表;The data encoding module 530 is used to perform base conversion on the binary data string using a quaternary bit-base mapping rule to obtain a corresponding first base sequence; the quaternary bit-base mapping rule includes a plurality of quaternary bit-base mapping tables;
序列编号模块540,用于使用特定碱基转换规则转换后的编号对第一碱基序列进行编号,得到包含块编号的第二碱基序列;块编号位于所述第二碱基序列的头部;A sequence numbering module 540 is used to number the first base sequence using the numbers converted according to the specific base conversion rule to obtain a second base sequence including a block number; the block number is located at the head of the second base sequence;
数据合成模块550,用于对第二碱基碱基序列添加新的引物并进行DNA合成,得到DNA存储数据;The data synthesis module 550 is used to add a new primer to the second base sequence and perform DNA synthesis to obtain DNA storage data;
数据解码模块560,用于对得到的DNA存储数据进行测序得到碱基序列数据,根据得到的碱基序列数据中头部碱基序列筛选所需进行分块解码的数据。The data decoding module 560 is used to sequence the obtained DNA storage data to obtain base sequence data, and to filter the data required for block decoding according to the head base sequence in the obtained base sequence data.
需要说明的是,上述实施例所提供的图像数据DNA存储系统在进行图像DNA数据存储时,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即图像数据DNA存储系统的内部结构将划分为不同的功能模块,以完成以上描述的全部或者部分功能。It should be noted that the image data DNA storage system provided in the above embodiment only uses the division of the above-mentioned functional modules as an example when storing image DNA data. In actual applications, the above-mentioned functions can be assigned to different functional modules as needed, that is, the internal structure of the image data DNA storage system will be divided into different functional modules to complete all or part of the functions described above.
另外,上述实施例所提供的图像数据DNA存储系统与图像数据DNA存储方法的实施例属于同一构思,其中各个模块执行操作的具体方式已经在方法实施例中进行了详细描述,此处不再赘述。In addition, the image data DNA storage system and the image data DNA storage method provided in the above embodiments belong to the same concept, and the specific manner in which each module performs operations has been described in detail in the method embodiments and will not be repeated here.
请参阅图6,本申请实施例中提供了一种电子设备4000,该电子设备4000包括至少一个处理器4001、至少一条通信总线4002以及至少一个存储器4003。Please refer to FIG. 6 . An electronic device 4000 is provided in an embodiment of the present application. The electronic device 4000 includes at least one processor 4001 , at least one communication bus 4002 , and at least one memory 4003 .
其中,处理器4001和存储器4003相连,如通过通信总线4002相连。可选地,电子设备4000还可以包括收发器4004,收发器4004可以用于该电子设备与其他电子设备之间的数据交互,如数据的发送和/或数据的接收等。需要说明的是,实际应用中收发器4004不限于一个,该电子设备4000的结构并不构成对本申请实施例的限定。Among them, the processor 4001 and the memory 4003 are connected, such as through a communication bus 4002. Optionally, the electronic device 4000 may also include a transceiver 4004, which may be used for data interaction between the electronic device and other electronic devices, such as data transmission and/or data reception. It should be noted that in actual applications, the transceiver 4004 is not limited to one, and the structure of the electronic device 4000 does not constitute a limitation on the embodiments of the present application.
处理器4001可以是CPU(Central Processing Unit,中央处理器),通用处理器,DSP(Digital Signal Processor,数据信号处理器),ASIC(Application Specific Integrated Circuit,专用集成电路),FPGA(Field Programmable Gate Array,现场可编程门阵列)或者其他可编程逻辑器件、 晶体管逻辑器件、硬件部件或者其任意组合。其可以实现或执行结合本申请公开内容所描述的各种示例性的逻辑方框,模块和电路。处理器4001也可以是实现计算功能的组合,例如包含一个或多个微处理器组合,DSP和微处理器的组合等。 Processor 4001 can be a CPU (Central Processing Unit), a general-purpose processor, a DSP (Digital Signal Processor), an ASIC (Application Specific Integrated Circuit), an FPGA (Field Programmable Gate Array) or other programmable logic devices, transistor logic devices, hardware components or any combination thereof. It can implement or execute various exemplary logic blocks, modules and circuits described in conjunction with the contents disclosed in this application. Processor 4001 can also be a combination that implements computing functions, such as a combination of one or more microprocessors, a combination of a DSP and a microprocessor, etc.
通信总线4002可包括一通路,在上述组件之间传送信息。通信总线4002可以是PCI(Peripheral Component Interconnect,外设部件互连标准)总线或EISA(Extended Industry Standard Architecture,扩展工业标准结构)总线等。通信总线4002可以分为地址总线、数据总线、控制总线等。为便于表示,图6中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。The communication bus 4002 may include a path for transmitting information between the above components. The communication bus 4002 may be a PCI (Peripheral Component Interconnect) bus or an EISA (Extended Industry Standard Architecture) bus, etc. The communication bus 4002 may be divided into an address bus, a data bus, a control bus, etc. For ease of representation, only one thick line is used in FIG6 , but it does not mean that there is only one bus or one type of bus.
存储器4003可以是ROM(Read Only Memory,只读存储器)或可存储静态信息和指令的其他类型的静态存储设备,RAM(Random Access Memory,随机存取存储器)或者可存储信息和指令的其他类型的动态存储设备,也可以是EEPROM(Electrically Erasable Programmable Read Only Memory,电可擦可编程只读存储器)、CD-ROM(Compact Disc ReadOnly Memory,只读光盘)或其他光盘存储、光碟存储(包括压缩光碟、激光碟、光碟、数字通用光碟、蓝光光碟等)、磁盘存储介质或者其他磁存储设备、或者能够用于携带或存储具有指令或数据结构形式的期望的程序代码并能够由计算机存取的任何其他介质,但不限于此。The memory 4003 can be a ROM (Read Only Memory) or other types of static storage devices that can store static information and instructions, a RAM (Random Access Memory) or other types of dynamic storage devices that can store information and instructions, or an EEPROM (Electrically Erasable Programmable Read Only Memory), a CD-ROM (Compact Disc Read Only Memory) or other optical disk storage, optical disk storage (including compressed optical disk, laser disk, optical disk, digital versatile disk, Blu-ray disk, etc.), magnetic disk storage medium or other magnetic storage device, or any other medium that can be used to carry or store the desired program code in the form of instructions or data structures and can be accessed by a computer, but is not limited to these.
存储器4003上存储有计算机程序,处理器4001通过通信总线4002读取存储器4003中存储的计算机程序。The memory 4003 stores a computer program, and the processor 4001 reads the computer program stored in the memory 4003 through the communication bus 4002 .
该计算机程序被处理器4001执行时实现上述各实施例中的图像数据DNA存储方法。When the computer program is executed by the processor 4001, the image data DNA storage method in the above-mentioned embodiments is implemented.
此外,本申请实施例中提供了一种存储介质,该存储介质上存储有计算机程序,该计算机程序被处理器执行时实现上述各实施例中的图像数据DNA存储方法。In addition, a storage medium is provided in an embodiment of the present application, on which a computer program is stored. When the computer program is executed by a processor, the image data DNA storage method in the above embodiments is implemented.
本申请实施例中提供了一种计算机程序产品,该计算机程序产品包括计算机程序,该计算机程序存储在存储介质中。计算机设备的处理器从存储介质读取该计算机程序,处理器执行该计算机程序,使得该计算机设备执行上述各实施例中的图像DNA数据存储方法。In an embodiment of the present application, a computer program product is provided, which includes a computer program stored in a storage medium. A processor of a computer device reads the computer program from the storage medium, and the processor executes the computer program, so that the computer device executes the image DNA data storage method in each of the above embodiments.
与相关技术相比,一方面,本申请提供的实施例利用了对图像进行分割的优势,支持分块读取,能够降低读取的测序成本,并支持并行编解码,进一步提高了编解码效率;另一方面,本申请提供的类四进制比特-碱基映射规则,具有适配生化约束的快速筛选方式,相对于现有技术能够更快地完成编解码。有效地解决了现有技术中存在的进行解码时只能解码全部序列,耗时高,编解码效率低的问题。Compared with the related art, on the one hand, the embodiment provided by the present application takes advantage of image segmentation, supports block reading, can reduce the sequencing cost of reading, and supports parallel encoding and decoding, further improving the encoding and decoding efficiency; on the other hand, the quaternary bit-base mapping rule provided by the present application has a fast screening method adapted to biochemical constraints, which can complete encoding and decoding faster than the prior art. It effectively solves the problem that only the entire sequence can be decoded during decoding, which is time-consuming and has low encoding and decoding efficiency in the prior art.
应该理解的是,虽然附图的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,其可以按其他的顺序执行。而且,附图的流程图中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必是在同一时刻执行完成,而是可以在不同的时刻执行,其执行顺序也不必是依次进行,而是可以与其他步骤或者其他步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that, although the steps in the flowchart of the accompanying drawings are displayed in sequence as indicated by the arrows, these steps are not necessarily executed in sequence in the order indicated by the arrows. Unless otherwise specified herein, there is no strict order restriction on the execution of these steps, and they can be executed in other orders. Moreover, at least a portion of the steps in the flowchart of the accompanying drawings may include multiple sub-steps or multiple stages, and these sub-steps or stages do not have to be executed at the same time, but can be executed at different times, and their execution order does not have to be sequential, but can be executed in turn or alternately with other steps or at least a portion of the sub-steps or stages of other steps.
以上所述仅是本申请的部分实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本申请原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本申请的保护范围。The above is only a partial implementation method of the present application. It should be pointed out that for ordinary technicians in this technical field, several improvements and modifications can be made without departing from the principles of the present application. These improvements and modifications should also be regarded as the scope of protection of the present application.

Claims (10)

  1. 一种图像数据DNA存储方法,其特征在于,所述方法包括:A method for storing image data in DNA, characterized in that the method comprises:
    获取所需编码的图像,对所述获取到的图像进行拆分,得到若干图像块;Acquire an image to be encoded, and split the acquired image to obtain a plurality of image blocks;
    对所述得到的若干图像块进行二进制转换得到对应的二进制数据串;Performing binary conversion on the obtained image blocks to obtain corresponding binary data strings;
    使用类四进制比特-碱基映射规则对所述二进制数据串进行碱基转换,得到对应的第一碱基序列;所述类四进制比特-碱基映射规则包括若干类四进制比特-碱基映射表;Performing base conversion on the binary data string using a quaternary bit-base mapping rule to obtain a corresponding first base sequence; the quaternary bit-base mapping rule includes a plurality of quaternary bit-base mapping tables;
    使用特定碱基转换规则转换后的编号对第一碱基序列进行编号,得到包含块编号的第二碱基序列;所述块编号位于所述第二碱基序列的头部;Numbering the first base sequence using the numbers converted according to the specific base conversion rule to obtain a second base sequence including a block number; the block number is located at the head of the second base sequence;
    对所述第二碱基碱基序列添加新的引物并进行DNA合成,得到DNA存储数据Add new primers to the second base sequence and perform DNA synthesis to obtain DNA storage data
  2. 如权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, characterized in that the method further comprises:
    对所述得到的DNA存储数据进行测序得到碱基序列数据,根据得到的所述碱基序列数据中头部碱基序列筛选所需进行分块解码的数据。The obtained DNA storage data is sequenced to obtain base sequence data, and the data required for block decoding is screened according to the head base sequence in the obtained base sequence data.
  3. 如权利要求1所述的方法,其特征在于,所述获取所需编码的图像,对所述获取到的图像进行拆分,得到若干图像块,包括:The method according to claim 1, characterized in that the step of obtaining the image to be encoded and splitting the obtained image to obtain a plurality of image blocks comprises:
    获取所需编码的图像的宽、高像素值,将所述像素值除以需要等分的块数并向下取整得到图像块的宽、高像素值。Obtain the width and height pixel values of the image to be encoded, divide the pixel value by the number of blocks to be equally divided and round down to obtain the width and height pixel values of the image block.
  4. 如权利要求1所述的方法,其特征在于,对所述得到的若干图像块进行二进制转换得到对应的二进制数据串,包括:The method according to claim 1, characterized in that performing binary conversion on the obtained plurality of image blocks to obtain corresponding binary data strings comprises:
    获取拆分后的若干图像块,对所述各图像块并行二进制转换得到二进制数据串。A plurality of split image blocks are obtained, and binary data strings are obtained by performing parallel binary conversion on the image blocks.
  5. 如权利要求1所述的方法,其特征在于,所述类四进制比特-碱基映射规则包括:The method of claim 1, wherein the quaternary bit-base mapping rule comprises:
    使用所述类四进制比特-碱基映射表A*对所述二进制数据串的前4个比特进行碱基转换,得到两位初始碱基;Using the quaternary bit-base mapping table A*, base conversion is performed on the first 4 bits of the binary data string to obtain two initial bases;
    若所述两位初始碱基为重叠形式,则后续1或2个比特先使用类四进制比特-碱基映射表A进行碱基转换,得到第三位碱基;If the two initial bases are in overlapping form, the subsequent 1 or 2 bits are first converted using the quaternary bit-base mapping table A to obtain the third base;
    基于所述两位初始碱基为重叠形式,若下一次遇到重叠形式,则使用类 四进制比特-碱基映射表B进行碱基转换;否则使用类四进制比特-碱基映射表B*进行碱基转换;Based on the two initial bases being in an overlapping form, if an overlapping form is encountered next time, the quaternary bit-base mapping table B is used to perform base conversion; otherwise, the quaternary bit-base mapping table B* is used to perform base conversion;
    若上一次类四进制比特-碱基映射表B进行碱基转换,则下一次重叠形式使用类四进制比特-碱基映射表A进行碱基转换;非重叠形式使用类四进制比特-碱基映射表A*进行碱基转换;If the quaternary bit-base mapping table B was used for base conversion in the previous time, the quaternary bit-base mapping table A is used for base conversion in the next overlapping form; the quaternary bit-base mapping table A* is used for base conversion in the non-overlapping form;
    若所述两位初始碱基为非重叠形式,则后续1或2个比特继续使用类四进制比特-碱基映射表A*进行碱基转换,得到第三位碱基。If the two initial bases are in a non-overlapping form, the subsequent 1 or 2 bits continue to use the quaternary bit-base mapping table A* to perform base conversion to obtain the third base.
  6. 如权利要求5所述的方法,其特征在于,所述类四进制比特-碱基映射规则还包括:The method according to claim 5, characterized in that the quaternary bit-base mapping rule further comprises:
    当二进制数据串进行碱基转换中剩下最后的1或2个比特,使用末端比特处理表进行碱基转换。When the last 1 or 2 bits of the binary data string are left for base conversion, the end bit processing table is used to perform base conversion.
  7. 如权利要求1至6所述的方法,其特征在于,所述方法包括:The method according to claims 1 to 6, characterized in that the method comprises:
    对所述类四进制比特-碱基映射表A和表B进行变换,得到若干新的映射规则;Transforming the quaternary bit-base mapping table A and table B to obtain several new mapping rules;
    对得到的若干个新的映射规则使用如上编码流程进行编码,并记录碱基转换后的结果序列中GC的含量以及编码密度;The obtained several new mapping rules are encoded using the above encoding process, and the GC content and encoding density in the result sequence after base conversion are recorded;
    选择所述结果序列中GC含量在40%-60%且编码密度最高的对应的映射规则作为输出。The mapping rules corresponding to the result sequences with GC contents between 40% and 60% and the highest coding density are selected as output.
  8. 一种图像数据DNA存储系统,其特征在于,所述系统包括:An image data DNA storage system, characterized in that the system comprises:
    图像获取模块,用于获取所需编码的图像,对所述获取到的图像进行拆分,得到若干图像块;An image acquisition module is used to acquire an image to be encoded and split the acquired image into several image blocks;
    数据转换模块,用于对所述得到的若干图像块进行二进制转换得到对应的二进制数据串;A data conversion module, used for performing binary conversion on the obtained image blocks to obtain corresponding binary data strings;
    数据编码模块,用于使用类四进制比特-碱基映射规则对所述二进制数据串进行碱基转换,得到对应的第一碱基序列;所述类四进制比特-碱基映射规则包括若干类四进制比特-碱基映射表;A data encoding module, configured to perform base conversion on the binary data string using a quasi-quaternary bit-base mapping rule to obtain a corresponding first base sequence; the quasi-quaternary bit-base mapping rule includes a plurality of quaternary bit-base mapping tables;
    序列编号模块,用于使用特定碱基转换规则转换后的编号对第一碱基序列进行编号,得到包含块编号的第二碱基序列;所述块编号位于所述第二碱基序列的头部;A sequence numbering module, used to number the first base sequence using the numbers converted according to a specific base conversion rule to obtain a second base sequence including a block number; the block number is located at the head of the second base sequence;
    数据合成模块,用于对所述第二碱基碱基序列添加新的引物并进行DNA合成,得到DNA存储数据;A data synthesis module, used for adding a new primer to the second base sequence and performing DNA synthesis to obtain DNA storage data;
    数据解码模块,用于对所述得到的DNA存储数据进行测序得到碱基序列数据,根据得到的所述碱基序列数据中头部碱基序列筛选所需进行分块解码的数据。The data decoding module is used to sequence the obtained DNA storage data to obtain base sequence data, and to filter the data required for block decoding according to the head base sequence in the obtained base sequence data.
  9. 一种电子设备,其特征在于,包括:至少一个处理器、至少一个存储器、以及至少一条通信总线,其中,An electronic device, characterized in that it comprises: at least one processor, at least one memory, and at least one communication bus, wherein:
    所述存储器上存储有计算机程序,所述处理器通过所述通信总线读取所述存储器中的所述计算机程序;The memory stores a computer program, and the processor reads the computer program in the memory through the communication bus;
    所述计算机程序被所述处理器执行时实现权利要求1至7中任一项所述的图像数据DNA存储方法。When the computer program is executed by the processor, the image data DNA storage method according to any one of claims 1 to 7 is implemented.
  10. 一种存储介质,其上存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1至7中任一项所述的图像数据DNA存储方法。A storage medium having a computer program stored thereon, wherein when the computer program is executed by a processor, the image data DNA storage method according to any one of claims 1 to 7 is implemented.
PCT/CN2022/136367 2022-12-02 Image data dna storage method and system, and electronic device and storage medium WO2024113382A1 (en)

Publications (1)

Publication Number Publication Date
WO2024113382A1 true WO2024113382A1 (en) 2024-06-06

Family

ID=

Similar Documents

Publication Publication Date Title
JP3778087B2 (en) Data encoding apparatus and data decoding apparatus
US6885319B2 (en) System and method for generating optimally compressed data from a plurality of data compression/decompression engines implementing different data compression algorithms
US6819271B2 (en) Parallel compression and decompression system and method having multiple parallel compression and decompression engines
CN111510156A (en) Method for dynamically compressing and decompressing large file based on segmentation
JP3714935B2 (en) Improved Huffman decoding method and apparatus
RU2679960C2 (en) Method and system of database for locating documents
CN102255617A (en) Storage method of Huffman tree and method of decoding data by using arrays
WO2024113382A1 (en) Image data dna storage method and system, and electronic device and storage medium
US11424761B2 (en) Multiple symbol decoder
US9413388B1 (en) Modified huffman decoding
JP5472610B2 (en) Method and apparatus for encoding / decoding numeric data string
CN111061722A (en) Data compression method, data decompression method, device and equipment
JP7139897B2 (en) Encoding device, decoding device, encoding method, decoding method, encoding program, decoding program
CN116258781A (en) Image data DNA storage method, system, electronic device and storage medium
US7777653B2 (en) Decoding variable-length code (VLC) bitstream information
JP5626440B2 (en) Method and apparatus for encoding / decoding numeric data string
TW200920139A (en) Apparatus and method for parallely processing video decoding
JP6885466B2 (en) Coding device, decoding device, coding method, decoding method, coding program, decoding program
CN112699639B (en) Storage method and device of integer data and storage medium
CN111213146A (en) Pseudo data generating device, method thereof, and program
JP7493062B1 (en) MEMORY SYSTEM, DECODING CIRCUIT, AND ENCODED DATA GENERATION METHOD
JP4152491B2 (en) Data alignment device and compression device
CN110875744B (en) Coding method and device
KR101270633B1 (en) Fast Multimedia Huffman Decoding Method and Apparatus for Adapting Plurality of Huffman Tables
JPH0621828A (en) Vector quantizing decoder