GB2397666A - Logical data grouping in digital storage system - Google Patents

Logical data grouping in digital storage system Download PDF

Info

Publication number
GB2397666A
GB2397666A GB0301803A GB0301803A GB2397666A GB 2397666 A GB2397666 A GB 2397666A GB 0301803 A GB0301803 A GB 0301803A GB 0301803 A GB0301803 A GB 0301803A GB 2397666 A GB2397666 A GB 2397666A
Authority
GB
United Kingdom
Prior art keywords
data
tape
bytes
rows
code words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
GB0301803A
Other versions
GB0301803D0 (en
Inventor
Andrew Hana
John Patrick Hardwick
Robert Morling
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HP Inc
Original Assignee
Hewlett Packard Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hewlett Packard Co filed Critical Hewlett Packard Co
Priority to GB0301803A priority Critical patent/GB2397666A/en
Publication of GB0301803D0 publication Critical patent/GB0301803D0/en
Priority to JP2004016517A priority patent/JP2004234829A/en
Publication of GB2397666A publication Critical patent/GB2397666A/en
Priority to US10/986,976 priority patent/US7831869B2/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/12Formatting, e.g. arrangement of data block or words on the record carriers
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/18Error detection or correction; Testing, e.g. of drop-outs
    • G11B20/1833Error detection or correction; Testing, e.g. of drop-outs by adding special lists or symbols to the coded information

Abstract

A method of formatting a block of user data comprising: arranging the block of user data into rows and columns of bytes, i.e. array 500, and applying an error correction code to individual rows of bytes, such that the rows each comprise four code words. The code words may be C2 code words and may be interleaved along the length of the row. Each column may comprise two C1 code words which may also be interleaved. Additionally, a metal particle type tape storage device, having a width of 3.81mm plus or minus 0.01mm and a length of 170m plus or minus 5m, is claimed.

Description

À . . . . À . À À À
LOGICAL DATA GROUPING IN DIGITAL STORAGE SYSTEM
Field of the Invention
The present invention relates to formatting of data and particularly although not exclusively to a method of writing data to a data storage medium, and to a data processing apparatus for arranging data into a format.
Background to the Invention
Referring to Fig. 1 herein, a known data storage format for writing data to a tape data storage medium has a logical data sub group 100 known as a 'G4 sub group' comprising an array of bytes of data arranged in rows and columns, there being 96 columns and 124 rows in a data group, each column being referred to as a 'data fragment'. Each column comprises 124 bytes of data numbered 0 to s 123. Each row comprises 96 bytes, numbered 0 to 95.
The data sub group has applied to it redundancy error correction coding (ECC). The applied redundancy error correction coding, which may be for example a Reed-Solomon coding, comprises 192 C1 code words, each column do comprising a pair of interleaved C1 code words; and 336 C2 code words, extending over 112 of the rows. Each coded row comprises 3 interleaved C2 code words, each code word being of 32 bytes. Shown in Fig. 1 is an example of first row 101, comprising an interleaving of 3 C2 code words in row 0, where bytes of 3 code words A, B. C respectively of the first row are interleaved z alternately such that individual bytes are ordered A, B. C, A, B. C, along the row.
C2 code words of another 111 rows are arranged similarly Since each column comprises 2 C1 code words, the G4 data sub-group comprises 96 x 2 = 192 C1 code words. C1 code words run vertically down the o columns, whilst C2 code words run horizontally across rows of the array. Each C1 code word extends along the entire height of its column, and each C2 code word À À. À. À .
À bee À * .e À . Àe e.e extends across the entire width of its row. The C2 code words occupy rows 0 - 1 11, so that there are 112 x 3 = 336 C2 code words in the G4 data sub- group.
Referring to Fig. 2 herein, the entire G4 data sub-group is stored on a magnetic tape data storage medium in a single track 200 extending transversely across a length of the tape as a stripe. In the prior art format, a physical width of the track a is 6.8pm, the track density is 147 tracks/mm (3,735 tracks /inch), and the bit density of bits written along the tracks is 4,800 bits/mm (122,000 bits/inch). This results in a data storage capacity of 20 Gbytes of uncompressed JO data (40Gbytes at 2:1 compression ratio) on a single data storage cartridge, containing a tape data storage medium of length 150 m. A mean time between failure (MTBF) reliability parameter of the prior art system, which is related to tape thickness, is 250,000 hours at a 30% duty cycle.
A plurality of tracks are written sequentially in parallel as the tape moves past a write head, the plurality of tracks abutting each other, so as to store a maximum amount of data on the tape. At a start of each track is provided a first synchronization tone 201 which allows a phase locked loop (PLL) to synchronize with the data. Similarly, at the end of each track is provided a second set of tones 202. The G4 data sub-group is stored between the first and second tone regions in a single track, and extends across a width of the tape Due to increased data storage demands, ongoing objectives in the improvement of tape data storage devices include: Increasing the amount of data which can be stored on a tape data storage medium; Increasing the data rate for writing data to a data storage medium; Improving reliability and byte error rates. À .
À À:: . i. .
À . . . . À . À À À À À À The above 3 parameters are interrelated, and attempts at increasing the amount of data storage capacity can affect the reliability of a tape data storage system.
Summary of the Invention
According to a first aspect of the present invention there is provided a method of formatting a block of user data, said method comprising: arranging said block of user data into an array of bytes, said array To comprising a plurality of rows and a plurality of columns of said bytes; and applying an error correction code to individual said rows of bytes, such that said error correction coded rows each comprise 4 code words.
According to a second aspect of the present invention there is provided a method of writing data to a linear tape data storage medium, said method comprising; arranging a block of user data into a data group of a logical array of data 2 o bytes, comprising a plurality of rows of said data bytes and a plurality of columns of said data bytes; applying an error correction coding to individual ones of said rows, such that individual said rows are each arranged into 4 code words; writing said data group as a data track extending across a width of said tape data storage medium such that all of said data group is contained within said single data track extending across said width of said tape data storage medium, and transverse to a main length of said tape data storage medium.
À . . À _ À À. . e t À À À À e . . À À À À ... . According to a third aspect of the present invention there is provide a data processing apparatus for arranging data into a format for writing to a data storage medium, said apparatus comprising: a memory for storing a data group comprising a plurality of bytes of user data arranged logically in rows; an error correction coding device for applying an error correction code to individual rows of said array, such that each said individual row is coded into 4 o code words; and a write head for writing each said data group across a width of a tape data storage medium, such that each said data group is written along a corresponding single track extending across a width of said tape data storage medium, and s transverse to a main length of said tape data storage medium.
According to a fourth aspect of the present invention there is provided a method of writing data to a linear tape data storage medium, said method comprising; arranging a block of user data into a logical group of data bytes, comprising a plurality of rows of said data bytes and a plurality of columns of said data bytes; z applying an error correction code to said data group such that each said column of said data group is coded with 2 C1 code words; applying an error correction coding to individual said rows, such that said individual rows are each arranged into 4 C2 code words; and À ' . e.e e acc Àee ÀÀ c Àc ec À c ÀÀ Àc e c # À,e. acee c writing said data group in a single data track across a width of said tape data storage medium, such that all of said data group is contained within said single data track extending across said width of said tape data storage medium.
According to a fifth aspect of the present invention there is provided a tape data storage system comprising: at least one write head for writing data to a magnetic tape data storage medium; a transport mechanism for transporting said tape data storage medium past said write head; a logical formatting device for formatting data into a data group comprising an array of bytes of data arranged logically in a plurality of rows and a plurality of columns; a memory device for storing said logical array of data; and an error correction coding device for applying an error correction code to individual said rows of bytes, such that said individual rows are each coded into 4 code words.
The invention includes a metal particle type tape data storage media 2 5 cartridge comprising a band of elongate tape having: width in the range 3.81 mm plus or minus 0.01 mm; and length in the range 1 70m plus or minus 5m.
À C C À À # À C À . À À À À À C À À ÀÀ À À À À À À . . ... # À À S t s e À À À À À ., À À
Brief Description of the Drawings
For a better understanding of the invention and to show how the same may be carried into effect, there will now be described by way of example only, specific embodiments, methods and processes according to the present invention with reference to the accompanying drawings in which: Figure 1 schematically illustrates a prior art G4 data subgroup according to a known tape data storage format; To Figure 2 schematically illustrates a physical layout of a prior art G4 data sub-group written along individual corresponding respective tracks across a tape data storage medium, as is known in the art; Figure 3 schematically illustrates a problem of variation of 'straightness' of a s physical track written in a single stripe across a width of a tape data storage medium; Figure 4 schematically illustrates a set of physical and practical considerations and limitations in achieving an overall increase in data storage o capacity for a novel tape data storage system; Figure 5 schematically illustrates a novel data sub-group according to a specific implementation of the present invention; z Figure 6 schematically illustrates a physical layout of a novel data sub group written to tape along a data track; Figure 7 schematically illustrates a data processing apparatus for creating a novel logical sub-group according to a specific implementation of the present invention; and . he e t T À e À e C. C A e..
Figure 8 schematically illustrates in perspective view, an example of a tape drive unit according to a specific embodiment of the present invention.
Detailed Description of a Specific Mode for Carrying Out the Invention There will now be described by way of example a specific mode contemplated by the inventors for carrying out the invention. In the following description numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent however, to one skilled in the art, that the present invention may be practiced without limitation to To these specific details. In other instances, well known methods and structures have not been described in detail so as not to unnecessarily obscure the present invention.
As data storage densities on tape data storage media improve, making ls further significant progress in increasing data storage capacity becomes increasingly difficult due to physical limitations of the tape data storage medium, and due to mechanical tolerances of tape transport components and read/write heads. Specific problems which can be encountered include the following: 2 o Firstly, to increase bit density on a tape, means that each bit of data must physically occupy a smaller area of a magnetic coating of a magnetic tape data storage medium. Consequently, a reduction in signal to noise ratio can be expecting in reading data as the bit density on tape increases.
Secondly, in a helical scan system in which individual tracks of data are written as stripes transverse to main length of the tape and, across a width of an elongate tape data storage medium, by a rotating write head, if the width of tracks is decreased, then as well as encountering a reduction in signal to noise ratio due to the reduction in width of the track, there is also a tracking error problem in that so a read head is more difficult to align for reading a track, due to the reduced dimension of the track.
À . .. . e eve À ... . . À À À e À À e À À À À À À eve À6 Àe À e.
Thirdly, as track width decreases, there is an 'integration problem' where a different tape drive unit is used to read a tape data storage medium, compared to the tape drive unit which has been used to write data to the tape medium. Each tape drive unit tends to follow its own characteristic shape of path as its stripes data tracks across the tape. This means that data written by one tape drive unit may not be readable by another, nominally identical, tape drive unit.
Referring to Figure 3 herein there is schematically illustrated in exaggerated form, one example of a characteristic path 300 followed by a particular specific To tape drive unit, compared to an ideal straight path 301, which represents a nominal path which every tape drive unit should follow when reading or writing a track across a width of tape data storage medium. As the track width is reduced, the tracking error due to a rotating read/write head following a path which deviates from the ideal straight path becomes more significant, since the amount of deviation of the actual tape drive specific path relative to the width of the ideal nominal track increases.
A particular tape drive unit may have its own characteristic curve as the track is written across the tape. Since each tape drive unit has a different 2 o respective characteristic curve for tracks which it reads or writes, tracks must be wide enough such that a track written by a first tape drive, having its own characteristic curve can be read by a second tape drive having a different characteristic curve. As the nominal width of tracks decreases, interchangeability problems between different devices increase.
Fourthly, there is problem of random byte errors, which increase as the signal to noise ratio decreases. In the prior art, C1 error correction coding is used to correct for random byte errors.
o Fifthly, there is the problem of damage to a tape surface causing longer and non-random errors. In the prior art, C2 coding is used to alleviate the effect of À . À À ..
À À e e À . ... . À e tee e e e À e e À À e Àe e e extended tape damage, which most commonly takes the form of elongate scratches running along a length of a tape data storage medium.
Referring to Figure 4 herein, there is illustrated schematically an interrelationship between parameters and limitations which need to be taken into consideration in achieving an overall increase in data storage capacity.
A total available increase in data storage capacity 400 is dependent upon the maximum allowable increase in tape length 401, the maximum allowable increase in track density 402, and any increases in capacity which can be achieved by significant re-organization of the logical sub-group, on the assumption that one sub-group will be written across a single width of tape, and the width of tape will not be significantly varied compared to the prior art tape width.
The allowable increase in tape length is limited by physical factors including the minimum tape thickness which can be achieved. The minimum tape thickness which can be achieved depends upon (1) substrate thickness and (2) the thickness of a magnetic material coating the substrate. In turn, the minimum o practical substrate thickness which can be used depends upon reliability factors, and the type of substrate material which is being used.
Limitations on the substrate thickness are imposed by the design and tolerance of tape transport mechanisms. Varying the substrate thickness and substrate material also affects the reliability of the tape, particularly its susceptability to edge defects and damage along the edge of the tape. Varying the thickness of magnetic media on the film affects the signal to noise ratio of a signal which can be read back from the tape, which also has the consequence, that thinner tapes require a redesign of tape drives to improve tolerances to 3 o make them able to read a lower signal to noise ratio signal.
À - . À À À À À eÀ À e À À - . .. - À . c r e À À .. e À A determination of the allowable increase in track density 402 is limited by the problem of decreasing signal to noise ratio as the track density increases, and also the problem of interoperability of tape drive units and tape cassettes as the track width is reduced.
Re-organization of a data sub-group 403 has an implication for changing the bit density 404. In turn, a change in bit density has a corresponding change in signal to noise ratio 405. Changes in signal to noise ratio limit the allowable increase in track density, and the thickness of magnetic media on the film.
Changes to the structure of the logical sub-group therefore have a nonintuitive effect on the total available increase in capacity which is a non-linear relationship, since changing the logical structure of the data sub-group changes the bit density, and hence the signal to noise ratio which varies other parameters, in particular the allowable track density and physical characteristics of the tape, which dictate the allowable tape length. Each of these parameters, allowable track density and allowable increase in tape, also affect the total available increase in capacity 400.
o According to a specific implementation of the present invention, an improved capacity tape data storage system is provided by the following combination of features.
À Tape data storage cartridges are provided, having a longer length of tape s data storage medium, compared to a prior art cartridge. The tape length is increased by approximately 15% - 45% compared to the prior art cartridge.
This involves reducing a thickness of tape substrate, and using a reduced thickness film of magnetic medium. Use of a reduced thickness magnetic film medium may provide a slightly worse signal to noise ratio than the prior art film, in the absence of any other changes in format. Further, reductions in substrate thickness may provide a different stretching characteristic compared À .À À À e À À . . À À À À À . .- . : it: :: Àe À: 77::: À.e to prior art substrate. However, the disadvantages of reduced tape thickness are off-set by provision of a greater active surface area of tape within a cartridge.
À A width of data track is reduced compared to the prior art case. According to a specific embodiment of the present invention, data tracks have a nominal width of 5.4,um, compared to the prior art in which a nominal track width of 6.8'um is employed. Reduction in track width, in isolation of other improvements, allows for an increase in data storage capacity for a same To length of tape, of the order of 12%, since more tracks can be written per unit area of tape. However, reduction in track width carries with it a corresponding reduction in signal to noise ratio in reading from tape, and also tracking errors and interchangeability of a tape between different tape drive units is affected.
Further, because there are a higher number of tracks written per unit length of tape than in the prior art case, physical damage to the tape, for example elongate scratches, for a same length of scratch will cause greater obliteration of data than in the prior art case, since there are more tracks per unit length
than in the prior art case.
À A novel sub group data format.
In view of the above considerations, the inventors have been faced with the need to overcome a variety of problems and to find a technical solution which allows acceptable performance on each of the above itemised parameters of signal to noise ratio, leading to random bit errors; interchangeability between different tape drive units; and protection against the relatively increased effect of tape damage, in particular scratches or obliteration of data, whilst improving the overall data storage capacity.
À . ... . ..
À . . À À À . . . . À a c À À À À . À.
À-. À .. - - e: ee: :: Àe À t À À ... À .
ln order to increase capacity, there were several logical data formatting options available.
Firstly, more C1 code words (i.e. 3 or more) per fragment could have been incorporated, keeping the same number of bytes per C1 code word as in the prior art format. This would have entailed columns of length 186 bytes, 248 bytes, and so on, where each additional code word per fragment contains 62 bytes.
Secondly, the ratio of user data bytes to redundancy bytes per C1 code word To could have been altered.
Thirdly, the ratio of user data bytes to parity bytes in the C2 code words could be varied, in order to give a higher proportion of user data, thereby increasing capacity, at the expense of a lower proportion of C2 parity bytes. However, in a case where all other parameters such as track width, and physical byte length on tape are unchanged, this would result in a lower level of error correction capability for tape damage or tape defects, since the amount of redundancy coding in each C2 code word is relatively reduced.
go For each possible combination of changes of parameters in the sub data group, that is: changes to the ratio of user bytes to parity bytes within a C1 code word; changes to the number of C1 code words within a data fragment; changes to the relative ratio of user bytes to parity bytes within the C2 code word, this results in a different physical bit length, given that the whole data group is to be :s written across a single track diagonally across the width of the tape. Each different bit length produces a corresponding respective signal to noise ratio, longer bit lengths giving a better signal to noise ratio compare to shorter bit lengths.
In order to improve data storage capacity, the density of bits written to tape must increase. However, if the bit density is increased too much, then the corresponding reduction in signal to noise ratio means that the readability of data e....
À À À À . À . ... . À 1 À . . . . À -À . . . e. en, À À ... . À À written to tape is reduced. Therefore, the bit density needs to be increased in a manner in which the effect of reduced signal to noise ratio on the re-readability of data from the tape remains within acceptable limits, such that the tape can be re read by the same tape drive unit which wrote the data in the first place, and can be re-read by other tape drive units, being ones which did not originally write the data to a particularly tape. In other words, interchangeability of tape drive units must be maintained and the signal to noise ratio of the data written to tape must be above a threshold level for achieving interoperability of tape drive units and tape data storage media.
A full and comprehensive analysis of variation in bit length, and consequent signal to noise ratio for each of the variable parameters of C1 code word byte ratio; C1 code word number per fragment; C2 byte ratio; and C2 code word number per row is complex and time consuming. However, as a result of an analysis, a solution as shown with reference to Fig. 5 was achieved, having 4 C2 code words per row.
As a result of experiments and analysis, the inventors have realised that an optimum solution for achieving increased capacity, within significantly degrading the interoperability of tape drive units and tape data storage media is to provide the logical data group format as described with reference to Fig. 5 herein.
The inventors have conceived a data format, after consideration of the above technical problems, which is now described as follows.
Referring to Fig. 5 herein, a novel data storage format according to a specific implementation of the present invention has a logical data group 500 comprising an array bytes of data arranged in rows and columns, there being 128 columns in a sub data group numbered O to 127, each column being referred to so as a 'data fragment', and 124 rows numbered 0 to 123. Each column comprises 124 bytes of data. Each row comprises 128 bytes of data.
. . À À . c. . eve, e a. e e T À e e e e e e The sub data group has applied to it redundancy error correction coding (ECC). The applied redundancy error correction coding may be for example a Reed-Solomon coding. Since each column comprises 2 C1 code words, the data sub-group comprises 128 x 2 = 256 C1 code words. Each column (data fragment) comprising a pair of interleaved C1 code words. Each row comprises 4 interleaved C2 code words, each C2 code word being of 32 bytes.
C1 code words run vertically down the columns, whilst C2 code words run horizontally across rows of the array. Each C1 code word extends along the To entire height of its column, and each C2 code word extends across the entire width of its row. The C2 code words occupy rows 0 - 111, so that there are 112 x 4 = 448 C2 code words in the data sub-group.
Shown in Fig. 5 is an interleaving of 4 C2 code words in a first row (row 0), where bytes of 4 code words A, B. C, D respectively of the first row are interleaved alternately as such that individual bytes are ordered A, B. C, D, A, B. C, D Rows 1 to 111 are similarly interleaved.
Referring to Fig 6 herein, the entire sub-group is stored on a magnetic tape go data storage medium in a single striped track 600 extending transversely across a length of the tape. A physical width a of the track is 5.4,um plus or minus 0.1'um, giving a track density in the range 181 189 tracks/mm, and nominally tracks/mm (4,597 - 4,800.6 tracks/inch, and nominally 4,699 tracks /inch).
Each data fragment is written across a track with a bit density in the range 6,220 : 6,614 bits/mm, and nominally 6,417 bits/mm (158,000 168,000 bits/inch, and nominally 163,000 bits/ inch) A plurality of tracks are written sequentially as the tape moves past a write head, the plurality of tracks abutting each other so as to store a maximum amount of data on the tape.
À a ate a sea a e e e À 8 C e a a a a e a 6 a e a À a.' Rae a sa. a À À . . ÀÀ À : '. .. : : À À . À. . At a start of each track is provided a first synchronization tone 601 which allows a phase locked loop (PLL) to synchronize with the data. Similarly, at the end of each stripe is provided a second set of synchronization tones 602. The data sub-group is stored between the first and second tone regions, and extends across a width of the tape in a straight line diagonally across a width of the tape.
The tape data storage medium itself comprises a single band of tape, having a length 1 70m plus or minus 5m. The tape is of a metal particle (MP) type, having an overall thickness of 5.3,um plus or minus 0.02,um. The tape comprises base film substrate having a thickness of 3.6 Bum plus or minus 0.02'um. A reverse side of the substrate is coated with back coating, and an upper side of the substrate is coated with an undercoat layer, and on top of the undercoat layer, there is formed a magnetic coating. The width of the tape is 3.81 mm plus or minus 0.01 mm.
Initially, solutions which involved only increasing tape length and increasing track density, but without increasing byte density were investigated.
However, moving from a 6.8'um track width to a 5.4'um track width, i.e. a 20% reduction in track width, for example means that the increased mechanical o tolerances of the tape drive required to achieve interoperability of tape drive units and tape cartridges, would make such drives technically difficulty to produce, and uneconomic.
Further, using a longer tape length more than about 200 metres, in a single : cartridge, means having a reduced tape substrate thickness, which increases interoperability problems, and introduces further problems in tape transport, which requires significant changes to tape drive mechanisms, again making tape drive units technically difficult to manufacture, and uneconomic. Using a thinner substrate, makes the tape more prone to damage at each edge of the tape.
Increasing capacity by a combination of increased tape length, and reduced track width, presents all of above problems in combination. Combining the finer À. e À.
À . . .. . : e: : .: À À À . . : À :e À À . - tolerances required for a thinner track width, with a tape having a thinner substrate, and therefore a less durable tape, which is much more subject to mechanical damage, particularly at the edges of the tape, then this dramatically decreases the interoperability of tape drive units and tape cartridges. Further, there is the problem of producing a thin enough film substrate to produce a tape of longer than approximately 200 meters length, to fit into a known format cartridge. As a practical matter, producing such a thin substrate film whilst achieving reliability of the tape is at present not feasible.
To Consequently, achieving significant increases in data storage capacity by varying physical parameters of track length and track width (trackdensity) alone does not give a high enough increase in data storage capacity over prior art systems of comparable size and weight. Increasing tape length and track density necessitates finer tolerances of tape drive units and tape data storage cartridges, which in turn, reduces the interoperability of tape drive units and cartridges, and also results in mechanical tolerances which are too high to produce a practical commercial tape data storage system.
The inventors examined varying the parameter of bit density, as a way of go increasing data storage capacity. Constraints on increases in bit density arise from the logical format of a data sub-group, which is written in a single track striped across a width of tape. In a data subgroup which has applied to it error correction coding (ECC), a number of code words into which the data is sub divided can only be increased in integer units of one code word. Therefore, if the z size of a data subgroup is to be increased by adding bytes, the number of bytes must correspond with an integer number of C1 and C2 code words. Therefore, in Fig. 5, extensions to the sub-group can be made by addition of an extra C2 code word per row, or one or more extra C1 code words per column (fragment). This has a consequence that byte length of bytes physically written to tape is not a o continuously variable parameter, but is only variable in discrete steps, depending upon whether it is the number of C1 code words and/or C2 code words which is to be increased. Since bit density is directly related to signal to noise ratio, signal r It a to noise ratio is not a continuously variable design parameter, but can only be varied in discrete steps.
By selecting a C2 code word length of 32 bytes, the ratio of user data bytes to parity bytes is preserved, without reducing further the capability of the C2 error correction to rectify the same number of missing data bytes. Any reduction in C2 code word length would involve a lower ratio of user data bytes to parity data bytes, therefore reducing the interval in signal to noise ratio by reducing the C2 code word length would involve a corresponding reduction in data capacity within To each C2 code word.
Therefore, the variable parameters in selecting the data group format include variation of C2 code word length, variation of C1 code word length, each of which has an implication for efficiency of correctability of data, and an implication for efficiency of data capacity within the code word; and the number of C1 code words in each data fragment (column), and the number of C2 code words in each row.
The length of the C2 code word cannot be reduced beyond a limit which is o dictated by the minimum length of scratches or other long defects along the tape, which must be correctable by the C2 code words. Each time a new C2 code word length is considered a full analysis of the correcting capability of that C2 code Further, the minimum length of C2 code word is also limited by the proportion of user data to parity data within each C2 code word which is necessary in order to achieve optimum data storage capacity. Reductions in C2 code word length lead to a less efficient ratio of user data, or in other words shorter C2 code words produce proportionately less data storage capacity than o longer C2 code words.
À e.. ..
À . . .. . . . ... . r; r t Also, increasing the code word length involves additional hardware. For example taking an example of a 32 byte C2 code word, to get the same error correcting capability out of a 64 byte C2 code word. A code word which is more difficult to create in hardware or firmware is obtained, requiring hardware or firmware which is more difficult to implement. Therefore, there is a technical disadvantage in increasing the C2 code word length. The longer the code word, the more powerful it becomes, but the harder it becomes to create in hardware or firmware. Therefore, a compromise needs to be made between increasing the length of code word in order to increase the error correction capability of that To code word, and between the additional difficulty in implementing a system for creating that code word in hardware and firmware.
Choosing the optimum values of C1 code word length, C2 code word length, number of C1 code words in a data fragment, and number of C2 code words in a row requires detailed analysis, and a consideration of trade off's and optimizations. Arriving at a combination of C1 and C2 code word number and C1, C2 code word lengths which provides an optimum performance is a non-trivial task.
go In order to achieve a significant increase in data storage capacity, for example to store 40Gbytes of uncompressed data on a single tape cartridge, (80 Gbytes at 2:1 compression ratio) requires an increase in density from the prior art density of 122Kbits per inch. In arriving at the specific implementation described herein, the C1 and C2 parameters including code word length and number of code words was optimized, in order to give the required number of Kbits per inch of user data, in order to achieve the target capacity of a single tape data storage medium. There are various possible implementation options giving for example 36Gbytes, 38Gbytes, 40Gbytes or 45Gbytes uncompressed capacity on a single tape. However, each of these implementation options gives a different bit density, so and hence a different signal to noise ratio. Further, the relationship between signal to noise ratio and capacity is not intuitive, since to achieve each capacity requires a different C1/C2 combination, and a different ratio of parity bytes to user .. e ee. À e.
À À . . À À À À À À À . . e À e e data bytes within the code words. Therefore, for example a data capacity of 36Gbytes on a single length of tape may give a worse signal to noise ratio than using an other combination of C1/C2 code words which gives 38Gbytes of data on a tape, due to the different overhead of the parity bytes within the specific choice of error correction coding used.
In the specific implementation described herein, bearing in mind the constraint that C2 code words can only be added or subtracted from rows in whole units of code words, then having selected a system using 4 code words o per row, this enabled a tape data storage system having only a marginal increase in tape length, and only a relatively marginal decrease in track width to be utilized, to achieve a significant increase in overall data storage capacity to give of the order of 40Gbytes of uncompressed data storage on a single tape data storage medium. The signal to noise ratio resulting from the bit density was within a limit which could be processed on the basis of a practical manufactured tape data storage system. This has the advantage that a slight increase in tape length does not necessitate a complete re-design of the tape substrate and tape film, but rather, that tape re-design is a fine tuning exercise, rather than a fundamental re-design of the tape data storage medium itself. In the specific implementation 2 o adopted, the tape data storage medium is thinner than the prior art tape, however the reduction in thickness is small enough such that it has been achieved by a reduction in magnetic film media thickness rather than a reduction in the thickness of the substrate material itself metal particle (MP) type tape can be used.
If, on the other hand if 5 code words per row had been used, then this would have necessitated using a metal evaporated (ME) type tape in order to achieve the required byte density and signal to noise ratio.
so Referring to Fig. 7 herein, there is schematically illustrated a tape data storage system, according to a specific implementation of the present invention.
The tape data storage system comprises: a host interface 700 for receiving user À e.. À .
À À e À À e À À À . À.
Àe:: Àc I: : À. : : À : data from a host device, for example a host computer; an internal data bus 701; a main memory buffer 702 for storing user data whilst it is being formatted and error correction coded; a logical formatter device 702 which operates to arrange an incoming stream of host data into logical 2 dimensional arrays of data as described with a reference to Fig. 5 herein, the logical formatter comprising an error correction encoder 703 for applying an error correction code to rows and columns of each data sub group as described herein before; a physical formatter 704 for formatting logical data groups into a continuous stream of data fragments for writing as physical data tracks, there being 128 data fragments per physical o data track; a read/write component 704 for writing data to at least one read/write head 705, and conversely, for reading data for accepting data from at least one read head; and one or a plurality of tape data storage cartridges 706, each containing a length of elongate magnetic tape data storage medium.
s The write heads operate to write a plurality of data tracks sequentially along a length of the tape, each physical track being written as a stripe across a width of the tape as described with reference to Fig. 6 herein. The elongate tape data storage medium contained in the cartridge 706 has a length of the order 170m plus or minus 5m, a width of the order 3.81 mm plus or minus 0.01 mm; a 2 o tape thickness of the order 5.3 Sum, of which a substrate thickness is of the order of 3.6pm.
Referring to Figure 8 herein, the tape data storage system as described with reference to Fig.7 herein, may be implemented as a discrete stand alone tape drive unit, having its own casing, power supply, and connections.
In other embodiments, the tape data storage system may be implemented within a host computer or other computer entity within a same casing as the host computer or other computer entity.
À . ... À ..
À À À À À À . ..
À À . ... .

Claims (24)

  1. e. :: : c. .e Àe... . . Claims: 1. A method of formatting a block of user
    data, said method comprising: arranging said block of user data into an array of bytes, said array comprising a plurality of rows and a plurality of columns of said bytes; and applying an error correction code to individual said rows of bytes, such that o said error correction coded rows each comprise 4 code words.
  2. 2. The method as claimed in claim 1, further comprising: in each of said error correction coded rows, interleaving a corresponding said 4 code words, such that each code word extends along substantially an entire length of said row.
  3. 3. The method as claimed in claim 1, further comprising: So in each of said error correction coded rows, interleaving a corresponding said 4 code words, such that each code word extends along substantially an entire length of said row; and such that in each row, a first code word, a second code word, a third code :s word, and a fourth code word are interleaved in an order of a byte of said first code word, a byte of said second code word, a byte of said third code word and a byte of said fourth code word, said interleaved order repeating along said row.
  4. 4. The method as claimed in any one of the preceding claims, wherein o each said row comprises 128 bytes.
    . . À . . , .. e..
    À . . :. À:--:e
  5. 5. The method as claimed in any one of the preceding claims, wherein each said column comprises 124 bytes.
  6. 6. The method as claimed in any one of the preceding claims, wherein each said column comprises a pair of code words.
  7. 7. The method as claimed in any one of the preceding claims, wherein each said column comprises a pair of code words, said pair of code words being interleaved with each other.
  8. 8. The method as claimed in any one of the preceding claims, wherein: each said row comprises 4 C2 code words; and each said column comprises 2 C1 code words.
  9. 9. A method of writing data to a linear tape data storage medium, said method comprising; arranging a block of user data into a data group of a logical array of data bytes, comprising a plurality of rows of said data bytes and a plurality of columns of said data bytes; s applying an error correction coding to individual ones of said rows, such that individual said rows are each arranged into 4 code words; writing said data group as a data track extending across a width of said tape data storage medium such that all of said data group is contained within said o single data track extending across said width of said tape data storage medium.
    e. ..
    . . . . . e.e À:: À: e. e: : À a À..
  10. 10. The method as claimed in claim 9, wherein said data group comprises 448 C2 code words.
  11. 11. The method as claimed in claim 9 or 10 further comprising: applying an error correction coding to columns of said data group such that each said column comprises 2 code words.
  12. 12. The method as claimed in any one of claims 9 to 11, wherein said JO data group comprises 256 C1 code words.
  13. 13. The method as claimed in any one of claims 9 to 12, wherein said data track has a width in the range 5.4pm, plus or minus 0.1pm.
    :s
  14. 14. The method as claimed in any one of claims 9 to 13, wherein said data is written to tape at a bit density in the range 6,220 to 6,614 bits/mm.
  15. 15. Data processing apparatus for arranging data into a format for writing to a data storage medium, said apparatus comprising: a memory for storing a data group comprising a plurality of bytes of user data arranged logically in rows; an error correction coding device for applying an error correction code to :s individual rows of said array, such that each said individual row is coded into 4 code words; and a write head for writing each said data group across a width of a tape data storage medium, such that each said data group is written along a corresponding single track extending across a width of said tape data storage medium, and transverse to a main length of said tape data storage medium.
    c cad *c.
    À c a
    C C C C
    < C C c * a c c se C-. * t.C C e i; '
  16. 16. A data processing device for arranging data into a format of an array of bytes arranged in rows and columns, said device comprising: a memory for storing a logical array comprising a said plurality of bytes of user data arranged logically in a plurality of rows and a plurality of columns; and an error correction coding device for applying an error correction code to individual said rows of bytes, such that said individual rows are each coded into 4 code words.
  17. 17. The data processing device as claimed in claim 16, wherein: said error correction coding device is operable for applying an error correction code to each said column of bytes, such that each said column is coded into 2 code words.
  18. 18. The data processing device as claimed in claim 16, wherein: individual said rows of bytes are coded into 4 code words, said code words being interleaved with each other, such that each said code word of a row extends substantially along a whole length of said row.
  19. 19. A method of writing data to a linear tape data storage medium, said method comprising; arranging a block of user data into a logical group of data bytes, comprising a plurality of rows of said data bytes and a plurality of columns of said data bytes; So applying an error correction code to said data group such that each said column of said data group is coded with 2 C1 code words; e bee e Àe À e e À À À À e e À À e e e e e À e ee : À - À À: * . : applying an error correction coding to individual said rows, such that said individual rows are each arranged into 4 C2 code words; and writing said data group in a single data track across a width of said tape data storage medium, such that all of said data group is contained within said single data track extending across said width of said tape data storage medium.
  20. 20. A tape data storage system comprising: at least one write head for writing data to a magnetic tape data storage medium; a transport mechanism for transporting said tape data storage medium past said write head; a logical formatting device for formatting data into a data group comprising an array of bytes of data arranged logically in a plurality of rows and a plurality of columns; 2 o a memory device for storing said logical array of data; and an error correction coding device for applying an error correction code to individual said rows of bytes, such that said individual rows are each coded into 4 code words.
  21. 21. The data storage system as claimed in claim 20, wherein said at least one write head operates to write a plurality of said arrays of data to a corresponding respective plurality of data tracks, each data track extending diagonally across a width of said tape data storage medium, wherein each said o data track is written with a corresponding respective said array.
    À * .. * bee À À * À À À À.
    . . ... . c: I: À À À À À . - . . :. À
  22. 22. The data storage system as claimed in claim 20 or 21, further comprising a tape data storage media comprising a band of elongate tape of: width in the range 3.81 mm plus or minus 0.01 mm; and length in the range 1 70m plus or minus 5m.
  23. 23. A metal particle type tape data storage media cartridge comprising a band of elongate tape having: width in the range 3.81 mm plus or minus 0. 01 mm; and length in the range 1 70m plus or minus 5m.
  24. 24. The tape data storage media cartridge as claimed in claim 23, having a thickness in the range 5.3,um plus or minus 0.02,um.
    À À . À À . À .
GB0301803A 2003-01-27 2003-01-27 Logical data grouping in digital storage system Withdrawn GB2397666A (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
GB0301803A GB2397666A (en) 2003-01-27 2003-01-27 Logical data grouping in digital storage system
JP2004016517A JP2004234829A (en) 2003-01-27 2004-01-26 Grouping of logical data of digital storage system
US10/986,976 US7831869B2 (en) 2003-01-27 2004-11-15 DDS logical data grouping

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB0301803A GB2397666A (en) 2003-01-27 2003-01-27 Logical data grouping in digital storage system

Publications (2)

Publication Number Publication Date
GB0301803D0 GB0301803D0 (en) 2003-02-26
GB2397666A true GB2397666A (en) 2004-07-28

Family

ID=9951853

Family Applications (1)

Application Number Title Priority Date Filing Date
GB0301803A Withdrawn GB2397666A (en) 2003-01-27 2003-01-27 Logical data grouping in digital storage system

Country Status (2)

Country Link
JP (1) JP2004234829A (en)
GB (1) GB2397666A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4413340A (en) * 1980-05-21 1983-11-01 Sony Corporation Error correctable data transmission method
US4477903A (en) * 1982-01-21 1984-10-16 U.S. Philips Corporation Error correction method for the transfer of blocks of data bits, a device for preforming such a method, a decoder for use with such a method, and a device comprising such a decoder
EP0551973A2 (en) * 1992-01-15 1993-07-21 E-Systems Inc. Triple orthogonally interleaved error correction system
EP0957484A1 (en) * 1998-05-12 1999-11-17 Hewlett-Packard Company Read while write method in data storage device
US6311305B1 (en) * 1997-03-06 2001-10-30 T.T.R. Technologies Ltd. Method and system for overriding error correction

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6631485B1 (en) * 1998-01-16 2003-10-07 Hewlett-Packard Development Company, L.C. Methods and apparatus for data storage on magnetic media including error correction codes
JP2000067532A (en) * 1998-08-21 2000-03-03 Hitachi Ltd Error-correcting encoding/decoding system and error- correcting encoding/decoding device
JP4518586B2 (en) * 1999-01-20 2010-08-04 ソニー株式会社 Data recording apparatus and rewrite determination method thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4413340A (en) * 1980-05-21 1983-11-01 Sony Corporation Error correctable data transmission method
US4477903A (en) * 1982-01-21 1984-10-16 U.S. Philips Corporation Error correction method for the transfer of blocks of data bits, a device for preforming such a method, a decoder for use with such a method, and a device comprising such a decoder
EP0551973A2 (en) * 1992-01-15 1993-07-21 E-Systems Inc. Triple orthogonally interleaved error correction system
US6311305B1 (en) * 1997-03-06 2001-10-30 T.T.R. Technologies Ltd. Method and system for overriding error correction
EP0957484A1 (en) * 1998-05-12 1999-11-17 Hewlett-Packard Company Read while write method in data storage device

Also Published As

Publication number Publication date
GB0301803D0 (en) 2003-02-26
JP2004234829A (en) 2004-08-19

Similar Documents

Publication Publication Date Title
US5369652A (en) Error detection and correction having one data format recordable on record media using a diverse number of concurrently recorded tracks
US20100232047A1 (en) Data interleaving in tape drives
US6903887B2 (en) Multiple level (ML), integrated sector format (ISF), error correction code (ECC) encoding and decoding processes for data storage or communication devices and systems
US6539514B1 (en) Read while write method in data storage device
US6947247B2 (en) Large angle azimuth recording and head configurations
US6282039B1 (en) Scratch protection in tape data storage system
US7876516B2 (en) Rewrite-efficient ECC/interleaving for multi-track recording on magnetic tape
EP1915758B1 (en) Method and apparatus for providing error correction capability to longitudinal position data
KR101077520B1 (en) Data recording/reproducing system, data recording/reproducing method, program, and recording medium
US6914746B1 (en) High sustained data rate storage devices having microactuator
US20020178421A1 (en) Programmable multi-level track layout method and system for optimizing ECC redundancy in data storage devices
US8645622B2 (en) Method to protect data on a disk drive from uncorrectable media errors
US7787205B2 (en) Detecting and correcting insertion and deletion of bits for bit patterned media storage systems
US7188299B2 (en) Data-recording/reproduction apparatus and data-recording/reproduction method
US7308637B2 (en) Data recording/reproducing device, data recording/reproducing method, program, and recording medium
US8514509B2 (en) Error tolerant or streaming storage device
US6631492B2 (en) Multitrack data recording and read out of recorded multitrack digital data for error correction
US7831869B2 (en) DDS logical data grouping
US11379120B2 (en) Error tolerant or streaming storage device
GB2397666A (en) Logical data grouping in digital storage system
US7774685B2 (en) HDD sector format with reduced vulnerability to defects and burst errors
Williams DLT technology overview: a survey of general technology and design features

Legal Events

Date Code Title Description
WAP Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1)