WO2014147362A1 - Apparatus for reducing data volumes - Google Patents
Apparatus for reducing data volumes Download PDFInfo
- Publication number
- WO2014147362A1 WO2014147362A1 PCT/GB2014/000089 GB2014000089W WO2014147362A1 WO 2014147362 A1 WO2014147362 A1 WO 2014147362A1 GB 2014000089 W GB2014000089 W GB 2014000089W WO 2014147362 A1 WO2014147362 A1 WO 2014147362A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- stage
- calculator
- meta
- analyser
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/3059—Digital compression and data reduction techniques where the original information is represented by a subset or similar information, e.g. lossy compression
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/55—Compression Theory, e.g. compression of random number, repeated compression
Definitions
- This invention relates to apparatus for reducing data volumes.
- apparatus for reducing data volumes which apparatus comprises:
- a source systems data profiler and extractor subsystem comprising: a. a fuzzy logic controller comprising fuzzyfier, inference, and output handling,
- a stage 1 data reduction subsystem comprising: a. a two-step principle component analyser (PCA) having a covariance matrix calculator and an eigenvalues calculator, and
- stage 1 fidelity analyser comprising:
- a classify failure type 1 module (iv) a classify failure type 1 module; (3) a stage 2 data reduction persistent homology machine (PHM), comprising:
- stage 2 fidelity analyser comprising:
- stage 3 data reduction holographic production means comprising:
- a. optimum topological data representation containing the datamorphology and instruction sequence
- RW read write
- TBS topological boundary surface
- the apparatus may be one which includes destination systems which interact with the hologram and meta-keys without ever needing to reconstruct the source data.
- the destination systems may comprise:
- the apparatus of the present invention uses information density holography (IDH).
- IDH information density holography
- the IDH is a data movement, access and storage technology that reduces data volumes with a theoretical limit that can exceed 1 ,000,000 fold whilst maintaining a high level of fidelity and integrity.
- the IDH enables the apparatus of the present invention to be used in many applications, including cybernetic control in spacecraft, autonomous vehicles and in manufacturing plant robotics, that cannot presently be realised without interacting with petabytes of data.
- the apparatus of the present invention is able to reduce a petabyte of data (1x10 15 bytes) down to a gigabyte (1x10 9 ).
- the IDH as used in the apparatus of the present invention permits a most likely one-off batch upload over the currently available broadband upload speeds of 3.125x10 4 bytes/second in just 9 hours. Interactions requiring a terabyte are achievable in around 30 seconds, whilst those requiring from one to a few hundred gigabytes are virtually instantaneous (called asynchronous or real-time).
- Highly parallel processing IDH hardware combines data compression, high-dimensional algebraic topological representation and dimensional reduction algorithms to reproduce and transmit data holographically. These algorithms encode an arbitrarily large dataset on a two-dimensional topological surface (the boundary to the region), just like a hologram. The algorithms can also find and display hidden properties of massive data sets as well as embedded control sequences, enabling the reduced volume of data to be moved at astonishing speeds which are many orders of magnitude greater than current methods.
- the IDH as used in the apparatus of the present invention is based on applying the holographic principle in cosmology and information entropy equivalence to data.
- the description of a volume of space can be thought of as encoded on a boundary to the region, so that our 3D universe, might instead be "written" on a two-dimensional surface like a hologram.
- Analogously an arbitrarily large dataset can be encoded (written) on 'the boundary to the region' as a hofographic-topological surface.
- the application of the holographic principle to big data is believed to be a completely unique insight, as is the application of dimensional reduction techniques to achieve holographic data reproduction to produce the huge data volume reductions achievable by the apparatus of the present invention.
- IDH The algebraic theory of topologies behind the IDH is that a compacted description (an unprecedented 100 - 1 ,000,000 fold bit reduction) of the shape of the data can be created, whilst still enabling deeper insights and patterns to be obtained than is possible with existing techniques. It is called IDH because the first stage of the data reduction process, whilst it reduces the total number of data dimensions, allows all the source data points to be retained. This is directly analogous to a density increase due to a volume reduction.
- Figure 1 shows examples of IDH used to facilitate applications requiring the communications of vast data volumes
- FIG. 1 shows IDH source apparatus sub-systems and an integration assembly
- Figure 3 shows an IDH destination apparatus system assembly
- Figure 4 shows Intel® Xeon® processor high performance computing for IDH sources
- Figure 5 shows Intel® Xeon® processor high performance computing for IDH destinations.
- Figure 1 illustrates examples of where and how IDH is applied. These examples include vehicle autonomous driving 22c, automated spacecraft rendiezvous 22d, information, communications and media applications 22b, and automated manufacturing plants 22a.
- Geographically dispersed, multi-structured data sources from destination systems 21 is reduced in volume by firstly representing it as a complex mathematical topology, secondly reproducing it as a hologram and thirdly compressing.
- the systems 1 are shown in detail in Figure 2.
- the operation of reproducing the data holographically reveals hidden structures in the data so that the hologram is not only a reproduction of the data but also an analysis.
- the hologram can also contain control sequences for actuators.
- the data hologram can be accessed directly by end-user devices with highly parallel processing chip technology enabling interaction with data either in holographic format or by reproducing the original data sets.
- the data hologram can also be moved as a hologram and stored either on traditional disc drives or holographically further reducing storage volumes.
- the uncompressed data is accessed from host computer systems through a high performance data interface 1 shown in Figure 2, the input source to the IDH apparatus shown in Figure 1.
- the uncompressed unreduced data enters the data profiler and extractor subsystem 2 shown in Figure 2.
- the data is read, sampled and then 'fuzzified' in a fuzzy logic controller comprising fuzzyfier, inference, and output handling 2a.
- the purpose of the fuzzy logic circuit is to identify the meta-keys (the indexes) 2b enabling the data to be reproduced with a very high degree of fidelity.
- the inference engine identifies these meta-keys as relationships using the rule base and outputs these as 'fuzzy sets' to the type reducer.
- the type reducer outputs the meta-keys 2b and because the data will contain categories as text, it also outputs the data to a combined data numerifier and data normaliser 2c.
- the extracted and profiled data is now ready for the first stage of the reduction process.
- the stage 1 data reduction subsystem apparatus 3 is essentially a two-step principle component analyser (PCA) 3a.
- the apparatus also contains a stage 1 fidelity analyser 3b.
- the first dimensional reduction This can be up to a 35-fold reduction in the total data volume with no reduction in the number of data points (corresponding to an increase in information density).
- the first dimensional reducer is part of the stage 1 fidelity analyser 3b comprising the residual analysis classifier, comparator and classify failure type 1 module.
- the residual analyser module uncovers hidden structures within the data and also embedded control sequences. These components provide a test loop comprising logic circuits.
- the comparator compares the input data at source with the PCA output and measures the difference. This is the first point that the meta- keys 2b are used to correct for errors. If the error is greater than the threshold set, the comparator feeds the data to th classify failure type 1 module and checks against the meta-keys 2b back in the data profiler and extractor assembly 2. The process is iterative and the objective is to reduce the error to below the threshold value. When the error becomes less than the set threshold, the now reduced but still uncompressed data is output to the stage 2 data reduction apparatus 4 which is in the form of a persistent homology machine 4a (PHM).
- PPM persistent homology machine
- the PHM comprises the rips complex calculator which performs the simplicial approximation, the homology and groups calculator and the persistent homology converter, see 4a in Figure 2.
- the PHM also contains the stage 2 fidelity analyser 4b.
- the reduced data output now in the form of a persistent homology undergoes further volume reduction in the dimensional reducer.
- This second dimensional reducer is contained within the stage 2 fidelity analyser 4b, which also comprises the datamorphology and instruction sequence modules together with similar modules to the stage 1 fidelity analyser 3b.
- the fidelity measurement and improvement process is similar this time with the classify failure type 2 module looking-up the meta-keys 2b and the second comparator in the stage 2 fidelity analyser 4b.
- This second comparator compares the source data with the persistent homology output.
- This stage further reduces the data to a factor of between 50 and 1000 of the source.
- the data in the form of a persistent homology is output to the data reduction stage 3 holographic representation 5 for conversion to a hologram
- the data reduction stage 3 holographic representation 5 comprises optimum topological data representation, containing the datamorphology and instruction sequence which is output to a read write (RW) miniaturised recording system called a topological boundary surface (TBS) recorder.
- the TBS recorder has written on its fluctuating surface the internal projection of the optimum topological data representation. This is akin to including all the information content contained in the original data volume onto a surface which is analogous to creation of an optical hologram. However, the dimensionality can be many times greater than optical systems.
- the data is further reduced compared with the source now by a total factor of between 1,000 and 100,000.
- the data hologram is combined digitally with the meta-keys 2b to reproduce the data with the highest possible fidelity.
- the combined signal is output through a digital multiplexor (mux) in the data reduction stage 3 holographic representation 5 to the source coding compressor 6.
- the design of the source coding compressor 6 is based on high throughput compression of double-precision floating-point data source coding (compression) system developed by Martin Burtscher and Paruj Ratanaworabhan at the School of Electrical and Computer Engineering, Cornell University, Ithaca, NY 14853, United States of America.
- the design of the source coding compressor 6 will not be described further herein, other than to say that a realistic 1 :10 compression is achievable with this technology.
- the total reduction possible with IDH plus source coding is 100 - 1 ,000,000 times of the source original.
- the reduced compressed data is output through a high performance data interface 7.
- Figure 1 shows destination systems 22.
- the destination system electronics modules are shown as modules 9 in Figure 3.
- the destination systems interact with the hologram produced by the source technology of Figure 2 and meta-keys 2b without ever needing to reconstruct the source data.
- Similar electronics to that shown in Figure 2 are required.
- the compressed data input comprising the hologram produced by the source technology of Figure 2 and the meta-keys 2b from the data profiler and extractor 1 in Figure 2 is accessed through a high performance data interface 8 in Figure 3, which connects to destination subsystem assembly 9.
- the decoder module of the source codec 9a uncompresses the signal (essentially decoding it) and the demultiplexer module of the digital mux/demux 9b splits the signal into the meta-keys 9c, instruction sequence 9d and the datamorphology 9e.
- the other components are the feedback elements 9f, the comparator 9g, the controller 9i, the aggregator 9h and the actuator 9j.
- the actuator 9j outputs control demands via line 9m.
- the actuator output is generalised in Figure 3 by a reference to Figure 1. Adds, moves and changes AND/OR perturbations at the output are aggregated by the aggregator 9h and compared via the comparator 9g with the input signal. The differential is fed into the feedback elements 9f.
- the differential comprises AND/OR changes to the meta-keys 9c, instruction sequence 9d and datamorphology 9e.
- the differential is then fed back through the mux/demux 9b, source codec 9a and high performance data interface 8 to update the source data systems, see Figure 1.
- the architecture for the destination systems is based on an Intel® Xeon® processor as shown in Figure 5 but requires fewer processors and memory.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Holo Graphy (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201480017014.2A CN105164590A (en) | 2013-03-19 | 2014-03-11 | Apparatus for reducing data volumes |
US14/392,077 US9496892B2 (en) | 2013-03-19 | 2014-03-11 | Apparatus for reducing data volumes |
KR1020157030155A KR20150131388A (en) | 2013-03-19 | 2014-03-11 | Apparatus for reducing data volumes |
EP14710604.1A EP2976681A1 (en) | 2013-03-19 | 2014-03-11 | Apparatus for reducing data volumes |
JP2016503712A JP2016521024A (en) | 2013-03-19 | 2014-03-11 | Data volume reduction device |
IL241379A IL241379A0 (en) | 2013-03-19 | 2015-09-09 | Apparatus for reducing data volumes |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB1305070.3A GB201305070D0 (en) | 2013-03-19 | 2013-03-19 | Appatatus for reducing data volumes |
GB1305070.3 | 2013-03-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014147362A1 true WO2014147362A1 (en) | 2014-09-25 |
Family
ID=48226708
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/GB2014/000089 WO2014147362A1 (en) | 2013-03-19 | 2014-03-11 | Apparatus for reducing data volumes |
Country Status (8)
Country | Link |
---|---|
US (1) | US9496892B2 (en) |
EP (1) | EP2976681A1 (en) |
JP (1) | JP2016521024A (en) |
KR (1) | KR20150131388A (en) |
CN (1) | CN105164590A (en) |
GB (1) | GB201305070D0 (en) |
IL (1) | IL241379A0 (en) |
WO (1) | WO2014147362A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10554761B2 (en) | 2015-12-12 | 2020-02-04 | At&T Intellectual Property I, Lp | Methods and apparatus to improve transmission of a field data set to a network access point via parallel communication sessions |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10373019B2 (en) * | 2016-01-13 | 2019-08-06 | Ford Global Technologies, Llc | Low- and high-fidelity classifiers applied to road-scene images |
US10333958B2 (en) | 2016-07-19 | 2019-06-25 | Cisco Technology, Inc. | Multi-dimensional system anomaly detection |
CN110110814B (en) * | 2019-05-21 | 2021-05-04 | 浙江大学 | Distributed parallel PCA process monitoring modeling method based on continuous MapReduce |
KR102330171B1 (en) * | 2020-12-07 | 2021-11-23 | 한국과학기술정보연구원 | Controlling apparatus for principal component analyisis, and control method thereof |
CN112954977B (en) * | 2021-02-18 | 2023-04-14 | 财拓云计算(上海)有限公司 | System and method for realizing energy-saving temperature control of data center based on artificial intelligence |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5892503A (en) * | 1994-07-29 | 1999-04-06 | Ast Research, Inc. | Multimedia console keyboard |
ATE484104T1 (en) * | 2003-11-10 | 2010-10-15 | Rose Hulman Inst Of Technology | BINARY DELTA-SIGMA MODULATOR |
US7903008B2 (en) * | 2007-11-08 | 2011-03-08 | National Instruments Corporation | Source-measure unit based on digital control loop |
-
2013
- 2013-03-19 GB GBGB1305070.3A patent/GB201305070D0/en not_active Ceased
-
2014
- 2014-03-11 CN CN201480017014.2A patent/CN105164590A/en active Pending
- 2014-03-11 KR KR1020157030155A patent/KR20150131388A/en not_active Application Discontinuation
- 2014-03-11 JP JP2016503712A patent/JP2016521024A/en active Pending
- 2014-03-11 US US14/392,077 patent/US9496892B2/en not_active Expired - Fee Related
- 2014-03-11 EP EP14710604.1A patent/EP2976681A1/en not_active Withdrawn
- 2014-03-11 WO PCT/GB2014/000089 patent/WO2014147362A1/en active Application Filing
-
2015
- 2015-09-09 IL IL241379A patent/IL241379A0/en unknown
Non-Patent Citations (5)
Title |
---|
ANONYMOUS: "Fuzzy logic - Wikipedia, the free encyclopedia", WIKIPEDIA, 31 January 2013 (2013-01-31), pages 1 - 6, XP055117192, Retrieved from the Internet <URL:http://web.archive.org/web/20130131132015/https://en.wikipedia.org/wiki/Fuzzy_logic> [retrieved on 20140509] * |
ANONYMOUS: "Holographic data storage - Wikipedia, the free encyclopedia", WIKIPEDIA, 13 March 2013 (2013-03-13), pages 1 - 5, XP055117190, Retrieved from the Internet <URL:http://web.archive.org/web/20130313061634/http://en.wikipedia.org/wiki/Holographic_data_storage> [retrieved on 20140509] * |
ANONYMOUS: "Homology (mathematics) - Wikipedia, the free encyclopedia", WIKIPEDIA, 31 January 2013 (2013-01-31), pages 1 - 4, XP055117198, Retrieved from the Internet <URL:http://web.archive.org/web/20130131004352/http://en.wikipedia.org/wiki/Homology_(mathematics)> [retrieved on 20140509] * |
ANONYMOUS: "Principal component analysis - Wikipedia, the free encyclopedia", WIKIPEDIA, 17 March 2013 (2013-03-17), pages 1 - 12, XP055117197, Retrieved from the Internet <URL:http://web.archive.org/web/20130317012102/http://en.wikipedia.org/wiki/Principal_component_analysis> [retrieved on 20140509] * |
ANONYMOUS: "Simplicial approximation theorem - Wikipedia, the free encyclopedia", WIKIPEDIA, 28 March 2006 (2006-03-28), XP055117195, Retrieved from the Internet <URL:http://web.archive.org/web/20060328203614/http://en.wikipedia.org/wiki/Simplicial_approximation_theorem> [retrieved on 20140509] * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10554761B2 (en) | 2015-12-12 | 2020-02-04 | At&T Intellectual Property I, Lp | Methods and apparatus to improve transmission of a field data set to a network access point via parallel communication sessions |
Also Published As
Publication number | Publication date |
---|---|
US20160043734A1 (en) | 2016-02-11 |
CN105164590A (en) | 2015-12-16 |
GB201305070D0 (en) | 2013-05-01 |
JP2016521024A (en) | 2016-07-14 |
US9496892B2 (en) | 2016-11-15 |
KR20150131388A (en) | 2015-11-24 |
IL241379A0 (en) | 2015-11-30 |
EP2976681A1 (en) | 2016-01-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9496892B2 (en) | Apparatus for reducing data volumes | |
CN111652367B (en) | Data processing method and related product | |
Bui et al. | Tamper-proofing video with hierarchical attention autoencoder hashing on blockchain | |
KR102199285B1 (en) | Method for compressing deep learning neural networks and apparatus for performing the same | |
Zhao et al. | Improving the i/o throughput for data-intensive scientific applications with efficient compression mechanisms | |
Park et al. | {DeepSketch}: A new machine {Learning-Based} reference search technique for {Post-Deduplication} delta compression | |
Becking et al. | Ecq x: explainability-driven quantization for low-bit and sparse DNNs | |
CA3194802A1 (en) | Quantization for neural network computation | |
CN115660070A (en) | Compression method for confrontation generation neural network | |
KR102645267B1 (en) | Quantization method for transformer encoder layer based on the sensitivity of the parameter and apparatus thereof | |
CN116842153A (en) | Multi-mode emotion analysis method and system based on feedback feature learning | |
CN107612554B (en) | Data compression processing method | |
Kandaswamy et al. | Real-time hyper-dimensional reconfiguration at the edge using hardware accelerators | |
CN104571940A (en) | Data storage device and computing system with the same | |
US11354595B2 (en) | Similarity-based hierarchical data loading for machine learning training | |
Przewlocka-Rus et al. | Energy efficient hardware acceleration of neural networks with power-of-two quantisation | |
JP7140705B2 (en) | Storage system and data recording method | |
CN115409150A (en) | Data compression method, data decompression method and related equipment | |
KR20210113297A (en) | Systems, methods, and apparatus for eliminating duplication and value redundancy in computer memory | |
KR102641500B1 (en) | Apparatus and method for unsupervised domain adaptation | |
Wen et al. | Fast and robust compression of deep convolutional neural networks | |
Bao et al. | EdgeNet: a low-power image recognition model based on small sample information | |
Kamath et al. | Compressing unstructured mesh data using spline fits, compressed sensing, and regression methods | |
US20230237278A1 (en) | Large-scale text data encoding and compression | |
US11979174B1 (en) | Systems and methods for providing simulation data compression, high speed interchange, and storage |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201480017014.2 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14710604 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2014710604 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 241379 Country of ref document: IL |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14392077 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: 2016503712 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 20157030155 Country of ref document: KR Kind code of ref document: A |