CN115269743A - Data collection and processing system for data fusion - Google Patents

Data collection and processing system for data fusion Download PDF

Info

Publication number
CN115269743A
CN115269743A CN202210873392.7A CN202210873392A CN115269743A CN 115269743 A CN115269743 A CN 115269743A CN 202210873392 A CN202210873392 A CN 202210873392A CN 115269743 A CN115269743 A CN 115269743A
Authority
CN
China
Prior art keywords
data
fusion
analysis
module
exchange
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210873392.7A
Other languages
Chinese (zh)
Inventor
张茂华
童晓雯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hubei Changjiang Media Digital Publishing Co ltd
Original Assignee
Hubei Changjiang Media Digital Publishing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hubei Changjiang Media Digital Publishing Co ltd filed Critical Hubei Changjiang Media Digital Publishing Co ltd
Priority to CN202210873392.7A priority Critical patent/CN115269743A/en
Publication of CN115269743A publication Critical patent/CN115269743A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a data collection processing system for data fusion, which comprises a data collection module, an application library and a data exchange and analysis module, wherein the data collection module collects user information data generated by various operation service systems, the application library module meets the retrieval requirements of large data volume and high concurrency, and the data exchange and analysis module uniformly collects, exchanges and analyzes the collected and processed data in a data warehouse, deploys data mining and machine learning algorithms and applications, and integrates offline calculation and online calculation technology to support data analysis based on mass data storage. The data collection and processing system for data fusion comprises data collection, analysis, feature extraction, fusion and the like, a unified data center platform is finally formed, key problems of construction of various user models and demand models, fusion of multi-source heterogeneous data and the like are solved, and data support of operation management is provided for fusion publishing.

Description

Data collection and processing system for data fusion
Technical Field
The invention relates to the technical field related to data collection and processing, in particular to a data collection and processing system for data fusion.
Background
Today of high-speed development of informatization, data concentration has become the mainstream trend of informatization construction, and is a necessary means for realizing innovation management, scientific management and intelligent management, the data center construction has become a necessary product under the trend of data concentration, various organizations such as government departments, enterprises, scientific and educational institutions and the like have all built own data centers and comprehensively manage various information systems of the organizations, the demand of the data centers caused by the increase of the data centers is continuously growing, the data centers have become physical carriers and core resources of the organization information systems, and an important support platform of organization services how to treat huge data becomes one of key problems in the field of big data application.
The big data management aims at collecting and processing various heterogeneous data acquired by each data center, analyzing and processing the collected data for the fusion publishing and knowledge service system, constructing a data center, providing a basis for the fusion publishing and knowledge service system to produce products and serve users, being a starting point and a basis of the fusion publishing and knowledge service activities, and completing the collection and processing of information necessary for various activities of the fusion publishing and knowledge service network big data system through the collection and processing of data in the fusion publishing and knowledge service big data system, wherein the links comprise data collection, cleaning, extraction, fusion, modeling and the like, and key problems of the construction of various user models and demand models, the fusion of multi-source heterogeneous data and the like need to be solved.
Aiming at the problems, the innovative design is carried out on the basis of the original data collecting and processing system.
Disclosure of Invention
The invention aims to provide a data collection and processing system for data fusion, which aims to solve the problems that the data collection and processing system provided by the background technology needs to solve the construction of various user models and demand models, the fusion of multi-source heterogeneous data and the like.
In order to achieve the purpose, the invention provides the following technical scheme: a data collection processing system for data fusion;
the processing system comprises a data collection module, an application library module and a data exchange and analysis module, and a multi-source heterogeneous publishing resource fusion system is built to fuse massive heterogeneous resources so as to realize the management of the fused publishing data;
the data collection module collects user information, video, audio, picture and character data generated by various operation service systems, and the data center can collect and analyze the structured, unstructured and semi-structured data and distributed files;
the application library module supports automation of large-scale deployment, real-time monitoring of running states and load balancing based on the characteristics of application requirements and data capacity scale, and meets the retrieval requirements of large data volume and high concurrency;
the data exchange and analysis module is used for collecting, exchanging and analyzing collected and processed data in a data warehouse in a unified manner, deploying data mining and machine learning algorithms and applications, realizing flexible theme-oriented analysis tasks, integrating offline calculation and online calculation technologies based on mass data storage, realizing various data analysis scenes through different programming models and supporting real-time calculated data analysis.
Preferably, the data collected by the data collection module comprises user data, product information data, various system logs, data generated by system operation and user interaction behavior data.
Preferably, the application library adopts a data storage and calculation technology combining a database of a mass data parallel processing architecture and distributed file storage and calculation to realize unified management and retrieval of structured, semi-structured and unstructured data, and provides flexible data access strategy support for upper-layer application through an open secondary development interface.
Preferably, the application library comprises digital publishing data and printing publishing data, the digital publishing data comprises an information acquisition module for acquiring texts of different books, an integration module for merging, de-duplicating, deleting and arranging the acquired information, and a storage module for storing the information, the printing publishing data comprises book entity data including but not limited to a book kiosk, a library, a bookstore and other publishing carriers for displaying different text abstracts printed on paperboards and two-dimensional code cardboard labels corresponding to the texts, and the book entity data comprises one or more of book names, international standard book numbers and book profiles.
Preferably, the data exchange and analysis module is used for a data exchange function between different resource data, and the multi-source heterogeneous data exchange technology has the following three data exchange modes: the method supports data exchange of various formats, is adaptive to the standardized format of national data exchange, and adopts Unicode encoding; the method comprises the following steps of supporting a loose coupling information exchange system based on SOA technology and providing a Web Service interface; the front-end processor technology is reasonably applied.
Preferably, the exchange fusion analysis of the multi-source heterogeneous data specifically comprises the following steps:
s1, acquiring user data, product information data, various system logs, data generated by system operation and data acquired by user interaction behavior data;
s2, analyzing user data, product information data, various system logs, data generated by system operation and user interaction behavior data by adopting a fusion recognition algorithm, and performing target information positioning, target information recognition and characteristic information extraction on the processed data by adopting a CNN (neural network) to obtain analysis results such as target attributes, target characteristic values and the like;
s3, performing fusion analysis and real-time calculation under big data on the user data, the product information data, various system logs, data generated by system operation and data acquired by user interaction behavior data to obtain a real-time calculation result, and storing the real-time calculation result;
and S4, performing off-line calculation on the historical data to obtain an off-line calculation result, and performing fusion analysis on the real-time calculation result and the off-line calculation result to obtain a data association relation to obtain fused multi-source heterogeneous data.
Compared with the prior art, the invention has the beneficial effects that: the data collection and processing system for data fusion is beneficial to publishing and knowledge service data management to improve the efficiency of data aggregation and content management, the intelligent customization under the personalized information production and consumption becomes an important characteristic of content publishing in the 5G era, in the data flood caused by the perception of everything, the data of different types of resources are processed by adopting a deep learning algorithm while the fragmentization and the systematic processing of knowledge are well done, and the multisource heterogeneous data exchange technology adopts the following three data exchange modes aiming at the data exchange function among different resource data: the method supports data exchange of various formats, is adaptive to the standardized format of national data exchange, and adopts Unicode coding; the method comprises the steps of supporting a loosely-coupled information exchange system based on SOA technology and providing a Web Service interface; the reasonable application of the front-end processor technology comprises data acquisition, analysis, feature extraction, fusion and the like, and finally forms a unified data warehouse, so that the content acquisition and generation efficiency is improved, and the digital asset management tamping foundation is realized.
Drawings
FIG. 1 is a schematic block diagram of the system of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, the present invention provides a technical solution: a data collection and processing system for data fusion is intended to build a multi-source heterogeneous publishing resource fusion system to fuse massive heterogeneous resources and achieve management of fused publishing data, and comprises a data collection and application library and a data exchange and analysis module, as shown in figure 1.
The data collection module collects user data, product information data, various system logs, data generated by system operation and user interaction behavior data including user information, video, audio, pictures and character data from various operation service systems, and the data center can collect and analyze the structured, unstructured and semi-structured data and the distributed files.
Based on the application demand characteristics and the data capacity scale characteristics, the application library can adopt a data storage and calculation technology combining a database of a massive data parallel processing architecture and distributed file storage/calculation to realize unified management and retrieval of structured, semi-structured and unstructured data, and supports automation of large-scale deployment, real-time monitoring of running states and load balancing; and the large data volume and high concurrency retrieval requirements are met. And flexible data access strategy support is provided for upper-layer application through an open secondary development interface.
The application library comprises digital publishing data and basic information, thematic information and analysis indexes of the printing publishing data, wherein the digital publishing data comprises an information acquisition module for acquiring texts of different books, an integrating module for combining, de-weighting, deleting and arranging the acquired information and a storage module for storing the information, the printing publishing data comprises but is not limited to book kiosks, libraries, bookstores and other publishing carriers for displaying abstracts of different texts printed by paperboards and book entity data of two-dimensional code paperboard labels corresponding to the texts, and the book entity data comprises book names, international standard book numbers and book introduction.
The collected and processed data are converged into a unified data warehouse for data exchange and analysis, data mining, machine learning algorithms and application are deployed, flexible theme-oriented analysis tasks are realized, off-line calculation and on-line calculation technologies are integrated based on massive data storage, various data analysis scenes are realized through different programming models, real-time calculation data analysis is supported, and the multi-source heterogeneous data exchange technology has the following three data exchange modes aiming at the data exchange function among different resource data: the method supports data exchange of various formats, is adaptive to the standardized format of national data exchange, and adopts Unicode coding; the method comprises the steps of supporting a loosely-coupled information exchange system based on SOA technology and providing a Web Service interface; the front-end processor technology is reasonably applied.
The exchange fusion analysis of the multi-source heterogeneous data specifically comprises the following steps:
s1, acquiring user data, product information data, various system logs, data generated by system operation and data acquired by user interaction behavior data;
s2, analyzing user data, product information data, various system logs, data generated by system operation and user interaction behavior data by adopting a fusion recognition algorithm, and performing target information positioning, target information recognition and characteristic information extraction on the processed data by adopting a CNN (neural network) to obtain analysis results such as target attributes, target characteristic values and the like;
s3, performing fusion analysis and real-time calculation under big data on the user data, the product information data, various system logs, data generated by system operation and data acquired by user interaction behavior data to obtain a real-time calculation result, and storing the real-time calculation result;
and S4, performing off-line calculation on the historical data to obtain an off-line calculation result, and performing fusion analysis on the real-time calculation result and the off-line calculation result to obtain a data association relation to obtain fused multi-source heterogeneous data.
The process comprises data acquisition, analysis, feature extraction, fusion and the like, a unified data middle platform is finally formed, data support of operation management is provided for fusion publishing, a unified fusion publishing platform is built, a platform type fusion new infrastructure system is built, three subsystems including a base layer, a platform layer, an application layer and the like are included, and corresponding modularized use tools are provided for different user roles such as editing, publishing units, users and the like.
The basic layer provides basic facilities and technical services for other business functions, realizes the wide multiplexing of the services, and simultaneously provides basic services for the application of the access platform. The platform layer comprises a business center platform and a data center platform, and adopts a middle-class platform mode design, wherein the business center platform mainly provides public-oriented and sharable demand aggregation service; the data center mainly meets public data requirements, and the application layer provides front-end applications with different objects and different functions in different scenes.
The base layer mainly comprises: infrastructure and infrastructure services. The infrastructure mainly provides application bottom layer operation environment support and comprises database storage, column storage, cache, message middleware and the like; the basic service mainly provides a universal component of a function module related to the service, is constructed in a micro-service mode, comprises a short message service, a workflow engine, a two-dimension code service, a full-text retrieval and a unique key and the like, and provides basic support for the core service of the platform layer.
The platform layer comprises a business middle platform and a data center, and adopts a middle platform mode design to provide resource, channel and copyright management; providing a workbench for carrying out business process approval; providing resource market and channel market, strengthening the cooperation among publishing and issuing units, and getting through the resource cooperation and channel cooperation on the line; the comprehensive competitiveness of a publishing unit is improved, and resource, copyright and channel management modules are preferentially built.
The resource management module is responsible for managing and storing the resource parts of the works in the digital assets. The resource data subsystem is composed of a data and metadata rule base, metadata, a classified content resource base, a content resource processing center and other modules. The system is responsible for defining and managing data standards of all digital assets and providing storage management services for various types of work resources such as literary works, electronic books, physical books, continuous publications, pictures, music, audio, videos, database products, application software products and the like. And an entry indexing processing tool can be provided, so that a user can conveniently process and produce more content resources.
The copyright management module is responsible for managing the ownership information of the works resources. The system is composed of a flow configuration center, a contract management center, a copyright information management center, a copyright asset management center and the like. The registering work of different types of works can be efficiently completed through flexible self-defined flow configuration. Contract management of standard specifications can ensure that all rights are clear and definite, and structured query can be carried out. The copyright management module is used for correspondingly associating the copyright information with the works resources to form effective digital asset information and can operate assets.
The channel management module is responsible for operating the digital assets after being commercialized through a self-owned platform or a third-party platform so as to obtain the income of the assets. Including packaging for the production of digital assets, external authorization or self-management, revenue settlement, etc.
The method comprises the steps of constructing an application layer, realizing front-end application with different functions under different objects and different scenes, and mainly comprising an operation platform facing terminal user operation and based on a shop mode, an open platform facing third-party technology cooperation, and a docking service facing various channel applications, making a unified development standard and opening a service interface; a third-party developer and an application are introduced, and the continuous development capability of the platform can be greatly improved by the third-party developer and the application; and a continuously updated source is provided for the platform.
The project application scenario has three specific aspects: firstly, providing on-line operation and technical support of the converged publishing business for each publishing unit; secondly, intelligent personalized digital content service is provided for audiences through the unified operation foreground; thirdly, a customized professional platform is quickly built through a micro service architecture provided by the project according to the requirements of content providers.
For publishing units, the project platform provides tool services such as content resource management, digital product release, unified channel management and the like, opens up an operation channel and shares a user pool, and provides technical support of a whole industrial chain from product manufacturing to channel operation.
For the audience, the platform can match and integrate the big data analysis and the result of the database according to the retrieval requirement, and feed back a recommendation list based on content association. The data center station can finely index mass data, improve the resource retrievability and the resource recycling performance, predict data and discover knowledge, provide intelligent personalized knowledge service, improve user experience and improve user viscosity.
The project uses unified development standards and specifications, and the micro service architecture provided by the project can quickly realize project construction, save development time, improve project quality and reduce operation and maintenance cost. In this way, a platform can be customized for content providers with personalized needs.
The method is beneficial to extracting common technologies which are simultaneously suitable for fusing publishing and knowledge service big data application and other field applications for integration and industrialization, is intended to carry out engineering and industrialization application demonstration in the aspects of big data management, knowledge service and intelligent push, block chain technology, intelligent decision and the like, and is popularized by utilizing a market mechanism.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (6)

1. A data collection processing system for data fusion, characterized by:
the processing system comprises a data collection module, an application library module and a data exchange and analysis module, and a multi-source heterogeneous publishing resource fusion system is constructed to fuse massive heterogeneous resources, so that the management of fused publishing data is realized;
the data collection module collects user information, video, audio, picture and character data generated by various operation service systems, and the data center can collect and analyze the structured, unstructured and semi-structured data and distributed files;
the application library module supports automation of large-scale deployment and real-time monitoring and load balancing of an operation state based on the characteristics of application requirements and the characteristics of data capacity scale, and meets the retrieval requirements of large data volume and high concurrency;
the data exchange and analysis module is used for collecting, exchanging and analyzing collected and processed data in a data warehouse in a unified manner, deploying data mining and machine learning algorithms and applications, realizing flexible theme-oriented analysis tasks, integrating offline calculation and online calculation technologies based on mass data storage, realizing various data analysis scenes through different programming models and supporting real-time calculated data analysis.
2. A data collection processing system for data fusion according to claim 1, characterized by: the data collection module collects data including user data, product information data, various system logs, data generated by system operation and user interaction behavior data.
3. A data collection processing system for data fusion, according to claim 1, characterized by: the application library adopts a data storage and calculation technology combining a database of a mass data parallel processing architecture and distributed file storage and calculation to realize unified management and retrieval of structured, semi-structured and unstructured data, and provides flexible data access strategy support for upper-layer application through an open secondary development interface.
4. A data collection processing system for data fusion, according to claim 1, characterized by: the application library comprises digital publishing data and printing publishing data, wherein the digital publishing data comprises an information acquisition module for acquiring different book texts, an integrating module for merging, de-duplicating, deleting and arranging the acquired information and a storage module for storing the information, the printing publishing data comprises but is not limited to book entities data of publication carriers such as book kiosks, libraries and bookstores for displaying different text abstracts printed on paperboards and two-dimensional code paperboard labels corresponding to the texts, and the book entities data comprises one or more of book names, international standard book numbers and book introduction.
5. A data collection processing system for data fusion according to claim 1, characterized by: the data exchange and analysis module aims at the data exchange function among different resource data, and the multi-source heterogeneous data exchange technology has the following three data exchange modes: the method supports data exchange of various formats, is adaptive to the standardized format of national data exchange, and adopts Unicode coding; the method comprises the following steps of supporting a loose coupling information exchange system based on SOA technology and providing a Web Service interface; the front-end processor technology is reasonably applied.
6. A data collection processing system for data fusion, according to claim 1, characterized by: the exchange fusion analysis of the multi-source heterogeneous data specifically comprises the following steps:
s1, acquiring user data, product information data, various system logs, data generated by system operation and data acquired by user interaction behavior data;
s2, analyzing user data, product information data, various system logs, data generated by system operation and user interaction behavior data by adopting a fusion recognition algorithm, and performing target information positioning, target information recognition and characteristic information extraction on the processed data by adopting a CNN (neural network) to obtain analysis results such as target attributes, target characteristic values and the like;
s3, performing fusion analysis and real-time calculation under big data on user data, product information data, various system logs, data generated by system operation and data acquired by user interaction behavior data to obtain a real-time calculation result, and storing the real-time calculation result;
and S4, performing off-line calculation on the historical data to obtain an off-line calculation result, and performing fusion analysis on the real-time calculation result and the off-line calculation result to obtain a data association relation to obtain fused multi-source heterogeneous data.
CN202210873392.7A 2022-07-22 2022-07-22 Data collection and processing system for data fusion Pending CN115269743A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210873392.7A CN115269743A (en) 2022-07-22 2022-07-22 Data collection and processing system for data fusion

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210873392.7A CN115269743A (en) 2022-07-22 2022-07-22 Data collection and processing system for data fusion

Publications (1)

Publication Number Publication Date
CN115269743A true CN115269743A (en) 2022-11-01

Family

ID=83769148

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210873392.7A Pending CN115269743A (en) 2022-07-22 2022-07-22 Data collection and processing system for data fusion

Country Status (1)

Country Link
CN (1) CN115269743A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116484118A (en) * 2023-03-24 2023-07-25 中国科学院空天信息创新研究院 Unmanned aerial vehicle flight environment data service system and method
CN117009921A (en) * 2023-08-04 2023-11-07 振宁(无锡)智能科技有限公司 Optimized data processing method and system of data fusion engine

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116484118A (en) * 2023-03-24 2023-07-25 中国科学院空天信息创新研究院 Unmanned aerial vehicle flight environment data service system and method
CN116484118B (en) * 2023-03-24 2024-04-09 中国科学院空天信息创新研究院 Unmanned aerial vehicle flight environment data service system and method
CN117009921A (en) * 2023-08-04 2023-11-07 振宁(无锡)智能科技有限公司 Optimized data processing method and system of data fusion engine
CN117009921B (en) * 2023-08-04 2024-02-23 振宁(无锡)智能科技有限公司 Optimized data processing method and system of data fusion engine

Similar Documents

Publication Publication Date Title
Wang et al. Industrial big data analytics: challenges, methodologies, and applications
Yaqoob et al. Big data: From beginning to future
CN107819824B (en) Urban data opening and information service system and service method
Hu et al. Toward scalable systems for big data analytics: A technology tutorial
CN115269743A (en) Data collection and processing system for data fusion
CN106407278B (en) Architecture design system of big data platform
CN103838847A (en) Data organization method oriented to sea-cloud collaboration network computing network
CN101695082B (en) Service organization method based on relation mining and device thereof
CN111931027A (en) Intelligent news public opinion early warning system
US20190050435A1 (en) Object data association index system and methods for the construction and applications thereof
CN102880683A (en) Automatic network generation system for feasibility study report and generation method thereof
CN101827239A (en) Mass mobile streaming media image and video data storage and cross-domain resource access
CN115858513A (en) Data governance method, data governance device, computer equipment and storage medium
CN115858829A (en) Multi-source heterogeneous environment data asset construction method based on computational power network
Zhang et al. The construction of a domain knowledge graph and its application in supply chain risk analysis
Laurent et al. Data lakes
CN115309749A (en) Big data experiment system for scientific and technological service
Ali et al. A state of art survey for big data processing and nosql database architecture
Prajapati et al. A review on big data with data mining
Suresh et al. Cloud-based big data analysis tools and techniques towards sustainable smart city services
CN113111244A (en) Multisource heterogeneous big data fusion system based on traditional Chinese medicine knowledge large-scale popularization
Ma et al. Banking Comprehensive Risk Management System Based on Big Data Architecture of Hybrid Processing Engines and Databases
Almeida et al. Survey on trends in big data: Data management, integration and cloud computing environment
KR20210045172A (en) Big Data Management and System for Livestock Disease Outbreak Analysis
CN117076463B (en) Multi-source data aggregation storage system for smart city

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination