CN110851488A - Multi-source-based multi-modal data fusion analysis processing method and platform - Google Patents

Multi-source-based multi-modal data fusion analysis processing method and platform Download PDF

Info

Publication number
CN110851488A
CN110851488A CN201910920062.7A CN201910920062A CN110851488A CN 110851488 A CN110851488 A CN 110851488A CN 201910920062 A CN201910920062 A CN 201910920062A CN 110851488 A CN110851488 A CN 110851488A
Authority
CN
China
Prior art keywords
data
fusion
internet
acquisition
analysis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910920062.7A
Other languages
Chinese (zh)
Inventor
曹娅琪
陈�峰
闫碧莹
陈新国
吴小全
张思卿
曹蓉
韩云杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhong Kjia Speed Beijing Information Technology Co ltd
Guiyang Academy Of Information Technology (institute Of Software Chinese Academy Of Sciences Guiyang Branch)
Institute of Software of CAS
Original Assignee
Zhong Kjia Speed Beijing Information Technology Co ltd
Guiyang Academy Of Information Technology (institute Of Software Chinese Academy Of Sciences Guiyang Branch)
Institute of Software of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhong Kjia Speed Beijing Information Technology Co ltd, Guiyang Academy Of Information Technology (institute Of Software Chinese Academy Of Sciences Guiyang Branch), Institute of Software of CAS filed Critical Zhong Kjia Speed Beijing Information Technology Co ltd
Priority to CN201910920062.7A priority Critical patent/CN110851488A/en
Publication of CN110851488A publication Critical patent/CN110851488A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/288Entity relationship models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1097Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Signal Processing (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a multi-source multi-modal data fusion analysis processing method, which is characterized in that data acquisition is carried out in the modes of interface service acquisition, sensing and reading of the Internet of things, database synchronization, file synchronization, data crawling and the like, and various data cleaning models are established for processing multi-modal data aiming at the problems of data loss, abnormal space-time data, inconsistent data and the like existing in the acquired massive data such as structured database data, semi-structured network data, unstructured text, video and the like; the multi-source multi-modal data fusion analysis module constructs an algorithm library of common data fusion algorithms, and can support text data analysis and image data analysis, and fusion analysis of structured data and unstructured data from multiple sources.

Description

Multi-source-based multi-modal data fusion analysis processing method and platform
Technical Field
The invention relates to the technical field of big data processing, in particular to a multi-source multi-modal data fusion analysis processing method and a multi-source multi-modal data fusion analysis processing platform.
Background
In the multi-source heterogeneous data in the fields of traffic, judicial and the like, the data are various in types, wide in sources and inconsistent in protocol diversification, and the problems of disordered data, low quality, disordered data architecture, inconsistent storage and the like exist.
Since the original data contains a large amount of error and redundant data, and the quality of the data directly affects the reliability of the analysis result of the upper-layer application and the real realization of the application target, the data quality of the original data needs to be evaluated so as to provide richer data information for the upper-layer application. Based on the modeling of the quality of heterogeneous multi-source multi-modal data, a new fusion analysis processing method needs to be designed for solving the problems of data loss, abnormal space-time data, inconsistent data and the like in the multi-source multi-modal data, and the multi-modal data is processed through a plurality of data cleaning models.
Disclosure of Invention
In view of the above, the present invention provides a multi-source-based multi-modal data fusion analysis processing method, and further provides a multi-source-based multi-modal data fusion analysis processing platform, which constructs an algorithm library for cluster analysis, association analysis, classification prediction, and the like, according to different application requirements and data structures, so as to implement fusion analysis on structured data and unstructured data from multiple sources, and solve the problems of data loss, temporal-spatial data abnormality, data inconsistency, and the like in the existing multi-source multi-modal data.
One of the purposes of the invention is realized by the following technical scheme:
the multi-source multi-modal data fusion analysis processing method comprises the following steps:
step S1, data acquisition, including system fusion acquisition, Internet of things data acquisition and Internet data acquisition; the system fusion acquisition acquires data of the open data interface and the accessed database service through a fusion acquisition system; the data of the Internet of things is subjected to real-time data acquisition by adopting a distributed coordination service Zookeeper and a message middleware Kafka in a distributed environment; the internet data acquisition is realized by crawling data through a built distributed crawler system;
step S2: the method comprises the steps of performing fusion analysis on collected data, performing feature extraction and attribute fusion on the data according to different characteristics of the data, and constructing a clustering analysis, association analysis and classification prediction algorithm library according to different application requirements and data structures to realize fusion analysis on structured data and unstructured data from multiple sources.
In particular, in the step S1, in the process of system fusion acquisition and data acquisition of the internet of things, the temporary storage Redis, the permanent storage MySQL, and the distributed storage HDFS are performed on the acquired data according to the situation, and the problems of data redundancy, missing, abnormality, inconsistency, and the like existing in the data are removed, complemented, or modified through a data quality detection model on the structured data, so as to improve the quality of the structured data.
In particular, in the step S1, in the internet data acquisition process, the HDFS is distributively stored for the acquired data according to the situation, the chinese named entity recognition is performed for the massive text data by using Lattice LSTM, the entity extraction is performed, and the relationship mining is performed by using the iterative iteration of bootstrapping.
The second purpose of the invention is realized by the following technical scheme:
the multi-source based multi-modal data fusion analysis processing platform comprises
The data acquisition module comprises a system fusion acquisition unit, an internet data acquisition unit and an internet of things data acquisition unit;
the infrastructure of the hardware is such that,
the virtual facility is used for realizing server virtualization, storage virtualization and network virtualization;
the scheduling center is used for realizing task scheduling, resource scheduling, availability management and load balancing;
the data preprocessing module is used for realizing preprocessing and data extraction of unstructured data, semi-structured data and structured data;
the data mining module is used for realizing the feature extraction, the association analysis, the classification prediction and the clustering analysis of the preprocessed data;
the central module is applied to the application of the central module,
an open center module.
Particularly, the system fusion acquisition unit acquires data of the open data interface and the accessed database service through a fusion acquisition system; the Internet of things data acquisition unit acquires real-time data by adopting a distributed coordination service Zookeeper and a message middleware Kafka in a distributed environment; the internet data acquisition unit crawls data through the built distributed crawler system.
Particularly, in the internet data acquisition process, distributed storage HDFS is carried out on the acquired data according to the situation, Chinese naming entity identification is carried out on massive text data by adopting Lattice LSTM, entity extraction is carried out, and relationship mining is carried out by adopting repeat iteration of bootstrapping.
The invention has the beneficial effects that:
the invention provides a multi-source multi-modal data fusion analysis processing method, which is characterized in that data acquisition is carried out in the modes of interface service acquisition, sensing and reading of the Internet of things, database synchronization, file synchronization, data crawling and the like, and various data cleaning models are established for processing multi-modal data aiming at the problems of data loss, abnormal space-time data, inconsistent data and the like existing in the acquired massive data such as structured database data, semi-structured network data, unstructured text, video and the like; the multi-source multi-modal data fusion analysis module constructs an algorithm library of common data fusion algorithms, and can support text data analysis and image data analysis, and fusion analysis of structured data and unstructured data from multiple sources.
Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof.
Drawings
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be further described in detail with reference to the accompanying drawings, in which:
FIG. 1 is a schematic flow chart of the method of the present invention.
FIG. 2 is a platform architecture diagram according to the present invention.
Detailed Description
Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. It should be understood that the preferred embodiments are illustrative of the invention only and are not limiting upon the scope of the invention.
As shown in FIG. 1, the multi-source multi-modal data fusion analysis processing method based on the invention comprises the following steps:
step S1, data acquisition, including system fusion acquisition, Internet of things data acquisition and Internet data acquisition; the system fusion acquisition acquires data of the open data interface and the accessed database service through a fusion acquisition system; the method comprises the following steps that data of the Internet of things are subjected to real-time data acquisition in a distributed environment by adopting a distributed coordination service Zookeeper and a message middleware Kafka; the internet data acquisition is realized by crawling data through a built distributed crawler system, so that the data can be efficiently crawled, and meanwhile, an anti-crawler mechanism of a social network is prevented from being triggered; in the process of system fusion acquisition and data acquisition of the Internet of things, temporary storage Redis, permanent storage MySQL and distributed storage HDFS are carried out on the acquired data according to conditions, and data elimination, completion or modification is carried out on the problems of data redundancy, deficiency, abnormity, inconsistency and the like existing in the data through a data quality detection model aiming at the structured data, so that the quality of the structured data is improved.
In the internet data acquisition process, distributed storage HDFS is carried out on the acquired data according to conditions, Chinese named entity identification is carried out on massive text data by adopting Lattice LSTM, entity extraction is carried out, and relationship mining is carried out by adopting repeat iteration of bootstrapping.
Step S2: the method comprises the steps of performing fusion analysis on collected data, performing feature extraction and attribute fusion on the data according to different characteristics of the data, and constructing a clustering analysis, association analysis and classification prediction algorithm library according to different application requirements and data structures to realize fusion analysis on structured data and unstructured data from multiple sources.
Based on the design idea of the method, the invention also provides a multi-source multi-modal data fusion analysis and processing platform, which comprises the following components:
(1) the data acquisition module comprises a system fusion acquisition unit, an internet data acquisition unit and an internet of things data acquisition unit; the system fusion acquisition unit acquires data of the open data interface and the accessed database service through the fusion acquisition system; the Internet of things data acquisition unit acquires real-time data by adopting a distributed coordination service Zookeeper and a message middleware Kafka in a distributed environment; the Internet data acquisition unit crawls data through a built distributed crawler system;
in the internet data acquisition process, distributed storage HDFS is carried out on the acquired data according to conditions, Chinese named entity identification is carried out on massive text data by adopting Lattice LSTM, entity extraction is carried out, and relationship mining is carried out by adopting repeat iteration of bootstrapping.
(2) An infrastructure hardware facility; including the infrastructure for implementing the various functions; the system comprises server resources, network resources and storage resources;
(3) virtual facilities: the system is used for realizing server virtualization, storage virtualization and network virtualization;
(4) the scheduling center is used for realizing task scheduling, resource scheduling, availability management and load balancing;
(5) the data preprocessing module is used for realizing preprocessing and data extraction of unstructured data, semi-structured data and structured data;
(5) the data mining module is used for realizing the feature extraction, the association analysis, the classification prediction and the clustering analysis of the preprocessed data;
(6) an application center module: for implementing various functional applications;
(7) an open center module: including software development kit SDK, application programming interface API, and other interfaces and kit software for use by developers.
The big data fusion analysis infrastructure platform can provide mobile terminal APP, city management decision support, public service support and the like.
In the implementation process, the multi-source multi-modal data fusion analysis needs to perform cross-network and cross-modal association analysis on the multi-source multi-modal data. Meanwhile, an algorithm library containing common data fusion algorithms needs to be constructed, so that the platform is convenient for fusion analysis of data. The multi-source multi-modal data analysis platform can support text data analysis and image data analysis, and also faces the need of fusion analysis of text data and image data from multiple sources in many application scenes. Massive text data are analyzed, and the text data analysis can be divided into word segmentation, feature extraction, training models and application. The multi-source multi-modal data analysis platform provides a text analysis algorithm library in order to enable a user to quickly use the platform to analyze text data, and the text analysis algorithm comprises a common word segmentation method, a feature extraction method and a common model.
It should be understood by those skilled in the art that the timing sequence of the method steps provided in the above embodiments may be adaptively adjusted according to actual situations, or may be concurrently performed according to actual situations.
All or part of the steps in the methods according to the above embodiments may be implemented by a program instructing related hardware, where the program may be stored in a storage medium readable by a computer device and used to execute all or part of the steps in the methods according to the above embodiments. The computer device, for example: personal computer, server, network equipment, intelligent mobile terminal, intelligent home equipment, wearable intelligent equipment, vehicle-mounted intelligent equipment and the like; the storage medium, for example: RAM, ROM, magnetic disk, magnetic tape, optical disk, flash memory, U disk, removable hard disk, memory card, memory stick, network server storage, network cloud storage, etc.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
Finally, the above embodiments are only intended to illustrate the technical solutions of the present invention and not to limit the present invention, and although the present invention has been described in detail with reference to the preferred embodiments, it will be understood by those skilled in the art that modifications or equivalent substitutions may be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions, and all of them should be covered by the claims of the present invention.

Claims (6)

1. The multi-source-based multi-modal data fusion analysis processing method is characterized by comprising the following steps of: the method comprises the following steps:
step S1, data acquisition, including system fusion acquisition, Internet of things data acquisition and Internet data acquisition; the system fusion acquisition acquires data of the open data interface and the accessed database service through a fusion acquisition system; the data of the Internet of things is subjected to real-time data acquisition by adopting a distributed coordination service Zookeeper and a message middleware Kafaka in a distributed environment; the internet data acquisition is realized by crawling data through a built distributed crawler system;
step S2: the method comprises the steps of performing fusion analysis on collected data, performing feature extraction and attribute fusion on the data according to different characteristics of the data, and constructing a clustering analysis, association analysis and classification prediction algorithm library according to different application requirements and data structures to realize fusion analysis on structured data and unstructured data from multiple sources.
2. The multi-source multi-modal data fusion analysis processing method based on claim 1, wherein: in the step S1, in the process of system fusion acquisition and data acquisition of the internet of things, the temporary storage Redis, the permanent storage MySQL and the distributed storage HDFS are performed on the acquired data according to the situation, and data elimination, completion or modification is performed on the problems of data redundancy, deficiency, abnormality, inconsistency and the like existing in the data through a data quality detection model on the structured data, so as to improve the quality of the structured data.
3. The multi-source multi-modal data fusion analysis processing method based on claim 1, wherein: in the step S1, in the internet data collection process, the HDFS is distributively stored for the collected data according to the situation, the lattic LSTM is used to identify the named entity in chinese for the mass text data, the entity extraction is performed, and the relationship mining is performed by using the iterative iteration of bootstrapping.
4. Based on multisource multimode data fusion analysis processing platform, its characterized in that: the platform comprises
The data acquisition module comprises a system fusion acquisition unit, an internet data acquisition unit and an internet of things data acquisition unit;
the infrastructure of the hardware is such that,
the virtual facility is used for realizing server virtualization, storage virtualization and network virtualization;
the scheduling center is used for realizing task scheduling, resource scheduling, availability management and load balancing;
the data preprocessing module is used for realizing preprocessing and data extraction of unstructured data, semi-structured data and structured data;
the data mining module is used for realizing the feature extraction, the association analysis, the classification prediction and the clustering analysis of the preprocessed data;
the central module is applied to the application of the central module,
an open center module.
5. The multi-source multi-modal data fusion analysis processing platform of claim 4, wherein: the system fusion acquisition unit acquires data of the open data interface and the accessed database service through a fusion acquisition system; the Internet of things data acquisition unit acquires real-time data by adopting a distributed coordination service Zookeeper and a message middleware Kafka in a distributed environment; the internet data acquisition unit crawls data through the built distributed crawler system.
6. The multi-source multi-modal data fusion analysis processing platform of claim 4, wherein: in the internet data acquisition process, distributed storage HDFS is carried out on the acquired data according to conditions, Chinese named entity identification is carried out on massive text data by adopting Lattice LSTM, entity extraction is carried out, and relationship mining is carried out by adopting repeat iteration of bootstrapping.
CN201910920062.7A 2019-09-26 2019-09-26 Multi-source-based multi-modal data fusion analysis processing method and platform Pending CN110851488A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910920062.7A CN110851488A (en) 2019-09-26 2019-09-26 Multi-source-based multi-modal data fusion analysis processing method and platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910920062.7A CN110851488A (en) 2019-09-26 2019-09-26 Multi-source-based multi-modal data fusion analysis processing method and platform

Publications (1)

Publication Number Publication Date
CN110851488A true CN110851488A (en) 2020-02-28

Family

ID=69596019

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910920062.7A Pending CN110851488A (en) 2019-09-26 2019-09-26 Multi-source-based multi-modal data fusion analysis processing method and platform

Country Status (1)

Country Link
CN (1) CN110851488A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111859451A (en) * 2020-07-23 2020-10-30 北京尚隐科技有限公司 Processing system of multi-source multi-modal data and method applying same
CN112100179A (en) * 2020-09-11 2020-12-18 北京明略昭辉科技有限公司 HBASE-based data fusion method, HBASE-based data fusion device, HBASE-based data fusion equipment and computer readable medium
CN112214531A (en) * 2020-10-12 2021-01-12 海南大学 Cross-data, information and knowledge multi-modal feature mining method and component
CN112364000A (en) * 2020-10-29 2021-02-12 广西电网有限责任公司南宁供电局 System and method for multi-source heterogeneous data fusion in power industry
CN116881335A (en) * 2023-07-24 2023-10-13 郑州华商科技有限公司 Multi-mode data intelligent analysis system and method
CN118012850A (en) * 2024-04-08 2024-05-10 北京市农林科学院智能装备技术研究中心 Intelligent irrigation multisource information-oriented database construction system, method and equipment

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105893526A (en) * 2016-03-30 2016-08-24 上海坤士合生信息科技有限公司 Multi-source data fusion system and method
CN106651188A (en) * 2016-12-27 2017-05-10 贵州电网有限责任公司贵阳供电局 Electric transmission and transformation device multi-source state assessment data processing method and application thereof
CN106778033A (en) * 2017-01-10 2017-05-31 南京邮电大学 A kind of Spark Streaming abnormal temperature data alarm methods based on Spark platforms
CN107193967A (en) * 2017-05-25 2017-09-22 南开大学 A kind of multi-source heterogeneous industry field big data handles full link solution
CN107402976A (en) * 2017-07-03 2017-11-28 国网山东省电力公司经济技术研究院 Power grid multi-source data fusion method and system based on multi-element heterogeneous model
CN107545046A (en) * 2017-08-17 2018-01-05 北京奇安信科技有限公司 A kind of fusion method and device of multi-source heterogeneous data
CN107832913A (en) * 2017-10-11 2018-03-23 微梦创科网络科技(中国)有限公司 The Forecasting Methodology and system to monitoring data trend based on deep learning
CN108737543A (en) * 2018-05-21 2018-11-02 天津中兴智联科技有限公司 A kind of distribution Internet of things middleware and working method
CN109033387A (en) * 2018-07-26 2018-12-18 广州大学 A kind of Internet of Things search system, method and storage medium merging multi-source data
CN109509557A (en) * 2018-11-16 2019-03-22 郑州大学第附属医院 A kind of Chinese electronic health record information extraction preprocess method based on big data platform

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105893526A (en) * 2016-03-30 2016-08-24 上海坤士合生信息科技有限公司 Multi-source data fusion system and method
CN106651188A (en) * 2016-12-27 2017-05-10 贵州电网有限责任公司贵阳供电局 Electric transmission and transformation device multi-source state assessment data processing method and application thereof
CN106778033A (en) * 2017-01-10 2017-05-31 南京邮电大学 A kind of Spark Streaming abnormal temperature data alarm methods based on Spark platforms
CN107193967A (en) * 2017-05-25 2017-09-22 南开大学 A kind of multi-source heterogeneous industry field big data handles full link solution
CN107402976A (en) * 2017-07-03 2017-11-28 国网山东省电力公司经济技术研究院 Power grid multi-source data fusion method and system based on multi-element heterogeneous model
CN107545046A (en) * 2017-08-17 2018-01-05 北京奇安信科技有限公司 A kind of fusion method and device of multi-source heterogeneous data
CN107832913A (en) * 2017-10-11 2018-03-23 微梦创科网络科技(中国)有限公司 The Forecasting Methodology and system to monitoring data trend based on deep learning
CN108737543A (en) * 2018-05-21 2018-11-02 天津中兴智联科技有限公司 A kind of distribution Internet of things middleware and working method
CN109033387A (en) * 2018-07-26 2018-12-18 广州大学 A kind of Internet of Things search system, method and storage medium merging multi-source data
CN109509557A (en) * 2018-11-16 2019-03-22 郑州大学第附属医院 A kind of Chinese electronic health record information extraction preprocess method based on big data platform

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111859451A (en) * 2020-07-23 2020-10-30 北京尚隐科技有限公司 Processing system of multi-source multi-modal data and method applying same
CN111859451B (en) * 2020-07-23 2024-02-06 北京尚隐科技有限公司 Multi-source multi-mode data processing system and method for applying same
CN112100179A (en) * 2020-09-11 2020-12-18 北京明略昭辉科技有限公司 HBASE-based data fusion method, HBASE-based data fusion device, HBASE-based data fusion equipment and computer readable medium
CN112214531A (en) * 2020-10-12 2021-01-12 海南大学 Cross-data, information and knowledge multi-modal feature mining method and component
CN112214531B (en) * 2020-10-12 2021-11-05 海南大学 Cross-data, information and knowledge multi-modal feature mining method and component
WO2022078142A1 (en) * 2020-10-12 2022-04-21 海南大学 Multi-modal feature mining method and component across data, information and knowledge
CN112364000A (en) * 2020-10-29 2021-02-12 广西电网有限责任公司南宁供电局 System and method for multi-source heterogeneous data fusion in power industry
CN116881335A (en) * 2023-07-24 2023-10-13 郑州华商科技有限公司 Multi-mode data intelligent analysis system and method
CN116881335B (en) * 2023-07-24 2024-06-04 郑州华商科技有限公司 Multi-mode data intelligent analysis system and method
CN118012850A (en) * 2024-04-08 2024-05-10 北京市农林科学院智能装备技术研究中心 Intelligent irrigation multisource information-oriented database construction system, method and equipment

Similar Documents

Publication Publication Date Title
US11989597B2 (en) Dataset connector and crawler to identify data lineage and segment data
CN110851488A (en) Multi-source-based multi-modal data fusion analysis processing method and platform
Mehmood et al. Implementing big data lake for heterogeneous data sources
Osman A novel big data analytics framework for smart cities
US11288551B2 (en) Edge-based adaptive machine learning for object recognition
US11915123B2 (en) Fusing multimodal data using recurrent neural networks
US10642891B2 (en) Graph matching by sub-graph grouping and indexing
US9626159B2 (en) Automatic generation of task scripts from web browsing interaction history
US11538236B2 (en) Detecting backdoor attacks using exclusionary reclassification
US10832164B2 (en) Generating streaming analytics applications using a glossary
KR20180085756A (en) Order Clustering and Malicious Information Fighting Methods and Devices
JP2022537170A (en) Cognitive video and voice search aggregation
US11487963B2 (en) Automatically determining whether an activation cluster contains poisonous data
CN114357197B (en) Event reasoning method and device
US11836331B2 (en) Mathematical models of graphical user interfaces
CN114244611B (en) Abnormal attack detection method, device, equipment and storage medium
US20200218994A1 (en) Generating a sequence rule
CN114005055A (en) Method, device and equipment for generating algorithm application element and computer readable storage medium
CN111831750A (en) Block chain data analysis method and device, computer equipment and storage medium
CN111126422B (en) Method, device, equipment and medium for establishing industry model and determining industry
Shaukat et al. Cloud based face recognition for google glass
Jagannathan Building cyber physical systems in the context of Smart Cities
CN114254081B (en) Enterprise big data search system, method and electronic equipment
US20240098106A1 (en) Generating models for detection of anomalous patterns
US20210343039A1 (en) Dynamic sampling for object recognition

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200228

RJ01 Rejection of invention patent application after publication