CN113220826A - Scientific and creative service platform based on natural language processing technology and big data analysis - Google Patents
Scientific and creative service platform based on natural language processing technology and big data analysis Download PDFInfo
- Publication number
- CN113220826A CN113220826A CN202110416821.3A CN202110416821A CN113220826A CN 113220826 A CN113220826 A CN 113220826A CN 202110416821 A CN202110416821 A CN 202110416821A CN 113220826 A CN113220826 A CN 113220826A
- Authority
- CN
- China
- Prior art keywords
- data
- big data
- layer
- platform
- metadata
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000005516 engineering process Methods 0.000 title claims abstract description 30
- 238000007405 data analysis Methods 0.000 title claims abstract description 18
- 238000003058 natural language processing Methods 0.000 title claims abstract description 11
- 230000006870 function Effects 0.000 claims abstract description 18
- 238000012545 processing Methods 0.000 claims abstract description 13
- 238000004458 analytical method Methods 0.000 claims abstract description 7
- 238000004422 calculation algorithm Methods 0.000 claims description 18
- 238000007418 data mining Methods 0.000 claims description 4
- 230000004048 modification Effects 0.000 claims description 4
- 238000012986 modification Methods 0.000 claims description 4
- 238000003909 pattern recognition Methods 0.000 claims description 4
- 238000012217 deletion Methods 0.000 claims description 3
- 230000037430 deletion Effects 0.000 claims description 3
- 238000007619 statistical method Methods 0.000 claims description 3
- 238000011161 development Methods 0.000 abstract description 7
- 230000009466 transformation Effects 0.000 abstract description 2
- 238000007726 management method Methods 0.000 description 13
- 239000008186 active pharmaceutical agent Substances 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000001914 filtration Methods 0.000 description 3
- 238000000034 method Methods 0.000 description 3
- 238000012544 monitoring process Methods 0.000 description 3
- 241000233805 Phoenix Species 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/18—File system types
- G06F16/182—Distributed file systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to the technical field of big data information networks, in particular to a scientific and creative service platform based on natural language processing technology and big data analysis. The system comprises an infrastructure layer, a big data platform layer and a big data application layer which are connected with each other through the Internet; the infrastructure layer is based on a distributed storage architecture, and an infrastructure framework is established, so that a stable, reliable, high-performance, strong-expansibility and easy-to-manage infrastructure layer is provided for a large data platform layer; the big data platform layer is used for providing big data analysis processing functions including data access, analysis, sharing and platform management; the big data application layer is used for providing various big data applications to fully show the data processing capacity and the sharing capacity of the big data platform layer. The invention aims to support the government to realize economic sustainable development on a higher platform and provide experience and inspiration for industrial transformation and upgrading.
Description
Technical Field
The invention relates to the technical field of big data information networks, in particular to a scientific and creative service platform based on natural language processing technology and big data analysis.
Background
Big data is changing the information society and we are moving from the IT era to the DT era. The ecological environment of the big data full industrial chain with the core purposes of a data-driven operation system, innovation and creation ecology, a big data industrial chain, government affair data sharing and exchange and a smart city big data support platform is actively built.
The system takes 'one-point innovation and full-disk activation' as a guide idea and 'one platform and multiple applications' as a framework design idea, and truly achieves the purpose of providing all-round services for governments in the aspects of talent cultivation system establishment, employment post increase, industry development space and output value improvement and the like, thereby forming a benchmarking development pattern for the current industrial development situation and tamping the development foundation of the big data industry. The present invention has been made in view of the above circumstances.
Disclosure of Invention
The invention aims to provide a scientific and creative service platform which supports the government to realize economic sustainable development on a higher platform, provides experience and inspiration for industrial transformation and upgrade and is based on a natural language processing technology and big data analysis.
In order to solve the technical problems, the technical scheme of the invention is as follows: a scientific and creative service platform based on natural language processing technology and big data analysis comprises an infrastructure layer, a big data platform layer and a big data application layer which are connected with each other through the Internet;
the infrastructure layer is based on a distributed storage architecture, and an infrastructure framework is established, so that a stable, reliable, high-performance, strong-expansibility and easy-to-manage infrastructure layer is provided for a large data platform layer;
the big data platform layer is used for providing big data analysis processing functions including data access, analysis, sharing and platform management;
the big data application layer is used for providing various big data applications to fully show the data processing capacity and the sharing capacity of the big data platform layer.
The infrastructure layer comprises an internet distributed acquisition system, an internet portal website information acquisition access system, a network social forum information acquisition system and a microblog information acquisition system;
the internet distributed acquisition system is used for acquiring internet data by adopting a distributed network acquisition framework;
the internet portal website information acquisition access system is used for analyzing related B/S webpages of various news portal websites, industry field professional portal websites and the like in real time to acquire qualified data by utilizing a distributed webpage acquisition technology and a pattern recognition technology based on an industry field word stock;
the network social forum information acquisition system acquires social forum information in the internet through a webpage acquisition technology;
the microblog information acquisition system is used for acquiring the webpage of the Xinlang microblog by using an acquisition tool, acquiring information issued by microblog users in real time, and carrying out basic statistical analysis on microblog information to prepare for deep utilization in the future.
The large data platform layer comprises an SOA frame system, a distributed heterogeneous storage system, an efficient algorithm and a distributed computing frame system;
the SOA framework system adopts an SOA framework based on a government affair service bus GSB and government affair data bus GDB dual-bus architecture; the government affair data bus GDB is used for data access and exchange, and the government affair service bus GSB is used for providing a uniform service interface for internal and external.
The distributed heterogeneous storage system adopts a heterogeneous storage scheme combining HDFS, HBase and a cluster relational database which are deeply optimized, and realizes a high-concurrency heterogeneous storage system by utilizing a cache based on a memory exchange technology and a high-performance data middleware;
the efficient algorithm and the distributed computing framework system realize various complex data mining and analyzing requirements by using an efficient distributed computing framework of MapReduce and Spark.
The big data application layer comprises a metadata management system and a metadata service management system;
the metadata management system is based on a Web browser end, provides a function of importing a metadata file for a background system administrator, provides a function of creating, editing and storing data description metadata for the background system administrator based on a single-edition metadata editor, and provides a metadata management WebService service based on metadata retrieval, metadata storage and metadata modification and deletion functions;
the metadata service management system is based on a Web browser end and provides a function of starting and stopping metadata service for a background system administrator.
Compared with the prior art, the invention has the beneficial effects that:
the intelligent data processing system and the intelligent data processing method have the advantages that a plurality of core business requirements such as smart city bottom layer support, government affair data sharing exchange, internet data storage and data access interfaces are born, the platform needs to be provided with and adopt an advanced deep optimization big data technology, a distributed storage calculation and algorithm model, a high-concurrency and quick-response cache architecture and the like, a big data capacity core platform based on an SOA (service oriented architecture) framework is built, the core platform serves as a data intelligent processing center of the whole project, the collection of various data sources (government affair data, internet data, industry data and the like) needs to be supported, and different data formats (structured data, unstructured data, streaming data and the like) are supported. Under the condition of complicated data access requirements, the data access system based on a bus mechanism is adopted, so that the requirements of data access can be quickly and effectively finished, and rich and flexible expansibility is provided.
Drawings
FIG. 1 is a block diagram of the system architecture of the present invention;
FIG. 2 is a block diagram of the system architecture of the infrastructure layer of the present invention;
FIG. 3 is a block diagram of the system architecture of the big data platform tier of the present invention;
fig. 4 is a system structure block diagram of a big data application layer of the present invention.
Detailed Description
The following further describes embodiments of the present invention with reference to the drawings. It should be noted that the description of the embodiments is provided to help understanding of the present invention, but the present invention is not limited thereto. In addition, the technical features involved in the embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.
Referring to fig. 1-4, the scientific and creative service platform based on natural language processing technology and big data analysis of the present invention includes an infrastructure layer 1, a big data platform layer 2 and a big data application layer 3, which are connected to each other via internet; the infrastructure layer 1 is based on a distributed storage architecture, and an infrastructure framework is established, so that the infrastructure layer 1 which is stable, reliable, high in performance, strong in expansibility and easy to manage is provided for the large data platform layer 2; the big data platform layer 2 is used for providing big data analysis processing functions including data access, analysis, sharing and platform management; the big data application layer 3 is used to provide various big data applications to fully exhibit the data processing capability and sharing capability of the big data platform layer 2.
The infrastructure layer 1 comprises an internet distributed acquisition system, an internet portal website information acquisition access system, a network social forum information acquisition system and a microblog information acquisition system; the internet distributed acquisition system is used for acquiring internet data by adopting a distributed network acquisition framework; the acquisition of the internet data mainly depends on the network acquisition technology. The innovative and creative big data platform adopts an advanced distributed network acquisition framework, and the framework finishes the unified scheduling, management and maintenance work of acquisition and the unified storage work of acquired data. The acquisition user can complete the grabbing work of a complex page and even a website only by carrying out simple configuration or developing a very small amount of script codes. In addition, the platform properly utilizes anti-defense acquisition technologies such as identifying codes, dynamic IP, dynamic users and the like in a legal range, thereby ensuring effective acquisition of internet data, reducing manual intervention and saving cost. The platform provides collection of partial internet portal websites, social networking forums, microblogs and other systems, and collects corresponding data for the platform to use. For newly added applications later, if data of other websites are needed, a corresponding acquisition tool can be developed by using an SDK suite provided by the platform based on an acquisition framework.
The internet portal website information acquisition access system is used for analyzing related B/S webpages of various news portal websites, industry field professional portal websites and the like in real time to acquire qualified data by utilizing a distributed webpage acquisition technology and a pattern recognition technology based on an industry field word stock; by utilizing a distributed webpage acquisition technology and a pattern recognition technology based on an industry field word stock, relevant B/S webpages of various news portal websites, industry field professional portal websites and the like are analyzed in real time to obtain qualified data. The real-time monitoring of the objects of the internet portal website information acquisition, such as a Xinhua network, a people network, a Chinese news network, a new wave network, a fox searching network, a Tencent network, a Yinyi network, a phoenix network, a provincial news network and other large comprehensive news websites, obtains the information related to the e-government affairs according with the conditions. The method is used for monitoring portal websites in the industry fields of China meteorological networks, China earthquake platform networks, traffic networks, disaster reduction networks and the like in real time and acquiring information of meteorological disasters, earthquakes, traffic disasters and natural disasters. Other governments desire valuable internet information to be obtained in a timely manner, etc.
The network social forum information acquisition system acquires social forum information in the internet through a webpage acquisition technology; similar to the portal website information acquisition mode, the network social forum information acquisition also acquires the social forum information in the internet through the webpage acquisition technology. The method mainly provides real-time monitoring for information of a plurality of mainstream forums such as a sky community, a cat promotion community, a fox search forum, a phoenix forum, a cyber forum, a new wave forum, a kaidi community, a strong country forum, a Chinese network forum, a Xinhua network forum, a world network forum, a red network forum and the like, and information related to electronic government affairs meeting conditions is obtained. The information in public communication platforms such as dog searching and speaking bars and Baidu post bars is monitored, and information content related to electronic government affair reflected by netizens is obtained.
The microblog information acquisition system is used for acquiring the webpage of the Sing microblog by using an acquisition tool, acquiring information issued by microblog users in real time, and performing basic statistical analysis on the microblog information to prepare for deep utilization in the future.
The large data platform layer 2 comprises an SOA frame system, a distributed heterogeneous storage system, an efficient algorithm and a distributed computing frame system;
the SOA frame system adopts an SOA frame based on a government affair service bus GSB and government affair data bus GDB double-bus structure; the government affair data bus GDB is used for data access and exchange, and the government affair service bus GSB is used for providing a uniform service interface for internal and external.
The distributed heterogeneous storage system adopts a heterogeneous storage scheme combining HDFS, HBase and a cluster relational database which are deeply optimized, and realizes a high concurrent heterogeneous storage system by utilizing a cache based on a memory exchange technology and a high-performance data middleware;
the efficient algorithm and the distributed computing framework system realize various complex data mining and analyzing requirements by using an efficient distributed computing framework of MapReduce and Spark.
The big data application layer 3 comprises a metadata management system and a metadata service management system;
the metadata management system is based on a Web browser end, provides a function of importing a metadata file for a background system administrator, provides a function of creating, editing and storing data description metadata for the background system administrator based on a single-edition metadata editor, and provides a metadata management WebService service based on metadata retrieval, metadata storage and metadata modification and deletion functions;
the metadata service management system is based on a Web browser end and provides a function of starting and stopping metadata service for a background system administrator.
Access to internet information will provide richer data resources for big data on the scale of information resources. The system accesses relevant information resources such as Internet portal websites, network social forums, microblog public opinion information and the like.
Data analysis cannot be generalized, and requires support by the underlying algorithm. As the size and complexity of data sets continues to rise, the algorithm requirements are also increasing. The innovation and creation big data platform adopts a processing technology based on a Hadoop technical architecture, and provides dozens of distributed algorithms with independent intellectual property rights for data analysts and developers. The platform provides a corresponding SDK development tool suite and an algorithm calling API, so that various applications can conveniently use platform data and algorithms thereof. The method has the advantages that the basic algorithm is supported, only the first step of data analysis and mining is completed, and for each specific data-based application system, a corresponding analysis model is required to be established according to the characteristics of the application system, so that the application system can be effectively supported. Briefly, an analytical model may be viewed as a combination of one or more basic algorithms that provide a number of intermediate results, with the model processing the number of intermediate results into a final result.
Three distributed computing frameworks provided by a big data platform provide strong distributed computing capability for the platform, so that a basic operating environment is provided for various data-based applications; meanwhile, a large number of distributed algorithms in the basic algorithm library provide bottom-layer tool support for establishing a data analysis model. In order to apply the computing power and the algorithm analysis power of the platform to actual business, an analysis program using the computing power and the algorithm library needs to be developed by a developer.
The data display mainly displays information such as data types, descriptions, samples, historical visit volumes and the like to a client so as to facilitate analysis and use of the user. Specifically, the data classification display data classification can be displayed in a head navigation bar of a page, when a user puts a mouse on a certain large classification, all sub-classifications under the current large classification are popped up, and the user clicks the sub-classifications to enter a detailed list page of related data of the current sub-classifications.
The data filtering is to provide a label filtering function, and a user clicks one of the labels to re-filter the target data according to the label and the previous filtering label. The keywords of the data title matched with the label of the target data are found out and displayed in a list form for the user to select.
The data searching is to provide a searching function, a user searches for data matched with related input keywords through page searching, a title or search details of the searched data can be selected before a search box, the default is the keyword of the title, the keyword of the input data is searched for the keyword matching of the related data according to the processing of a program, and the keyword matching is displayed in a list form for selection.
The data list is to show all data selected by the current user or in a default category, and is to be shown in a form of list paging according to a time default sorting. The current list may show some basic information of the data "title of the data, score of data quality, picture of the data, access amount of the data, and simple description of the data"; if the current logged-in user is a user of an innovation workshop, the user of the innovation factory enters an API list, the grade of gold, silver and copper is added behind the API title, and only the user according with the current grade can call the current API data.
The data detail information is the detail information page for clicking the title of the data into the current data. Detailed information of the current data is displayed, and comprises' pictures of the data, titles of the data, values of the data, data sources, the time for putting the data on shelf, the size of the data, the downloading amount of the current data, short description of the data, the integrity (eight indexes) of the current data, collections, relevant data which is recommended to the current data, detailed information of the data, and comment information of accessed users on the current data.
The retrieval of the data comprises the screening of the labels and the searching of keywords and detailed information of the data. When a user clicks on one of the labels, the target data is re-filtered according to the label and the previous filter label.
The target data is the keyword of the data title matched with the label. The title or the detailed information of the searched data can be selected before the search box, the default is the keyword of the title, the keyword of the input data is searched for keyword matching of related data according to the processing of a program, and the keyword matching is displayed in a list form for selection.
The embodiments of the present invention have been described in detail with reference to the accompanying drawings, but the present invention is not limited to the described embodiments. It will be apparent to those skilled in the art that various changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, and the scope of protection is still within the scope of the invention.
Claims (4)
1. A scientific and creative service platform based on natural language processing technology and big data analysis is characterized in that: the system comprises an infrastructure layer (1), a big data platform layer (2) and a big data application layer (3) which are connected with each other through the Internet;
the infrastructure layer (1) is based on a distributed storage architecture, and an infrastructure framework is established, so that the stable, reliable, high-performance, strong-expansibility and easy-management infrastructure layer (1) is provided for the large data platform layer (2);
the big data platform layer (2) is used for providing big data analysis processing functions including data access, analysis, sharing and platform management;
the big data application layer (3) is used for providing various big data applications to fully show the data processing capacity and the sharing capacity of the big data platform layer (2).
2. The scientific service platform based on natural language processing technology and big data analysis according to claim 1, characterized in that: the infrastructure layer (1) comprises an internet distributed acquisition system, an internet portal website information acquisition access system, a network social forum information acquisition system and a microblog information acquisition system;
the internet distributed acquisition system is used for acquiring internet data by adopting a distributed network acquisition framework;
the internet portal website information acquisition access system is used for analyzing related B/S webpages of various news portal websites, industry field professional portal websites and the like in real time to acquire qualified data by utilizing a distributed webpage acquisition technology and a pattern recognition technology based on an industry field word stock;
the network social forum information acquisition system acquires social forum information in the internet through a webpage acquisition technology;
the microblog information acquisition system is used for acquiring the webpage of the Xinlang microblog by using an acquisition tool, acquiring information issued by microblog users in real time, and carrying out basic statistical analysis on microblog information to prepare for deep utilization in the future.
3. The scientific service platform based on natural language processing technology and big data analysis according to claim 1, characterized in that: the large data platform layer (2) comprises an SOA frame system, a distributed heterogeneous storage system, an efficient algorithm and a distributed computing frame system;
the SOA framework system adopts an SOA framework based on a government affair service bus GSB and government affair data bus GDB dual-bus architecture; the government affair data bus GDB is used for data access and exchange, and the government affair service bus GSB is used for providing a uniform service interface for internal and external.
The distributed heterogeneous storage system adopts a heterogeneous storage scheme combining HDFS, HBase and a cluster relational database which are deeply optimized, and realizes a high-concurrency heterogeneous storage system by utilizing a cache based on a memory exchange technology and a high-performance data middleware;
the efficient algorithm and the distributed computing framework system realize various complex data mining and analyzing requirements by using an efficient distributed computing framework of MapReduce and Spark.
4. The scientific service platform based on natural language processing technology and big data analysis according to claim 1, characterized in that: the big data application layer (3) comprises a metadata management system and a metadata service management system;
the metadata management system is based on a Web browser end, provides a function of importing a metadata file for a background system administrator, provides a function of creating, editing and storing data description metadata for the background system administrator based on a single-edition metadata editor, and provides a metadata management WebService service based on metadata retrieval, metadata storage and metadata modification and deletion functions;
the metadata service management system is based on a Web browser end and provides a function of starting and stopping metadata service for a background system administrator.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110416821.3A CN113220826A (en) | 2021-04-19 | 2021-04-19 | Scientific and creative service platform based on natural language processing technology and big data analysis |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110416821.3A CN113220826A (en) | 2021-04-19 | 2021-04-19 | Scientific and creative service platform based on natural language processing technology and big data analysis |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113220826A true CN113220826A (en) | 2021-08-06 |
Family
ID=77087696
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110416821.3A Pending CN113220826A (en) | 2021-04-19 | 2021-04-19 | Scientific and creative service platform based on natural language processing technology and big data analysis |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113220826A (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107958322A (en) * | 2017-10-09 | 2018-04-24 | 中国电子科技集团公司第二十八研究所 | A kind of urban network spatial synthesis governing system |
CN108009294A (en) * | 2017-12-26 | 2018-05-08 | 中电科大数据研究院有限公司 | A kind of big data wisdom credit administers platform architecture |
CN110706141A (en) * | 2019-07-23 | 2020-01-17 | 杭州中软安人网络通信股份有限公司 | E-government affair big data service system |
CN111984717A (en) * | 2020-08-26 | 2020-11-24 | 江西微博科技有限公司 | Big data intelligent government affair platform information management method |
CN112116488A (en) * | 2020-04-28 | 2020-12-22 | 刘革瑞 | Water conservancy big data comprehensive maintenance system |
-
2021
- 2021-04-19 CN CN202110416821.3A patent/CN113220826A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107958322A (en) * | 2017-10-09 | 2018-04-24 | 中国电子科技集团公司第二十八研究所 | A kind of urban network spatial synthesis governing system |
CN108009294A (en) * | 2017-12-26 | 2018-05-08 | 中电科大数据研究院有限公司 | A kind of big data wisdom credit administers platform architecture |
CN110706141A (en) * | 2019-07-23 | 2020-01-17 | 杭州中软安人网络通信股份有限公司 | E-government affair big data service system |
CN112116488A (en) * | 2020-04-28 | 2020-12-22 | 刘革瑞 | Water conservancy big data comprehensive maintenance system |
CN111984717A (en) * | 2020-08-26 | 2020-11-24 | 江西微博科技有限公司 | Big data intelligent government affair platform information management method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110147437B (en) | Knowledge graph-based searching method and device | |
Li et al. | An active crawler for discovering geospatial web services and their distribution pattern–A case study of OGC Web Map Service | |
Das et al. | Big data analytics: A framework for unstructured data analysis | |
CN104850601B (en) | Police service based on chart database analyzes application platform and its construction method in real time | |
Yu et al. | Summary of web crawler technology research | |
CN1858737B (en) | Method and system for data searching | |
CN105007314A (en) | Big data processing system oriented to mass reading data of readers | |
US20160203224A1 (en) | System for analyzing social media data and method of analyzing social media data using the same | |
CN111259220A (en) | Data acquisition method and system based on big data | |
Li | [Retracted] Internet Tourism Resource Retrieval Using PageRank Search Ranking Algorithm | |
CN114637903A (en) | Public opinion data acquisition system for directional target data expansion | |
Ahamed et al. | An Efficient Mechanism for Deep Web Data Extraction Based on Tree‐Structured Web Pattern Matching | |
Xia et al. | Optimizing an index with spatiotemporal patterns to support GEOSS Clearinghouse | |
Bross et al. | Mapping the blogosphere with rss-feeds | |
Suzumura et al. | StreamWeb: Real-time web monitoring with stream computing | |
Huang | Geopubsubhub: A geospatial publish/subscribe architecture for the world-wide sensor web | |
CN113220826A (en) | Scientific and creative service platform based on natural language processing technology and big data analysis | |
Zhou et al. | A distributed text mining system for online web textual data analysis | |
Malik et al. | Ontology and Web Usage Mining towards an Intelligent Web focusing web logs | |
CN114328947A (en) | Knowledge graph-based question and answer method and device | |
CN113407803A (en) | Method for acquiring internet data in one step | |
CN105912584B (en) | Data indexing system based on webpage information data | |
Xu et al. | Method of deep web collection for mobile application store based on category keyword searching | |
Wang et al. | A hunger-based scheduling strategy for distributed crawler | |
Kamath et al. | A bio-inspired, incremental clustering algorithm for semantics-based web service discovery |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210806 |