CN111008190A - Knowledge collecting, processing and retrieving system - Google Patents

Knowledge collecting, processing and retrieving system Download PDF

Info

Publication number
CN111008190A
CN111008190A CN201911270663.4A CN201911270663A CN111008190A CN 111008190 A CN111008190 A CN 111008190A CN 201911270663 A CN201911270663 A CN 201911270663A CN 111008190 A CN111008190 A CN 111008190A
Authority
CN
China
Prior art keywords
data
knowledge
management
processing
retrieval
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911270663.4A
Other languages
Chinese (zh)
Inventor
杨春
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Energy Development Research Institute of China Southern Power Grid Co Ltd
Original Assignee
Energy Development Research Institute of China Southern Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Energy Development Research Institute of China Southern Power Grid Co Ltd filed Critical Energy Development Research Institute of China Southern Power Grid Co Ltd
Priority to CN201911270663.4A priority Critical patent/CN111008190A/en
Publication of CN111008190A publication Critical patent/CN111008190A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/214Database migration support
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24552Database cache management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Quality & Reliability (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a knowledge collection processing and retrieval system, which is used for upgrading and optimizing a knowledge center, particularly optimizing a data bottom layer, realizing data cleaning and integration after upgrading and optimizing, establishing a whole network metadata standard, a data processing index standard, a system integration standard, a knowledge management and service standard and a related standard interface, upgrading an intelligent retrieval function, acquiring service data, establishing a knowledge map and a knowledge community, providing intelligent services such as knowledge tracking analysis and briefing production and the like, assisting the knowledge center to change from a document guarantee transformation to a knowledge service transformation, and completing the change from a passive service to an active service.

Description

Knowledge collecting, processing and retrieving system
Technical Field
The invention relates to a network information processing system, in particular to a knowledge collection processing and retrieval system.
Background
The existing power grid knowledge system is based on information service of massive big data, and needs to deeply integrate and mine massive information, so that more accurate knowledge information and knowledge resources are provided. Therefore, for power grid enterprises, a knowledge and knowledge service platform based on the whole network unification is constructed, resources in the platform are integrated and mined according to business requirements and service characteristics, and functions and service channels of a knowledge system are integrated and optimized, so that the method has important significance. Massive unstructured resources are integrated, the cost of information service is reduced, and the accuracy checking efficiency of user knowledge resources is improved; the repeated construction of an information system is avoided, a data center and a service center are established, and knowledge services are provided uniformly; getting through the association among various resources and improving the quality of information service; constructing a cloud platform architecture of knowledge, and realizing enterprise knowledge service integration; the knowledge service is combined with the knowledge service, an enterprise business knowledge base is built, and knowledge is converted into productivity.
Disclosure of Invention
The invention aims to carry out re-carding and integration optimization on mass data of the current power grid knowledge system, thereby providing efficient knowledge and knowledge service for the requirements of the power industry. .
In order to achieve the purpose, the invention adopts the following technical scheme:
the utility model provides a knowledge collection handles and search system, includes knowledge analysis tracking module, intelligent data mining analysis module, retrieval module, knowledge database, browse retrieval interface, its characterized in that: the data center management application system can manage data, a data access interface and safety access authentication; the data processing and indexing system comprises automatic metadata identification, indexing, processing and management; the knowledge management service module comprises a professional knowledge base, an interest model base and a special knowledge base; the whole system conducts export and import operation on data from different sources based on central heterogeneous data integration, corresponding data in the extractable service system generates a knowledge base and stores the knowledge base in the knowledge base, the system can achieve data preprocessing and updating management, and metadata fields are corresponded and converted to achieve data updating. The data processing indexing system comprises automatic metadata identification and indexing, and extracts and marks resources to be processed through a human-computer interaction interface according to metadata standards; a template can be configured for the resources with fixed format comparison during processing and indexing, and metadata items are automatically identified and automatically extracted according to the template; data processing and management: and knowledge processing and data cleaning preprocessing processing are realized. The data center management application system specifically comprises: data management, namely, a distributed storage mode is adopted, and the system can automatically store in a sub-table mode according to the size of data volume and time; meanwhile, the most suitable storage nodes and storage units are automatically distributed for management according to the storage nodes registered in the data center, and the management specifically comprises data storage node management, data backup and recovery management, data archiving and data cache management; the data access interface is used for facilitating calling and using of later projects and services on data of the data center, and a standard data interface is established according to requirements: a database listing acquisition interface; a data query interface; a data update interface; resource access authorization; resource access authorization; a digital object download interface; and safety access authentication, wherein in order to ensure the safety of data of the data center, an accessor can be authenticated and verified, and only a user registered and authorized in the system can access the authorized resource. The knowledge analysis tracking module is used for carrying out hotspot analysis and tracking on the thematic knowledge set by the user on the basis of local resources and internet resources.
The system also comprises a content management module which comprises unified authentication, authority management and resource copyright management, the whole system is designed according to layers, each layer is communicated with each other in a loose coupling mode, and the whole data framework is based on an SOA system and a cloud storage platform.
Compared with the prior art, the invention has the following advantages: the system has powerful functions and is simple and easy to operate. The information resource databases of various platforms are integrated into a unified platform, so that the functions of cross-database query, customized push and knowledge service are realized, unified, visual and convenient information retrieval and information acquisition means are provided for users, the information service function is expanded, the individuation and networking of information service are realized, and the users can complete various requirements through a unified system. The knowledge resources and services extend to all employee desktops of a company, and become a main window and a main channel for leaders, professional technologies and managers to obtain science, technology, management and innovation sources in daily work; the deep mining and utilization of the industry information resources by the enterprise staff are greatly facilitated.
Drawings
FIG. 1 is a general technical architecture of the system of the present invention
Fig. 2 is a general network architecture of the system of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments.
The bottom layer data architecture is based on an SOA system and a cloud storage platform, a knowledge center bottom layer data architecture is constructed, the whole system is designed according to layers, all the layers are communicated with each other in a loose coupling mode, and the detailed architecture is shown in figure 1. The overall network architecture of the system of the invention is shown in fig. 2, and the three-layer structure is to divide the application function into three parts, namely a presentation layer, a service logic layer and a data layer. Logically making them independent. Compared with the traditional two-layer (C/S) structure, the structure has the following advantages: the system management is simple, and the maintenance workload of the client is greatly reduced; flexible software and hardware system composition is provided; the maintainability and the expandability of the program are improved; the security of the system is increased.
Based on the advanced concept of SOA, a flexible and extensible system architecture is realized, and the combination of openness and extensibility is achieved. The SOA is a leading application system structure with strong integration capability, provides various basic application functions of IT through the concept of business service, can be freely arranged and combined, interconnected and communicated, and can be adjusted by flexibly matching with future and new requirements at any time. The method is combined with the current hottest big data technology, mass big data resources are managed, data analysis and mining technology is utilized to carry out refinement processing on mass data, and useful knowledge is mined in the mass data resources so as to get rid of low-level data application and develop the method to multi-dimensional application of information intelligence and knowledge discovery. The system is based on a cloud platform architecture, and a scalable and extensible enterprise-level cloud platform is realized and comprises a cloud data center and a cloud service platform. The platform function is enhanced by introducing an optimization technology, and a high-performance computing and storing platform is built at low cost, so that the requirements of acquisition, processing, storage and retrieval of mass information of the project are met. And the resource database of the system has: standard database, patent library, scientific and technological achievement library, knowledge product library, periodical library, theory library, electric power book library, south network enterprise library, business production knowledge library, network consulting library, video multimedia library, picture library, electric power mechanism library, electric power student library
For the adjustment and construction of the system, interfaces with other business systems and network resources need to be considered, so that the integration of relevant software and the integration of relevant functions are facilitated, and a user can operate, manage and utilize data in various aspects more conveniently. Data interface with corporate business system: an interface to network resource data; an interface to a southern power grid periodical system; a data interface to a molecular company knowledge system; data interfacing with other systems of the company.
And (3) combing all data of the knowledge center according to the characteristics of the power industry to form a unified metadata standard of the whole network, and establishing a theoretical library, a standard library, a result library and other databases. The method comprises the functions of metadata storage platform transformation, relevant standard construction, business system data acquisition, data preprocessing, updating management and the like.
And establishing a uniform metadata standard of the whole network, automatically collecting metadata of an internal database, and realizing metadata integration and scheduling. For resources that can obtain metadata, the system preferentially adopts a metadata integration mode. The platform is based on a KBase full-text database management system and supports the capability of quickly searching massive information of various unstructured full-text data. Various resources are migrated, synchronized, collected and converted into a KBase full-text database through a MetaServer metadata integration engine and an OpenQuery relational database integration gateway to form a unified metadata warehouse, and WEB retrieval and release of all metadata are realized by adopting the strong full-text retrieval capability of KBase, so that deep integration and value-added service of the resources are realized. The retrieval speed and the retrieval mode are not restricted by different data sources, and the retrieval is integrated in the mode, so that the efficiency is highest. Meanwhile, standard construction of data is carried out, metadata export is carried out, data of an existing system of a knowledge center are exported, migrated and uniformly migrated to a KBase database, and data in an original resource library can be exported in batches in various modes, wherein the data comprise a database gateway, an API (application programming interface), a Web retrieval driver and the like. Various outsourced resources, self-built resources, internal business data and the like are uniformly converted and synchronously stored in a corresponding database table of the data center through multiple ways. According to the characteristics and interface modes of different resource platforms, integration can be carried out through various integration modes. The method comprises the steps of acquiring and combing business data of all business systems of the southern power grid, mining and sorting the production management data of business departments, associating the production management data with related resources of a data center, and playing a certain role in guiding and deciding actual business application of the production management departments.
The data processing indexing system is used for carrying out structuring and fragmenting processing on special and key resources, for example, resources such as electric power standards, scientific and technological achievements and the like complete indexing work on each metadata item and xml data processing in a man-machine interaction mode, and information such as chapters, pictures, tables, formulas, knowledge points and the like of articles is split. The index extraction and the structuralization of the standard data are completed, and the index comparison among the same standards of different enterprises is facilitated.
The method comprises the functions of metadata indexing, automatic identification, knowledge fragmentation realization and the like, and the resources to be processed are extracted and labeled through a human-computer interaction interface according to the metadata standard. And the functions of auxiliary machine indexing and manual indexing and an operation interface are provided, so that the operation of an operator is facilitated.
And for the resources with fixed format comparison, a template can be configured during processing and indexing, and metadata items are automatically identified and automatically extracted according to the template. And (3) realizing knowledge fragmentation: preprocessing processing such as knowledge fragmentation and data cleaning and data XML processing management are realized. And performing more detailed field combing on data resources needing knowledge fragmentation processing, and particularly establishing a field table for fields in the content. Reprocessing the type of resources according to the field table, and processing a data resource into a plurality of pieces of knowledge; the intelligent data mining analysis function of the computer can also be utilized to automatically scan and analyze the data resources according to the field table, automatically extract the related knowledge items, and then manually check to achieve the purpose of knowledge fragmentation. For the knowledge elements, the system can automatically extract the knowledge elements such as nouns, terms, concepts and definitions related to the power grid technology, and perform relevance analysis on the knowledge elements.
The data center management module is the core of the knowledge data center and is used for carrying out unified warehousing, management, organization and service on various types of integrated data including databases and documents. The data import and access are operated through interfaces uniformly provided by the data center, and the data are stored and managed inside the data center through a cloud platform structure, so that the massive unstructured knowledge data center of the southern power grid is constructed. By adopting a distributed storage mode, the system can automatically store in different tables according to the data size and time; and meanwhile, the most suitable storage node and storage unit are automatically allocated for management according to the storage nodes registered in the data center, and the management specifically comprises data storage node management, data backup and recovery management, data archiving and data cache management. The realized functions comprise: data storage management: by adopting a distributed storage mode, the system can automatically store in different tables according to the data size and time. And meanwhile, the most suitable storage node and storage unit are automatically distributed according to the storage nodes registered in the data center. The method specifically comprises data storage node management, data backup and recovery management, data archiving and data cache management.
And (4) security access authentication: for data center security, the visitor can be authenticated and verified, and only users registered and authorized in the system can access the authorized resources. A variety of authentication means are provided including username/password, IP restrictions, certificate authentication, etc.
Resource access authorization: the administrator may assign a database, fields, specific digital objects that the caller may access. Only authorized callers can get the data access result list. And various verification modes such as account passwords, IP addresses and the like are supported. Through the management background, the administrator can authorize the user account.
For the data operation of the system, the control is carried out through the user group and the information classification, the user belongs to different user groups, different user groups can operate different information classifications, and different user groups can execute different data operations on resources, including addition, modification, deletion, browsing and the like, so that information documents under the information classification which can be operated and accessed by the user are defined, the safety control on the data operation is realized, and the condition that different users can only operate the resource documents authorized by the user groups is ensured.
The method mainly comprises the steps of revising a knowledge center portal system, upgrading and reconstructing partial system functions, adjusting the existing knowledge center service portal architecture and page specific layout, designing and customizing a new sub-channel architecture and page layout, upgrading and reconstructing the system functions, and comprises personalized and full-text intelligent retrieval, network-province secondary intelligent combined reference consultation and original text transfer service upgrading and reconstructing, knowledge requirement project function integration and the like. Columns and channels with high reuse rate are reserved, and meanwhile, the original channels and functions need to be perfected. The main manifestations are as follows: and the retrieval function is optimized, so that the method is more intelligent and accurate. The system can preprocess the retrieval conditions of the user, thereby achieving better retrieval effect; and the subscription push function is improved by an algorithm and a strategy, so that accurate push is realized. Garbage information is reduced, and 'fine and less' is achieved; the interface style is consistent with that of the original system, and the exercise adapts to the use habit of the existing user: various data and functions are combed, and the data and functions belonging to the same category are put together, so that a user can conveniently search: and standard data are finely processed, and richer standard services are provided: adding an entrance for business knowledge base query and knowledge submission: integrating advantageous functions in Shenzhen power supply bureau project and Wuhan project: the functions of knowledge sharing and communication are increased.
The functions of the knowledge center website are expanded, and the intelligent knowledge analysis and processing function is provided based on the data of the data center.
Upgrading a result registration system: according to the use feedback of the user to the achievement registration system of the prior knowledge center, the place with good use reverberation of the system is reserved, and the place inconvenient for the user to use is improved.
By using the most popular design style and development technology at present, a result registration system and a member center are upgraded and modified, and the use experience of a user is improved.
And (3) realizing intelligent knowledge analysis and processing: the method comprises the steps of analyzing and tracking knowledge hotspots (including internet data), mining various resources such as periodicals, papers, reports, books, internet data and the like, and mining research hotspots and technical hotspots in a certain period of time through clustering and topic extraction. And continuously tracking and analyzing the hot spot, and mastering the development dynamics and trend of the technology.
On the basis of a data center and an intelligent mining engine, knowledge monitoring and tracking such as research hotspots, frontiers, development trends and the like in the aspect of power grid related technologies are provided for a group aiming at a power grid technology plate, and a referable knowledge brief report is provided for leaders and researchers.
The module is a knowledge topic monitoring and tracking system, so the module mainly monitors and tracks, and the core module mainly comprises the following modules:
the knowledge hotspot analyzing and tracking function is used for mining various resources such as periodicals, papers, reports, books and the like, and mining research hotspots and technical hotspots in a certain period of time through clustering and topic extraction. And continuously tracking and analyzing the hot spot, and mastering the development dynamics and trend of the technology.
The knowledge management and service module is a knowledge base system related to the power grid service, is an open platform, and is used for mining and sorting production management data of a service department, associating the production management data with related resources of a data center, and playing a certain role in guiding and deciding actual service application of the production management department. Such as: the accident recording data of the transformer substation can be built into a special knowledge base, and information such as time, place, reason, solution and the like of each accident is marked. The analysis of the data can help other maintenance personnel to quickly solve the problems on one hand, and can effectively prevent accidents through the data analysis on the other hand.
The system realizes multidimensional evaluation and statistics of academic paper master library resource downloading, reading and accessing times, column access, provincial company access statistics and the like, and the result is stored in the database so as to optimize and adjust the future resource purchasing direction and range and calculate the value of knowledge resources.
The system automatically mines the association relationship among the knowledge through an intelligent mining engine, dynamically extracts the feature vector of each piece of knowledge, retrieves similar indexes according to the feature vector when a user browses the knowledge and dynamically associates the knowledge most relevant to the knowledge.
In addition, the system can excavate user behaviors and access logs, count and classify the knowledge accessed by the users, automatically associate the knowledge which is similar to the user and is accessed by other users in a large amount when accessing a certain knowledge, and thus form a mesh knowledge association diagram.
The subject word mining engine periodically counts the knowledge base, automatically mines subject words of each piece of knowledge, calculates the incidence relation among the subject words, and forms a reticular subject word navigation chart according to the incidence relation and the incidence degree, and the effect is as shown in the following chart, and each click on one subject word leads to outward association by taking the subjects as the center.
The knowledge community develops various consultation services and comprehensive statistical analysis and evaluation functions of daily updating and maintenance according to daily working conditions of two-stage operation and maintenance personnel of the network and provincial companies. And providing chart presentation and comprehensively evaluating the service quality of the secondary operation and maintenance personnel based on the chart presentation. The functions comprise system access, operation log, system operation maintenance statistical data, data analysis and the like.
In order to conveniently acquire the implicit knowledge in the mind of the staff, an internal staff network community is established, and mutual attention, resource uploading and sharing, recommendation and evaluation of individuals are realized; aiming at relevant topics, internal discussion and circles are initiated, implicit knowledge is collected, and production reports are collected. The knowledge community provides an interactive community of knowledge question and answer, knowledge sharing and knowledge precipitation for all users, and the implicit knowledge of experts is mined to form a special subject knowledge base.
The knowledge community realizes the following functions: user communication functions including subject overview and perusal, speech, reply, short message, voting, etc.; auxiliary communication functions, wherein the auxiliary functions comprise essence areas, ranking, retrieval, online user lists, printing, collection and the like; the online discussion function-providing chat room function to facilitate free discussion of users, and providing guest interview function to browse the communication content record and sort the speech; personalized content push-can provide personalized content push function, Email push, layout post subscription, post recommendation function for users; daily management monitoring function-providing comprehensive management function of administrator community parameters and contents, providing monitoring function of buffer pool and access statistics; security control functions-user and group policy management, layout authorization access control, posting information filtering, auditing, IP management, etc. need to be provided.
The invention integrates and integrates a plurality of heterogeneous information resource databases by applying advanced technologies such as heterogeneous data integration technology, intelligent retrieval technology, text mining technology and the like, thereby realizing one-stop cross-database retrieval, multi-language retrieval and intelligent classification of retrieval results; the method comprises the steps of upgrading and optimizing a knowledge center, cleaning and integrating data after upgrading and optimizing, establishing a whole network metadata standard, a data processing index standard, a system integration standard, a data management standard, a service standard and a related standard interface, upgrading an intelligent retrieval function, acquiring business data, establishing a knowledge map and a knowledge community, providing intelligent services such as knowledge tracking analysis and briefing production, assisting the knowledge center to change from a document guarantee transformation to a knowledge service type, and completing the change from a passive service to an active service.

Claims (10)

1. The utility model provides a knowledge collection handles and search system, includes knowledge analysis tracking module, intelligent data mining analysis module, retrieval module, knowledge database, browse retrieval interface, its characterized in that: the data center management application system can manage data, a data access interface and safety access authentication; the data processing and indexing system comprises automatic metadata identification, indexing, processing and management; the knowledge management service module comprises a professional knowledge base, an interest model base and a special knowledge base; the whole system conducts export and import operation on data from different sources based on central heterogeneous data integration, corresponding data in the extractable service system generates a knowledge base and stores the knowledge base in the knowledge base, the system can achieve data preprocessing and updating management, and metadata fields are corresponded and converted to achieve data updating.
2. The knowledge collection processing and retrieval system of claim 1, wherein: the data processing indexing system comprises automatic metadata identification and indexing, and extracts and marks resources to be processed through a human-computer interaction interface according to metadata standards; a template can be configured for the resources with fixed format comparison during processing and indexing, and metadata items are automatically identified and automatically extracted according to the template; data processing and management: and knowledge processing and data cleaning preprocessing processing are realized.
3. The knowledge collection processing and retrieval system of claim 1, wherein: the data center management application system specifically comprises: data management, namely, a distributed storage mode is adopted, and the system can automatically store in a sub-table mode according to the size of data volume and time; meanwhile, the most suitable storage nodes and storage units are automatically distributed for management according to the storage nodes registered in the data center, and the management specifically comprises data storage node management, data backup and recovery management, data archiving and data cache management; the data access interface is used for facilitating calling and using of later projects and services on data of the data center, and a standard data interface is established according to requirements: a database listing acquisition interface; a data query interface; a data update interface; resource access authorization; resource access authorization; a digital object download interface; and safety access authentication, wherein in order to ensure the safety of data of the data center, an accessor can be authenticated and verified, and only a user registered and authorized in the system can access the authorized resource.
4. The knowledge collection processing and retrieval system of claim 1, which provides a user with a variety of personalized intelligent retrieval functions, including retrieval intelligent prompting, intelligent error correction, intelligent conversion, and itemized retrieval intelligent retrieval functions.
5. The knowledge collection processing and retrieval system of claim 1, wherein the knowledge analysis tracking module performs hotspot analysis and tracking on the thematic knowledge set by the user based on local resources and internet resources.
6. The knowledge collection processing and retrieval system of claim 1, the knowledge management services module comprising knowledge bases that primarily navigate and correlate knowledge in different dimensions to form a knowledge network. The professional knowledge base comprises equipment knowledge, post knowledge, department knowledge and technical special knowledge.
7. The knowledge collection and retrieval system of claim 1, further comprising a content management module
Including unified authentication and authority management, resource copyright management, wherein: unified authentication and authority management are realized, unified identity authentication and single sign-on functions are realized, and the roles, authorities and resource security conditions and release of users in the platform are managed; resource copyright management, which is to perform security control and encryption management on documents, and realize the functions of using mode, range management, copy prevention, printing prevention and diffusion propagation prevention of security files by various encryption technologies.
8. The knowledge collection and retrieval system of claim 1, which can perform data mining and intelligent processing on existing projects, data association analysis, automatic clustering of data, classification, indexing, and user behavior analysis and log mining, data mining intelligent processing.
9. The knowledge collection and retrieval system of claim 1, wherein the overall system is designed in layers, each layer communicating with each other via a loose coupling.
10. The knowledge collection and retrieval system of claim 1, wherein the overall data architecture is based on an SOA system and a cloud storage platform.
CN201911270663.4A 2019-12-12 2019-12-12 Knowledge collecting, processing and retrieving system Pending CN111008190A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911270663.4A CN111008190A (en) 2019-12-12 2019-12-12 Knowledge collecting, processing and retrieving system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911270663.4A CN111008190A (en) 2019-12-12 2019-12-12 Knowledge collecting, processing and retrieving system

Publications (1)

Publication Number Publication Date
CN111008190A true CN111008190A (en) 2020-04-14

Family

ID=70114281

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911270663.4A Pending CN111008190A (en) 2019-12-12 2019-12-12 Knowledge collecting, processing and retrieving system

Country Status (1)

Country Link
CN (1) CN111008190A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113094025A (en) * 2021-04-06 2021-07-09 国家电网有限公司客户服务中心 Electric power business hall service system based on knowledge base

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113094025A (en) * 2021-04-06 2021-07-09 国家电网有限公司客户服务中心 Electric power business hall service system based on knowledge base

Similar Documents

Publication Publication Date Title
CN107819824B (en) Urban data opening and information service system and service method
CN104767813B (en) Public's row big data service platform based on openstack
CN101013482A (en) Online integrated management platform for archive and data of constructing project item
CN103929473A (en) Method and system for accessing multiple subsystems and public subsystem with distributed storage personalized data through single APP program
CN102609526A (en) Internet website content management system
CN103745302A (en) Digitalized archival data management system
Zhang et al. Research on the integration of heterogeneous information resources in university management informatization based on data mining algorithms
US20200334314A1 (en) Emergency disposal support system
CN114756567A (en) Intelligent government affair service analysis platform based on big data technology
CN111008190A (en) Knowledge collecting, processing and retrieving system
CN114003634A (en) Big data analysis and retrieval system and method based on ES technology
Gu Integration and optimization of ancient literature information resources based on big data technology
Lv et al. The exploring on university archives management system based on information system
CN111597332A (en) Fiscal encyclopedia application system
Sixin et al. A New Governance Architecture for Government Information Resources Based on Big Data Ecological Environment in China
Kunanets et al. Modeling of consolidated information resource for social data institutions
Zhao et al. Design and Implementation of Enterprise Public Data Management Platform Based on Artificial Intelligence
Ziegler et al. PAL: toward a recommendation system for manuscripts
Ling-ping et al. Application of Three-dimensional Digital Model in Digital Archive Ubiquitous Intelligent Service
Pan et al. An open sharing pattern design of massive power big data
CN110704421A (en) Data processing method, device, equipment and computer readable storage medium
Yun Design and Development of Online Booking Clients for University Books and Materials under the Background of Big Data
Tang The construction of intelligent professional discipline service platforms of libraries
Taylor et al. A new information architecture, website and services for the CMS experiment
Dong Design and Implementation of Internet-oriented News Management System

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20200414