CN112905746A - System archive knowledge mining processing method based on knowledge graph technology - Google Patents
System archive knowledge mining processing method based on knowledge graph technology Download PDFInfo
- Publication number
- CN112905746A CN112905746A CN202110249513.6A CN202110249513A CN112905746A CN 112905746 A CN112905746 A CN 112905746A CN 202110249513 A CN202110249513 A CN 202110249513A CN 112905746 A CN112905746 A CN 112905746A
- Authority
- CN
- China
- Prior art keywords
- knowledge
- entity
- entities
- method based
- processing method
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000005516 engineering process Methods 0.000 title claims abstract description 17
- 238000005065 mining Methods 0.000 title claims abstract description 14
- 238000003672 processing method Methods 0.000 title claims abstract description 14
- 238000002372 labelling Methods 0.000 claims abstract description 24
- 238000000034 method Methods 0.000 claims abstract description 22
- 238000000605 extraction Methods 0.000 claims abstract description 21
- 230000004927 fusion Effects 0.000 claims abstract description 17
- 238000012549 training Methods 0.000 claims abstract description 8
- 238000012550 audit Methods 0.000 claims abstract description 4
- 238000013136 deep learning model Methods 0.000 claims abstract description 4
- 238000012360 testing method Methods 0.000 claims abstract description 4
- 238000001914 filtration Methods 0.000 claims description 3
- 238000012163 sequencing technique Methods 0.000 claims description 3
- 238000010276 construction Methods 0.000 abstract description 5
- 230000008569 process Effects 0.000 description 5
- 230000009471 action Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
- G06F40/295—Named entity recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Artificial Intelligence (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computing Systems (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Software Systems (AREA)
- Evolutionary Computation (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Molecular Biology (AREA)
- Human Computer Interaction (AREA)
- Animal Behavior & Ethology (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a system archive knowledge mining and processing method based on knowledge graph technology, belonging to the technical field of knowledge graph construction and comprising the following steps of: setting a basic mode of a knowledge base and constructing the knowledge base; labeling data of system documents in an original document set by a method based on an entity and a relation labeling platform, and converting platform text labeling data into text sequence labeling data; taking the text sequence marking data as input, training and testing a deep learning model, and generating an extraction model of the relation between the marking and the entity; extracting incremental system documents based on an extraction model to serve as a pre-labeling result; performing knowledge fusion on a large number of related system names existing in the extracted entity system and the abstract; performing knowledge audit on the entity after knowledge fusion and storing the entity in a knowledge base; the association between the systems, the systems and the sub-systems, and the systems and the clauses remarkably improves the utilization efficiency and the value of the knowledge.
Description
Technical Field
The invention belongs to the technical field of knowledge graph construction, and relates to a system archive knowledge mining processing method based on knowledge graph technology.
Background
In the artificial intelligence era, enterprise system archives become increasingly important recessive assets of enterprises, and are the largest and most complete data resource pool formed by the enterprises in development. How to exert the knowledge value of system documents is the core of the management of a new generation of intelligent enterprises.
For a huge enterprise, after the gradual informatization process of the past decades, the data of a large number of system documents accumulated shows the following three characteristics:
firstly, the method comprises the following steps: the system file amount is large, the digitization degree is high, a large number of electronic documents exist, and the document amount is continuously enlarged along with the continuous enlargement of the service; II, secondly: the knowledge value contained in the document is high, not only the document value of the file itself is high, but also the knowledge value density contained in a single file is very high, for example, the XX production specification of an enterprise contains a large number of normalized knowledge points of the production flow, and the points are scattered in each document and lack sufficient correlation; thirdly, the method comprises the following steps: in the search scenario, because of the close knowledge association between documents (for example, a document at a lower level is made according to a document at an upper level), the conventional knowledge search technology only finds out the document and knowledge related to the text from the search matching perspective, and cannot search out the associated knowledge of the document and the document.
In summary, the construction and retrieval difficulties of the enterprise document knowledge base include: system document data is large in quantity and continuously enlarged in scale, and a document knowledge base needs to support large-scale document import and horizontal expansion; knowledge in system documents needs to be structured, and the establishment of association between the knowledge needs to establish a closed-loop construction mode of manual labeling, model training, pre-labeling and manual auditing, so that the construction efficiency is improved, knowledge retrieval supports knowledge association, expansion and traceability, the knowledge associated to business related documents is supported when the retrieval is supported, the knowledge expansion based on a knowledge base is supported, and the knowledge traceability viewing from the knowledge to source documents is supported.
Disclosure of Invention
The invention aims to: the system provides a professional system archive question-answering robot system based on a semantic analysis technology, and solves the problem that the conventional knowledge retrieval technology can only find out documents and knowledge related to texts from the search matching perspective and can not retrieve the associated knowledge of the documents.
The technical scheme adopted by the invention is as follows:
a system archive knowledge mining processing method based on knowledge graph technology comprises the following steps:
setting a basic mode of a knowledge base and constructing the knowledge base;
labeling data of system documents in an original document set by a method based on an entity and a relation labeling platform, and converting platform text labeling data into text sequence labeling data;
taking the text sequence marking data as input, training and testing a deep learning model, and generating an extraction model of the relation between the marking and the entity;
extracting incremental system documents based on an extraction model to serve as a pre-labeling result;
performing knowledge fusion on a large number of related system names existing in the extracted entity system and the abstract;
and performing knowledge audit on the entity after knowledge fusion and storing the entity in a knowledge base.
Further, the basic schema of the knowledge base is: and taking the system document as a standard document, analyzing the clauses, the sub-systems and the units of the text from the system document, and analyzing the clauses from the sub-systems.
Furthermore, the entity extraction algorithm of the extraction model adopts a Bi-LSTM + CRF model for model training, and the relation extraction algorithm adopts a Simple Bert model.
Further, the Bi-LSTM + CRF model masks the main entity and the guest entity in the sentence by using special characters.
Further, knowledge fusion comprises the following steps:
searching a plurality of text related entities for the extracted entity names by a full text search method;
respectively extracting entity attribute characteristics, entity name text characteristics and relationship characteristics from a plurality of candidate entities and target entities, inputting the entity attribute characteristics, the entity name text characteristics and the relationship characteristics into a binary model for judgment, and outputting fusion probability as a judgment basis for judging whether fusion is performed or not;
the relationship features comprise first-degree relationship features, first-degree entity features and second-degree relationship features of the entities.
Further, the relationship among the systems comprises abolishing, basis, mentioning and correlation, the abolishing and basis is extracted by a keyword triggering and pattern matching method, and the specific steps are as follows:
defining a system relation mode;
positioning sentences in the system making abstract according to the key trigger words, performing entity extraction on the sentences, and extracting a plurality of system names;
and extracting system relationship pairs according to a system relationship mode.
Further, for the query sentence input by the user, the knowledge association retrieval of the system comprises the following steps:
carrying out entity link on a query sentence of a user and a knowledge base, and finding out an entity set A which can be hit by the query sentence;
respectively carrying out first-degree and second-degree exploration on the entities in the set A in a knowledge base, and calculating the weight of the entities after the first-degree and second-degree exploration;
and performing weight sequencing on all the searched candidate entities, filtering out entities hit by entity links, and putting back the entities as an associated recommended entity set.
In summary, due to the adoption of the technical scheme, the invention has the beneficial effects that:
the system archive knowledge mining and processing method based on the knowledge graph technology is characterized in that a knowledge graph is used as a knowledge base to model system documents of enterprises, and system-system associations, system-sub-systems associations and system-clauses associations are performed, so that the utilization efficiency and the value of knowledge are obviously improved; using graph exploration, relationships between regimes and overall context between regime terms can be queried quickly.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are required to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and that for those skilled in the art, other relevant drawings can be obtained according to the drawings without inventive effort, wherein:
FIG. 1 is a schematic flow diagram of the present invention;
FIG. 2 is a schematic diagram of the basic schema of the knowledge base of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the detailed description and specific examples, while indicating the preferred embodiment of the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.
Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.
It is noted that relational terms such as "first" and "second," and the like, may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
Examples
As shown in fig. 1, a system archive knowledge mining processing method based on the knowledge graph technology according to a preferred embodiment of the present invention includes the following steps:
setting a basic mode of a knowledge base and constructing the knowledge base;
specifically, as shown in fig. 2, the basic schema of the knowledge base is: and taking the system document as a standard document, analyzing the clauses, the sub-systems and the units of the text from the system document, and analyzing the clauses from the sub-systems.
In this embodiment, for example, the "reimbursement management system" is used as an institutional entity, and the "reimbursement system for business trips" in the institutional system may be used as a sub-system. The "clause" entity type is used as the most detailed knowledge expression form, such as "one-line city reimbursement process" can be used as a clause, and the clause includes attributes such as "applicable personnel", "applicable standard", and the like.
Labeling data of system documents in an original document set by a method based on an entity and a relation labeling platform, and converting platform text labeling data into text sequence labeling data;
taking the text sequence marking data as input, training and testing a deep learning model, and generating an extraction model of the relation between the marking and the entity;
preferably, the entity extraction algorithm of the extraction model adopts a Bi-LSTM + CRF model for model training, and the relation extraction algorithm adopts a Simple Bert model; the Bi-LSTM + CRF model masks the host and guest entities in the sentence using special characters.
In this embodiment, the input form of the Simple Bert model is as follows: [ CLS ] sentence [ SEP ] host entity [ SEP ] guest entity [ SEP ]. To prevent overfitting, the host and guest entities in the sentence are masked with special characters, e.g., [ S-PER ] for a host entity of type person name and [ O-LOC ] for a guest entity of type place name. The whole sequence is coded by a Simple Bert model, the obtained hidden vector at each position is spliced with the coding vector of the relative position of the hidden vector and the coding vector of the host entity and the guest entity in the sentence, then the spliced hidden vector is input into a bidirectional LSTM layer, the hidden state at the last moment in each direction is taken and then spliced, and finally, the relation type prediction is realized for a feedforward network layer.
Extracting incremental system documents based on an extraction model to serve as a pre-labeling result; it should be noted that the result of the pre-labeling is used as the continuous accumulation of the labeling data, and in turn, an extraction model with better effect can be trained.
Performing knowledge fusion on a large number of related system names existing in the extracted entity system and the abstract;
specifically, knowledge fusion comprises the following steps:
searching a plurality of text related entities for the extracted entity names by a full text search method;
respectively extracting entity attribute characteristics, entity name text characteristics and relationship characteristics from a plurality of candidate entities and target entities, inputting the entity attribute characteristics, the entity name text characteristics and the relationship characteristics into a binary model for judgment, and outputting fusion probability as a judgment basis for judging whether fusion is performed or not;
the relationship features comprise first-degree relationship features, first-degree entity features and second-degree relationship features of the entities.
And performing knowledge audit on the entity after knowledge fusion and storing the entity in a knowledge base.
The system comprises a system and a method, wherein the system comprises a system and a system, the system comprises a system and a method, the system comprises the following steps:
defining a system relation mode; in practice, the relationship mode may be set as: [ make < B-system > according to | follow ] < A-system >; printing < A system > original < B system > abolishes, etc.
Positioning sentences in the system making abstract according to the key trigger words, performing entity extraction on the sentences, and extracting a plurality of system names;
and extracting system relationship pairs according to a system relationship mode.
In addition, for the query sentence input by the user, the knowledge correlation retrieval step of the system is as follows:
carrying out entity link on a query sentence of a user and a knowledge base, and finding out an entity set A which can be hit by the query sentence;
respectively carrying out first-degree and second-degree exploration on the entities in the set A in a knowledge base, and calculating the weight of the entities after the first-degree and second-degree exploration;
and performing weight sequencing on all the searched candidate entities, filtering out entities hit by entity links, and putting back the entities as an associated recommended entity set.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and should not be taken as limiting the scope of the present invention, and any modifications, equivalents and improvements made by those skilled in the art within the spirit and principle of the present invention should be included in the scope of the present invention.
Claims (7)
1. A system archive knowledge mining processing method based on knowledge graph technology is characterized in that: the method comprises the following steps:
setting a basic mode of a knowledge base and constructing the knowledge base;
labeling data of system documents in an original document set by a method based on an entity and a relation labeling platform, and converting platform text labeling data into text sequence labeling data;
taking the text sequence marking data as input, training and testing a deep learning model, and generating an extraction model of the relation between the marking and the entity;
extracting incremental system documents based on an extraction model to serve as a pre-labeling result;
performing knowledge fusion on a large number of related system names existing in the extracted entity system and the abstract;
and performing knowledge audit on the entity after knowledge fusion and storing the entity in a knowledge base.
2. The system archive knowledge mining processing method based on the knowledge graph technology as claimed in claim 1, characterized in that: the basic modes of the knowledge base are as follows: and taking the system document as a standard document, analyzing the clauses, the sub-systems and the units of the text from the system document, and analyzing the clauses from the sub-systems.
3. The system archive knowledge mining processing method based on the knowledge graph technology as claimed in claim 1, characterized in that: the entity extraction algorithm of the extraction model adopts a Bi-LSTM + CRF model for model training, and the relation extraction algorithm adopts a Simple Bert model.
4. The system archive knowledge mining processing method based on the knowledge graph technology as claimed in claim 3, wherein: the Bi-LSTM + CRF model masks the host and guest entities in the sentence using special characters.
5. The system archive knowledge mining processing method based on the knowledge graph technology as claimed in claim 1, characterized in that: the knowledge fusion comprises the following steps:
searching a plurality of text related entities for the extracted entity names by a full text search method;
respectively extracting entity attribute characteristics, entity name text characteristics and relationship characteristics from a plurality of candidate entities and target entities, inputting the entity attribute characteristics, the entity name text characteristics and the relationship characteristics into a binary model for judgment, and outputting fusion probability as a judgment basis for judging whether fusion is performed or not;
the relationship features comprise first-degree relationship features, first-degree entity features and second-degree relationship features of the entities.
6. The system archive knowledge mining processing method based on the knowledge graph technology as claimed in claim 1, characterized in that: the relationship among the systems comprises abolishing, basis, mentioning and correlation, the abolishing and basis are extracted by a keyword triggering and pattern matching method, and the specific steps are as follows:
defining a system relation mode;
positioning sentences in the system making abstract according to the key trigger words, performing entity extraction on the sentences, and extracting a plurality of system names;
and extracting system relationship pairs according to a system relationship mode.
7. The system archive knowledge mining processing method based on the knowledge graph technology as claimed in claim 1, characterized in that: for the query sentence input by the user, the knowledge association retrieval of the system comprises the following steps:
carrying out entity link on a query sentence of a user and a knowledge base, and finding out an entity set A which can be hit by the query sentence;
respectively carrying out first-degree and second-degree exploration on the entities in the set A in a knowledge base, and calculating the weight of the entities after the first-degree and second-degree exploration;
and performing weight sequencing on all the searched candidate entities, filtering out entities hit by entity links, and putting back the entities as an associated recommended entity set.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110249513.6A CN112905746A (en) | 2021-03-08 | 2021-03-08 | System archive knowledge mining processing method based on knowledge graph technology |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110249513.6A CN112905746A (en) | 2021-03-08 | 2021-03-08 | System archive knowledge mining processing method based on knowledge graph technology |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112905746A true CN112905746A (en) | 2021-06-04 |
Family
ID=76107936
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110249513.6A Pending CN112905746A (en) | 2021-03-08 | 2021-03-08 | System archive knowledge mining processing method based on knowledge graph technology |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112905746A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113886606A (en) * | 2021-12-08 | 2022-01-04 | 北京海致星图科技有限公司 | Data annotation method, device, medium and equipment based on knowledge graph |
CN117668259A (en) * | 2024-02-01 | 2024-03-08 | 华安证券股份有限公司 | Knowledge-graph-based inside and outside data linkage analysis method and device |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110826316A (en) * | 2019-11-06 | 2020-02-21 | 北京交通大学 | Method for identifying sensitive information applied to referee document |
CN111428053A (en) * | 2020-03-30 | 2020-07-17 | 西安交通大学 | Tax field knowledge graph construction method |
CN111753099A (en) * | 2020-06-28 | 2020-10-09 | 中国农业科学院农业信息研究所 | Method and system for enhancing file entity association degree based on knowledge graph |
CN111832293A (en) * | 2020-06-24 | 2020-10-27 | 四川大学 | Entity and relation combined extraction method based on head entity prediction |
CN112037920A (en) * | 2020-08-31 | 2020-12-04 | 康键信息技术(深圳)有限公司 | Medical knowledge map construction method, device, equipment and storage medium |
CN112307171A (en) * | 2020-10-30 | 2021-02-02 | 中国电力科学研究院有限公司 | Institutional standard retrieval method and system based on power knowledge base and readable storage medium |
CN112417888A (en) * | 2020-11-26 | 2021-02-26 | 江苏网谱数据科技有限公司 | Method for analyzing sparse semantic relationship by combining BilSTM-CRF algorithm and R-BERT algorithm |
-
2021
- 2021-03-08 CN CN202110249513.6A patent/CN112905746A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110826316A (en) * | 2019-11-06 | 2020-02-21 | 北京交通大学 | Method for identifying sensitive information applied to referee document |
CN111428053A (en) * | 2020-03-30 | 2020-07-17 | 西安交通大学 | Tax field knowledge graph construction method |
CN111832293A (en) * | 2020-06-24 | 2020-10-27 | 四川大学 | Entity and relation combined extraction method based on head entity prediction |
CN111753099A (en) * | 2020-06-28 | 2020-10-09 | 中国农业科学院农业信息研究所 | Method and system for enhancing file entity association degree based on knowledge graph |
CN112037920A (en) * | 2020-08-31 | 2020-12-04 | 康键信息技术(深圳)有限公司 | Medical knowledge map construction method, device, equipment and storage medium |
CN112307171A (en) * | 2020-10-30 | 2021-02-02 | 中国电力科学研究院有限公司 | Institutional standard retrieval method and system based on power knowledge base and readable storage medium |
CN112417888A (en) * | 2020-11-26 | 2021-02-26 | 江苏网谱数据科技有限公司 | Method for analyzing sparse semantic relationship by combining BilSTM-CRF algorithm and R-BERT algorithm |
Non-Patent Citations (2)
Title |
---|
深度预习: "simple bert modal 用于短文本关系抽取", 《HTTPS://WWW.CNBLOGS.COM/CHENYUSHENG0803/P/12592775.HTML》 * |
高玲玲 等: "利用人工智能技术开展企业内部规章制度审计", 《审计文摘》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113886606A (en) * | 2021-12-08 | 2022-01-04 | 北京海致星图科技有限公司 | Data annotation method, device, medium and equipment based on knowledge graph |
CN117668259A (en) * | 2024-02-01 | 2024-03-08 | 华安证券股份有限公司 | Knowledge-graph-based inside and outside data linkage analysis method and device |
CN117668259B (en) * | 2024-02-01 | 2024-04-26 | 华安证券股份有限公司 | Knowledge-graph-based inside and outside data linkage analysis method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105468605B (en) | Entity information map generation method and device | |
CN104216913B (en) | Question answering method, system and computer-readable medium | |
CN109493265A (en) | A kind of Policy Interpretation method and Policy Interpretation system based on deep learning | |
CN110298033A (en) | Keyword corpus labeling trains extracting tool | |
CN106951558B (en) | Data processing method of tax intelligent consultation platform based on deep search | |
Sarawagi et al. | Open-domain quantity queries on web tables: annotation, response, and consensus models | |
CN103678412A (en) | Document retrieval method and device | |
CN111339318B (en) | University computer basic knowledge graph construction method based on deep learning | |
CN111626568B (en) | Knowledge base construction method and knowledge search method and system in natural disaster field | |
CN112417100A (en) | Knowledge graph in Liaodai historical culture field and construction method of intelligent question-answering system thereof | |
JP2023519049A (en) | Method and apparatus for obtaining POI status information | |
CN112905746A (en) | System archive knowledge mining processing method based on knowledge graph technology | |
CN113918725A (en) | Construction method of knowledge graph in water affairs field | |
CN114090861A (en) | Education field search engine construction method based on knowledge graph | |
Humbel et al. | Named-entity recognition for early modern textual documents: a review of capabilities and challenges with strategies for the future | |
CN112597768B (en) | Text auditing method, device, electronic equipment, storage medium and program product | |
CN113434789B (en) | Search sorting method based on multi-dimensional text features and related equipment | |
Geiß et al. | With a little help from my neighbors: person name linking using the Wikipedia social network | |
Fatemi et al. | Record linkage to match customer names: A probabilistic approach | |
Gupta et al. | Document summarisation based on sentence ranking using vector space model | |
CN113536772A (en) | Text processing method, device, equipment and storage medium | |
ElGindy et al. | Capturing place semantics on the geosocial web | |
Qiu et al. | BusinessDetect: an advanced business information mining application for intelligent marketing | |
CN113536133B (en) | Internet data processing method based on single public opinion event | |
Kleb et al. | Ontology based entity disambiguation with natural language patterns |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210604 |
|
RJ01 | Rejection of invention patent application after publication |