WO2023022655A3 - 知识图谱构建方法、装置、存储介质及电子设备 - Google Patents
知识图谱构建方法、装置、存储介质及电子设备 Download PDFInfo
- Publication number
- WO2023022655A3 WO2023022655A3 PCT/SG2022/050578 SG2022050578W WO2023022655A3 WO 2023022655 A3 WO2023022655 A3 WO 2023022655A3 SG 2022050578 W SG2022050578 W SG 2022050578W WO 2023022655 A3 WO2023022655 A3 WO 2023022655A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- knowledge map
- entity
- construction method
- electronic device
- storage medium
- Prior art date
Links
- 238000010276 construction Methods 0.000 title abstract 3
- 239000003607 modifier Substances 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
- G06N5/022—Knowledge engineering; Knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/901—Indexing; Data structures therefor; Storage structures
- G06F16/9024—Graphs; Linked lists
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/288—Entity relationship models
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Databases & Information Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
说明书摘要本公开涉及一种知识图谱构建方法、装置、存储介质及电子设备。该知识图谱构建方法包括:从目标网页的标题文本中识别出实体概念,并从所述目标网页的主体文本中识别出对应所述实体概念的至少一个实体;根据所述标题文本所属语种的语法分析规则,构建所述标题文本的语法分析树,并从所述语法分析树中确定用于修饰所述实体概念的修饰词;根据所述实体概念、所述修饰词以及所述至少一个实体生成知识图谱。采用本公开的这种方式,无需对目标网页进行结构化处理也能构建准确率和召回率高的知识图谱。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18/397,227 US20240135196A1 (en) | 2021-08-16 | 2023-12-27 | Method and apparatus for knowledge graph construction, storage medium, and electronic device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110939279.X | 2021-08-16 | ||
CN202110939279.XA CN113609309B (zh) | 2021-08-16 | 2021-08-16 | 知识图谱构建方法、装置、存储介质及电子设备 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US18/397,227 Continuation US20240135196A1 (en) | 2021-08-16 | 2023-12-27 | Method and apparatus for knowledge graph construction, storage medium, and electronic device |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2023022655A2 WO2023022655A2 (zh) | 2023-02-23 |
WO2023022655A3 true WO2023022655A3 (zh) | 2023-04-13 |
Family
ID=78308687
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/SG2022/050578 WO2023022655A2 (zh) | 2021-08-16 | 2022-08-15 | 知识图谱构建方法、装置、存储介质及电子设备 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20240135196A1 (zh) |
CN (1) | CN113609309B (zh) |
WO (1) | WO2023022655A2 (zh) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106156365A (zh) * | 2016-08-03 | 2016-11-23 | 北京智能管家科技有限公司 | 一种知识图谱的生成方法及装置 |
CN106484767A (zh) * | 2016-09-08 | 2017-03-08 | 中国科学院信息工程研究所 | 一种跨媒体的事件抽取方法 |
CN109033160A (zh) * | 2018-06-15 | 2018-12-18 | 东南大学 | 一种知识图谱动态更新方法 |
CN111177591A (zh) * | 2019-12-10 | 2020-05-19 | 浙江工业大学 | 面向可视化需求的基于知识图谱的Web数据优化方法 |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8898583B2 (en) * | 2011-07-28 | 2014-11-25 | Kikin Inc. | Systems and methods for providing information regarding semantic entities included in a page of content |
CN104484379B (zh) * | 2014-12-09 | 2018-06-12 | 百度在线网络技术(北京)有限公司 | 确定音乐实体关系的方法和装置及查询处理方法和装置 |
CN105279277A (zh) * | 2015-11-12 | 2016-01-27 | 百度在线网络技术(北京)有限公司 | 知识数据的处理方法和装置 |
CN107169078A (zh) * | 2017-05-10 | 2017-09-15 | 京东方科技集团股份有限公司 | 中医药知识图谱及其建立方法以及计算机系统 |
CN107341215B (zh) * | 2017-06-07 | 2020-05-12 | 北京航空航天大学 | 一种基于分布式计算平台的多源垂直知识图谱分类集成查询系统 |
US11334692B2 (en) * | 2017-06-29 | 2022-05-17 | International Business Machines Corporation | Extracting a knowledge graph from program source code |
CN108376160B (zh) * | 2018-02-12 | 2022-02-18 | 北京大学 | 一种中文知识图谱构建方法和系统 |
CN108959433B (zh) * | 2018-06-11 | 2022-05-03 | 北京大学 | 一种从软件项目数据中提取知识图谱并问答的方法与系统 |
US10846288B2 (en) * | 2018-07-02 | 2020-11-24 | Babylon Partners Limited | Computer implemented method for extracting and reasoning with meaning from text |
CN109033358B (zh) * | 2018-07-26 | 2022-06-10 | 李辰洋 | 新闻聚合与智能实体关联的方法 |
CN109885698A (zh) * | 2019-02-13 | 2019-06-14 | 北京航空航天大学 | 一种知识图谱构建方法及装置、电子设备 |
CN110096599B (zh) * | 2019-04-30 | 2023-03-21 | 长沙知了信息科技有限公司 | 知识图谱的生成方法及装置 |
CN110263351A (zh) * | 2019-06-17 | 2019-09-20 | 深圳前海微众银行股份有限公司 | 一种网页的多语言翻译方法、装置及设备 |
CN111414489B (zh) * | 2020-03-25 | 2023-10-27 | 中金智汇科技有限责任公司 | 知识图谱构建方法、装置、电子设备及可读存储介质 |
CN111813950B (zh) * | 2020-05-20 | 2024-02-27 | 淮阴工学院 | 一种基于神经网络自适应寻优调参的建筑领域知识图谱构建方法 |
CN111723186A (zh) * | 2020-06-23 | 2020-09-29 | 宁波富万信息科技有限公司 | 用于对话系统的基于人工智能的知识图谱生成方法、电子设备 |
CN112507076A (zh) * | 2020-12-14 | 2021-03-16 | 英大传媒投资集团有限公司 | 一种语义分析搜索方法、装置及存储介质 |
-
2021
- 2021-08-16 CN CN202110939279.XA patent/CN113609309B/zh active Active
-
2022
- 2022-08-15 WO PCT/SG2022/050578 patent/WO2023022655A2/zh unknown
-
2023
- 2023-12-27 US US18/397,227 patent/US20240135196A1/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106156365A (zh) * | 2016-08-03 | 2016-11-23 | 北京智能管家科技有限公司 | 一种知识图谱的生成方法及装置 |
CN106484767A (zh) * | 2016-09-08 | 2017-03-08 | 中国科学院信息工程研究所 | 一种跨媒体的事件抽取方法 |
CN109033160A (zh) * | 2018-06-15 | 2018-12-18 | 东南大学 | 一种知识图谱动态更新方法 |
CN111177591A (zh) * | 2019-12-10 | 2020-05-19 | 浙江工业大学 | 面向可视化需求的基于知识图谱的Web数据优化方法 |
Non-Patent Citations (1)
Title |
---|
WANG PEILU, JIANG HAO, XU JINGFANG, ZHANG QI: "Knowledge Graph Construction and Applications for Web Search and Beyond", DATA INTELLIGENCE, vol. 1, no. 4, 1 November 2019 (2019-11-01), pages 333 - 349, XP093061027, DOI: 10.1162/dint_a_00019 * |
Also Published As
Publication number | Publication date |
---|---|
CN113609309A (zh) | 2021-11-05 |
US20240135196A1 (en) | 2024-04-25 |
CN113609309B (zh) | 2024-02-06 |
WO2023022655A2 (zh) | 2023-02-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107430859B (zh) | 将输入映射到表单域 | |
EP3014608B1 (en) | Computer-implemented method, computer-readable medium and system for pronunciation learning | |
US10748528B2 (en) | Language model generating device, language model generating method, and recording medium | |
US20160343366A1 (en) | Speech synthesis model selection | |
EP4113354A3 (en) | Method and apparatus for generating pre-trained language model, electronic device and storage medium | |
CN111859994A (zh) | 机器翻译模型获取及文本翻译方法、装置及存储介质 | |
WO2013009578A3 (en) | Systems and methods for speech command processing | |
EP3832484A3 (en) | Semantics processing method, semantics processing apparatus, electronic device, and medium | |
SG11201900264PA (en) | Method and device of analysis based on model, and computer readable storage medium | |
AU2017408800A1 (en) | Method and system of mining information, electronic device and readable storable medium | |
MX2016005489A (es) | Metodo y aparato para determinar similitud y terminal. | |
US9099091B2 (en) | Method and apparatus of adaptive textual prediction of voice data | |
Zayats et al. | Multi-domain disfluency and repair detection. | |
CN103246641A (zh) | 一种文本语义信息分析系统和方法 | |
EP4348451A1 (en) | Determining topic labels for communication transcripts based on a trained generative summarization model | |
US10599782B2 (en) | Analytical optimization of translation and post editing | |
WO2023134447A9 (zh) | 数据处理的方法和相关设备 | |
JPWO2014065392A1 (ja) | 情報抽出システム、情報抽出方法および情報抽出用プログラム | |
Płaza et al. | Call transcription methodology for contact center systems | |
US10582046B2 (en) | Voice recognition-based dialing | |
WO2023022655A3 (zh) | 知识图谱构建方法、装置、存储介质及电子设备 | |
EP3825897A3 (en) | Method, apparatus, device, storage medium and program for outputting information | |
KR20170010978A (ko) | 통화 내용 패턴 분석을 통한 보이스 피싱 방지 방법 및 장치 | |
CN112148751B (zh) | 用于查询数据的方法和装置 | |
CN108205542A (zh) | 一种歌曲评论的分析方法和系统 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
NENP | Non-entry into the national phase |
Ref country code: DE |