WO2023022655A3 - 知识图谱构建方法、装置、存储介质及电子设备 - Google Patents

知识图谱构建方法、装置、存储介质及电子设备 Download PDF

Info

Publication number
WO2023022655A3
WO2023022655A3 PCT/SG2022/050578 SG2022050578W WO2023022655A3 WO 2023022655 A3 WO2023022655 A3 WO 2023022655A3 SG 2022050578 W SG2022050578 W SG 2022050578W WO 2023022655 A3 WO2023022655 A3 WO 2023022655A3
Authority
WO
WIPO (PCT)
Prior art keywords
knowledge map
entity
construction method
electronic device
storage medium
Prior art date
Application number
PCT/SG2022/050578
Other languages
English (en)
French (fr)
Other versions
WO2023022655A2 (zh
Inventor
熊泓宇
汪罕
高远
冯一琦
刘宾
Original Assignee
脸萌有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 脸萌有限公司 filed Critical 脸萌有限公司
Publication of WO2023022655A2 publication Critical patent/WO2023022655A2/zh
Publication of WO2023022655A3 publication Critical patent/WO2023022655A3/zh
Priority to US18/397,227 priority Critical patent/US20240135196A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/288Entity relationship models
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Databases & Information Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

说明书摘要本公开涉及一种知识图谱构建方法、装置、存储介质及电子设备。该知识图谱构建方法包括:从目标网页的标题文本中识别出实体概念,并从所述目标网页的主体文本中识别出对应所述实体概念的至少一个实体;根据所述标题文本所属语种的语法分析规则,构建所述标题文本的语法分析树,并从所述语法分析树中确定用于修饰所述实体概念的修饰词;根据所述实体概念、所述修饰词以及所述至少一个实体生成知识图谱。采用本公开的这种方式,无需对目标网页进行结构化处理也能构建准确率和召回率高的知识图谱。
PCT/SG2022/050578 2021-08-16 2022-08-15 知识图谱构建方法、装置、存储介质及电子设备 WO2023022655A2 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US18/397,227 US20240135196A1 (en) 2021-08-16 2023-12-27 Method and apparatus for knowledge graph construction, storage medium, and electronic device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110939279.X 2021-08-16
CN202110939279.XA CN113609309B (zh) 2021-08-16 2021-08-16 知识图谱构建方法、装置、存储介质及电子设备

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/397,227 Continuation US20240135196A1 (en) 2021-08-16 2023-12-27 Method and apparatus for knowledge graph construction, storage medium, and electronic device

Publications (2)

Publication Number Publication Date
WO2023022655A2 WO2023022655A2 (zh) 2023-02-23
WO2023022655A3 true WO2023022655A3 (zh) 2023-04-13

Family

ID=78308687

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SG2022/050578 WO2023022655A2 (zh) 2021-08-16 2022-08-15 知识图谱构建方法、装置、存储介质及电子设备

Country Status (3)

Country Link
US (1) US20240135196A1 (zh)
CN (1) CN113609309B (zh)
WO (1) WO2023022655A2 (zh)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106156365A (zh) * 2016-08-03 2016-11-23 北京智能管家科技有限公司 一种知识图谱的生成方法及装置
CN106484767A (zh) * 2016-09-08 2017-03-08 中国科学院信息工程研究所 一种跨媒体的事件抽取方法
CN109033160A (zh) * 2018-06-15 2018-12-18 东南大学 一种知识图谱动态更新方法
CN111177591A (zh) * 2019-12-10 2020-05-19 浙江工业大学 面向可视化需求的基于知识图谱的Web数据优化方法

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8898583B2 (en) * 2011-07-28 2014-11-25 Kikin Inc. Systems and methods for providing information regarding semantic entities included in a page of content
CN104484379B (zh) * 2014-12-09 2018-06-12 百度在线网络技术(北京)有限公司 确定音乐实体关系的方法和装置及查询处理方法和装置
CN105279277A (zh) * 2015-11-12 2016-01-27 百度在线网络技术(北京)有限公司 知识数据的处理方法和装置
CN107169078A (zh) * 2017-05-10 2017-09-15 京东方科技集团股份有限公司 中医药知识图谱及其建立方法以及计算机系统
CN107341215B (zh) * 2017-06-07 2020-05-12 北京航空航天大学 一种基于分布式计算平台的多源垂直知识图谱分类集成查询系统
US11334692B2 (en) * 2017-06-29 2022-05-17 International Business Machines Corporation Extracting a knowledge graph from program source code
CN108376160B (zh) * 2018-02-12 2022-02-18 北京大学 一种中文知识图谱构建方法和系统
CN108959433B (zh) * 2018-06-11 2022-05-03 北京大学 一种从软件项目数据中提取知识图谱并问答的方法与系统
US10846288B2 (en) * 2018-07-02 2020-11-24 Babylon Partners Limited Computer implemented method for extracting and reasoning with meaning from text
CN109033358B (zh) * 2018-07-26 2022-06-10 李辰洋 新闻聚合与智能实体关联的方法
CN109885698A (zh) * 2019-02-13 2019-06-14 北京航空航天大学 一种知识图谱构建方法及装置、电子设备
CN110096599B (zh) * 2019-04-30 2023-03-21 长沙知了信息科技有限公司 知识图谱的生成方法及装置
CN110263351A (zh) * 2019-06-17 2019-09-20 深圳前海微众银行股份有限公司 一种网页的多语言翻译方法、装置及设备
CN111414489B (zh) * 2020-03-25 2023-10-27 中金智汇科技有限责任公司 知识图谱构建方法、装置、电子设备及可读存储介质
CN111813950B (zh) * 2020-05-20 2024-02-27 淮阴工学院 一种基于神经网络自适应寻优调参的建筑领域知识图谱构建方法
CN111723186A (zh) * 2020-06-23 2020-09-29 宁波富万信息科技有限公司 用于对话系统的基于人工智能的知识图谱生成方法、电子设备
CN112507076A (zh) * 2020-12-14 2021-03-16 英大传媒投资集团有限公司 一种语义分析搜索方法、装置及存储介质

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106156365A (zh) * 2016-08-03 2016-11-23 北京智能管家科技有限公司 一种知识图谱的生成方法及装置
CN106484767A (zh) * 2016-09-08 2017-03-08 中国科学院信息工程研究所 一种跨媒体的事件抽取方法
CN109033160A (zh) * 2018-06-15 2018-12-18 东南大学 一种知识图谱动态更新方法
CN111177591A (zh) * 2019-12-10 2020-05-19 浙江工业大学 面向可视化需求的基于知识图谱的Web数据优化方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
WANG PEILU, JIANG HAO, XU JINGFANG, ZHANG QI: "Knowledge Graph Construction and Applications for Web Search and Beyond", DATA INTELLIGENCE, vol. 1, no. 4, 1 November 2019 (2019-11-01), pages 333 - 349, XP093061027, DOI: 10.1162/dint_a_00019 *

Also Published As

Publication number Publication date
CN113609309A (zh) 2021-11-05
US20240135196A1 (en) 2024-04-25
CN113609309B (zh) 2024-02-06
WO2023022655A2 (zh) 2023-02-23

Similar Documents

Publication Publication Date Title
CN107430859B (zh) 将输入映射到表单域
EP3014608B1 (en) Computer-implemented method, computer-readable medium and system for pronunciation learning
US10748528B2 (en) Language model generating device, language model generating method, and recording medium
US20160343366A1 (en) Speech synthesis model selection
EP4113354A3 (en) Method and apparatus for generating pre-trained language model, electronic device and storage medium
CN111859994A (zh) 机器翻译模型获取及文本翻译方法、装置及存储介质
WO2013009578A3 (en) Systems and methods for speech command processing
EP3832484A3 (en) Semantics processing method, semantics processing apparatus, electronic device, and medium
SG11201900264PA (en) Method and device of analysis based on model, and computer readable storage medium
AU2017408800A1 (en) Method and system of mining information, electronic device and readable storable medium
MX2016005489A (es) Metodo y aparato para determinar similitud y terminal.
US9099091B2 (en) Method and apparatus of adaptive textual prediction of voice data
Zayats et al. Multi-domain disfluency and repair detection.
CN103246641A (zh) 一种文本语义信息分析系统和方法
EP4348451A1 (en) Determining topic labels for communication transcripts based on a trained generative summarization model
US10599782B2 (en) Analytical optimization of translation and post editing
WO2023134447A9 (zh) 数据处理的方法和相关设备
JPWO2014065392A1 (ja) 情報抽出システム、情報抽出方法および情報抽出用プログラム
Płaza et al. Call transcription methodology for contact center systems
US10582046B2 (en) Voice recognition-based dialing
WO2023022655A3 (zh) 知识图谱构建方法、装置、存储介质及电子设备
EP3825897A3 (en) Method, apparatus, device, storage medium and program for outputting information
KR20170010978A (ko) 통화 내용 패턴 분석을 통한 보이스 피싱 방지 방법 및 장치
CN112148751B (zh) 用于查询数据的方法和装置
CN108205542A (zh) 一种歌曲评论的分析方法和系统

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE