CN105550190A - Knowledge graph-oriented cross-media retrieval system - Google Patents

Knowledge graph-oriented cross-media retrieval system Download PDF

Info

Publication number
CN105550190A
CN105550190A CN201510358374.5A CN201510358374A CN105550190A CN 105550190 A CN105550190 A CN 105550190A CN 201510358374 A CN201510358374 A CN 201510358374A CN 105550190 A CN105550190 A CN 105550190A
Authority
CN
China
Prior art keywords
semantic
media
data
knowledge
cross
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510358374.5A
Other languages
Chinese (zh)
Other versions
CN105550190B (en
Inventor
杨月华
张铃丽
平源
王亚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Huchuan Technology Co ltd
Original Assignee
Xuchang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xuchang University filed Critical Xuchang University
Priority to CN201510358374.5A priority Critical patent/CN105550190B/en
Publication of CN105550190A publication Critical patent/CN105550190A/en
Application granted granted Critical
Publication of CN105550190B publication Critical patent/CN105550190B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

为了满足跨媒体语义描述和知识获取的需要,有效利用知识图谱中涵盖的跨媒体属性及各种关联,本发明提出建立跨媒体属性感知模型,对跨媒体数据中蕴含的自然属性和社会属性进行感知和关联分析,建立一种统一的跨媒体数据关联描述机制,对不同类型的关联关系进行统一量化表达;提出将知识图谱涵盖的不同形态的数据映射到同一个语义空间的方法,实现语义知识一致性表达;针对用户以自然语言、多媒体样例或者不同类型媒体数据组合表达的查询请求,提出借助知识图谱中涵盖的各种关联对用户查询进行语义分析来理解用户检索意图,从而检索出更加符合用户查询需求的相关结果的方法;提出引入知识图谱的跨媒体检索系统架构和实现方法。

In order to meet the needs of cross-media semantic description and knowledge acquisition, and effectively utilize the cross-media attributes and various associations covered in the knowledge map, the present invention proposes to establish a cross-media attribute perception model, and to carry out natural and social attributes contained in cross-media data. Perception and association analysis, establish a unified cross-media data association description mechanism, and uniformly quantify and express different types of association relationships; propose a method of mapping different forms of data covered by knowledge graphs to the same semantic space to realize semantic knowledge Consistent expression; for the query requests expressed by users in natural language, multimedia samples, or combinations of different types of media data, it is proposed to use various associations covered in the knowledge map to perform semantic analysis on user queries to understand user retrieval intentions, so as to retrieve more A method of relevant results that meets user query requirements; a cross-media retrieval system architecture and implementation method that introduces knowledge graphs is proposed.

Description

面向知识图谱的跨媒体检索系统Cross-media retrieval system for knowledge graph

技术领域technical field

本发明属于信息检索技术范畴,具体为面向知识图谱的跨媒体检索系统。在跨媒体检索中引入知识图谱有助于获得各种维度的情境数据,甚至通过进一步推理来发现不同情境下的特征,从而能够更好地理解用户查询内容的语义,返回更加满足用户需求的检索结果。The invention belongs to the category of information retrieval technology, in particular to a cross-media retrieval system oriented to knowledge graphs. The introduction of knowledge graphs in cross-media retrieval helps to obtain contextual data of various dimensions, and even further reasoning to discover features in different contexts, so as to better understand the semantics of user query content and return retrievals that better meet user needs. result.

背景技术Background technique

当前,全球网络的发展和普及已经达到空前的规模,人们已经习惯于在互联网上查找各种信息,搜索引擎已成为互联网的中心。国内各个互联网巨头正不遗余力地完善自己的搜索引擎,国家“核高基”科技重大专项也将“新一代搜索引擎与浏览器”列为“十二五”期间支持的重要发展方向。但是互联网上的信息正在呈指数级增长,而且类型多样,各种媒体形式的信息之间存在错综复杂的关联,这些交叉关联使得互联网数据呈现出了跨媒体特性,而这种跨媒体特性对互联网信息分析与检索提出了更高的要求。由于将知识图谱引入跨媒体检索系统后,有助于获得各种维度的情境数据,更好地支持用户以自然语言、多媒体样例或者不同类型媒体数据组合来表达检索意图,还可以通过进一步推理来发现不同情境下的特征,实现更加准确的用户查询语义分析和检索。因此,本发明从知识图谱的角度出发给出了跨媒体检索系统的实现方案。At present, the development and popularization of the global network has reached an unprecedented scale. People have become accustomed to searching for various information on the Internet, and search engines have become the center of the Internet. Various domestic Internet giants are sparing no effort to improve their search engines, and the national "nuclear high-tech" major science and technology project has also listed "new generation search engines and browsers" as an important development direction supported during the "Twelfth Five-Year Plan" period. However, the information on the Internet is growing exponentially, and there are various types of information. There are intricate correlations between information in various media forms. Analysis and retrieval put forward higher requirements. Since the knowledge map is introduced into the cross-media retrieval system, it helps to obtain contextual data of various dimensions, better supports users to express retrieval intentions in natural language, multimedia samples, or combinations of different types of media data, and can also be further reasoned To discover the characteristics of different situations, to achieve more accurate user query semantic analysis and retrieval. Therefore, the present invention provides an implementation scheme of a cross-media retrieval system from the perspective of a knowledge map.

知识图谱是谷歌在2010年收购了开放式数据库公司Metaweb后发展而来的。Metaweb当时主要专注于将不同文字表述与同一个实体连接起来,并探索这些实体的属性(例如明星的年龄)以及彼此之间的联系,最终提供一种新的搜索形式。虽然不能完全替代关键词搜索,但Metaweb的索引、搜索方法在处理自然语言的查询时更高效。同样,在跨媒体检索中,借助知识图谱,也可以更好地理解用户的查询请求并总结出与查询需求语义相关的内容,为用户找出更加准确和更有深度的相关信息。此外,知识图谱还会帮助用户了解事物之间的关系。当用户以自然语言、多媒体样例或者不同类型媒体数据组合表达的查询请求时,这样的一个查询请求可能会代表多重含义,知识图谱能够理解其中的差别,并可以将搜索结果范围缩小到用户最想要的那种含义。再者,由于知识图谱构建了一个与搜索结果相关的完整的知识体系,融合了很多学科,把与用户查询语义相关的知识体系系统化地展示给用户,所以在检索时用户可能会了解到某个新的事实或新的联系,促使其进行一系列全新的搜索查询,让搜索更有深度和广度。因此,将知识图谱引入跨媒体检索中对于改进检索性能具有重要作用。The knowledge graph was developed after Google acquired the open database company Metaweb in 2010. Metaweb was mainly focused on connecting different textual expressions with the same entity, and exploring the attributes of these entities (such as the age of stars) and the relationship between each other, and finally provided a new form of search. Although it cannot completely replace keyword search, Metaweb's index and search methods are more efficient in processing natural language queries. Similarly, in cross-media retrieval, with the help of knowledge graphs, users' query requests can be better understood and content related to the semantics of query requirements can be summarized, so as to find out more accurate and in-depth relevant information for users. In addition, the knowledge graph will also help users understand the relationship between things. When a user expresses a query request in natural language, multimedia samples, or a combination of different types of media data, such a query request may represent multiple meanings, and the knowledge graph can understand the differences and narrow the search results to the user's most the desired meaning. Furthermore, because the knowledge map builds a complete knowledge system related to search results, integrates many disciplines, and systematically displays the knowledge system related to user query semantics to users, so users may learn about certain knowledge during retrieval. A new fact or a new connection prompts it to conduct a whole new set of search queries, giving it greater depth and breadth. Therefore, introducing knowledge graphs into cross-media retrieval plays an important role in improving retrieval performance.

因此,本发明以面向知识图谱的跨媒体检索关键技术为研究对象,提出了跨媒体属性的感知模型和多种关联统一量化表达、跨媒体知识的一致性表达和基于知识图谱的用户查询语义分析方法以及面向知识图谱的跨媒体检索系统的实现方案。在信息检索领域,从当前国内外发展情况来看,面向知识图谱和跨媒体已经成为必然趋势,因此本发明具有非常大的实际应用价值以及广阔的应用前景。Therefore, the present invention takes the key technology of knowledge graph-oriented cross-media retrieval as the research object, and proposes a perceptual model of cross-media attributes, a unified quantitative expression of multiple associations, a consistent expression of cross-media knowledge, and a semantic analysis of user queries based on knowledge graphs. The method and the implementation scheme of the cross-media retrieval system oriented to knowledge graph. In the field of information retrieval, judging from the current development situation at home and abroad, facing knowledge graphs and cross-media has become an inevitable trend, so the present invention has very great practical application value and broad application prospects.

发明内容Contents of the invention

本发明的目的在于提供一个跨媒体信息检索工具,在跨媒体检索中引入知识图谱,基于知识图谱上涵盖的跨媒体语义关联和知识进行语义分析和推理,实现跨媒体检索。具体来说,本发明内容包括以下几点。The purpose of the present invention is to provide a cross-media information retrieval tool, which introduces a knowledge graph into cross-media retrieval, performs semantic analysis and reasoning based on cross-media semantic associations and knowledge covered in the knowledge graph, and realizes cross-media retrieval. Specifically, the content of the present invention includes the following points.

(1)针对互联网上错综复杂的跨媒体数据,建立跨媒体属性感知模型并对其中涵盖的关联关系进行分析,提出一种统一的跨媒体数据关联描述机制。通过文本解析、实体抽取、元数据分析、语义标注和用户行为分析等技术获得跨媒体数据的自然属性和社会属性,然后对跨媒体数据中自然属性和社会属性之间的复杂关系进行关联建模,在建模过程中考虑跨媒体数据间存在的内容关联(同一模态)、语义关联(不同模态)、时序关联、结构关联等多种关联,根据多媒体对象所在网页之间的链接,基于概率图模型对跨媒体内容和链接进行概率化的建模分析,从而对不同类型的关联关系进行统一量化表达。(1) Aiming at the intricate cross-media data on the Internet, establish a cross-media attribute perception model and analyze the association relationships covered in it, and propose a unified cross-media data association description mechanism. Obtain the natural and social attributes of cross-media data through text analysis, entity extraction, metadata analysis, semantic annotation and user behavior analysis, and then perform association modeling on the complex relationship between natural and social attributes in cross-media data In the modeling process, various associations such as content association (same modality), semantic association (different modality), timing association, and structural association among cross-media data are considered. According to the links between the web pages where multimedia objects are located, based on The probabilistic graphical model performs probabilistic modeling and analysis on cross-media content and links, so as to uniformly quantify and express different types of associations.

(2)为了满足跨媒体语义描述和知识获取的需要,提出将不同形态的数据映射到同一个语义标签空间的方法,实现语义一致性表达。当文本、图像等异构互补的媒体形态共同表达一种语义时,通过学习某种映射关系,将这些异构模态信息映射到一个语义标签空间,从而在一个表达框架下直接对异构数据进行相似性度量,并根据语义相似度、语义覆盖度和语义区分度建立评价函数,对语义标签的可选择性进行评价,利用语义标签信息分别为每一个形态训练分类器,并将分类的结果作为共享特征,使得不同形态的数据也可以映射到同一个语义标签空间,从而实现语义一致性表达。(2) In order to meet the needs of cross-media semantic description and knowledge acquisition, a method of mapping different forms of data to the same semantic label space is proposed to achieve semantic consistency expression. When heterogeneous and complementary media forms such as text and images jointly express a semantic, by learning a certain mapping relationship, these heterogeneous modal information can be mapped to a semantic label space, so that heterogeneous data can be directly analyzed under an expression framework. Carry out similarity measurement, and establish an evaluation function based on semantic similarity, semantic coverage and semantic differentiation, evaluate the selectivity of semantic labels, use semantic label information to train classifiers for each form, and classify the results As a shared feature, data of different forms can also be mapped to the same semantic label space, so as to achieve semantic consistency expression.

(3)提出当用户以自然语言、多媒体样例或者不同类型媒体数据组合表达查询请求时结合知识图谱涵盖的关联对其进行语义分析和推理的方法。对于用户输入的查询内容,分别对文本和多媒体查询的内容进行各自以及联合分析,从语义层面来解析用户查询意图。因此首先从互联网上采集足够的跨媒体信息并为不同媒体类型的数据分别建立语义模型,实现跨媒体数据在同一语义空间上的特征描述。然后综合图像数据和文本数据的语义分布分析和识别用户查询的语义,并结合知识图谱进行进一步的关联语义挖掘。基于知识图谱涵盖的数据语义关联、时序关联和结构关联等,获得与用户查询内容相关的各种维度的情境数据,并通过推理来发现不同情境下的特征,从而得到更加完善的查询语义。(3) Propose a method for semantic analysis and reasoning based on associations covered by knowledge graphs when users express query requests in natural language, multimedia samples, or combinations of different types of media data. For the query content entered by the user, the content of the text and multimedia query is separately and jointly analyzed, and the user query intention is analyzed from the semantic level. Therefore, firstly enough cross-media information is collected from the Internet and semantic models are established for data of different media types to realize the feature description of cross-media data in the same semantic space. Then, the semantic distribution analysis of image data and text data is integrated, and the semantics of user queries are identified, and further associated semantic mining is carried out in combination with knowledge graphs. Based on the data semantic association, temporal association, and structural association covered by the knowledge graph, contextual data in various dimensions related to user query content are obtained, and features in different contexts are discovered through reasoning, so as to obtain more complete query semantics.

(4)提出引入知识图谱的跨媒体检索系统架构和实现方法。系统除了具备用户查询分析、索引、检索和排序等基本组成部分,还要创建具有一定规模的知识图谱知识库并集成到系统中。在用户查询分析部分,支持用户以自然语言、跨媒体样例、不同媒体类型数据等形式输入的查询内容。在进行查询语义分析时,除了要对用户输入的各种媒体类型数据分别进行语义分析,还要结合知识图谱对其进行联合语义分析以及进一步的推理,以便根据知识图谱上的时间、地点、实体及其社会关系等情境知识更好地理解用户查询意图。在跨媒体哈希索引和排序部分主要是调用已有的一些算法。(4) Propose the architecture and implementation method of cross-media retrieval system that introduces knowledge graph. In addition to the basic components such as user query analysis, indexing, retrieval and sorting, the system also needs to create a knowledge map knowledge base with a certain scale and integrate it into the system. In the user query analysis section, it supports user-input query content in the form of natural language, cross-media samples, and data of different media types. When performing query semantic analysis, in addition to semantic analysis of various media types data input by users, joint semantic analysis and further reasoning should be carried out in combination with the knowledge graph, so that the time, place, and entity on the knowledge graph Situational knowledge such as social relations and social relations can better understand user query intentions. In the part of cross-media hash indexing and sorting, some existing algorithms are mainly called.

附图说明Description of drawings

图1为跨媒体属性感知和关联分析;Figure 1 shows cross-media attribute perception and association analysis;

图2为基于知识图谱的用户查询语义分析;Figure 2 is a semantic analysis of user queries based on knowledge graphs;

图3为面向知识图谱的跨媒体检索系统架构。Figure 3 shows the architecture of cross-media retrieval system for knowledge graph.

具体实施方式detailed description

为使本发明的目的、技术方案及优点更加清楚明白,以下结合说明书附图对本发明做进一步的详细说明。In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings.

1.跨媒体属性感知和关联分析1. Cross-media attribute perception and association analysis

当前知识传播的方式越来越具有跨媒体的特性,同一实体的相关知识和信息往往来自多个渠道,以多种媒体形态协同表达,并且蕴含着多种自然属性和社会属性,为了利用跨媒体数据中蕴含的关联知识并将其用于跨媒体检索中,在构建知识图谱的过程中,除了要考虑实体间的语义关系,还要考虑对实体的跨媒体属性的感知,建立跨媒体属性感知模型并对其进行关联分析。为了对不同类型的关联关系进行统一量化表达,并对潜在的关联进行有效预测,使不同的关联关系之间能相互利用,建立一种统一的跨媒体数据关联描述机制。The current way of knowledge dissemination is increasingly characterized by cross-media. Relevant knowledge and information of the same entity often come from multiple channels and are expressed collaboratively in various media forms, and contain various natural and social attributes. In order to make use of cross-media The associated knowledge contained in the data is used in cross-media retrieval. In the process of building a knowledge map, in addition to considering the semantic relationship between entities, the perception of cross-media attributes of entities should also be considered, and the establishment of cross-media attribute perception model and perform correlation analysis on it. In order to uniformly quantify and express different types of associations, and effectively predict potential associations, so that different associations can use each other, a unified description mechanism for cross-media data associations is established.

针对来自多个渠道(包含微博、微信、论坛、新闻网站、专业网站等),以多种媒体形态(文本、声音、图像、视频)协同表达,并且蕴含着多种自然属性(时间、地点、人物、表观信息等)和社会属性(如热度、评价和偏好等)的实体相关信息,基于和文本伴随信息之间的互补信息来提取其他媒体类型数据的高层语义,然后通过文本解析、实体抽取、元数据分析、语义标注和用户行为分析等技术获得跨媒体数据的自然属性和社会属性,再通过一组支持向量机分类器对新数据进行分类,从而从有噪声的网络图像中集中自动地提取和识别同类别的目标;或者通过分析网络用户对跨媒体数据的转发行为对现实世界用户的关注度等进行建模,通过分析微博、微信、社交网络等数据内容及用户转发行为,构建转发树模型,并利用频繁子树来发现用户行为的重复性和倾向性规律,从而对群体关注度进行更准确的跟踪和预测。接下来对跨媒体数据中自然属性和社会属性之间的复杂关系进行关联建模,在建模过程中考虑跨媒体数据间存在的内容关联(同一模态)、语义关联(不同模态)、时序关联、结构关联等多种关联,根据多媒体对象所在网页之间的链接,基于概率图模型对跨媒体内容和链接进行概率化的建模分析,从而对不同类型的关联关系进行统一量化表达,并进一步实现跨媒体数据的关联预测,如图1所示。From multiple channels (including Weibo, WeChat, forums, news websites, professional websites, etc.), it is expressed in a variety of media forms (text, sound, image, video), and contains a variety of natural attributes (time, place, etc.) , characters, appearance information, etc.) and social attributes (such as popularity, evaluation and preference, etc.), based on the complementary information with the text accompanying information to extract the high-level semantics of other media type data, and then through text analysis, Entity extraction, metadata analysis, semantic annotation, and user behavior analysis and other technologies obtain the natural and social attributes of cross-media data, and then classify the new data through a set of support vector machine classifiers, thereby concentrating on noisy network images. Automatically extract and identify targets of the same category; or model the attention of users in the real world by analyzing the forwarding behavior of network users to cross-media data, and analyze the data content and user forwarding behavior of Weibo, WeChat, social networks, etc. , build a forwarding tree model, and use frequent subtrees to discover the repetitive and tendency rules of user behavior, so as to track and predict the attention of groups more accurately. Next, carry out association modeling on the complex relationship between natural attributes and social attributes in cross-media data, and consider the content association (same modality), semantic association (different modality), and Various associations such as temporal association and structural association, according to the links between the web pages where the multimedia objects are located, carry out probabilistic modeling and analysis of cross-media content and links based on the probabilistic graph model, so as to uniformly quantify and express different types of associations. And further realize the association prediction of cross-media data, as shown in Figure 1.

2.跨媒体知识的一致性表达2. Consistent expression of cross-media knowledge

由于已有的知识表示方式和知识库资源基本上还局限在单一模态的状态,已无法满足跨媒体语义描述和知识获取的需要,因此在构建的知识图谱中涵盖跨媒体属性知识后,要将其用于跨媒体检索中,在分析单一模态数据语义知识表达规律的基础上,提出了将不同形态的数据映射到同一个语义标签空间的方法,从而实现语义一致性表达。为了从单一模态扩展到跨媒体知识表示层面,提出了对知识图谱中各种媒体类型的内容进行计算和度量的方法,从理论上将多种媒体数据的结构信息统一映射到一定的空间以便进行结构分析、融合以及推理等。Since the existing knowledge representation and knowledge base resources are basically limited to a single mode, they can no longer meet the needs of cross-media semantic description and knowledge acquisition. Therefore, after covering cross-media attribute knowledge in the constructed knowledge map, it is necessary It is used in cross-media retrieval. On the basis of analyzing the semantic knowledge expression rules of single-modal data, a method of mapping different forms of data to the same semantic label space is proposed, so as to achieve semantic consistency expression. In order to expand from a single modality to the level of cross-media knowledge representation, a method for calculating and measuring the content of various media types in the knowledge graph is proposed, and the structural information of various media data is mapped to a certain space theoretically. Perform structural analysis, fusion, and inference.

在获取了足够的跨媒体属性知识及关联关系后,为了将其用于跨媒体检索中,在不同的数据粒度、不同知识层次上建立跨媒体知识一致性表示机制。当文本、图像等异构互补的媒体形态共同表达一种语义时,通过学习某种映射关系,将这些异构模态信息映射到一个共享子空间,就可以在一个表达框架下直接对异构数据进行相似性度量。对于在内容和语义上具有相关性的跨媒体数据,采用概率生成模型将不同媒体类型的数据转换到统一的隐变量空间进行描述,以跨媒体数据在各个隐变量上的分布作为其语义标签,并根据语义相似度、语义覆盖度和语义区分度建立评价函数,对语义标签的可选择性进行评价,并建立语义组。利用语义组的语义标签信息,将不同多媒体文档中的同模态数据分别提取出来,利用组的语义标签分别为每一个形态训练分类器,并将分类的结果作为共享特征,使得不同形态的数据也可以映射到同一个语义标签空间,从而实现语义一致性表达。After obtaining enough cross-media attribute knowledge and association relations, in order to use it in cross-media retrieval, a consistent representation mechanism for cross-media knowledge is established on different data granularities and different knowledge levels. When heterogeneous and complementary media forms such as text and images jointly express a semantic, by learning a certain mapping relationship and mapping these heterogeneous modal information to a shared subspace, the heterogeneous data similarity measure. For cross-media data that is relevant in content and semantics, a probabilistic generation model is used to transform data of different media types into a unified latent variable space for description, and the distribution of cross-media data on each latent variable is used as its semantic label. And according to the semantic similarity, semantic coverage and semantic differentiation, an evaluation function is established to evaluate the selectivity of semantic tags and establish semantic groups. Using the semantic label information of the semantic group, the same-modal data in different multimedia documents are extracted separately, and the semantic label of the group is used to train a classifier for each form, and the classification result is used as a shared feature, so that the data of different forms It can also be mapped to the same semantic label space to achieve semantic consistency expression.

语义标签选择的关键是计算它与跨媒体内容的语义相关性,即语义标签和语义模型之间的匹配,为了能够直接将语义标签与语义模型进行比较,将语义标签以语义分布的方式表示,使用KL距离计算语义标签和语义模型之间的语义相似性。为了获得语义标签l的语义分布{p(w|l)},通过跨媒体数据集D来近似估计{p(w|l,D)}。这样就可以使用KL距离计算语义标签{p(w|l)}和语义模型{p(w|θ)}之间的语义相似性:The key to semantic tag selection is to calculate its semantic correlation with cross-media content, that is, the matching between semantic tags and semantic models. In order to be able to directly compare semantic tags with semantic models, semantic tags are expressed in a semantic distribution, Semantic similarity between semantic labels and semantic models is computed using KL distance. To obtain the semantic distribution {p(w|l)} of the semantic label l, {p(w|l, D)} is approximated by cross-media dataset D. In this way, the semantic similarity between the semantic label {p(w|l)} and the semantic model {p(w|θ)} can be calculated using the KL distance:

SS (( ll ,, θθ )) == -- dd (( θθ || || ll )) == -- ΣΣ ww pp (( ww || θθ )) loglog pp (( ww || θθ )) pp (( ww || ll )) -- -- -- (( 11 ))

为了保证语义标签对跨媒体数据的语义内容有较高的覆盖度,选择的新语义词能够覆盖其它语义部分,而不是已有语义词已经涵盖的内容,采用最大边缘相关方法,通过最大化最大边缘相关性取得最大相关性和差异性语义词:In order to ensure that the semantic tags have a high coverage of the semantic content of the cross-media data, the selected new semantic words can cover other semantic parts, rather than the content already covered by the existing semantic words. Marginal relevance achieves maximum relevance and difference semantic words:

ll ^^ == argarg maxmax ll ∈∈ LL -- SS [[ λSλS (( ll ,, θθ )) -- (( 11 -- λλ )) maxmax SimSim (( ll '' ,, ll )) ll ∈∈ LL -- SS ]] -- -- -- (( 22 ))

SimSim (( ll '' ,, ll )) == -- dd (( ll '' || || ll )) == -- ΣΣ ww pp (( ww || ll '' )) loglog pp (( ww || ll '' )) pp (( ww || ll )) -- -- -- (( 33 ))

其中,S是已经选择的语义词。Among them, S is the selected semantic word.

此外,当对多个语义内容进行标注时,为了保证一个语义词不会和多个语义内容具有较高的相关度,还要考虑不同语义内容间的区分,即语义区分度,在这种情况下,需要采用考虑区分度的语义相似性计算方法:In addition, when labeling multiple semantic contents, in order to ensure that a semantic word will not have a high degree of correlation with multiple semantic contents, the distinction between different semantic contents must also be considered, that is, the degree of semantic differentiation. In this case In this case, it is necessary to adopt a semantic similarity calculation method that considers the degree of discrimination:

S’(l,θi)=S(l,θi)-αS(l,θ-i)(4)S'(l, θ i ) = S(l, θ i )-αS(l, θ -i ) (4)

S(l,θ-i)=-d(θ-i‖l)(5)其中,θ-1表示除语义特征θ1之外的其他k-1个语义特征,即θ1,...i-1i+1,...k,k为语义特征数。通过S’(l,θi)计算跨语义特征的语义相似度并进行排序,从而可以为多个语义内容生成语义相关且具有一定覆盖度和区分度的语义词。S(l, θ -i )=-d(θ -i ∥ l) (5) where θ -1 represents k-1 semantic features other than the semantic feature θ 1 , namely θ 1,... i-1i+1,...k , where k is the number of semantic features. The semantic similarity of cross-semantic features is calculated and sorted by S'(l, θ i ), so that semantic words with certain coverage and differentiation can be generated for multiple semantic contents.

3.基于知识图谱的用户查询语义分析3. Semantic analysis of user queries based on knowledge graph

对于用户输入的查询内容,需要分别对文本和多媒体查询的内容进行各自以及联合分析,从语义层面来解析用户查询意图。因此,首先从互联网上采集足够的跨媒体信息并为不同媒体类型的数据分别建立语义模型,如图2所示:以文本词描述的文本语义模型和以视觉词描述的视觉语义模型;然后利用这两个模型将待分析文档中的文本数据和图像数据都转换到相同的语义空间,并以语义概率分布的方式进行描述。之后通过语义学习实现不同媒体类型数据的语义映射。为了在不同媒体类型的数据间建立关联,挖掘关联性异构媒体数据之间存在的共享子空间,对于具有语义相关性的跨媒体数据,如图像、视频等与文本语义相关的视觉数据,采用文本数据进行视觉语义学习,以视觉词的形式描述文本语义,建立文本语义和视觉语义之间的映射关系,从而实现跨媒体数据在同一语义空间上的特征描述。For the query content entered by the user, it is necessary to separately and jointly analyze the text and multimedia query content, and analyze the user query intention from the semantic level. Therefore, first, collect enough cross-media information from the Internet and establish semantic models for different media types of data, as shown in Figure 2: the text semantic model described by text words and the visual semantic model described by visual words; then use These two models transform both text data and image data in the document to be analyzed into the same semantic space, and describe it in the form of semantic probability distribution. After that, the semantic mapping of different media types of data is realized through semantic learning. In order to establish associations between data of different media types, and to mine the shared subspaces existing among related heterogeneous media data, for cross-media data with semantic correlation, such as image, video and other visual data related to text semantics, adopt Text data is used to learn visual semantics, describe text semantics in the form of visual words, and establish a mapping relationship between text semantics and visual semantics, so as to realize the feature description of cross-media data in the same semantic space.

在获得了跨媒体数据的语义特征描述后,综合图像数据和文本数据的语义分布分析和识别用户查询的语义,并结合知识图谱进行进一步的关联语义挖掘。基于知识图谱涵盖的数据语义关联、时序关联和结构关联等,获得与用户查询内容相关的各种维度的情境数据,如时间、地点、实体及其社会关系等,并通过推理来发现不同情境下的特征,从而得到更加完善的查询语义。由于推理涉及的是跨媒体数据,所以推理前先基于图像标注、视频中活动对象动作识别等技术实现跨媒体到文本模式的转换并进行形式化表示,然后基于文本的推理技术实现推理。在转换过程中需要在语义层处理跨媒体数据,可以基于所建立的跨媒体语义模型来实现。After the semantic feature description of cross-media data is obtained, the semantic distribution analysis of image data and text data is integrated to identify the semantics of user queries, and further associated semantic mining is carried out in combination with knowledge graphs. Based on the data semantic association, temporal association, and structural association covered by the knowledge map, obtain contextual data in various dimensions related to user query content, such as time, location, entity and its social relationship, etc., and use reasoning to discover different situations. features, so as to obtain a more complete query semantics. Since the reasoning involves cross-media data, before reasoning, the conversion from cross-media to text mode is realized based on technologies such as image annotation and moving object action recognition in video, and formalized representation is performed, and then reasoning is realized based on text reasoning technology. In the conversion process, it is necessary to process cross-media data at the semantic level, which can be realized based on the established cross-media semantic model.

4.面向知识图谱的跨媒体检索系统4. Cross-media retrieval system for knowledge graph

为了实现一个面向知识图谱的跨媒体检索系统,首先提出引入知识图谱的跨媒体检索系统架构,如图3所示。系统除了具备用户查询分析、索引、检索、排序等基本组成部分外,加入了跨媒体属性感知和关联分析以及一致性表达几个部分。首先从互联网上采集足够的多媒体数据,基于跨媒体属性感知模型分别获取跨媒体数据的自然属性和社会属性,然后对其中蕴含的实体对象关联、各种媒体类型数据的语义关联、时序关联、结构关联等进行关联分析和描述。之后在此基础上构建形成达到一定规模的知识图谱,为了利用知识图谱中涵盖的跨媒体知识,基于所提出的一致性表达框架对其进行表示。In order to realize a knowledge graph-oriented cross-media retrieval system, a cross-media retrieval system architecture that introduces knowledge graphs is firstly proposed, as shown in Figure 3. In addition to basic components such as user query analysis, indexing, retrieval, and sorting, the system also includes several parts such as cross-media attribute perception, correlation analysis, and consistency expression. First, collect enough multimedia data from the Internet, obtain the natural attributes and social attributes of the cross-media data based on the cross-media attribute perception model, and then analyze the entity object associations, semantic associations, time series associations, and structure of various media types. Correlation analysis and description. Afterwards, a knowledge graph of a certain scale is built on this basis. In order to utilize the cross-media knowledge covered in the knowledge graph, it is represented based on the proposed consistent expression framework.

在用户查询分析部分,支持用户以自然语言、跨媒体样例、不同媒体类型数据等形式输入的查询内容。在进行查询语义分析时,除了要对用户输入的各种媒体类型数据分别进行语义分析,还要结合知识图谱对其进行联合语义分析以及进一步的推理,以便根据知识图谱上的时间、地点、实体及其社会关系等情境知识更好地理解用户查询意图。在跨媒体哈希索引和排序部分主要是调用已有的一些算法。In the user query analysis section, it supports user-input query content in the form of natural language, cross-media samples, and data of different media types. When performing query semantic analysis, in addition to semantic analysis of various media types data input by users, joint semantic analysis and further reasoning should be carried out in combination with the knowledge graph, so that the time, place, and entity on the knowledge graph Situational knowledge such as social relations and social relations can better understand user query intentions. In the part of cross-media hash indexing and sorting, some existing algorithms are mainly called.

Claims (5)

1. towards the cross-media retrieval system of knowledge mapping, it is characterized in that, this system covers the content of following aspect:
Across medium property perception and association analysis;
Consistance across media knowledge is expressed;
User's query semantics of knowledge based collection of illustrative plates is analyzed;
Towards cross-media retrieval system architecture and the realization of knowledge mapping.
2. system according to claim 1, is characterized in that, sets up across medium property sensor model and analyzes the incidence relation wherein contained, and what propose a kind of unification describes mechanism across media data association.Pass through text resolution, entity extracts, metadata analysis, the technology such as semantic tagger and user behavior analysis obtains across the natural quality of media data and social property, then association modeling is carried out to across the complex relationship in media data between natural quality and social property, consider across the relevance existed between media data (same mode) in modeling process, semantic association (different modalities), sequential correlation, the multiple association such as structure connection, according to the link between the webpage of multimedia object place, based on probability graph model to the modeling analysis carrying out randomization across media content and link, thus unified quantization expression is carried out to dissimilar incidence relation.
3. system according to claim 1, is characterized in that, in order to meet the needs across media semantic description and knowledge acquisition, proposes the method for the data-mapping of different shape to same semantic label space, realizes semantic consistency and expresses.Work as text, when the media modalities co expression one of the isomery complementations such as image is semantic, by learning certain mapping relations, these isomery modal informations are mapped to a semantic label space, thus directly similarity measurement is carried out to isomeric data under expressing framework at one, and according to semantic similarity, semantic coverage and semantic space calibration set up evaluation function, the alternative of semantic label is evaluated, semantic label information is utilized to be respectively each shape up exercise sorter, and using the result of classification as sharing feature, make the data of different shape also can be mapped to same semantic label space, thus realize semantic consistency expression.
4. system according to claim 1, is characterized in that, proposes when user to carry out the method for semantic analysis and reasoning to it with the association contained in conjunction with knowledge mapping during natural language, multimedia sample or dissimilar media data combination expression inquiry request.For the query contents of user's input, respectively the content of text and multimedia inquiry is carried out separately and Conjoint Analysis, carry out analyzing user queries intention from semantic level.Therefore first gather from internet and enough set up semantic model respectively across media information and for the data of different media types, realize across the feature interpretation of media data on same semantic space.Then the semantic distributional analysis of composite image data and text data and the semanteme of identification user inquiry, and carry out the semantic excavation of further association in conjunction with knowledge mapping.Knowledge based collection of illustrative plates contain data semantic association, sequential correlation and structure connection etc., obtain the context data of the various dimensions relevant to user's query contents, and find the feature under different situation by reasoning, thus obtain more perfect query semantics.
5. system according to claim 1, is characterized in that, system, except possessing the elements such as user's query analysis, index, retrieval and sequence, also will create knowledge mapping knowledge base of certain scale and be integrated in system.In user's query analysis part, support that user is with natural language, query contents across the input of the form such as media sample, different media types data.When carrying out query semantics and analyzing, except semantic analysis will be carried out respectively to the various media type data of user's input, also to carry out combination semantic analysis and further reasoning, to understand user's query intention better according to context knowledge such as the time on knowledge mapping, place, entity and social relationships thereof in conjunction with knowledge mapping to it.
CN201510358374.5A 2015-06-26 2015-06-26 Cross-media retrieval system towards knowledge mapping Active CN105550190B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510358374.5A CN105550190B (en) 2015-06-26 2015-06-26 Cross-media retrieval system towards knowledge mapping

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510358374.5A CN105550190B (en) 2015-06-26 2015-06-26 Cross-media retrieval system towards knowledge mapping

Publications (2)

Publication Number Publication Date
CN105550190A true CN105550190A (en) 2016-05-04
CN105550190B CN105550190B (en) 2019-03-29

Family

ID=55829379

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510358374.5A Active CN105550190B (en) 2015-06-26 2015-06-26 Cross-media retrieval system towards knowledge mapping

Country Status (1)

Country Link
CN (1) CN105550190B (en)

Cited By (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106156365A (en) * 2016-08-03 2016-11-23 北京智能管家科技有限公司 A kind of generation method and device of knowledge mapping
CN106776370A (en) * 2016-12-05 2017-05-31 哈尔滨工业大学(威海) Cloud storage method and device based on the assessment of object relevance
CN106776564A (en) * 2016-12-21 2017-05-31 张永成 The method for recognizing semantics and system of a kind of knowledge based collection of illustrative plates
CN106844334A (en) * 2016-12-20 2017-06-13 网易(杭州)网络有限公司 Method and apparatus for evaluating and testing session robotic intelligence
CN107038261A (en) * 2017-05-28 2017-08-11 海南大学 A kind of processing framework resource based on data collection of illustrative plates, Information Atlas and knowledge mapping can Dynamic and Abstract Semantic Modeling Method
CN107247739A (en) * 2017-05-10 2017-10-13 浙江大学 A kind of financial publication text knowledge extracting method based on factor graph
CN107256271A (en) * 2017-06-27 2017-10-17 鲁东大学 Cross-module state Hash search method based on mapping dictionary learning
CN107273418A (en) * 2017-05-11 2017-10-20 浙江大学 A kind of across Noumenon property chain inference method based on cloud platform
CN107291828A (en) * 2017-05-27 2017-10-24 北京百度网讯科技有限公司 Spoken inquiry analytic method, device and storage medium based on artificial intelligence
CN107330520A (en) * 2017-06-09 2017-11-07 上海电力学院 The object Affording acquisition inference method that a kind of knowledge based storehouse is represented
CN107369098A (en) * 2016-05-11 2017-11-21 华为技术有限公司 The treating method and apparatus of data in social networks
CN107423820A (en) * 2016-05-24 2017-12-01 清华大学 The knowledge mapping of binding entity stratigraphic classification represents learning method
CN107748754A (en) * 2017-09-15 2018-03-02 广州唯品会研究院有限公司 A kind of knowledge mapping improving method and device
CN107783973A (en) * 2016-08-24 2018-03-09 慧科讯业有限公司 Method, device and system for monitoring internet media event based on industry knowledge map database
CN107967267A (en) * 2016-10-18 2018-04-27 中兴通讯股份有限公司 A kind of knowledge mapping construction method, apparatus and system
CN108009182A (en) * 2016-10-28 2018-05-08 京东方科技集团股份有限公司 A kind of information extracting method and device
CN108090167A (en) * 2017-12-14 2018-05-29 畅捷通信息技术股份有限公司 Method, system, computing device and the storage medium of data retrieval
CN108491502A (en) * 2018-03-21 2018-09-04 腾讯科技(深圳)有限公司 A kind of method, terminal, server and the storage medium of news tracking
CN108549667A (en) * 2018-03-23 2018-09-18 绍兴诺雷智信息科技有限公司 A kind of semantic retrieving method of structuring engineering design knowledge
CN108959328A (en) * 2017-05-27 2018-12-07 株式会社理光 Processing method, device and the electronic equipment of knowledge mapping
CN109522465A (en) * 2018-10-22 2019-03-26 国家电网公司 The semantic searching method and device of knowledge based map
CN109697233A (en) * 2018-12-03 2019-04-30 中电科大数据研究院有限公司 A kind of knowledge mapping system building method
CN109716286A (en) * 2016-08-16 2019-05-03 电子湾有限公司 Determine the item with confirmed feature
CN109710776A (en) * 2018-12-29 2019-05-03 中国科学技术大学 The construction method of the knowledge map of the album
CN109710923A (en) * 2018-12-06 2019-05-03 浙江大学 Cross-language entity matching method based on cross-media information
CN110275898A (en) * 2018-03-16 2019-09-24 埃森哲环球解决方案有限公司 Use the integrated monitoring and communication system of the explanatory equipment management of knowledge based figure
CN110457502A (en) * 2019-08-21 2019-11-15 京东方科技集团股份有限公司 Construct knowledge mapping method, man-machine interaction method, electronic equipment and storage medium
CN110489565A (en) * 2019-08-15 2019-11-22 广州拓尔思大数据有限公司 Based on the object root type design method and system in domain knowledge map ontology
CN110532341A (en) * 2019-09-03 2019-12-03 华东师范大学 Spatial information space-time big data constraint expression method
CN110532404A (en) * 2019-09-03 2019-12-03 北京百度网讯科技有限公司 One provenance multimedia determines method, apparatus, equipment and storage medium
CN110597992A (en) * 2019-09-10 2019-12-20 腾讯科技(深圳)有限公司 Semantic reasoning method and device based on knowledge graph and electronic equipment
CN110647804A (en) * 2019-08-09 2020-01-03 中国传媒大学 Violent video identification method, computer system and storage medium
CN110689033A (en) * 2018-07-05 2020-01-14 第四范式(北京)技术有限公司 Data acquisition method, device and equipment for model training and storage medium
CN110741389A (en) * 2017-11-21 2020-01-31 谷歌有限责任公司 Improved access to entity data
CN110750656A (en) * 2019-10-29 2020-02-04 上海德拓信息技术股份有限公司 Multimedia detection method based on knowledge graph
CN110928961A (en) * 2019-11-14 2020-03-27 出门问问(苏州)信息科技有限公司 Multi-mode entity linking method, equipment and computer readable storage medium
CN110968776A (en) * 2018-09-30 2020-04-07 北京国双科技有限公司 Policy knowledge recommendation method, device storage medium and processor
CN110970112A (en) * 2018-09-29 2020-04-07 九阳股份有限公司 Method and system for constructing knowledge graph for nutrition and health
CN111221984A (en) * 2020-01-15 2020-06-02 北京百度网讯科技有限公司 Multimodal content processing method, device, equipment and storage medium
CN111460169A (en) * 2020-03-27 2020-07-28 科大讯飞股份有限公司 Semantic expression generation method, device and equipment
CN111666425A (en) * 2020-06-10 2020-09-15 深圳开思时代科技有限公司 Automobile accessory searching method based on semantic knowledge
CN111680173A (en) * 2020-05-31 2020-09-18 西南电子技术研究所(中国电子科技集团公司第十研究所) CMR model for uniformly retrieving cross-media information
CN111680207A (en) * 2020-03-11 2020-09-18 华中科技大学鄂州工业技术研究院 A method and apparatus for determining a user's search intent
CN111708745A (en) * 2020-06-18 2020-09-25 全球能源互联网研究院有限公司 A cross-media data sharing representation method and user behavior analysis method and system
CN112084339A (en) * 2020-08-11 2020-12-15 同济大学 Traffic knowledge graph construction method based on cross-media data
CN112115270A (en) * 2019-06-20 2020-12-22 国电南瑞科技股份有限公司 Method for constructing transformer knowledge graph ontology relation model
CN112732969A (en) * 2021-01-14 2021-04-30 珠海格力电器股份有限公司 Image semantic analysis method and device, storage medium and electronic equipment
CN112749289A (en) * 2020-12-31 2021-05-04 重庆空间视创科技有限公司 Multi-mode-based knowledge graph retrieval system and method
CN112836060A (en) * 2019-11-25 2021-05-25 中国科学技术信息研究所 Map construction method and device for scientific and technological innovation data
CN112948547A (en) * 2021-01-26 2021-06-11 中国石油大学(北京) Logging knowledge graph construction query method, device, equipment and storage medium
CN113010701A (en) * 2021-02-25 2021-06-22 北京四达时代软件技术股份有限公司 Video-centered fused media content recommendation method and device
CN113111161A (en) * 2021-04-09 2021-07-13 北京语言大学 Cross-media association analysis method
CN113157882A (en) * 2021-03-31 2021-07-23 山东大学 Knowledge graph path retrieval method and device with user semantics as center
CN113254678A (en) * 2021-07-14 2021-08-13 北京邮电大学 Training method of cross-media retrieval model, cross-media retrieval method and equipment thereof
CN114781400A (en) * 2022-06-17 2022-07-22 之江实验室 A method and device for semantic expression of cross-media knowledge
CN115374765A (en) * 2022-10-27 2022-11-22 浪潮通信信息系统有限公司 Computing power network 5G data analysis system and method based on natural language processing
CN117150031A (en) * 2023-07-24 2023-12-01 青海师范大学 A processing method and system for multi-modal data
CN117252262A (en) * 2023-09-28 2023-12-19 四川大学 Knowledge graph construction and patent information retrieval method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130246328A1 (en) * 2010-06-22 2013-09-19 Peter Joseph Sweeney Methods and devices for customizing knowledge representation systems
CN103488713A (en) * 2013-09-10 2014-01-01 浙江大学 Cross-modal search method capable of directly measuring similarity of different modal data
CN103593792A (en) * 2013-11-13 2014-02-19 复旦大学 Individual recommendation method and system based on Chinese knowledge mapping
CN104035917A (en) * 2014-06-10 2014-09-10 复旦大学 Knowledge graph management method and system based on semantic space mapping
CN104166684A (en) * 2014-07-24 2014-11-26 北京大学 Cross-media retrieval method based on uniform sparse representation

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130246328A1 (en) * 2010-06-22 2013-09-19 Peter Joseph Sweeney Methods and devices for customizing knowledge representation systems
CN103488713A (en) * 2013-09-10 2014-01-01 浙江大学 Cross-modal search method capable of directly measuring similarity of different modal data
CN103593792A (en) * 2013-11-13 2014-02-19 复旦大学 Individual recommendation method and system based on Chinese knowledge mapping
CN104035917A (en) * 2014-06-10 2014-09-10 复旦大学 Knowledge graph management method and system based on semantic space mapping
CN104166684A (en) * 2014-07-24 2014-11-26 北京大学 Cross-media retrieval method based on uniform sparse representation

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
蔡思: "基于概率的跨媒体检索方法研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 *
赵耀 等: "跨媒体时代的知识表达---感知、关联及一致性表示", 《中国计算机学会通讯》 *

Cited By (94)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107369098A (en) * 2016-05-11 2017-11-21 华为技术有限公司 The treating method and apparatus of data in social networks
CN107423820A (en) * 2016-05-24 2017-12-01 清华大学 The knowledge mapping of binding entity stratigraphic classification represents learning method
CN106156365B (en) * 2016-08-03 2019-06-18 北京儒博科技有限公司 A kind of generation method and device of knowledge mapping
CN106156365A (en) * 2016-08-03 2016-11-23 北京智能管家科技有限公司 A kind of generation method and device of knowledge mapping
CN109716286A (en) * 2016-08-16 2019-05-03 电子湾有限公司 Determine the item with confirmed feature
CN107783973A (en) * 2016-08-24 2018-03-09 慧科讯业有限公司 Method, device and system for monitoring internet media event based on industry knowledge map database
CN107783973B (en) * 2016-08-24 2022-02-25 慧科讯业有限公司 Method, device and system for monitoring internet media event based on industry knowledge map database
CN107967267A (en) * 2016-10-18 2018-04-27 中兴通讯股份有限公司 A kind of knowledge mapping construction method, apparatus and system
CN108009182B (en) * 2016-10-28 2020-03-10 京东方科技集团股份有限公司 Information extraction method and device
CN108009182A (en) * 2016-10-28 2018-05-08 京东方科技集团股份有限公司 A kind of information extracting method and device
US10657330B2 (en) 2016-10-28 2020-05-19 Boe Technology Group Co., Ltd. Information extraction method and apparatus
CN106776370A (en) * 2016-12-05 2017-05-31 哈尔滨工业大学(威海) Cloud storage method and device based on the assessment of object relevance
CN106844334B (en) * 2016-12-20 2022-07-15 网易(杭州)网络有限公司 Method and equipment for evaluating conversation robot intelligence
CN106844334A (en) * 2016-12-20 2017-06-13 网易(杭州)网络有限公司 Method and apparatus for evaluating and testing session robotic intelligence
CN106776564A (en) * 2016-12-21 2017-05-31 张永成 The method for recognizing semantics and system of a kind of knowledge based collection of illustrative plates
CN107247739B (en) * 2017-05-10 2019-11-01 浙江大学 A kind of financial bulletin text knowledge extracting method based on factor graph
CN107247739A (en) * 2017-05-10 2017-10-13 浙江大学 A kind of financial publication text knowledge extracting method based on factor graph
CN107273418A (en) * 2017-05-11 2017-10-20 浙江大学 A kind of across Noumenon property chain inference method based on cloud platform
CN107291828A (en) * 2017-05-27 2017-10-24 北京百度网讯科技有限公司 Spoken inquiry analytic method, device and storage medium based on artificial intelligence
US10698932B2 (en) 2017-05-27 2020-06-30 Beijing Baidu Netcom Science And Technology Co., Ltd. Method and apparatus for parsing query based on artificial intelligence, and storage medium
CN108959328A (en) * 2017-05-27 2018-12-07 株式会社理光 Processing method, device and the electronic equipment of knowledge mapping
CN108959328B (en) * 2017-05-27 2021-12-21 株式会社理光 Knowledge graph processing method and device and electronic equipment
CN107038261A (en) * 2017-05-28 2017-08-11 海南大学 A kind of processing framework resource based on data collection of illustrative plates, Information Atlas and knowledge mapping can Dynamic and Abstract Semantic Modeling Method
CN107330520A (en) * 2017-06-09 2017-11-07 上海电力学院 The object Affording acquisition inference method that a kind of knowledge based storehouse is represented
CN107256271B (en) * 2017-06-27 2020-04-03 鲁东大学 Cross-modal hash retrieval method based on mapping dictionary learning
CN107256271A (en) * 2017-06-27 2017-10-17 鲁东大学 Cross-module state Hash search method based on mapping dictionary learning
CN107748754A (en) * 2017-09-15 2018-03-02 广州唯品会研究院有限公司 A kind of knowledge mapping improving method and device
US11769064B2 (en) 2017-11-21 2023-09-26 Google Llc Onboarding of entity data
CN110741389A (en) * 2017-11-21 2020-01-31 谷歌有限责任公司 Improved access to entity data
CN108090167B (en) * 2017-12-14 2020-11-10 畅捷通信息技术股份有限公司 Data retrieval method, system, computing device and storage medium
CN108090167A (en) * 2017-12-14 2018-05-29 畅捷通信息技术股份有限公司 Method, system, computing device and the storage medium of data retrieval
CN110275898A (en) * 2018-03-16 2019-09-24 埃森哲环球解决方案有限公司 Use the integrated monitoring and communication system of the explanatory equipment management of knowledge based figure
CN110275898B (en) * 2018-03-16 2023-07-21 埃森哲环球解决方案有限公司 Integrated monitoring and communication system using knowledge graph-based interpretive device management
CN108491502A (en) * 2018-03-21 2018-09-04 腾讯科技(深圳)有限公司 A kind of method, terminal, server and the storage medium of news tracking
CN108549667B (en) * 2018-03-23 2022-04-08 绍兴诺雷智信息科技有限公司 Semantic retrieval method for structural engineering design knowledge
CN108549667A (en) * 2018-03-23 2018-09-18 绍兴诺雷智信息科技有限公司 A kind of semantic retrieving method of structuring engineering design knowledge
CN110689033A (en) * 2018-07-05 2020-01-14 第四范式(北京)技术有限公司 Data acquisition method, device and equipment for model training and storage medium
CN110970112A (en) * 2018-09-29 2020-04-07 九阳股份有限公司 Method and system for constructing knowledge graph for nutrition and health
CN110970112B (en) * 2018-09-29 2024-03-12 九阳股份有限公司 Knowledge graph construction method and system for nutrition and health
CN110968776A (en) * 2018-09-30 2020-04-07 北京国双科技有限公司 Policy knowledge recommendation method, device storage medium and processor
CN109522465A (en) * 2018-10-22 2019-03-26 国家电网公司 The semantic searching method and device of knowledge based map
CN109697233B (en) * 2018-12-03 2023-06-20 中电科大数据研究院有限公司 Knowledge graph system construction method
CN109697233A (en) * 2018-12-03 2019-04-30 中电科大数据研究院有限公司 A kind of knowledge mapping system building method
CN109710923A (en) * 2018-12-06 2019-05-03 浙江大学 Cross-language entity matching method based on cross-media information
CN109710923B (en) * 2018-12-06 2020-09-01 浙江大学 Cross-language entity matching method based on cross-media information
CN109710776B (en) * 2018-12-29 2022-10-28 中国科学技术大学 The construction method of the knowledge map of the album
CN109710776A (en) * 2018-12-29 2019-05-03 中国科学技术大学 The construction method of the knowledge map of the album
CN112115270A (en) * 2019-06-20 2020-12-22 国电南瑞科技股份有限公司 Method for constructing transformer knowledge graph ontology relation model
CN112115270B (en) * 2019-06-20 2022-09-16 北京南瑞数字技术有限公司 Method for constructing transformer knowledge graph ontology relation model
CN110647804A (en) * 2019-08-09 2020-01-03 中国传媒大学 Violent video identification method, computer system and storage medium
CN110489565A (en) * 2019-08-15 2019-11-22 广州拓尔思大数据有限公司 Based on the object root type design method and system in domain knowledge map ontology
CN110489565B (en) * 2019-08-15 2023-05-16 广州拓尔思大数据有限公司 Method and system for designing object root type in domain knowledge graph body
CN110457502A (en) * 2019-08-21 2019-11-15 京东方科技集团股份有限公司 Construct knowledge mapping method, man-machine interaction method, electronic equipment and storage medium
CN110532404A (en) * 2019-09-03 2019-12-03 北京百度网讯科技有限公司 One provenance multimedia determines method, apparatus, equipment and storage medium
CN110532404B (en) * 2019-09-03 2023-08-04 北京百度网讯科技有限公司 Source multimedia determining method, device, equipment and storage medium
CN110532341A (en) * 2019-09-03 2019-12-03 华东师范大学 Spatial information space-time big data constraint expression method
CN110597992A (en) * 2019-09-10 2019-12-20 腾讯科技(深圳)有限公司 Semantic reasoning method and device based on knowledge graph and electronic equipment
CN110597992B (en) * 2019-09-10 2023-08-29 腾讯科技(深圳)有限公司 Semantic reasoning method and device based on knowledge graph and electronic equipment
CN110750656A (en) * 2019-10-29 2020-02-04 上海德拓信息技术股份有限公司 Multimedia detection method based on knowledge graph
CN110928961A (en) * 2019-11-14 2020-03-27 出门问问(苏州)信息科技有限公司 Multi-mode entity linking method, equipment and computer readable storage medium
CN110928961B (en) * 2019-11-14 2023-04-28 出门问问(苏州)信息科技有限公司 Multi-mode entity linking method, equipment and computer readable storage medium
CN112836060A (en) * 2019-11-25 2021-05-25 中国科学技术信息研究所 Map construction method and device for scientific and technological innovation data
CN112836060B (en) * 2019-11-25 2023-11-24 中国科学技术信息研究所 Atlas construction method and apparatus for technological innovation data
CN111221984B (en) * 2020-01-15 2024-03-01 北京百度网讯科技有限公司 Multi-mode content processing method, device, equipment and storage medium
CN111221984A (en) * 2020-01-15 2020-06-02 北京百度网讯科技有限公司 Multimodal content processing method, device, equipment and storage medium
CN111680207A (en) * 2020-03-11 2020-09-18 华中科技大学鄂州工业技术研究院 A method and apparatus for determining a user's search intent
CN111680207B (en) * 2020-03-11 2023-08-04 华中科技大学鄂州工业技术研究院 A method and device for determining user search intent
CN111460169A (en) * 2020-03-27 2020-07-28 科大讯飞股份有限公司 Semantic expression generation method, device and equipment
CN111460169B (en) * 2020-03-27 2023-06-02 科大讯飞股份有限公司 Semantic expression generation method, device and equipment
CN111680173A (en) * 2020-05-31 2020-09-18 西南电子技术研究所(中国电子科技集团公司第十研究所) CMR model for uniformly retrieving cross-media information
CN111680173B (en) * 2020-05-31 2024-02-23 西南电子技术研究所(中国电子科技集团公司第十研究所) CMR model for unified searching cross-media information
CN111666425A (en) * 2020-06-10 2020-09-15 深圳开思时代科技有限公司 Automobile accessory searching method based on semantic knowledge
CN111666425B (en) * 2020-06-10 2023-04-18 深圳开思时代科技有限公司 Automobile accessory searching method based on semantic knowledge
CN111708745A (en) * 2020-06-18 2020-09-25 全球能源互联网研究院有限公司 A cross-media data sharing representation method and user behavior analysis method and system
CN111708745B (en) * 2020-06-18 2023-04-21 全球能源互联网研究院有限公司 Cross-media data sharing representation method and user behavior analysis method and system
CN112084339B (en) * 2020-08-11 2023-11-24 同济大学 Traffic knowledge graph construction method based on cross-media data
CN112084339A (en) * 2020-08-11 2020-12-15 同济大学 Traffic knowledge graph construction method based on cross-media data
CN112749289A (en) * 2020-12-31 2021-05-04 重庆空间视创科技有限公司 Multi-mode-based knowledge graph retrieval system and method
CN112732969A (en) * 2021-01-14 2021-04-30 珠海格力电器股份有限公司 Image semantic analysis method and device, storage medium and electronic equipment
CN112948547B (en) * 2021-01-26 2024-04-09 中国石油大学(北京) Logging knowledge graph construction query method, device, equipment and storage medium
CN112948547A (en) * 2021-01-26 2021-06-11 中国石油大学(北京) Logging knowledge graph construction query method, device, equipment and storage medium
CN113010701A (en) * 2021-02-25 2021-06-22 北京四达时代软件技术股份有限公司 Video-centered fused media content recommendation method and device
CN113157882A (en) * 2021-03-31 2021-07-23 山东大学 Knowledge graph path retrieval method and device with user semantics as center
CN113157882B (en) * 2021-03-31 2022-05-31 山东大学 User semantic-centric knowledge graph path retrieval method and device
CN113111161B (en) * 2021-04-09 2023-09-08 北京语言大学 Cross-media association analysis method
CN113111161A (en) * 2021-04-09 2021-07-13 北京语言大学 Cross-media association analysis method
CN113254678B (en) * 2021-07-14 2021-10-01 北京邮电大学 Training method of cross-media retrieval model, cross-media retrieval method and device thereof
CN113254678A (en) * 2021-07-14 2021-08-13 北京邮电大学 Training method of cross-media retrieval model, cross-media retrieval method and equipment thereof
CN114781400A (en) * 2022-06-17 2022-07-22 之江实验室 A method and device for semantic expression of cross-media knowledge
CN114781400B (en) * 2022-06-17 2022-09-09 之江实验室 Cross-media knowledge semantic expression method and device
CN115374765A (en) * 2022-10-27 2022-11-22 浪潮通信信息系统有限公司 Computing power network 5G data analysis system and method based on natural language processing
CN115374765B (en) * 2022-10-27 2023-06-02 浪潮通信信息系统有限公司 Computing power network 5G data analysis system and method based on natural language processing
CN117150031A (en) * 2023-07-24 2023-12-01 青海师范大学 A processing method and system for multi-modal data
CN117252262A (en) * 2023-09-28 2023-12-19 四川大学 Knowledge graph construction and patent information retrieval method and device

Also Published As

Publication number Publication date
CN105550190B (en) 2019-03-29

Similar Documents

Publication Publication Date Title
CN105550190A (en) Knowledge graph-oriented cross-media retrieval system
CN102902821B (en) The image high-level semantics mark of much-talked-about topic Network Based, search method and device
US9305083B2 (en) Author disambiguation
CN110990590A (en) Dynamic financial knowledge map construction method based on reinforcement learning and transfer learning
US20170228459A1 (en) Method and device for mobile searching based on artificial intelligence
CN110910175B (en) Image generation method for travel ticket product
CN114238573A (en) Information pushing method and device based on text countermeasure sample
CN101364239A (en) A classification catalog automatic construction method and related system
CN108765383A (en) Video presentation method based on depth migration study
Miao et al. A dynamic financial knowledge graph based on reinforcement learning and transfer learning
Kacprzak et al. Making sense of numerical data-semantic labelling of web tables
CN118260717A (en) Internet low-orbit satellite information mining method, system, device and medium
Li Construction of Internet of Things English terms model and analysis of language features via deep learning
CN114661951B (en) Video processing method, device, computer equipment and storage medium
Derungs et al. Mining nearness relations from an n-grams Web corpus in geographical space
Nie et al. Cross-domain semantic transfer from large-scale social media
CN111104492B (en) Civil aviation field automatic question and answer method based on layering Attention mechanism
CN114942981B (en) Question and answer query method and device, electronic equipment and computer readable storage medium
CN114238735B (en) Intelligent internet data acquisition method
Berg et al. Do you see what I see? Measuring the semantic differences in image‐recognition services' outputs
Li et al. Research on hot news discovery model based on user interest and topic discovery
Hu et al. GeoEntity-type constrained knowledge graph embedding for predicting natural-language spatial relations
Jalal et al. A web content mining application for detecting relevant pages using Jaccard similarity
Lytras et al. Innovations, developments, and applications of semantic web and information systems
Bernasconi et al. NOTAE: NOT A writtEn word but graphic symbols.

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20201225

Address after: 210012 18 / F, A1 North building, 32 Fengzhan Road, Yuhuatai District, Nanjing City, Jiangsu Province

Patentee after: Jiangsu Huchuan Technology Co.,Ltd.

Address before: 461000 No. 88 Bayi Road, Henan, Xuchang

Patentee before: XUCHANG University

TR01 Transfer of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: A Cross Media Retrieval System for Knowledge Graph

Effective date of registration: 20231130

Granted publication date: 20190329

Pledgee: Nanjing Bank Co.,Ltd. Nanjing Financial City Branch

Pledgor: Jiangsu Huchuan Technology Co.,Ltd.

Registration number: Y2023980067998

PE01 Entry into force of the registration of the contract for pledge of patent right