CN109033160A - A kind of knowledge mapping dynamic updating method - Google Patents

A kind of knowledge mapping dynamic updating method Download PDF

Info

Publication number
CN109033160A
CN109033160A CN201810627957.7A CN201810627957A CN109033160A CN 109033160 A CN109033160 A CN 109033160A CN 201810627957 A CN201810627957 A CN 201810627957A CN 109033160 A CN109033160 A CN 109033160A
Authority
CN
China
Prior art keywords
entity
updated
extension
knowledge mapping
entities
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810627957.7A
Other languages
Chinese (zh)
Inventor
漆桂林
李丞
李林
吴天星
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southeast University
Original Assignee
Southeast University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southeast University filed Critical Southeast University
Priority to CN201810627957.7A priority Critical patent/CN109033160A/en
Publication of CN109033160A publication Critical patent/CN109033160A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of knowledge mapping dynamic updating methods, for solving the stationary problem between encyclopaedic knowledge map and its data source.The present invention is using the Hot Contents on WWW as starting point;Name entity is therefrom extracted as kind of a fructification, these usual entities are the entities that possible update.Then crawl other entities relevant with kind fructification are used as extension entity on encyclopaedia website.Then, a certain number of entities are obtained from encyclopaedia website and carry out Feature Engineering, and excavate the more new information of substance feature characterization using machine learning algorithm, construct fallout predictor;The high entity of update probability is filtered out from extension entity using fallout predictor.Finally, in the case where the limitation of data source access number, realizing that the dynamic of knowledge mapping updates using the high extension entity of kind of fructification and update probability as the object updated.

Description

A kind of knowledge mapping dynamic updating method
Technical field
The invention belongs to knowledge mapping fields, are related to a kind of encyclopaedic knowledge map dynamic updating method.
Background technique
Knowledge mapping can be constructed by various forms of data sources, and wherein encyclopaedic knowledge map is exactly to pass through encyclopaedia What the data on website were built.Usually, data the managing and maintaining due to specification on encyclopaedia website, data cover Wide, content is accurate, and the knowledge mapping built also has better quality.But due to the knowledge and information on encyclopaedia website Variation, already existing data can face out-of-date problem in encyclopaedic knowledge map;In addition, for occurring on encyclopaedia website Completely new vocabulary, knowledge mapping can not timely include them.
In order to guarantee the not out-of-date update it is necessary to keep data and data source in knowledge mapping of the data in knowledge mapping It is synchronous as far as possible.A kind of basic method is the data by obtaining entire data source, to rebuild knowledge mapping, this It is the way that current most of encyclopaedic knowledge maps are updated.Certain encyclopaedia websites, such as wikipedia can periodically provide complete The data set and data change information stood, provide a great convenience the acquisition of data.But also some encyclopaedia websites are simultaneously Such data set, such as Baidupedia, interaction encyclopaedia will not be provided;If it is desired to updating to encyclopaedia data, crawler can only be passed through Crawl some or all of data.Do so some drawbacks: first, although wikipedia provides the variation letter of data set Breath, but these data sets are usually to be provided using the moon as the period;In this way, knowledge mapping still cannot timely keep up with encyclopaedia enough The information change of website.Second, for not providing the encyclopaedia website of data set downloading, if grabbing the number of whole station by crawler According to for the entity of ten million number of levels, machine runs without interruption one month and also different surely crawled.No matter third, It is downloaded either with or without data set, obtains whole data sets all very greatly, this can expend very more bandwidth;And encyclopaedia website, Also it can limit frequently access is crawled, so that the acquisition of new data is highly difficult.So finding a kind of knowledge mapping dynamic Update method, this method keep the timeliness of knowledge mapping data under conditions of avoiding bandwidth resources from wasting, and have very big Research significance.
Summary of the invention
Technical problem: the present invention provides a kind of knowledge mapping dynamic updating method, and this method avoid network bandwidths and meter The waste for calculating resource, greatly reduces the time lag of data in knowledge mapping, can it is automatic, frequently, efficiently to knowledge graph Spectrum is updated, and knowledge mapping may be implemented, and frequently efficient more new task is also protected while ensure that lesser amount of access The quality for updating work is demonstrate,proved.
Technical solution: the present invention searches vocabulary as starting point using the heat of popular title and search engine on social network sites, from These titles and heat, which are searched, excavates entity word in word, as kind of a fructification, these usual entities are the entities that possible update. Then related entities are grabbed on encyclopaedia website to be extended.Then, by carrying out Feature Engineering in certain amount entity information, Based on these features, the fallout predictor that can predict future update situation is trained with machine learning method.Finally, just with seed Entity and the higher extension entity of update probability are realized in the case where the limitation of data source access number as the object updated The dynamic of knowledge mapping updates.
Knowledge mapping dynamic updating method of the invention, includes the following steps:
1) vocabulary is searched as starting point, from these titles and heat using the popular title on social network sites and the heat on search engine It searches and extracts entity in word, using these entities as kind fructification to be updated;
2) internal chaining in the Abstract and Infobox on the encyclopaedia page of crawl kind fructification, as extension entity;
3) entity word is grabbed from encyclopaedia website, therefrom extracts feature, with history renewal frequency for Label value, building Training dataset establishes regression model using machine learning method, trains fallout predictor from the training dataset of building;
4) using the renewal frequency of fallout predictor prediction extension entity, forecast updating frequency higher preceding K are picked out Entity is extended, K is that extension entity updates the upper limit of the number value;
5) the extension entity that kind of fructification and the step 4) are picked out carries out more knowledge mapping as upgating object Newly.
In the preferred embodiment of dynamic updating method of the present invention, in the step 1), entity is extracted as follows:
Name entity 1-a) is extracted first using name Entity recognition to the title grabbed;
1-b) then utilize participle technique, from title obtain identification less than candidate entity word list;
Part-of-speech tagging 1-c) is carried out to candidate entity word, weeding out auxiliary word, number, these do not have the candidate vocabulary of practical significance, Then verify whether these candidate words are entity words on encyclopaedia website, using entity word and the name entity of extraction as seed reality Body.
In the preferred embodiment of dynamic updating method of the present invention, training data is constructed according to following characteristics in the step 3) Collection:
Number of weeks existing for 3-1): entity is present in the time on encyclopaedia website, and new entity is more likely to be updated;
3-2) update times: the number that entity is updated in total, the number being updated in the past is more, it is following more may by after It is continuous to update;
3-3) browsing time: the number that an entity is seen is more, illustrates more welcome, is also more likely to be updated;
3-4) number of links: easier by it when the hyperlink that a physical page includes other resources or entity is more His resource or entity influence;
3-5) internal chaining number: if being linked to other entities, the change of other entities may also will affect oneself;
3-6) content of pages length: an article is longer, and content is abundanter, easier to be updated;
3-7) clip Text length: abstract contains the description most succinct to entity, makes a summary longer, information content is bigger;
3-8) average all renewal frequency: past updates more frequent, then future may also update it is frequent.
In the preferred embodiment of dynamic updating method of the present invention, according to following specific steps to knowledge mapping in the step 5) It is updated:
Firstly, more content of the new seed entity in knowledge mapping, extension entity is then updated according to priority and is being known Know the content in map: for the entity being originally not present in knowledge mapping, being inserted into knowledge mapping with highest priority;It is right The entity existing for those scripts provides forecast updating frequency first with fallout predictor, then preferentially to upper subsynchronous expectation later The maximum extension entity of renewal frequency is updated, until all extension entities have all been updated completion or extension entity update The extension entity that quantity reaches setting updates the upper limit of the number value K, and wherein K is determined according to network bandwidth and computing resource.
In the preferred embodiment of dynamic updating method of the present invention, priority when extension entity updates is calculate by the following formula excellent First grade value determines that priority value is bigger, then priority is higher:
E [u (x)]=P (x) × (tnow-ts(x))
Wherein, E [u (x)] be to the priority value that is updated of extension entity, u (x) be it is upper it is subsynchronous after update when Between, P (x) is the renewal frequency predicted by fallout predictor, tnowFor current time, tsIt (x) is the time updated entity x last time, when When x is novel entities, ts(x)=- ∞.
Encyclopaedic knowledge map dynamic updating method of the invention does not need periodically to obtain whole numbers on encyclopaedia website According to and frequently knowledge mapping can be updated with fraction of amount of access, avoid existing method well to network bandwidth Waste, also greatly reduce the time lag of data in knowledge mapping.
The present invention finds the entity that may have occurred update using enlightening rule;And by selecting feature, excavate Relationship between entity and renewal frequency out, constructs fallout predictor, further finds the entity that may be had updated.Realize energy Enough automatic, frequently, the dynamic updating method being efficiently updated to knowledge mapping reduces in knowledge mapping renewal process not Necessary update.Problem is expressed as follows:
Wherein, tnIt (x) is the last time renewal time of entity x on encyclopaedia website, tsIt (x) is to synchronize the entity x last time Time is (if x is novel entities, ts(x)=- ∞), the intention of formula expression are as follows: be up to K's in the number of processing entities Under the conditions of, after subsynchronous in searching as much as possible, the entity that is updated on encyclopaedia website.
The utility model has the advantages that compared with prior art, the present invention having the advantage that
Compared to the update mode of current most of knowledge mappings, the present invention is efficiently realized to knowledge mapping part more Newly, it updates those reality and is more likely to the entity changed on encyclopaedia website.When existing encyclopaedic knowledge map updates, mostly Need to obtain data whole in nearest data source;No matter whether encyclopaedia website provides data set downloading, this can all be wasted largely Network bandwidth.In the present invention, it is updated every time from network hotspot, extracts kind of a fructification, then sought on encyclopaedia website It looks for related entities to be extended, finally the entity picked out is updated;Demand very little of the whole process to Internet resources, section Network bandwidth is saved.It is based on the judgement of network hotspot and some heuristic rules and fallout predictor simultaneously, so that choosing update Entity quality has guarantee, is updated to those entities with greater need for update.In addition, due to the data volume to access every time It is very small, it may be implemented frequently to be updated the data in knowledge mapping, greatly reduce the lag of knowledge mapping data Property.
It is proved by experimental analysis, using knowledge mapping dynamic updating method proposed by the present invention, knowledge graph may be implemented The frequent efficient more new task of spectrum.Based on it is didactic rule and sequence of the fallout predictor to renewal frequency so that those with greater need for The entity of update is selected out, while ensure that lesser amount of access, also ensures the quality for updating work.The present invention In, plant the validity of fructification, the validity of the extension entity that fallout predictor is picked out is in accuracy rate, recall rate, F1 value, and The outstanding effect of the update method is all sufficiently demonstrated by the indexs such as MAP, nDCG, AUC.
Detailed description of the invention
Fig. 1 is general frame schematic diagram of the invention;
Fig. 2 is the flow diagram of more new algorithm in the present invention.
Specific embodiment
With reference to embodiments and Figure of description, the implementation process that the present invention will be described in detail.
The present invention is the table entity link method based on multiple knowledge base, including following 5 steps:
1) vocabulary is searched as starting point using the heat of popular title and search engine on social network sites, is searched from these titles and heat Entity is extracted in word, using these entities as kind fructification to be updated.This process is the discovery of kind of fructification, is walked in detail It is rapid as follows:
(1) hot spot title is grabbed, heat searches vocabulary
Because of the hot topic quality with higher of the real-time hot spot and Baidu's discussion bar of search dog search, in the present embodiment with They are as the source for planting fructification;The hot spot of search dog search and the topic numbers of Baidu's discussion bar be all it is constant, the two is shared 50 titles.In addition, Baidupedia homepage, which can also provide heat, searches entry, these can directly be grabbed as seed entity.
(2) kind of a fructification is extracted from title
Entity recognition is named to title first, and has selected following classification as the entity class retained: ' PERSON ', ' LOCATION ', ' ORGANIZATION ', ' MISC ', ' CITY ', ' STATE_OR_PROVINCE ', ' COUNTRY’.’RELIGION’.’IDEOLOGY’。
Meanwhile also picking out some entity class and being abandoned, predominantly digital, the classifications such as time, itself difficulty has reality Meaning: ' MONEY ', ' NUMBER ', ' ORDINAL ', ' PERCENT ', ' DATE ', ' TIME ', ' SET ', ' DURATION ' ' TITLE’。
In addition, carrying out part-of-speech tagging to the word in title, remain to clean subsequent word segmentation result.
Name Entity recognition, which is only used only, can omit many entities.In order to improve recall rate, participle technique is used, has been come More candidate seed entities (word) are obtained, the effect for segmenting generation is as follows, can more comprehensively obtain entity candidate word.
Search engine mode Accurate model
Total eclipse of the moon will appear the sky Total eclipse/total eclipse of the moon/appears/sky Total eclipse of the moon/appear/the sky
Easy thousand imperial or royal seal achievement of melt exposure Easily/melt/thousand imperial or royal seals/achievement/exposure Thousand imperial or royal seals of easy melt/achievement/exposure
Intend closed management in the Ming Tombs 13/tri- mounds/Ming Tombs/quasi-/closing/management The Ming Tombs/quasi-/closing/management
Although participle increases candidate entity word, also band is come in many meaningless words.By choosing some meaningless words Property (for example, auxiliary word, number):
' AD ', ' AS ', ' BA ', ' CC ', ' CD ', ' CS ', ' DEC ', ' DEG ', ' DER ', ' DEV ', ' DT ', ' ETC ', ' IJ ', ' JJ ', ' LB ', ' LC ', ' M ', ' OD ', ' ON ', ' P ', ' PN ', ' PU ', ' SB ', ' SP ', ' VA ', ' VC ', ' VE ', ' VV ', ' X '
Then, using part-of-speech tagging information, to be cleaned to word segmentation result.So far time as much as possible is had been obtained for Entity word is selected, finally, they are directly retrieved on Baidupedia as entity word, it is whether genuine real to verify them Body;If there is corresponding entry, then it is assumed that be entity word, be not otherwise just entity word.Kind fructification, which extracts, to be completed.
2) the related internal chaining on the encyclopaedia page of crawl kind fructification, as extension entity.
(1) extension of fructification is planted
The present invention passes through the internal chaining for selecting kind of fructification, to select and plant the relevant entity of fructification.Specifically, this reality Apply the internal chaining for having selected to include in abstract and Infobox two parts in the kind fructification encyclopaedia page in example.It is interior in abstract Link and kind fructification are more relevant, and Infobox contains the association attributes of kind of fructification.The physical correlation extended in this way is more Height, and will not excessively be expanded too much because of internal chaining.
(2) extension entity has actually occurred the ratio updated.
The present embodiment acquires more days seed solid datas by this method, and is extended.After one month, Collect these extension nearest month ratios updated of entity.The update ratio of available extension entity is on 30% left side It is right.This result shows that:
It one, is effective by the method that related entities are found in abstract and Infobox internal chaining.
Two, extend in entity and only 30% updated in the recent period, directly synchronize in this way still suffer from 70% extension Entity does not update, synchronizes to them, still results in waste.Fallout predictor is needed to screen and be more likely to update Entity.
3) a certain number of entity words are grabbed from encyclopaedia, therefrom extract feature, are Label with history renewal frequency Value constructs training dataset.Regression model is established using machine learning method for training data, as fallout predictor.
In the present embodiment, tens of thousands of physical pages are grabbed from Baidupedia, and therefrom extracted following feature:
Feature 1: existing number of weeks: entity is present in the time on encyclopaedia website, and new entity is more likely to be updated.
Feature 2: update times: the number that entity is updated in total, the number being updated in the past is more, the following more possible quilt Continue to update.
Feature 3: browsing time: the number that an entity is seen is more, illustrates more welcome, is also more likely to be updated.
Feature 4: number of links: when the hyperlink that a physical page includes other resources or entity is more, easier quilt Other resources or entity influence.
Feature 5: internal chaining number: if being linked to other entities, the change of other entities may also be will affect self.
Feature 6: content of pages length: an article is longer, and content is abundanter, easier to be updated.
Feature 7: clip Text length: abstract contains the description most succinct to entity, makes a summary longer, information content is bigger.
Feature 8: average week renewal frequency: past updates more frequent, then future may also update it is frequent.
In the present embodiment, the validity of test feature engineering by the following method:
The label value in training sample, class categories are changed into: specifically, what the last one month was updated, y (e) > 0, as positive example;The last one moon, there is no update, y (e)=0, as counter-example.Then chi-square value (χ 2) and information are calculated Gain (IG:Information Gain).Chi-square value can be used to correlation of the judging characteristic between class label, value Smaller to illustrate that correlation is lower, feature is more useless.Information gain indicates to learn the information of feature X and make the letter of class label Cease the degree of uncertainty reduction.Feature is more useful, and information gain value is bigger.The test result of the present embodiment meets expection and sets Think.
Regression model is as prediction model using in machine learning, and the fallout predictor in the present embodiment has used ridge regression and random Two kinds of models of forest, and using history week renewal frequency as forecast updating frequency, construct benchmark model.Ridge regression is in linear regression On the basis of be added to regularization term, can reduce the risk of over-fitting.Random forest is carried out using decision tree as base learner It is integrated, while the selection of random attribute is introduced in the training process, so that finally integrated Generalization Capability is because of individual study Between diversity factor increase and promoted.
4) entity for being more likely to update is picked out from extension entity using fallout predictor.
In the present embodiment, the feature of extension entity is extracted, the renewal frequency of prediction is then provided using fallout predictor, and will more Newly frequency is descending is ranked up, and K extension entity before being selected according to network bandwidth and computing resource situation, network bandwidth is got over Greatly, computing resource is abundanter, and K is bigger, and the maximum value that K can be obtained is to extend the number of entity;Otherwise, network bandwidth is small, calculates Resource is few, in order to guarantee update validity, K is set it is a little bit smaller, can take extension physical quantities half.
In the present embodiment, it tests first with Hold-out model to fallout predictor: training dataset is divided into two Part, wherein 90% is used to train, 10% is used to test.It calculates on test set, all renewal frequencies predicted and actual week are more MSE between new frequency.Then determine whether each test entity occurs after time stamp T by different threshold It updates, precision-racall curve is obtained with this, and judge the robustness of regressor using AUC.
The present embodiment also assesses the effect of fallout predictor on true data set: judging that can fallout predictor be The entity updated has been actually occurred, higher update probability is predicted.
In time t, grab popular title, then therefrom extract several fructification, by these kind of fructification into Row extension, obtains extending entity accordingly;Then, it is predicted against extension entity, and according to the frequency of prediction height It is sorted.Then, in t to t+30 days, one month time, the update status of these extension entities is collected.Most Zhongdao t It until+30, checks in this month, how many extension entity is updated altogether.Later, it is updated for these Extension entity judged, look at they whether be predicted device be discharged to front.The present embodiment has used following index to carry out Assessment:
MAP: Average Accuracy, for assessing the effect of retrieval ranking.The effect of assessment prediction ranking is carried out in the present embodiment Fruit, in the top to be considered to retrieve relevant, then having actually occurred, the entity ranking updated is more forward, and MAP value will be got over It is high.
NDCG: common ranking evaluation index, for the accuracy for ranking of testing and assessing.In the present embodiment, fallout predictor returns The ranking of entity, each entity correspond to a score value, these score values are exactly gain (Gain) value.These scores It is added, is exactly CumulativeGain (storage gain).Those have actually occurred those of update entity, should be put into front. Therefore, when being added these scores, each is needed divided by an incremental number, that is, loses value, and obtain DCG, finally It is normalized.
Area below AUC:Precision-Recall curve, biggish AUC represent preferable effect.
Precision@n: n entity in the top has actually occurred the ratio updated.
Recall@n: having actually occurred in the entity updated, how much has been discharged to preceding n.
Fl@n: the harmonic-mean based on Precision and Recall, more attention smaller value.
In the multiple test performance of the present embodiment, in most cases Random Forest model is all showed best, therefore this Invention is using Random Forest model as fallout predictor model.
5) using kind of fructification and the extension entity picked out as upgating object, knowledge mapping is updated.

Claims (5)

1. a kind of knowledge mapping dynamic updating method, which is characterized in that method includes the following steps:
1) vocabulary is searched as starting point using the popular title on social network sites and the heat on search engine, searches word from these titles and heat In extract entity, using these entities as kind fructification to be updated;
2) internal chaining in the Abstract and Infobox on the encyclopaedia page of crawl kind fructification, as extension entity;
3) entity word is grabbed from encyclopaedia website, therefrom extracts feature, with history renewal frequency for Label value, building training Data set establishes regression model using machine learning method, trains fallout predictor from the training dataset of building;
4) using the renewal frequency of fallout predictor prediction extension entity, the higher preceding K extension of forecast updating frequency is picked out Entity, K are that extension entity updates the upper limit of the number value;
5) the extension entity that kind of fructification and the step 4) are picked out is updated knowledge mapping as upgating object.
2. knowledge mapping dynamic updating method according to claim 1, which is characterized in that in the step 1), according to such as Under type extracts entity:
Name entity 1-a) is extracted first using name Entity recognition to the title grabbed;
1-b) then utilize participle technique, from title obtain identification less than candidate entity word list;
Part-of-speech tagging 1-c) is carried out to candidate entity word, weeding out auxiliary word, number, these do not have the candidate vocabulary of practical significance, then Verify whether these candidate words are entity words on encyclopaedia website, using entity word and the name entity of extraction as kind of a fructification.
3. knowledge mapping dynamic updating method according to claim 1, which is characterized in that according to following in the step 3) Feature construction training dataset:
Number of weeks existing for 3-1): entity is present in the time on encyclopaedia website, and new entity is more likely to be updated;
3-2) update times: the number that entity is updated in total, the number being updated in the past is more, and future may more continue more Newly;
3-3) browsing time: the number that an entity is seen is more, illustrates more welcome, is also more likely to be updated;
3-4) number of links: easier to be provided by other when the hyperlink that a physical page includes other resources or entity is more Source or entity influence;
3-5) internal chaining number: if being linked to other entities, the change of other entities may also will affect oneself;
3-6) content of pages length: an article is longer, and content is abundanter, easier to be updated;
3-7) clip Text length: abstract contains the description most succinct to entity, makes a summary longer, information content is bigger;
3-8) average all renewal frequency: past updates more frequent, then future may also update it is frequent.
4. knowledge mapping dynamic updating method according to claim 1, which is characterized in that according to as follows in the step 5) Specific steps are updated knowledge mapping:
Firstly, more content of the new seed entity in knowledge mapping, then updates extension entity in knowledge graph according to priority Content in spectrum: it for the entity being originally not present in knowledge mapping, is inserted into knowledge mapping with highest priority;For that Entity existing for a little scripts, provides forecast updating frequency first with fallout predictor, then preferentially updates to upper subsynchronous expectation later The maximum extension entity of frequency is updated, until all extension entities have all been updated completion or extension entity update quantity The extension entity for reaching setting updates the upper limit of the number value K, and wherein K is determined according to network bandwidth and computing resource.
5. knowledge mapping dynamic updating method according to claim 4, which is characterized in that when the extension entity updates The priority priority value that is calculate by the following formula determine that priority value is bigger, then priority is higher:
E [u (x)]=P (x) × (tnow-ts(x))
Wherein, E [u (x)] be to the priority value that is updated of extension entity, u (x) be it is upper it is subsynchronous after renewal time, P It (x) is the renewal frequency predicted by fallout predictor, tnowFor current time, tsIt (x) is the time updated entity x last time, when x is When novel entities, ts(x)=- ∞.
CN201810627957.7A 2018-06-15 2018-06-15 A kind of knowledge mapping dynamic updating method Pending CN109033160A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810627957.7A CN109033160A (en) 2018-06-15 2018-06-15 A kind of knowledge mapping dynamic updating method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810627957.7A CN109033160A (en) 2018-06-15 2018-06-15 A kind of knowledge mapping dynamic updating method

Publications (1)

Publication Number Publication Date
CN109033160A true CN109033160A (en) 2018-12-18

Family

ID=64609573

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810627957.7A Pending CN109033160A (en) 2018-06-15 2018-06-15 A kind of knowledge mapping dynamic updating method

Country Status (1)

Country Link
CN (1) CN109033160A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109800288A (en) * 2019-01-22 2019-05-24 杭州师范大学 A kind of the scientific research analysis of central issue and prediction technique of knowledge based map
CN110008353A (en) * 2019-04-09 2019-07-12 福建奇点时空数字科技有限公司 A kind of construction method of dynamic knowledge map
CN110162640A (en) * 2019-04-28 2019-08-23 北京百度网讯科技有限公司 Novel entities method for digging, device, computer equipment and storage medium
CN110598002A (en) * 2019-08-14 2019-12-20 广州视源电子科技股份有限公司 Knowledge graph library construction method and device, computer storage medium and electronic equipment
CN110851612A (en) * 2019-08-29 2020-02-28 国家计算机网络与信息安全管理中心 Encyclopedic knowledge-based mobile application knowledge graph composite completion method and device
CN111488467A (en) * 2020-04-30 2020-08-04 北京建筑大学 Construction method and device of geographical knowledge graph, storage medium and computer equipment
CN111626821A (en) * 2020-05-26 2020-09-04 山东大学 Product recommendation method and system for realizing customer classification based on integrated feature selection
CN111859238A (en) * 2020-07-27 2020-10-30 平安科技(深圳)有限公司 Method and device for predicting data change frequency based on model and computer equipment
CN112035682A (en) * 2020-09-28 2020-12-04 上海松鼠课堂人工智能科技有限公司 Character model analysis processing method and system based on knowledge graph
CN113065353A (en) * 2021-03-16 2021-07-02 北京金堤征信服务有限公司 Entity identification method and device
CN113239208A (en) * 2021-05-06 2021-08-10 广东博维创远科技有限公司 Mark training model based on knowledge graph
CN113553436A (en) * 2020-04-23 2021-10-26 广东博智林机器人有限公司 Knowledge graph updating method and device, electronic equipment and storage medium
CN113609309A (en) * 2021-08-16 2021-11-05 脸萌有限公司 Knowledge graph construction method and device, storage medium and electronic equipment
CN115408534A (en) * 2022-08-23 2022-11-29 连连银通电子支付有限公司 Knowledge graph updating method, device, equipment and storage medium
CN116502711A (en) * 2023-06-27 2023-07-28 北京智谱华章科技有限公司 Knowledge graph construction and dynamic expansion method, device, equipment and medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106503035A (en) * 2016-09-14 2017-03-15 海信集团有限公司 A kind of data processing method of knowledge mapping and device
CN107346346A (en) * 2017-08-26 2017-11-14 海南大学 Learner competencies modeling and learning process Optimal Management System based on data collection of illustrative plates, Information Atlas and knowledge mapping

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106503035A (en) * 2016-09-14 2017-03-15 海信集团有限公司 A kind of data processing method of knowledge mapping and device
CN107346346A (en) * 2017-08-26 2017-11-14 海南大学 Learner competencies modeling and learning process Optimal Management System based on data collection of illustrative plates, Information Atlas and knowledge mapping

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JIAQING LIANG 等: "How to Keep a Knowledge Base Synchronized with Its Encyclopedia Source", 《PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-17)》 *

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109800288A (en) * 2019-01-22 2019-05-24 杭州师范大学 A kind of the scientific research analysis of central issue and prediction technique of knowledge based map
CN110008353A (en) * 2019-04-09 2019-07-12 福建奇点时空数字科技有限公司 A kind of construction method of dynamic knowledge map
CN110162640A (en) * 2019-04-28 2019-08-23 北京百度网讯科技有限公司 Novel entities method for digging, device, computer equipment and storage medium
CN110598002A (en) * 2019-08-14 2019-12-20 广州视源电子科技股份有限公司 Knowledge graph library construction method and device, computer storage medium and electronic equipment
CN110851612A (en) * 2019-08-29 2020-02-28 国家计算机网络与信息安全管理中心 Encyclopedic knowledge-based mobile application knowledge graph composite completion method and device
CN110851612B (en) * 2019-08-29 2023-08-18 国家计算机网络与信息安全管理中心 Mobile application knowledge graph composite completion method and device based on encyclopedia knowledge
CN113553436A (en) * 2020-04-23 2021-10-26 广东博智林机器人有限公司 Knowledge graph updating method and device, electronic equipment and storage medium
CN111488467A (en) * 2020-04-30 2020-08-04 北京建筑大学 Construction method and device of geographical knowledge graph, storage medium and computer equipment
CN111488467B (en) * 2020-04-30 2022-04-05 北京建筑大学 Construction method and device of geographical knowledge graph, storage medium and computer equipment
CN111626821A (en) * 2020-05-26 2020-09-04 山东大学 Product recommendation method and system for realizing customer classification based on integrated feature selection
CN111626821B (en) * 2020-05-26 2024-03-12 山东大学 Product recommendation method and system for realizing customer classification based on integrated feature selection
CN111859238A (en) * 2020-07-27 2020-10-30 平安科技(深圳)有限公司 Method and device for predicting data change frequency based on model and computer equipment
CN111859238B (en) * 2020-07-27 2024-07-16 平安科技(深圳)有限公司 Model-based method, device and computer equipment for predicting data change frequency
CN112035682A (en) * 2020-09-28 2020-12-04 上海松鼠课堂人工智能科技有限公司 Character model analysis processing method and system based on knowledge graph
CN113065353A (en) * 2021-03-16 2021-07-02 北京金堤征信服务有限公司 Entity identification method and device
CN113065353B (en) * 2021-03-16 2024-04-02 北京金堤征信服务有限公司 Entity identification method and device
CN113239208A (en) * 2021-05-06 2021-08-10 广东博维创远科技有限公司 Mark training model based on knowledge graph
CN113609309A (en) * 2021-08-16 2021-11-05 脸萌有限公司 Knowledge graph construction method and device, storage medium and electronic equipment
CN113609309B (en) * 2021-08-16 2024-02-06 脸萌有限公司 Knowledge graph construction method and device, storage medium and electronic equipment
WO2023022655A3 (en) * 2021-08-16 2023-04-13 脸萌有限公司 Knowledge map construction method and apparatus, storage medium, and electronic device
CN115408534B (en) * 2022-08-23 2023-12-12 连连银通电子支付有限公司 Knowledge graph updating method, device, equipment and storage medium
CN115408534A (en) * 2022-08-23 2022-11-29 连连银通电子支付有限公司 Knowledge graph updating method, device, equipment and storage medium
CN116502711B (en) * 2023-06-27 2023-09-26 北京智谱华章科技有限公司 Knowledge graph construction and dynamic expansion method, device, equipment and medium
CN116502711A (en) * 2023-06-27 2023-07-28 北京智谱华章科技有限公司 Knowledge graph construction and dynamic expansion method, device, equipment and medium

Similar Documents

Publication Publication Date Title
CN109033160A (en) A kind of knowledge mapping dynamic updating method
CN103176985B (en) The most efficient a kind of internet information crawling method
CN106777274B (en) A kind of Chinese tour field knowledge mapping construction method and system
US7424484B2 (en) Path-based ranking of unvisited web pages
CN102902806B (en) A kind of method and system utilizing search engine to carry out query expansion
CN102567464B (en) Based on the knowledge resource method for organizing of expansion thematic map
CN110704624B (en) Geographic information service metadata text multi-level multi-label classification method
CN107169001A (en) A kind of textual classification model optimization method based on mass-rent feedback and Active Learning
CN104866557B (en) A kind of personalized instant learning theoretical based on constructive learning supports System and method for
CN107577759A (en) User comment auto recommending method
CN103955842B (en) A kind of online advertisement commending system and method towards mass media data
CN106156372B (en) A kind of classification method and device of internet site
CN101350011A (en) Method for detecting search engine cheat based on small sample set
CN105718579A (en) Information push method based on internet-surfing log mining and user activity recognition
CN103744956B (en) A kind of diversified expanding method of key word
CN106204156A (en) A kind of advertisement placement method for network forum and device
CN107885793A (en) A kind of hot microblog topic analyzing and predicting method and system
CN102708096A (en) Network intelligence public sentiment monitoring system based on semantics and work method thereof
CN106066873A (en) A kind of travel information based on body recommends method
CN106227788A (en) Database query method based on Lucene
CN103049542A (en) Domain-oriented network information search method
CN104182412A (en) Webpage crawling method and webpage crawling system
CN108710672B (en) Theme crawler method based on incremental Bayesian algorithm
CN107833142A (en) Academic social networks scientific research cooperative person recommends method
CN104239496A (en) Collaborative filtering method based on integration of fuzzy weight similarity measurement and clustering

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20181218