CN109815340A - A kind of construction method of national culture information resources knowledge mapping - Google Patents

A kind of construction method of national culture information resources knowledge mapping Download PDF

Info

Publication number
CN109815340A
CN109815340A CN201910042744.2A CN201910042744A CN109815340A CN 109815340 A CN109815340 A CN 109815340A CN 201910042744 A CN201910042744 A CN 201910042744A CN 109815340 A CN109815340 A CN 109815340A
Authority
CN
China
Prior art keywords
attribute
participle
word segmentation
knowledge mapping
information resources
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910042744.2A
Other languages
Chinese (zh)
Inventor
甘健侯
王俊
周菊香
文斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yunnan University YNU
Yunnan Normal University
Original Assignee
Yunnan Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yunnan Normal University filed Critical Yunnan Normal University
Priority to CN201910042744.2A priority Critical patent/CN109815340A/en
Publication of CN109815340A publication Critical patent/CN109815340A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

The present invention relates to a kind of construction methods of national culture information resources knowledge mapping, belong to knowledge mapping technical field.First with Chinese word segmentation system and the customized dictionary of user is segmented to the entry data in the ethnic group's voluminous dictionary data being collected into and part-of-speech tagging, then the entry data after participle and part-of-speech tagging are detected, if continuative participle is threshold value of the quantity not less than setting of individual character, then carry out artificial participle operation, and the customized dictionary of user that artificial word segmentation result is added to Chinese word segmentation system, until no neologisms, then attributes extraction is carried out to the entry data after correct participle, to construct domain knowledge map, repeated detection is carried out to domain knowledge map again, deleting duplicated data, by the domain knowledge map and resource link after storage, finally stored.

Description

A kind of construction method of national culture information resources knowledge mapping
Technical field
The present invention relates to a kind of construction methods of national culture information resources knowledge mapping, belong to knowledge mapping technology neck Domain.
Background technique
National culture is a national spiritual wealth, and national culture is preserved, and can not only descendant be allowed to pass Outstanding culture is held, national culture can also be made to leave colourful one in history.Traditional national culture passes through number Word technology is converted into digitized coding form, then stores, transmits, duplication, reproducing and even create, being allowed to be rendered as " living Culture ", this will become protection national cultural heritage inexorable trend and ethnic mountainous regions digital development innovation think Road.
Knowledge mapping technology causes the extensive concern of scholars in recent years, can be by internet by knowledge mapping Information representation provides a kind of preferably tissue, management and use information at the form closer to the human cognitive world Mode.
In order to push national culture information resources gradually to develop, need to construct national culture information resources knowledge mapping, but The method to the building of English knowledge mapping external at present can not be completely suitable for the building of Chinese knowledge mapping, national culture information The knowledge mapping building of resource is also less, and there is an urgent need to national culture information resources knowledge mapping construction methods.
Summary of the invention
The technical problem to be solved in the present invention is to provide a kind of construction methods of national culture information resources knowledge mapping, use To solve the above problems.
The technical scheme is that a kind of construction method of national culture information resources knowledge mapping, first with the Chinese Language Words partition system and the customized dictionary of user to the entry data in the ethnic group's voluminous dictionary data being collected into carry out participle and Then part-of-speech tagging detects the entry data after participle and part-of-speech tagging, if continuative participle be the quantity of individual character not Less than the threshold value of setting, then artificial participle operation is carried out, and artificial word segmentation result is added to the user of Chinese word segmentation system certainly Dictionary is defined, until no neologisms, attributes extraction then is carried out to the entry data after correct participle, is known to construct field Know map, repeated detection carried out to domain knowledge map again, deleting duplicated data, by after storage domain knowledge map with Resource link is finally stored.
Specific steps are as follows:
Step 1: collect ethnic group's entry data, construct ethnic group's entry database, using Chinese word segmentation system and The customized dictionary of user is segmented to the entry data in the ethnic group's entry database being collected into and part-of-speech tagging, and goes Except punctuation mark;
Step 2: and then the data after participle and part-of-speech tagging are detected, if continuative participle be the quantity of individual character not Less than the threshold value of setting, then artificial participle operation is carried out, and artificial word segmentation result is added to the user of Chinese word segmentation system certainly Dictionary is defined, step 1 is repeated, until no neologisms;
Step 3: attributes extraction being carried out to the data after correct participle, to construct domain knowledge map;
Step 4: repeated detection being carried out to domain knowledge map, deleting duplicated data is stored;
Step 5: by the domain knowledge map and resource link after storage.
Words partition system in the step 1 and step 2 is NLPIR Chinese word segmentation system
The text data after participle and part-of-speech tagging is detected in the step 2 method particularly includes:
1. defining word segmentation result set S (S1,S2,……,Sm);
2. to word segmentation result S each in set SiNumber of words is counted, set number of words result C (C is obtained1,C2,……,Cm), Middle Ci=len (Si), and 1≤i≤m;
3. given threshold k meets 2≤k≤m;
4. selected subset closes P from S, P meets formula (1) and formula (2)
J-i+1≤k < m (2)
Illustrate the S in SiTo SjPosition have continuous k number of words be 1 participle, pass through setting k value, it is believed that continuous number of words Participle for 1 is neologisms x, x={ a Si,Si+1…Si+k},Si∈S;
4. defining new set of words W is W=(x1,x2…xn), and user is added to if it is neologisms to W row manual examination and verification In customized dictionary.
The mode that the threshold value k is set is from big to small, k=m when being arranged for the first time successively successively decreases, until k=1, often Step 2 is repeated after subthreshold setting, until all neologisms are added in the customized dictionary of user.
The purpose that operation is manually segmented in the step 2 is that the word combination for judging that continuous k is individual character is a neologisms.
Attributes extraction is classified one by one according to word segmentation result and part-of-speech tagging in the step 3, all the elements Attributes extraction is carried out, and indicates attribute-name, forms the triple of " theme-attribute-name-attribute value ", i.e. knowledge mapping.
Repeatability detection is divided into following several types in affiliated step 4:
Class1: the same attribute of same entity has multiple attribute values, if some attribute value includes other attribute values, This eliminate by comprising attribute value;
Type 2: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, basis possesses this The quantity of attribute value judged, the more reservation of attribute value, and submits manual examination and verification;
Type 3: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, possesses the attribute value Quantity it is also identical, then completely submit manual examination and verification.
The storage of domain knowledge map is to be deposited with the mode of relational database to chart database (such as: Neo4j) in affiliated step 4 The mode of storage knowledge mapping is simulated.
Relational database structure design is as follows:
Node table (Number, nodename, node label)
Entity table (Number, affiliated node serial number, entity name)
Property Name table (Number, affiliated node serial number, affiliated entity number, Property Name)
Attribute value table (Number, affiliated node serial number, affiliated entity number, affiliated Property Name number, attribute value)
Relation table (Number, start node number, destination node number, relationship)
The beneficial effects of the present invention are: by building national culture information resources knowledge mapping, by the nationality in internet Cultural information is expressed as the form closer to the human cognitive world, convenient for national culture information is managed and is utilized.
Specific embodiment
With reference to embodiment, the invention will be further described.
A kind of construction method of national culture information resources knowledge mapping, comprising:
Step 1: collect ethnic group's entry data, construct ethnic group's entry database, using Chinese word segmentation system and The customized dictionary of user is segmented to the entry data in the ethnic group's entry database being collected into and part-of-speech tagging, and goes Except punctuation mark;
Step 2: and then the data after participle and part-of-speech tagging are detected, if continuative participle be the quantity of individual character not Less than the threshold value of setting, then artificial participle operation is carried out, and artificial word segmentation result is added to the user of Chinese word segmentation system certainly Dictionary is defined, step 1 is repeated, until no neologisms;
Step 3: attributes extraction being carried out to the data after correct participle, to construct domain knowledge map;
Step 4: repeated detection being carried out to domain knowledge map, deleting duplicated data is stored;
Step 5: by the domain knowledge map and resource link after storage.
Words partition system in the step 1 and step 2 is NLPIR Chinese word segmentation system
The text data after participle and part-of-speech tagging is detected in the step 2 method particularly includes:
1. defining word segmentation result set S (S1,S2,……,Sm);
2. to word segmentation result S each in set SiNumber of words is counted, set number of words result C (C is obtained1,C2,……,Cm), Middle Ci=len (Si), and 1≤i≤m;
3. given threshold k meets 2≤k≤m;
4. selected subset closes P from S, P meets formula (1) and formula (2)
J-i+1≤k < m (2)
Illustrate the S in SiTo SjPosition have continuous k number of words be 1 participle, pass through setting k value, it is believed that continuous number of words Participle for 1 is neologisms x, x={ a Si,Si+1…Si+k},Si∈S;
4. defining new set of words W is W=(x1,x2…xn), and user is added to if it is neologisms to W row manual examination and verification In customized dictionary.
The mode that the threshold value k is set is from big to small, k=m when being arranged for the first time successively successively decreases, until k=1, often Step 2 is repeated after subthreshold setting, until all neologisms are added in the customized dictionary of user.
Attributes extraction is classified one by one according to word segmentation result and part-of-speech tagging in the step 3, all the elements Attributes extraction is carried out, and indicates attribute-name, forms the triple of " theme-attribute-name-attribute value ", i.e. knowledge mapping.
Repeatability detection is divided into following several types in affiliated step 4:
Class1: the same attribute of same entity has multiple attribute values, if some attribute value includes other attribute values, This eliminate by comprising attribute value;
Type 2: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, basis possesses this The quantity of attribute value judged, the more reservation of attribute value, and submits manual examination and verification;
Type 3: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, possesses the attribute value Quantity it is also identical, then completely submit manual examination and verification.
The storage of domain knowledge map is the mode with relational database to chart database stored knowledge map in affiliated step 4 Mode simulated.
Embodiment 1: entry content: river right bank tributary is held in both hands in [brave handful river] south.Positioned at Lincang City Zhenkang County, the north and Nujiang phase Neighbour, it is western, western to have a common boundary with southern Burma.
1, it segments: being segmented entry using Chinese word segmentation system are as follows: " [/ brave/handful/river /]/south/handful/river/right side/bank/ Stream/./ positioned at/Lincang City/Zhenkang County/,/northern/with/Nujiang/adjacent/,/western/,/western/and/south/Burma/friendship Boundary/.", " [[punctuation mark]/brave [noun]/handful [verb]/river [noun] /] [punctuate]/southern [noun of locality]/is held in both hands after part-of-speech tagging [verb]/river [the noun]/right side [noun of locality]/bank [noun]/tributary [noun]/.[punctuation mark]/it is located at [verb]/Lincang City [noun]/Zhenkang County [noun]/,/northern [noun of locality]/with [preposition]/Nujiang [noun]/adjacent [verb]/, [punctuate symbol Number]/western [noun of locality]/, [punctuation mark]/western [noun of locality]/hand over [preposition] by/southern [noun of locality]/Burma [noun]/ Boundary's [verb]/.[punctuation mark] " obtains " brave [noun]/handful [verb]/river [noun]/southern [orientation after removing punctuation mark Word]/hold in both hands [verb]/river [noun]/right [noun of locality]/bank [noun]/tributary [noun]/be located at [verb]/Lincang City [noun]/ Zhenkang County [noun]/the north [noun of locality]/[square with [preposition]/Nujiang [noun]/adjacent [verb]/western part [noun of locality]/western part Position word]/have a common boundary [verb] with [preposition]/south [noun of locality]/Burma [noun]/".
2, detect: define word segmentation result collection be combined into S (it is brave, hold in both hands, river, south is held in both hands, and river is right, and bank, tributary is located at, Lincang City, Zhenkang County, it is northern, with, Nujiang is adjacent, and it is western, it is western, with south, Burma, boundary).It unites to word segmentation result each in set S Number of words is counted, set number of words result C (1,1,1,1,1,1,1,1,2,2,3,3,2,1,2,2,2,2,1,2,2,2) is obtained.Thus may be used Know m=22, sets k value as 22, operated according to step 2, when k value is reduced to 3,And 3-1+1 ≤ 3 < 22 meet simultaneously, it is believed that one neologisms x={ brave, to hold in both hands, river } when the participle that continuous number of words is 1, that is, find " brave/to hold in both hands/ River " is the set that continuous individual character is all 3, and all words found are defined as set W=(river is held in both hands in brave handful river, south), are carried out artificial Confirm that these are proper nouns after audit, adds in access customer custom words library.
3, result after segmenting again are as follows: " brave handfuls river/south handful the river/right side/bank/tributary/positioned at/Lincang City/Zhenkang County/the north/ With/Nujiang/adjacent/western/western/with/south/Burma/boundary " set without continuous individual character 2.
4, carry out attribute labeling using word segmentation result, such as: " brave handfuls river/south handful the river/right side/bank/tributary/positioned at/Lincang City/ Zhenkang County/the north/and/Nujiang/adjacent/western/western part/and/south/Burma/boundary " participle forms a series of three after marking Tuple:
1. brave handful river, river right bank tributary is held in both hands in --- affiliated river --- south;
2. brave handful river --- address --- Lincang City Zhenkang County, northern adjacent with Nujiang, western, western to hand over southern Burma Boundary;
3. brave handful river, --- adjacent river --- is northern adjacent with Nujiang;
4. brave handful river --- border land --- is western, western and south Burma has a common boundary;
5, duplicate attribute detection is carried out.
Class1: the same attribute of same entity has multiple attribute values, if some attribute value includes other attribute values, This eliminate by comprising attribute value.Such as:
1. brave handful river --- address --- Lincang City Zhenkang County, northern adjacent with Nujiang, western, western to hand over southern Burma Boundary;
2. brave handful river --- address --- Lincang City Zhenkang County;
Then, it eliminates 2., retains 1.;
Type 2: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, basis possesses this The quantity of attribute value judged, the more reservation of attribute value, and submits manual examination and verification.Such as:
1. brave handful river --- address --- Lincang City Zhenkang County, northern adjacent with Nujiang, western, western to hand over southern Burma Boundary;
2. brave handful river --- address --- Lincang City Zhenkang County;
3. the county Cang Yuan, brave handful river --- address --- Lincang City;
Then, it eliminates 3., retains 1. 2., and submit manual examination and verification, carry out supplement verification using other data;
Type 3: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, possesses the attribute value Quantity it is also identical, then completely submit manual examination and verification.Such as:
1. brave handful river --- address --- Lincang City Zhenkang County;
2. the county Cang Yuan, brave handful river --- address --- Lincang City;
Manual examination and verification are then submitted completely, carry out supplement verification using other data.
6, linking for knowledge mapping and resource is carried out, when due to the building of all knowledge mappings is extracted by resource , the unique resource address of each resource is formd, the attribute addition for each knowledge mapping is hyperlinked in resource, To carry out, attribute is verified and resource is checked.
7, knowledge mapping storage is carried out using relational database.Such as:
Node table: N001, river, river;N002, soil, ground
Entity table: E001, N001, brave handful river
Property Name table: P001, N001, E001, address
Attribute value table: V001, N001, E001, P001, Lincang City Zhenkang County
Relation table: R001, N001, N002 are irrigated
Embodiment 2: entry content: [card] Lahu name, i.e. " reed is long " in Was's name, are three Buddhist patriarch period Wa nationality areas Government post name.
1, it segments: being segmented entry using Chinese word segmentation system are as follows: " [/ card/a little /]/drawing/blessing/name/,/i.e./Was/name/ In// "/reed/length/"/,/for/tri-/Buddhist patriarch/period/Wa/area// government post/name/.", " [[punctuate symbol after part-of-speech tagging Number]/card [noun]/a little [quantifiers] /] [punctuation mark]/drawing [verb]/blessing [nominal morpheme]/name [nominal morpheme]/, [mark Point symbol]/i.e. [verb]/Was [distinction word]/name [quantifier]/in [noun of locality]/[auxiliary word]/" [punctuation mark]/reed [people Name]/long [nominal morpheme]/" [punctuation mark]/, [punctuation mark]/it is [preposition]/tri- [number]/Buddhist patriarch [noun]/period [noun]/Wa [other proper names]/area [noun]/[auxiliary word]/government post [noun]/name [quantifier]/.[punctuation mark].
2, detect: define word segmentation result collection be combined into S (card is drawn, blessing, name a bit, that is, Was, name, in, it is three that reed is long, Buddhist patriarch, period, Wa, area, government post, name).Number of words is counted to word segmentation result each in set S, obtains set number of words knot Fruit C (1,1,1,1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,1,2,1).It can thus be appreciated that m=21, k value is set as 21, according to Step 2 is operated, when k value is reduced to 2,Meet simultaneously with the < of 2-1+1≤2 21, it is believed that continuous Number of words be 1 participle when one neologisms x={ card, a little }, that is, finding " card/a little " is the set that continuous individual character is all 2, owning The word found is defined as set W=(card, Lahu, reed are long), confirms that these are proper nouns after carrying out manual examination and verification, is added into In the customized dictionary of user.
3, result after segmenting again are as follows: " card/Lahu/name // i.e./Was/name/in// reed grow/be/tri-/Buddhist patriarch/when Phase/Wa/area// government post/name " set without continuous individual character 2.
4, carry out attribute labeling using word segmentation result, such as: " card/Lahu/name/i.e./Was/name/in// reed grow/be/ Three/Buddhist patriarch/period/Wa/area// government post/name " segment, form a series of triples after mark:
--- source is national --- Lahu name 1. card;
2. reed is long for card --- Was ---;
3. blocking --- period --- three Buddhist patriarch's period;
4. blocking --- area --- Wa nationality area
5. blocking the government post name of --- explanation --- three Buddhist patriarch period Wa nationality area;
5, duplicate attribute detection is carried out.
Class1: the same attribute of same entity has multiple attribute values, if some attribute value includes other attribute values, This eliminate by comprising attribute value.Such as:
1. blocking the government post name of --- explanation --- three Buddhist patriarch period Wa nationality area;
2. blocking --- explanation --- government post name;
Then, it eliminates 2., retains 1.;
Type 2: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, basis possesses this The quantity of attribute value judged, the more reservation of attribute value, and submits manual examination and verification.Such as:
1. blocking the government post name of --- explanation --- three Buddhist patriarch period Wa nationality area;
2. blocking --- explanation --- Wa nationality area government post name;
3. blocking the government post name of --- explanation --- deep blue source area;
Then, it eliminates 3., retains 1. 2., and submit manual examination and verification, carry out supplement verification using other data;
Type 3: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, possesses the attribute value Quantity it is also identical, then completely submit manual examination and verification.Such as:
1. blocking --- area --- Wa nationality area;
2. blocking --- area --- deep blue source area;
Manual examination and verification are then submitted completely, carry out supplement verification using other data.
6, linking for knowledge mapping and resource is carried out, when due to the building of all knowledge mappings is extracted by resource , the unique resource address of each resource is formd, the attribute addition for each knowledge mapping is hyperlinked in resource, To carry out, attribute is verified and resource is checked.
7, knowledge mapping storage is carried out using relational database.Such as:
Node table: N001, government post, river;N002, place
Entity table: E001, N001, card
Property Name table: P001, N001, E001, period
Attribute value table: V001, N001, E001, P001, three Buddhist patriarch's period
Relation table: R001, N001, N002 are subordinate to
Above, the embodiment of the present invention is explained in detail, but the present invention is not limited to above-mentioned embodiment party Formula can also be made without departing from the purpose of the present invention within the knowledge of a person skilled in the art Various change out.

Claims (7)

1. a kind of construction method of national culture information resources knowledge mapping, it is characterised in that:
Step 1: collecting ethnic group's entry data, construct ethnic group's entry database, utilize Chinese word segmentation system and user Customized dictionary carries out participle and part-of-speech tagging to the entry data in the ethnic group's entry database being collected into, and removes mark Point symbol;
Step 2: and then the data after participle and part-of-speech tagging are detected, if the quantity that continuative participle is individual character is not less than The threshold value of setting then carries out artificial participle operation, and the user that artificial word segmentation result is added to Chinese word segmentation system is customized Dictionary repeats step 1, until no neologisms;
Step 3: attributes extraction being carried out to the data after correct participle, to construct domain knowledge map;
Step 4: repeated detection being carried out to domain knowledge map, deleting duplicated data is stored;
Step 5: by the domain knowledge map and resource link after storage.
2. the construction method of national culture information resources knowledge mapping according to claim 1, it is characterised in that: the step Rapid 1 and step 2 in Words partition system be NLPIR Chinese word segmentation system.
3. the construction method of national culture information resources knowledge mapping according to claim 1, it is characterised in that: the step The text data after participle and part-of-speech tagging is detected in rapid 2 method particularly includes:
1. defining word segmentation result set S (S1,S2,……,Sm);
2. to word segmentation result S each in set SiNumber of words is counted, set number of words result C (C is obtained1,C2,……,Cm), wherein Ci= len(Si), and 1≤i≤m;
3. given threshold k meets 2≤k≤m;
4. selected subset closes P from S, P meets formula (1) and formula (2)
J-i+1≤k < m (2)
Illustrate the S in SiTo SjTo have continuous k number of words be 1 participle for position, pass through setting k value, it is believed that continuous number of words is 1 Participle be neologisms x, x={ a Si,Si+1…Si+k},Si∈S;
4. defining new set of words W is W=(x1,x2…xn), and are added to by user and is made by oneself if it is neologisms for W row manual examination and verification In adopted dictionary.
4. the construction method of national culture information resources knowledge mapping according to claim 3, it is characterised in that: the threshold The mode that value k is set is from big to small, k=m when being arranged for the first time successively successively decreases, until k=1, weight after every subthreshold setting Multiple step 2, until all neologisms are added in the customized dictionary of user.
5. the construction method of national culture information resources knowledge mapping according to claim 1, it is characterised in that: the step Attributes extraction is classified one by one according to word segmentation result and part-of-speech tagging in rapid 3, and all the elements are all carried out attributes extraction, and It indicates attribute-name, forms the triple of " theme-attribute-name-attribute value ", i.e. knowledge mapping.
6. the construction method of national culture information resources knowledge mapping according to claim 1, it is characterised in that:
Repeatability detection is divided into following several types in affiliated step 4:
Class1: the same attribute of same entity has multiple attribute values, if some attribute value includes other attribute values, this disappears Except by comprising attribute value;
Type 2: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, basis possesses the attribute The quantity of value judged, the more reservation of attribute value, and submits manual examination and verification;
Type 3: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, possesses the number of the attribute value Measure also identical, then submission manual examination and verification completely.
7. the construction method of national culture information resources knowledge mapping according to claim 1, it is characterised in that: affiliated step The storage of domain knowledge map is to carry out mould with mode of the mode of relational database to chart database stored knowledge map in rapid 4 It is quasi-.
CN201910042744.2A 2019-01-17 2019-01-17 A kind of construction method of national culture information resources knowledge mapping Pending CN109815340A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910042744.2A CN109815340A (en) 2019-01-17 2019-01-17 A kind of construction method of national culture information resources knowledge mapping

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910042744.2A CN109815340A (en) 2019-01-17 2019-01-17 A kind of construction method of national culture information resources knowledge mapping

Publications (1)

Publication Number Publication Date
CN109815340A true CN109815340A (en) 2019-05-28

Family

ID=66604348

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910042744.2A Pending CN109815340A (en) 2019-01-17 2019-01-17 A kind of construction method of national culture information resources knowledge mapping

Country Status (1)

Country Link
CN (1) CN109815340A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111143574A (en) * 2019-12-05 2020-05-12 大连民族大学 Query and visualization system construction method based on minority culture knowledge graph
CN111324691A (en) * 2020-01-06 2020-06-23 大连民族大学 Intelligent question-answering method for minority nationality field based on knowledge graph
CN111611799A (en) * 2020-05-07 2020-09-01 北京智通云联科技有限公司 Dictionary and sequence labeling model based entity attribute extraction method, system and equipment
CN111681769A (en) * 2020-08-17 2020-09-18 耀方信息技术(上海)有限公司 Medicine word segmentation searching method and system
CN111930958A (en) * 2020-07-13 2020-11-13 车智互联(北京)科技有限公司 Graph database construction method, computing device and readable storage medium
CN112417100A (en) * 2020-11-20 2021-02-26 大连民族大学 Knowledge graph in Liaodai historical culture field and construction method of intelligent question-answering system thereof
WO2021147786A1 (en) * 2020-01-20 2021-07-29 京东方科技集团股份有限公司 Knowledge graph construction method and apparatus, storage medium, and electronic device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104133848A (en) * 2014-07-01 2014-11-05 中央民族大学 Tibetan language entity knowledge information extraction method
US9607098B2 (en) * 2014-06-02 2017-03-28 Wal-Mart Stores, Inc. Determination of product attributes and values using a product entity graph
CN107665252A (en) * 2017-09-27 2018-02-06 深圳证券信息有限公司 A kind of method and device of creation of knowledge collection of illustrative plates
CN108509420A (en) * 2018-03-29 2018-09-07 赵维平 Gu spectrum and ancient culture knowledge mapping natural language processing method
CN108694177A (en) * 2017-04-06 2018-10-23 北大方正集团有限公司 Knowledge mapping construction method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9607098B2 (en) * 2014-06-02 2017-03-28 Wal-Mart Stores, Inc. Determination of product attributes and values using a product entity graph
CN104133848A (en) * 2014-07-01 2014-11-05 中央民族大学 Tibetan language entity knowledge information extraction method
CN108694177A (en) * 2017-04-06 2018-10-23 北大方正集团有限公司 Knowledge mapping construction method and system
CN107665252A (en) * 2017-09-27 2018-02-06 深圳证券信息有限公司 A kind of method and device of creation of knowledge collection of illustrative plates
CN108509420A (en) * 2018-03-29 2018-09-07 赵维平 Gu spectrum and ancient culture knowledge mapping natural language processing method

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111143574A (en) * 2019-12-05 2020-05-12 大连民族大学 Query and visualization system construction method based on minority culture knowledge graph
CN111324691A (en) * 2020-01-06 2020-06-23 大连民族大学 Intelligent question-answering method for minority nationality field based on knowledge graph
WO2021147786A1 (en) * 2020-01-20 2021-07-29 京东方科技集团股份有限公司 Knowledge graph construction method and apparatus, storage medium, and electronic device
CN111611799A (en) * 2020-05-07 2020-09-01 北京智通云联科技有限公司 Dictionary and sequence labeling model based entity attribute extraction method, system and equipment
CN111611799B (en) * 2020-05-07 2023-06-02 北京智通云联科技有限公司 Entity attribute extraction method, system and equipment based on dictionary and sequence labeling model
CN111930958A (en) * 2020-07-13 2020-11-13 车智互联(北京)科技有限公司 Graph database construction method, computing device and readable storage medium
CN111930958B (en) * 2020-07-13 2023-12-01 车智互联(北京)科技有限公司 Graph database construction method, computing device and readable storage medium
CN111681769A (en) * 2020-08-17 2020-09-18 耀方信息技术(上海)有限公司 Medicine word segmentation searching method and system
CN112417100A (en) * 2020-11-20 2021-02-26 大连民族大学 Knowledge graph in Liaodai historical culture field and construction method of intelligent question-answering system thereof

Similar Documents

Publication Publication Date Title
CN109815340A (en) A kind of construction method of national culture information resources knowledge mapping
CN106777274B (en) A kind of Chinese tour field knowledge mapping construction method and system
CN112347222B (en) Method and system for converting non-standard address into standard address based on knowledge base reasoning
CN104679885B (en) A kind of user&#39;s search string organization names recognition method based on semantic feature model
CN110020433B (en) Industrial and commercial high-management name disambiguation method based on enterprise incidence relation
CN107368471B (en) Method for extracting place name address from webpage text
CN109885824A (en) A kind of Chinese name entity recognition method, device and the readable storage medium storing program for executing of level
CN109299469B (en) Method for identifying complex address in long text
WO2021189977A1 (en) Address coding method and apparatus, and computer device and computer-readable storage medium
CN106909611A (en) A kind of hotel&#39;s automatic matching method based on Text Information Extraction
Christen et al. A probabilistic geocoding system based on a national address file
CN109033225A (en) Chinese address identifying system
CN109800349A (en) The data processing method and device of content quantization news value are issued based on user
CN112527933A (en) Chinese address association method based on space position and text training
CN109344263A (en) A kind of address matching method
CN105117494B (en) Spatial entities mapping method in fuzzy context
CN106777118B (en) A kind of quick abstracting method of geographical vocabulary based on fuzzy dictionary tree
CN116680448A (en) Medical health institution information data standardization method based on deep neural network
CN117151659B (en) Ecological restoration engineering full life cycle tracing method based on large language model
CN114707003B (en) Method, equipment and storage medium for disambiguating names of paper authors
CN115185986A (en) Method and device for matching provincial and urban area address information, computer equipment and storage medium
CN105447104A (en) Knowledge map generating method and apparatus
AU2020101024A4 (en) Multi-language oriented general method for calculating place name semanteme similarity and use thereof
Xiang Region2vec: An Approach for Urban Land Use Detection by Fusing Multiple Features
CN110060472A (en) Road traffic accident localization method, system, readable storage medium storing program for executing and equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination