CN109815340A - A kind of construction method of national culture information resources knowledge mapping - Google Patents
A kind of construction method of national culture information resources knowledge mapping Download PDFInfo
- Publication number
- CN109815340A CN109815340A CN201910042744.2A CN201910042744A CN109815340A CN 109815340 A CN109815340 A CN 109815340A CN 201910042744 A CN201910042744 A CN 201910042744A CN 109815340 A CN109815340 A CN 109815340A
- Authority
- CN
- China
- Prior art keywords
- attribute
- participle
- word segmentation
- knowledge mapping
- information resources
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Machine Translation (AREA)
Abstract
The present invention relates to a kind of construction methods of national culture information resources knowledge mapping, belong to knowledge mapping technical field.First with Chinese word segmentation system and the customized dictionary of user is segmented to the entry data in the ethnic group's voluminous dictionary data being collected into and part-of-speech tagging, then the entry data after participle and part-of-speech tagging are detected, if continuative participle is threshold value of the quantity not less than setting of individual character, then carry out artificial participle operation, and the customized dictionary of user that artificial word segmentation result is added to Chinese word segmentation system, until no neologisms, then attributes extraction is carried out to the entry data after correct participle, to construct domain knowledge map, repeated detection is carried out to domain knowledge map again, deleting duplicated data, by the domain knowledge map and resource link after storage, finally stored.
Description
Technical field
The present invention relates to a kind of construction methods of national culture information resources knowledge mapping, belong to knowledge mapping technology neck
Domain.
Background technique
National culture is a national spiritual wealth, and national culture is preserved, and can not only descendant be allowed to pass
Outstanding culture is held, national culture can also be made to leave colourful one in history.Traditional national culture passes through number
Word technology is converted into digitized coding form, then stores, transmits, duplication, reproducing and even create, being allowed to be rendered as " living
Culture ", this will become protection national cultural heritage inexorable trend and ethnic mountainous regions digital development innovation think
Road.
Knowledge mapping technology causes the extensive concern of scholars in recent years, can be by internet by knowledge mapping
Information representation provides a kind of preferably tissue, management and use information at the form closer to the human cognitive world
Mode.
In order to push national culture information resources gradually to develop, need to construct national culture information resources knowledge mapping, but
The method to the building of English knowledge mapping external at present can not be completely suitable for the building of Chinese knowledge mapping, national culture information
The knowledge mapping building of resource is also less, and there is an urgent need to national culture information resources knowledge mapping construction methods.
Summary of the invention
The technical problem to be solved in the present invention is to provide a kind of construction methods of national culture information resources knowledge mapping, use
To solve the above problems.
The technical scheme is that a kind of construction method of national culture information resources knowledge mapping, first with the Chinese
Language Words partition system and the customized dictionary of user to the entry data in the ethnic group's voluminous dictionary data being collected into carry out participle and
Then part-of-speech tagging detects the entry data after participle and part-of-speech tagging, if continuative participle be the quantity of individual character not
Less than the threshold value of setting, then artificial participle operation is carried out, and artificial word segmentation result is added to the user of Chinese word segmentation system certainly
Dictionary is defined, until no neologisms, attributes extraction then is carried out to the entry data after correct participle, is known to construct field
Know map, repeated detection carried out to domain knowledge map again, deleting duplicated data, by after storage domain knowledge map with
Resource link is finally stored.
Specific steps are as follows:
Step 1: collect ethnic group's entry data, construct ethnic group's entry database, using Chinese word segmentation system and
The customized dictionary of user is segmented to the entry data in the ethnic group's entry database being collected into and part-of-speech tagging, and goes
Except punctuation mark;
Step 2: and then the data after participle and part-of-speech tagging are detected, if continuative participle be the quantity of individual character not
Less than the threshold value of setting, then artificial participle operation is carried out, and artificial word segmentation result is added to the user of Chinese word segmentation system certainly
Dictionary is defined, step 1 is repeated, until no neologisms;
Step 3: attributes extraction being carried out to the data after correct participle, to construct domain knowledge map;
Step 4: repeated detection being carried out to domain knowledge map, deleting duplicated data is stored;
Step 5: by the domain knowledge map and resource link after storage.
Words partition system in the step 1 and step 2 is NLPIR Chinese word segmentation system
The text data after participle and part-of-speech tagging is detected in the step 2 method particularly includes:
1. defining word segmentation result set S (S1,S2,……,Sm);
2. to word segmentation result S each in set SiNumber of words is counted, set number of words result C (C is obtained1,C2,……,Cm),
Middle Ci=len (Si), and 1≤i≤m;
3. given threshold k meets 2≤k≤m;
4. selected subset closes P from S, P meets formula (1) and formula (2)
J-i+1≤k < m (2)
Illustrate the S in SiTo SjPosition have continuous k number of words be 1 participle, pass through setting k value, it is believed that continuous number of words
Participle for 1 is neologisms x, x={ a Si,Si+1…Si+k},Si∈S;
4. defining new set of words W is W=(x1,x2…xn), and user is added to if it is neologisms to W row manual examination and verification
In customized dictionary.
The mode that the threshold value k is set is from big to small, k=m when being arranged for the first time successively successively decreases, until k=1, often
Step 2 is repeated after subthreshold setting, until all neologisms are added in the customized dictionary of user.
The purpose that operation is manually segmented in the step 2 is that the word combination for judging that continuous k is individual character is a neologisms.
Attributes extraction is classified one by one according to word segmentation result and part-of-speech tagging in the step 3, all the elements
Attributes extraction is carried out, and indicates attribute-name, forms the triple of " theme-attribute-name-attribute value ", i.e. knowledge mapping.
Repeatability detection is divided into following several types in affiliated step 4:
Class1: the same attribute of same entity has multiple attribute values, if some attribute value includes other attribute values,
This eliminate by comprising attribute value;
Type 2: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, basis possesses this
The quantity of attribute value judged, the more reservation of attribute value, and submits manual examination and verification;
Type 3: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, possesses the attribute value
Quantity it is also identical, then completely submit manual examination and verification.
The storage of domain knowledge map is to be deposited with the mode of relational database to chart database (such as: Neo4j) in affiliated step 4
The mode of storage knowledge mapping is simulated.
Relational database structure design is as follows:
Node table (Number, nodename, node label)
Entity table (Number, affiliated node serial number, entity name)
Property Name table (Number, affiliated node serial number, affiliated entity number, Property Name)
Attribute value table (Number, affiliated node serial number, affiliated entity number, affiliated Property Name number, attribute value)
Relation table (Number, start node number, destination node number, relationship)
The beneficial effects of the present invention are: by building national culture information resources knowledge mapping, by the nationality in internet
Cultural information is expressed as the form closer to the human cognitive world, convenient for national culture information is managed and is utilized.
Specific embodiment
With reference to embodiment, the invention will be further described.
A kind of construction method of national culture information resources knowledge mapping, comprising:
Step 1: collect ethnic group's entry data, construct ethnic group's entry database, using Chinese word segmentation system and
The customized dictionary of user is segmented to the entry data in the ethnic group's entry database being collected into and part-of-speech tagging, and goes
Except punctuation mark;
Step 2: and then the data after participle and part-of-speech tagging are detected, if continuative participle be the quantity of individual character not
Less than the threshold value of setting, then artificial participle operation is carried out, and artificial word segmentation result is added to the user of Chinese word segmentation system certainly
Dictionary is defined, step 1 is repeated, until no neologisms;
Step 3: attributes extraction being carried out to the data after correct participle, to construct domain knowledge map;
Step 4: repeated detection being carried out to domain knowledge map, deleting duplicated data is stored;
Step 5: by the domain knowledge map and resource link after storage.
Words partition system in the step 1 and step 2 is NLPIR Chinese word segmentation system
The text data after participle and part-of-speech tagging is detected in the step 2 method particularly includes:
1. defining word segmentation result set S (S1,S2,……,Sm);
2. to word segmentation result S each in set SiNumber of words is counted, set number of words result C (C is obtained1,C2,……,Cm),
Middle Ci=len (Si), and 1≤i≤m;
3. given threshold k meets 2≤k≤m;
4. selected subset closes P from S, P meets formula (1) and formula (2)
J-i+1≤k < m (2)
Illustrate the S in SiTo SjPosition have continuous k number of words be 1 participle, pass through setting k value, it is believed that continuous number of words
Participle for 1 is neologisms x, x={ a Si,Si+1…Si+k},Si∈S;
4. defining new set of words W is W=(x1,x2…xn), and user is added to if it is neologisms to W row manual examination and verification
In customized dictionary.
The mode that the threshold value k is set is from big to small, k=m when being arranged for the first time successively successively decreases, until k=1, often
Step 2 is repeated after subthreshold setting, until all neologisms are added in the customized dictionary of user.
Attributes extraction is classified one by one according to word segmentation result and part-of-speech tagging in the step 3, all the elements
Attributes extraction is carried out, and indicates attribute-name, forms the triple of " theme-attribute-name-attribute value ", i.e. knowledge mapping.
Repeatability detection is divided into following several types in affiliated step 4:
Class1: the same attribute of same entity has multiple attribute values, if some attribute value includes other attribute values,
This eliminate by comprising attribute value;
Type 2: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, basis possesses this
The quantity of attribute value judged, the more reservation of attribute value, and submits manual examination and verification;
Type 3: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, possesses the attribute value
Quantity it is also identical, then completely submit manual examination and verification.
The storage of domain knowledge map is the mode with relational database to chart database stored knowledge map in affiliated step 4
Mode simulated.
Embodiment 1: entry content: river right bank tributary is held in both hands in [brave handful river] south.Positioned at Lincang City Zhenkang County, the north and Nujiang phase
Neighbour, it is western, western to have a common boundary with southern Burma.
1, it segments: being segmented entry using Chinese word segmentation system are as follows: " [/ brave/handful/river /]/south/handful/river/right side/bank/
Stream/./ positioned at/Lincang City/Zhenkang County/,/northern/with/Nujiang/adjacent/,/western/,/western/and/south/Burma/friendship
Boundary/.", " [[punctuation mark]/brave [noun]/handful [verb]/river [noun] /] [punctuate]/southern [noun of locality]/is held in both hands after part-of-speech tagging
[verb]/river [the noun]/right side [noun of locality]/bank [noun]/tributary [noun]/.[punctuation mark]/it is located at [verb]/Lincang City
[noun]/Zhenkang County [noun]/,/northern [noun of locality]/with [preposition]/Nujiang [noun]/adjacent [verb]/, [punctuate symbol
Number]/western [noun of locality]/, [punctuation mark]/western [noun of locality]/hand over [preposition] by/southern [noun of locality]/Burma [noun]/
Boundary's [verb]/.[punctuation mark] " obtains " brave [noun]/handful [verb]/river [noun]/southern [orientation after removing punctuation mark
Word]/hold in both hands [verb]/river [noun]/right [noun of locality]/bank [noun]/tributary [noun]/be located at [verb]/Lincang City [noun]/
Zhenkang County [noun]/the north [noun of locality]/[square with [preposition]/Nujiang [noun]/adjacent [verb]/western part [noun of locality]/western part
Position word]/have a common boundary [verb] with [preposition]/south [noun of locality]/Burma [noun]/".
2, detect: define word segmentation result collection be combined into S (it is brave, hold in both hands, river, south is held in both hands, and river is right, and bank, tributary is located at, Lincang City,
Zhenkang County, it is northern, with, Nujiang is adjacent, and it is western, it is western, with south, Burma, boundary).It unites to word segmentation result each in set S
Number of words is counted, set number of words result C (1,1,1,1,1,1,1,1,2,2,3,3,2,1,2,2,2,2,1,2,2,2) is obtained.Thus may be used
Know m=22, sets k value as 22, operated according to step 2, when k value is reduced to 3,And 3-1+1
≤ 3 < 22 meet simultaneously, it is believed that one neologisms x={ brave, to hold in both hands, river } when the participle that continuous number of words is 1, that is, find " brave/to hold in both hands/
River " is the set that continuous individual character is all 3, and all words found are defined as set W=(river is held in both hands in brave handful river, south), are carried out artificial
Confirm that these are proper nouns after audit, adds in access customer custom words library.
3, result after segmenting again are as follows: " brave handfuls river/south handful the river/right side/bank/tributary/positioned at/Lincang City/Zhenkang County/the north/
With/Nujiang/adjacent/western/western/with/south/Burma/boundary " set without continuous individual character 2.
4, carry out attribute labeling using word segmentation result, such as: " brave handfuls river/south handful the river/right side/bank/tributary/positioned at/Lincang City/
Zhenkang County/the north/and/Nujiang/adjacent/western/western part/and/south/Burma/boundary " participle forms a series of three after marking
Tuple:
1. brave handful river, river right bank tributary is held in both hands in --- affiliated river --- south;
2. brave handful river --- address --- Lincang City Zhenkang County, northern adjacent with Nujiang, western, western to hand over southern Burma
Boundary;
3. brave handful river, --- adjacent river --- is northern adjacent with Nujiang;
4. brave handful river --- border land --- is western, western and south Burma has a common boundary;
5, duplicate attribute detection is carried out.
Class1: the same attribute of same entity has multiple attribute values, if some attribute value includes other attribute values,
This eliminate by comprising attribute value.Such as:
1. brave handful river --- address --- Lincang City Zhenkang County, northern adjacent with Nujiang, western, western to hand over southern Burma
Boundary;
2. brave handful river --- address --- Lincang City Zhenkang County;
Then, it eliminates 2., retains 1.;
Type 2: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, basis possesses this
The quantity of attribute value judged, the more reservation of attribute value, and submits manual examination and verification.Such as:
1. brave handful river --- address --- Lincang City Zhenkang County, northern adjacent with Nujiang, western, western to hand over southern Burma
Boundary;
2. brave handful river --- address --- Lincang City Zhenkang County;
3. the county Cang Yuan, brave handful river --- address --- Lincang City;
Then, it eliminates 3., retains 1. 2., and submit manual examination and verification, carry out supplement verification using other data;
Type 3: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, possesses the attribute value
Quantity it is also identical, then completely submit manual examination and verification.Such as:
1. brave handful river --- address --- Lincang City Zhenkang County;
2. the county Cang Yuan, brave handful river --- address --- Lincang City;
Manual examination and verification are then submitted completely, carry out supplement verification using other data.
6, linking for knowledge mapping and resource is carried out, when due to the building of all knowledge mappings is extracted by resource
, the unique resource address of each resource is formd, the attribute addition for each knowledge mapping is hyperlinked in resource,
To carry out, attribute is verified and resource is checked.
7, knowledge mapping storage is carried out using relational database.Such as:
Node table: N001, river, river;N002, soil, ground
Entity table: E001, N001, brave handful river
Property Name table: P001, N001, E001, address
Attribute value table: V001, N001, E001, P001, Lincang City Zhenkang County
Relation table: R001, N001, N002 are irrigated
Embodiment 2: entry content: [card] Lahu name, i.e. " reed is long " in Was's name, are three Buddhist patriarch period Wa nationality areas
Government post name.
1, it segments: being segmented entry using Chinese word segmentation system are as follows: " [/ card/a little /]/drawing/blessing/name/,/i.e./Was/name/
In// "/reed/length/"/,/for/tri-/Buddhist patriarch/period/Wa/area// government post/name/.", " [[punctuate symbol after part-of-speech tagging
Number]/card [noun]/a little [quantifiers] /] [punctuation mark]/drawing [verb]/blessing [nominal morpheme]/name [nominal morpheme]/, [mark
Point symbol]/i.e. [verb]/Was [distinction word]/name [quantifier]/in [noun of locality]/[auxiliary word]/" [punctuation mark]/reed [people
Name]/long [nominal morpheme]/" [punctuation mark]/, [punctuation mark]/it is [preposition]/tri- [number]/Buddhist patriarch [noun]/period
[noun]/Wa [other proper names]/area [noun]/[auxiliary word]/government post [noun]/name [quantifier]/.[punctuation mark].
2, detect: define word segmentation result collection be combined into S (card is drawn, blessing, name a bit, that is, Was, name, in, it is three that reed is long,
Buddhist patriarch, period, Wa, area, government post, name).Number of words is counted to word segmentation result each in set S, obtains set number of words knot
Fruit C (1,1,1,1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,1,2,1).It can thus be appreciated that m=21, k value is set as 21, according to
Step 2 is operated, when k value is reduced to 2,Meet simultaneously with the < of 2-1+1≤2 21, it is believed that continuous
Number of words be 1 participle when one neologisms x={ card, a little }, that is, finding " card/a little " is the set that continuous individual character is all 2, owning
The word found is defined as set W=(card, Lahu, reed are long), confirms that these are proper nouns after carrying out manual examination and verification, is added into
In the customized dictionary of user.
3, result after segmenting again are as follows: " card/Lahu/name // i.e./Was/name/in// reed grow/be/tri-/Buddhist patriarch/when
Phase/Wa/area// government post/name " set without continuous individual character 2.
4, carry out attribute labeling using word segmentation result, such as: " card/Lahu/name/i.e./Was/name/in// reed grow/be/
Three/Buddhist patriarch/period/Wa/area// government post/name " segment, form a series of triples after mark:
--- source is national --- Lahu name 1. card;
2. reed is long for card --- Was ---;
3. blocking --- period --- three Buddhist patriarch's period;
4. blocking --- area --- Wa nationality area
5. blocking the government post name of --- explanation --- three Buddhist patriarch period Wa nationality area;
5, duplicate attribute detection is carried out.
Class1: the same attribute of same entity has multiple attribute values, if some attribute value includes other attribute values,
This eliminate by comprising attribute value.Such as:
1. blocking the government post name of --- explanation --- three Buddhist patriarch period Wa nationality area;
2. blocking --- explanation --- government post name;
Then, it eliminates 2., retains 1.;
Type 2: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, basis possesses this
The quantity of attribute value judged, the more reservation of attribute value, and submits manual examination and verification.Such as:
1. blocking the government post name of --- explanation --- three Buddhist patriarch period Wa nationality area;
2. blocking --- explanation --- Wa nationality area government post name;
3. blocking the government post name of --- explanation --- deep blue source area;
Then, it eliminates 3., retains 1. 2., and submit manual examination and verification, carry out supplement verification using other data;
Type 3: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, possesses the attribute value
Quantity it is also identical, then completely submit manual examination and verification.Such as:
1. blocking --- area --- Wa nationality area;
2. blocking --- area --- deep blue source area;
Manual examination and verification are then submitted completely, carry out supplement verification using other data.
6, linking for knowledge mapping and resource is carried out, when due to the building of all knowledge mappings is extracted by resource
, the unique resource address of each resource is formd, the attribute addition for each knowledge mapping is hyperlinked in resource,
To carry out, attribute is verified and resource is checked.
7, knowledge mapping storage is carried out using relational database.Such as:
Node table: N001, government post, river;N002, place
Entity table: E001, N001, card
Property Name table: P001, N001, E001, period
Attribute value table: V001, N001, E001, P001, three Buddhist patriarch's period
Relation table: R001, N001, N002 are subordinate to
Above, the embodiment of the present invention is explained in detail, but the present invention is not limited to above-mentioned embodiment party
Formula can also be made without departing from the purpose of the present invention within the knowledge of a person skilled in the art
Various change out.
Claims (7)
1. a kind of construction method of national culture information resources knowledge mapping, it is characterised in that:
Step 1: collecting ethnic group's entry data, construct ethnic group's entry database, utilize Chinese word segmentation system and user
Customized dictionary carries out participle and part-of-speech tagging to the entry data in the ethnic group's entry database being collected into, and removes mark
Point symbol;
Step 2: and then the data after participle and part-of-speech tagging are detected, if the quantity that continuative participle is individual character is not less than
The threshold value of setting then carries out artificial participle operation, and the user that artificial word segmentation result is added to Chinese word segmentation system is customized
Dictionary repeats step 1, until no neologisms;
Step 3: attributes extraction being carried out to the data after correct participle, to construct domain knowledge map;
Step 4: repeated detection being carried out to domain knowledge map, deleting duplicated data is stored;
Step 5: by the domain knowledge map and resource link after storage.
2. the construction method of national culture information resources knowledge mapping according to claim 1, it is characterised in that: the step
Rapid 1 and step 2 in Words partition system be NLPIR Chinese word segmentation system.
3. the construction method of national culture information resources knowledge mapping according to claim 1, it is characterised in that: the step
The text data after participle and part-of-speech tagging is detected in rapid 2 method particularly includes:
1. defining word segmentation result set S (S1,S2,……,Sm);
2. to word segmentation result S each in set SiNumber of words is counted, set number of words result C (C is obtained1,C2,……,Cm), wherein Ci=
len(Si), and 1≤i≤m;
3. given threshold k meets 2≤k≤m;
4. selected subset closes P from S, P meets formula (1) and formula (2)
J-i+1≤k < m (2)
Illustrate the S in SiTo SjTo have continuous k number of words be 1 participle for position, pass through setting k value, it is believed that continuous number of words is 1
Participle be neologisms x, x={ a Si,Si+1…Si+k},Si∈S;
4. defining new set of words W is W=(x1,x2…xn), and are added to by user and is made by oneself if it is neologisms for W row manual examination and verification
In adopted dictionary.
4. the construction method of national culture information resources knowledge mapping according to claim 3, it is characterised in that: the threshold
The mode that value k is set is from big to small, k=m when being arranged for the first time successively successively decreases, until k=1, weight after every subthreshold setting
Multiple step 2, until all neologisms are added in the customized dictionary of user.
5. the construction method of national culture information resources knowledge mapping according to claim 1, it is characterised in that: the step
Attributes extraction is classified one by one according to word segmentation result and part-of-speech tagging in rapid 3, and all the elements are all carried out attributes extraction, and
It indicates attribute-name, forms the triple of " theme-attribute-name-attribute value ", i.e. knowledge mapping.
6. the construction method of national culture information resources knowledge mapping according to claim 1, it is characterised in that:
Repeatability detection is divided into following several types in affiliated step 4:
Class1: the same attribute of same entity has multiple attribute values, if some attribute value includes other attribute values, this disappears
Except by comprising attribute value;
Type 2: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, basis possesses the attribute
The quantity of value judged, the more reservation of attribute value, and submits manual examination and verification;
Type 3: the same attribute of same entity has multiple attribute values, if mutual exclusion between attribute value, possesses the number of the attribute value
Measure also identical, then submission manual examination and verification completely.
7. the construction method of national culture information resources knowledge mapping according to claim 1, it is characterised in that: affiliated step
The storage of domain knowledge map is to carry out mould with mode of the mode of relational database to chart database stored knowledge map in rapid 4
It is quasi-.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910042744.2A CN109815340A (en) | 2019-01-17 | 2019-01-17 | A kind of construction method of national culture information resources knowledge mapping |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910042744.2A CN109815340A (en) | 2019-01-17 | 2019-01-17 | A kind of construction method of national culture information resources knowledge mapping |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109815340A true CN109815340A (en) | 2019-05-28 |
Family
ID=66604348
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910042744.2A Pending CN109815340A (en) | 2019-01-17 | 2019-01-17 | A kind of construction method of national culture information resources knowledge mapping |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109815340A (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111143574A (en) * | 2019-12-05 | 2020-05-12 | 大连民族大学 | Query and visualization system construction method based on minority culture knowledge graph |
CN111324691A (en) * | 2020-01-06 | 2020-06-23 | 大连民族大学 | Intelligent question-answering method for minority nationality field based on knowledge graph |
CN111611799A (en) * | 2020-05-07 | 2020-09-01 | 北京智通云联科技有限公司 | Dictionary and sequence labeling model based entity attribute extraction method, system and equipment |
CN111681769A (en) * | 2020-08-17 | 2020-09-18 | 耀方信息技术(上海)有限公司 | Medicine word segmentation searching method and system |
CN111930958A (en) * | 2020-07-13 | 2020-11-13 | 车智互联(北京)科技有限公司 | Graph database construction method, computing device and readable storage medium |
CN112417100A (en) * | 2020-11-20 | 2021-02-26 | 大连民族大学 | Knowledge graph in Liaodai historical culture field and construction method of intelligent question-answering system thereof |
WO2021147786A1 (en) * | 2020-01-20 | 2021-07-29 | 京东方科技集团股份有限公司 | Knowledge graph construction method and apparatus, storage medium, and electronic device |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104133848A (en) * | 2014-07-01 | 2014-11-05 | 中央民族大学 | Tibetan language entity knowledge information extraction method |
US9607098B2 (en) * | 2014-06-02 | 2017-03-28 | Wal-Mart Stores, Inc. | Determination of product attributes and values using a product entity graph |
CN107665252A (en) * | 2017-09-27 | 2018-02-06 | 深圳证券信息有限公司 | A kind of method and device of creation of knowledge collection of illustrative plates |
CN108509420A (en) * | 2018-03-29 | 2018-09-07 | 赵维平 | Gu spectrum and ancient culture knowledge mapping natural language processing method |
CN108694177A (en) * | 2017-04-06 | 2018-10-23 | 北大方正集团有限公司 | Knowledge mapping construction method and system |
-
2019
- 2019-01-17 CN CN201910042744.2A patent/CN109815340A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9607098B2 (en) * | 2014-06-02 | 2017-03-28 | Wal-Mart Stores, Inc. | Determination of product attributes and values using a product entity graph |
CN104133848A (en) * | 2014-07-01 | 2014-11-05 | 中央民族大学 | Tibetan language entity knowledge information extraction method |
CN108694177A (en) * | 2017-04-06 | 2018-10-23 | 北大方正集团有限公司 | Knowledge mapping construction method and system |
CN107665252A (en) * | 2017-09-27 | 2018-02-06 | 深圳证券信息有限公司 | A kind of method and device of creation of knowledge collection of illustrative plates |
CN108509420A (en) * | 2018-03-29 | 2018-09-07 | 赵维平 | Gu spectrum and ancient culture knowledge mapping natural language processing method |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111143574A (en) * | 2019-12-05 | 2020-05-12 | 大连民族大学 | Query and visualization system construction method based on minority culture knowledge graph |
CN111324691A (en) * | 2020-01-06 | 2020-06-23 | 大连民族大学 | Intelligent question-answering method for minority nationality field based on knowledge graph |
WO2021147786A1 (en) * | 2020-01-20 | 2021-07-29 | 京东方科技集团股份有限公司 | Knowledge graph construction method and apparatus, storage medium, and electronic device |
CN111611799A (en) * | 2020-05-07 | 2020-09-01 | 北京智通云联科技有限公司 | Dictionary and sequence labeling model based entity attribute extraction method, system and equipment |
CN111611799B (en) * | 2020-05-07 | 2023-06-02 | 北京智通云联科技有限公司 | Entity attribute extraction method, system and equipment based on dictionary and sequence labeling model |
CN111930958A (en) * | 2020-07-13 | 2020-11-13 | 车智互联(北京)科技有限公司 | Graph database construction method, computing device and readable storage medium |
CN111930958B (en) * | 2020-07-13 | 2023-12-01 | 车智互联(北京)科技有限公司 | Graph database construction method, computing device and readable storage medium |
CN111681769A (en) * | 2020-08-17 | 2020-09-18 | 耀方信息技术(上海)有限公司 | Medicine word segmentation searching method and system |
CN112417100A (en) * | 2020-11-20 | 2021-02-26 | 大连民族大学 | Knowledge graph in Liaodai historical culture field and construction method of intelligent question-answering system thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109815340A (en) | A kind of construction method of national culture information resources knowledge mapping | |
CN106777274B (en) | A kind of Chinese tour field knowledge mapping construction method and system | |
CN112347222B (en) | Method and system for converting non-standard address into standard address based on knowledge base reasoning | |
CN104679885B (en) | A kind of user's search string organization names recognition method based on semantic feature model | |
CN110020433B (en) | Industrial and commercial high-management name disambiguation method based on enterprise incidence relation | |
CN107368471B (en) | Method for extracting place name address from webpage text | |
CN109885824A (en) | A kind of Chinese name entity recognition method, device and the readable storage medium storing program for executing of level | |
CN109299469B (en) | Method for identifying complex address in long text | |
WO2021189977A1 (en) | Address coding method and apparatus, and computer device and computer-readable storage medium | |
CN106909611A (en) | A kind of hotel's automatic matching method based on Text Information Extraction | |
Christen et al. | A probabilistic geocoding system based on a national address file | |
CN109033225A (en) | Chinese address identifying system | |
CN109800349A (en) | The data processing method and device of content quantization news value are issued based on user | |
CN112527933A (en) | Chinese address association method based on space position and text training | |
CN109344263A (en) | A kind of address matching method | |
CN105117494B (en) | Spatial entities mapping method in fuzzy context | |
CN106777118B (en) | A kind of quick abstracting method of geographical vocabulary based on fuzzy dictionary tree | |
CN116680448A (en) | Medical health institution information data standardization method based on deep neural network | |
CN117151659B (en) | Ecological restoration engineering full life cycle tracing method based on large language model | |
CN114707003B (en) | Method, equipment and storage medium for disambiguating names of paper authors | |
CN115185986A (en) | Method and device for matching provincial and urban area address information, computer equipment and storage medium | |
CN105447104A (en) | Knowledge map generating method and apparatus | |
AU2020101024A4 (en) | Multi-language oriented general method for calculating place name semanteme similarity and use thereof | |
Xiang | Region2vec: An Approach for Urban Land Use Detection by Fusing Multiple Features | |
CN110060472A (en) | Road traffic accident localization method, system, readable storage medium storing program for executing and equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |