CN108595421A - A kind of abstracting method, the apparatus and system of Chinese entity associated relationship - Google Patents

A kind of abstracting method, the apparatus and system of Chinese entity associated relationship Download PDF

Info

Publication number
CN108595421A
CN108595421A CN201810329836.4A CN201810329836A CN108595421A CN 108595421 A CN108595421 A CN 108595421A CN 201810329836 A CN201810329836 A CN 201810329836A CN 108595421 A CN108595421 A CN 108595421A
Authority
CN
China
Prior art keywords
relative
relatival
entity
text
fact object
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810329836.4A
Other languages
Chinese (zh)
Other versions
CN108595421B (en
Inventor
李德彦
晋耀红
吴相博
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Science and Technology (Beijing) Co., Ltd.
Original Assignee
Beijing Shenzhou Taiyue Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Shenzhou Taiyue Software Co Ltd filed Critical Beijing Shenzhou Taiyue Software Co Ltd
Priority to CN201810329836.4A priority Critical patent/CN108595421B/en
Publication of CN108595421A publication Critical patent/CN108595421A/en
Application granted granted Critical
Publication of CN108595421B publication Critical patent/CN108595421B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/253Grammatical analysis; Style critique

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This application discloses a kind of abstracting method, the apparatus and systems of Chinese entity associated relationship, according to relatival nexus nature in Chinese text, it extracts in text with the relevant target agent entity of the relative and target by fact object, further according to relative and the corresponding target agent entity of relative and target by fact object, the corresponding Chinese entity associated relationship of the relative in text is generated.Technical solution provided by the embodiments of the present application, unstructured Chinese text is divided into different words and expressions according to different nexus natures, further reduce each position range of relatival target agent entity and target where by fact object, to improve search precision and search speed, operand is reduced.In addition, the technical solution in the embodiment of the present application, also uses the division rule on Chinese syntactic level, the fault relationships word and false entries of some redundancies are largely filtered out, improves the accuracy rate for extracting relative and extracting entity.

Description

A kind of abstracting method, the apparatus and system of Chinese entity associated relationship
Technical field
This application involves natural language processing technique field more particularly to a kind of extraction sides of Chinese entity associated relationship Method, apparatus and system.
Background technology
With the high speed development of internet with the promotion at full speed of economic level, it is desirable to control when formulating corporate strategy First chance must just have and have a remarkable sense of smell, and hold more relevant informations, the pass as much as possible held between enterprise and enterprise System and the relationship between enterprise and individual can make most rational planning with aid decision person.
Existing enterprise's association identification technology generally more depends on the collection data of standardization and structuring.However This mode has significant limitation, and as text message source updates, slow, delay is high, and the structuring of data can expend The more time goes to screen information and arranged, it is possible to most timely information is unable to get, in addition, text envelope now Breath is that magnanimity increases, if valuable information is manually collected and arranged from these text messages, not only process is very numerous Best opportunity trivial and that decision can be missed.
In addition, above-mentioned this technology is only only applicable to the Chinese text of standardization and structuring, when one non-knot of processing When structure Chinese text, this mode obviously can not be competent at.Furthermore most unstructured Chinese text information can't at present Single incidence relation is only existed, the relevant unstructured Chinese text letter captured of the enterprise actually existed in internet The clause of breath is typically more complex, and the incidence relation of multipair different attribute, existing association identification may be included in a word Technology can not also improve the accuracy of relation recognition from syntactic level.So how literary from non-structured complicated Chinese Incidence relation is accurately extracted in this, becomes a urgent problem to be solved.
Invention content
This application provides a kind of abstracting method, the apparatus and systems of Chinese entity associated relationship, to solve the prior art In cannot accurately from unstructured Chinese text extract incidence relation the problem of.
On the one hand, the embodiment of the present application provides a kind of abstracting method of Chinese entity associated relationship, including:
Extract the relative in text;
If the relatival quantity extracted is more than 1, each relatival nexus nature is determined;
According to each relatival nexus nature, the corresponding target agent of each relative is extracted successively from text Entity and target are by fact object;
According to the relative and the corresponding target agent entity of the relative and target by fact object, Chinese is generated Entity associated relationship.
Second aspect, the embodiment of the present application provide a kind of draw-out device of Chinese entity associated relationship, and described device includes:
Relative abstraction module, for extracting the relative in text;
Property determining module determines each relatival relationship if the relatival quantity for extracting is more than 1 Property;
Target entity abstraction module, for according to each described relatival nexus nature, being extracted successively from text The corresponding target agent entity of each relative and target are by fact object;
Incidence relation generation module, for according to the relative and the corresponding target agent entity of the relative and Target is generated Chinese entity associated relationship by fact object.
The third aspect, the embodiment of the present application provide a kind of extraction system of Chinese entity associated relationship, the system comprises Memory and processor;
The memory is used to store the executable program of the processor;
The processor is configured as:
Extract the relative in text;
If the relatival quantity extracted is more than 1, each relatival nexus nature is determined;
According to each described relatival nexus nature, the corresponding target of each relative is extracted successively from text Agent entity and target are by fact object;
According to the relative and the corresponding target agent entity of the relative and target by fact object, Chinese is generated Entity associated relationship.
By above technical scheme it is found that abstracting method, the device of Chinese entity associated relationship provided by the embodiments of the present application And system, according to relatival nexus nature in Chinese text, extract in text with the relevant target agent entity of the relative With target text is generated further according to relative and the corresponding target agent entity of relative and target by fact object by fact object In the corresponding Chinese entity associated relationship of the relative.Technical solution provided by the embodiments of the present application, by unstructured Chinese text This is divided into different words and expressions according to different nexus natures, further reduces each relatival target agent entity and mesh Position range of the mark where by fact object reduces operand to improve search precision and search speed.In addition, the application is real The technical solution in example is applied, the division rule on Chinese syntactic level is also used, largely filters out the mistake of some redundancies Accidentally relative and false entries improve the accuracy rate for extracting relative and extracting entity.
Description of the drawings
In order to illustrate more clearly of the technical solution of the application, attached drawing needed in case study on implementation will be made below Simply introduce, it should be apparent that, for those of ordinary skills, in the premise of not making the creative labor property Under, other drawings may also be obtained based on these drawings.
Fig. 1 is a kind of abstracting method flow chart of Chinese entity associated relationship provided by the embodiments of the present application;
Fig. 2 is the flow chart of the step 102 in a preferred embodiment provided by the embodiments of the present application;
Fig. 3 is the flow chart of the step 102 in second preferred embodiment provided by the embodiments of the present application;
Fig. 4 is the flow chart of the step 102 in third preferred embodiment provided by the embodiments of the present application;
Fig. 5 is the flow chart of the step 102 in the 4th preferred embodiment provided by the embodiments of the present application;
Fig. 6 is a kind of structure chart of the draw-out device of Chinese entity associated relationship provided by the embodiments of the present application;
Fig. 7 is a kind of schematic diagram of the extraction system of Chinese entity associated relationship provided by the embodiments of the present application.
Specific implementation mode
In order to make those skilled in the art better understand the technical solutions in the application, below in conjunction with attached drawing, it is right Technical solution in the embodiment of the present application is clearly and completely described.
Structured message is the information that the database that we usually contact is managed, including production, business, transaction, client The record of information etc..Unstructured information, technical term are content, and the information covered is more extensive, can be divided into:Operation Content:Such as contract, invoice, letter and purchase records;Department's content:Such as document processing, electrical form, briefing file and electronics postal Part;Web content:The information of such as HTML and XML formats;Multimedia content:Such as sound, film, figure.
The magnanimity information occurred on Internet is probably divided into structuring, semi-structured and three kinds unstructured.Certainly Chinese Text message is also in this way, the position of structured message such as electronic commerce information, the property of information and the appearance of magnitude is fixed 's;Subdivision channel in semi-structured information such as professional website, the suitable specification of grammer of title and text, the model of keyword Enclose suitable limitation;Non-structured information such as BLOG and BBS, all the elements are all unpredictable.
Since current most of enterprise's association identification technology generally more depends on the Chinese of standardization and structuring Text message, and more accurate recognition methods is lacked for non-structured Chinese text information, so, the application is implemented Example provides a kind of abstracting method of Chinese entity associated relationship, and referring specifically to Fig. 1, this method includes:
Step 100, the relative in text is extracted;Each section, there are in the Chinese text of Relation extraction value, is necessarily deposited In relative, then before extracting text entities incidence relation, the relative in text is just first extracted, to determine text Present in relationship.In general, relative can be noun, can also be verb.
Chinese text information is a series of information with certain meaning of one's words got up by word combinations, for meaning of one's words complexity Chinese text information, it is desirable to extract the entity therein with incidence relation, it is necessary first to first determine text in there are which A or which relationship, to realize the purpose accurately extracted.
Optionally, after the relative in extracting text, also to judge that the relative whether there is and be closed in predefined It is in library.There are the relative of magnanimity in predefined relationship library, these relatives are all from processed a large amount of text envelope It is obtained in breath, wherein can also include corresponding with the relevant collaboration word of relative, some relatival attributes and relative Relationship etc., each relative has specific attribute and specific relationship, and has specific provider location relationship. Predefined relationship library can provide certain reference to extract the relative in Chinese text, if the relative being drawn into exists In predefined relationship library, then the attribute of some and relative that can be directly from relationship library corresponding to call relation word Other parameters, also avoid the process for re-establishing relative attribute and parameter in this way so that the entire entity associated that extracts is closed Process early period of system is quicker, in addition, due to having first relative property and parameter as a comparison, posterior relative is taken out It takes and the acquisition of relative property can be more accurate.
Further, relatival attribute includes relatival meaning, relatival part of speech and relatival nexus nature Etc., and according to relatival meaning, relatival part of speech and relatival nexus nature etc. can further obtain with The specific position of the relevant entity of relative, these are all stored in advance in predefined relationship library, when to extract relative It can quickly use.
If the relative is present in predefined relationship library, it is determined that the relatival nexus nature.It is general next It says, the meaning of one's words relationship in Chinese text depends primarily on relatival nexus nature, so the relative in extracting text Later, relatival nexus nature is also further judged, according to nexus nature, to be further processed to text.
If not finding the relative in predefined relationship library, illustrate, not predefined before the relative Relationship stores in library, is also searched in predefined relationship library with the relevant other information of the relative less than at this moment can then selecting It selects and abandons to the relatival further operating, that is, judging that the relative is invalid;Alternatively, establishing and being somebody's turn to do in predefined relationship library The relevant information of relative, including:Pass corresponding with the relevant collaboration word of the relative, the relatival attribute and the relative System etc. carries out next step operation, since this is relatival about this after establishing range of information, then to the relative Information it has been established that so, if extract entity associated relationship next time, then encounter the relative, then can rapidly from The relatival relevant information is extracted in predefined relationship library, therefore, the process that relationship word information is established can be enriched predefined Relationship library keeps its content more comprehensive.
Step 101, if the relatival quantity extracted is more than 1, each relatival nexus nature is determined.If The relative quantity extracted is more than 1, illustrates the entity associated relationship more than one in this section of text, for this relative number The case where amount is more than 1, it will be clear that each relatival nexus nature, so as to hereafter in the text according to relatival relationship Property extracts entity associated relationship.
In addition, after extracting relative, all relatives in text can also be stored together production Methods word Set, a relationship set of words correspond to one section of text, and the relationship word order in relationship set of words goes out in the text with relative Existing sequence consensus, in addition to this, relationship set of words also record with relative it is relevant cooperate with word, relatival nexus nature and Cooperate with the nexus nature of word.
It, can be according to relatival sequence in relationship set of words and relatival relational after production Methods set of words Matter, extracts the corresponding target agent entity of each relative and target by fact object successively from text, and this mode can be with Not only accurate that entity is extracted in an orderly manner again, for the Chinese text of relationship complexity, also save many entities extractions Time, improve efficiency.
Step 102, according to each relatival nexus nature, it is corresponding to extract each relative successively from text Target agent entity and target are by fact object.Relatival nexus nature is usually divided into verb active relationship, noun forward direction is closed System, the passive relationship of verb and noun inverse relationship etc..Relative in verb active relationship is typically the verb of an active, example Such as, " purchase ", " merger " and " spending more money on " etc.;Relative in noun positive relationship is typically a positive noun, for example, " controlling shareholder " and " investor " etc.;Relative in the passive relationship of verb is usually made of two parts, and a part is collaboration word, Another part is relative main body, and collaboration word indicates passive relationship, and relative main body is still a verb, for example, " quilt ... Purchase " and " by ... annex " etc., here, " quilt " and " by " is all to cooperate with word, indicates passive relationship, and " purchase " and " merger " For relatival main body;Relative in noun inverse relationship is also divided into collaboration word and relative main body two parts, cooperates with word Indicating inverse relationship, relative main body is a noun, for example, " as ... controlling shareholder " and " becoming ... holding people " Deng, wherein " as " and " becoming " is collaboration word, indicates the inverse relationship of noun, " controlling shareholder " and " holding people " is relationship Word main body.
Generally there are one agent entities and one for each relative by fact object, and agent entity is to constitute entity associated The masters of relationship are the passive side for constituting entity associated relationship by fact object, i.e., agent entity is relatival subject, and by Fact object is relatival object.In the text of complex relationship, due to relative have it is multiple, then each relatival agent Other relatival agent entities near entity and determination by fact object and the relative and there is relationship by fact object, needed The target agent entity and target of relationship by objective (RBO) word are determined according to the relatival agent entity of other in text and by fact object By fact object.
It is worth noting that the corresponding agent entity of each relative and be usually fixed by the position of fact object , specific position changes according to relatival nexus nature.In verb active relationship, agent be physically located at relative it Before, word denoting the receiver of an action is physically located at after relative, for example A purchases B, and A here is exactly agent entity, and B is exactly by fact object.Dynamic In the passive relationship of word, word denoting the receiver of an action is physically located at before collaboration word, and agent entity is then located between collaboration word and relative main body, such as B is purchased by A, and B here is exactly by fact object, and A is agent entity.In noun positive relationship, with verb active relationship, agent It being physically located at before relative, word denoting the receiver of an action is physically located at after relative, for example the purchase people of A is B, and A is exactly agent entity here, B is by fact object.In noun inverse relationship, with the passive relationship of verb, word denoting the receiver of an action is physically located at before collaboration word, agent entity position Between collaboration word and relative main body, such as purchase people of the B as A, B here is by fact object, and A is agent entity.
Step 103, Chinese is generated by fact object according to relative and the corresponding target agent entity of relative and target Entity associated relationship.
It is worth noting that in the technical solution of the application, when the relatival quantity extracted is more than 1, determine every One relatival nexus nature extracts each pass successively then according to each relatival nexus nature from text The corresponding target agent entity of copula and target are corresponded to by fact object and generate Chinese entity associated relationship.But when one section of text When relative quantity in this is only one, the technical solution of the application stands good, for the text of relationship complexity, Only a kind of process of relatival text of processing is with regard to fairly simple, without considering other relatival nexus natures and related reality The position of body only carries out judging to extract with entity to the relative itself.For example, to text, " Wanda's sport is purchased IRONMAN series competitions " carry out the extraction of text entities incidence relation, can first extract relative " purchase ", then judge The relative is verb active relationship, further according in verb active relationship, the position of target agent entity and target by fact object Relationship extracts target agent entity " Wanda's sport " and target by fact object " IRONMAN series competitions ", in finally regenerating Literary entity associated relationship " Wanda's sport->Purchase->IRONMAN series competitions ".
The abstracting method of Chinese entity associated relationship provided by the embodiments of the present application, according to relatival pass in Chinese text It is property, extracts in text with the relevant target agent entity of the relative and target by fact object, further according to relative and pass The corresponding target agent entity of copula and target are generated the corresponding Chinese entity associated of the relative in text and closed by fact object System.Unstructured Chinese text is divided into according to different nexus natures different by technical solution provided by the embodiments of the present application Words and expressions further reduces each position range of relatival target agent entity and target where by fact object, so as to Search precision and search speed are improved, operand is reduced.In addition, the technical solution in the embodiment of the present application, also uses Chinese Division rule on syntactic level largely filters out the fault relationships word and false entries of some redundancies, improves extraction Relative and the accuracy rate for extracting entity.
In the preferred embodiment of the application, by taking verb active relationship as an example, step 102 is explained further, such as Fig. 2 Shown, step 102 can specifically include:
Step 201, if relatival nexus nature is verb active relationship, in the text find be located at relative it It is preceding and be located in relatival first object relative and text distance relation word after relative it is farthest second Relationship by objective (RBO) word.
With text, " the IRONMAN series under world's iron man's house flag have been purchased in last year, Wanda's sport under Wanda For thing ", in the text exist three relatives, be respectively " under ", " purchase " and " under ", in the preferred embodiment I Study verb active relationship, so after the nexus nature to three relationships judges, determine " under " be noun master Dynamic relationship, and " purchase " is verb active relationship.Further according to described in step 201, due to before " purchase " near " purchase " Relative be " under ", thus first object relative be " under ";Due to after " purchase " and distance " purchase " most Remote relative be also " under ", thus the second relationship by objective (RBO) word also be " under ".
Step 202, first object relatival first is extracted in the text by the of fact object and the second relationship by objective (RBO) word Two by fact object.
Due to first object relative " under " be noun positive relationship, so " under " the first word denoting the receiver of an action be physically located at " under " after, before " purchase ", and the first agent be physically located at " under " before, after determining entity position, so One agent is physically located in " last year, Wanda group " this section of text, further Entity recognition, it may be determined that " Wanda group " is " under " the first agent entity, and the first word denoting the receiver of an action is physically located in " Wanda's sport " this section of text, after identification, it may be determined that " Wanda's sport " be " under " first by fact object.
Second relationship by objective (RBO) word be " under ", so " under " the second agent be physically located at " purchase " and " under " between " iron man company of the world " text in, by Entity recognition, it may be determined that the second agent entity is " iron man company of the world ", Second word denoting the receiver of an action be physically located at " under " after " IRONMAN series competitions " text in, the second word denoting the receiver of an action can be determined after identification Entity is " IRONMAN series competitions ".
Step 203, using first by fact object as relatival target agent entity and second by fact object as close The target of copula is by fact object.
So after above-mentioned steps 201 and step 202, the target agent entity of " purchase " is " Wanda's sport ", and The target of " purchase " is " IRONMAN series competitions " by fact object.
And then according to step 103, according to relative " purchase " and " purchase " corresponding target agent entity " Wanda By fact object " IRONMAN series competitions ", it is " Wanda's sport-to generate Chinese entity associated relationship for sport " and target>Purchase-> IRONMAN series competitions ".
Optionally, in the above content it is found that using first by fact object as relatival target agent entity, Yi Ji Two are included by the detailed process of fact object as relatival target by fact object:Respectively to first by fact object and the second word denoting the receiver of an action Entity carries out Entity recognition;Using first after Entity recognition by fact object as relatival target agent entity and entity After identification second by fact object as relatival target by fact object.In fact, the step of Entity recognition, is in step 202 The synchronous requirement carried out or all meet the embodiment of the present application in step 203, can realize and identify segment Chinese text The purpose of middle entity.Due to extract first by fact object and second by fact object process inherently determine provider location mistake Journey can only actually determine the range where entity, and the definite of entity and entity could be really determined after Entity recognition Position, so this process of Entity recognition can increase the accuracy of entire entity associated Relation extraction process.
In addition, in above-mentioned steps 202, if not finding relative before " purchase " or after " purchase ", say Bright first object relative or the second relationship by objective (RBO) word are not present, at this time, it may be necessary near " purchase " before finding " purchase " Entity as target agent entity, or find " purchase " later the farthest entity of distance " purchase " as target by the fact Body.For example, in the text of " the IRONMAN series competitions of iron man company of the world are purchased in Wanda's sport of Wanda group ", " receive Purchase " is front and back without other relatives, so the entity " Wanda's sport " before finding " purchase " near " purchase " is used as mesh Agent entity is marked, finding " purchase ", the farthest entity " IRONMAN series competitions " of distance " purchase " is used as target by the fact later Body.
In second preferred embodiment of the application, by taking noun positive relationship as an example, step 102 is explained further, such as Shown in Fig. 3, step 102 can specifically include:
Step 301, if relatival nexus nature is noun positive relationship, in the text find be located at relative it It is preceding and to be located at relative described in distance after relative in relatival first object relative and text farthest Second relationship by objective (RBO) word.
By taking text " the controlling shareholder C of the subsidiary B of A purchases D " as an example, it is with noun positive relationship word " controlling shareholder " , the first object relative in text before " controlling shareholder " near " controlling shareholder " is " subsidiary ", in " holding stock The second farthest relationship by objective (RBO) word of distance " controlling shareholder " is " purchase " after east ".
Step 302, first object relatival first is extracted in the text by the of fact object and the second relationship by objective (RBO) word Two by fact object.
The first agent entity of first object relative " subsidiary " is " A " in text, and first by fact object is " B ", the Second agent entity of two relationship by objective (RBO) words be " C ", second by fact object be " D ".
Step 303, using first by fact object as relatival target agent entity and second by fact object as close The target of copula is by fact object.Then the target agent entity of " controlling shareholder " is " A ", and target is " D " by fact object.
Further according to step 103, according to relative " controlling shareholder " and " controlling shareholder " corresponding target agent entity " A " With target by fact object " D ", it is " A- to generate Chinese entity associated relationship>Controlling shareholder->D”.
Optionally, in the above content it is found that using first by fact object as relatival target agent entity, Yi Ji Two are included by the detailed process of fact object as relatival target by fact object:Respectively to first by fact object and the second word denoting the receiver of an action Entity carries out Entity recognition;Using first after Entity recognition by fact object as relatival target agent entity and entity After identification second by fact object as relatival target by fact object.In fact, the step of Entity recognition, is in step 302 The synchronous requirement carried out or all meet the embodiment of the present application in step 303, can realize and identify segment Chinese text The purpose of middle entity.Due to extract first by fact object and second by fact object process inherently determine provider location mistake Journey can only actually determine the range where entity, and the definite of entity and entity could be really determined after Entity recognition Position, so this process of Entity recognition can increase the accuracy of entire entity associated Relation extraction process.
In addition, being needed in the text if not finding other relatives before or after relative " controlling shareholder " Find " controlling shareholder " before near " controlling shareholder " entity be used as target agent entity, or searching " controlling shareholder " it Afterwards distance " controlling shareholder " farthest entity as target by fact object.
In the third preferred embodiment of the application, by taking the passive relationship of verb as an example, step 102 is explained further, such as Shown in Fig. 4, step 102 can specifically include:
Step 401, if relatival nexus nature is the passive relationship of verb, relative is decomposed into collaboration word and closed Copula main body.
Using text, " controlling shareholder of U.S. TV Programs manufacturing company Dick as company A is furnished funds for by the subsidiary B of Wanda group 1000000000 dollars (about 7,800,000,000 Hongkong dollar) is purchased " for, there are the relative of the passive relationship of verb " quilt ... purchase " in the text, In " quilt " be collaboration word, " purchase " be relative main body.
Step 402, it finds in text before being located at collaboration word and near the first object relative of collaboration word, and text It is located at before relative main body and near the second relationship by objective (RBO) word of relative main body in this.
Near the first object relative of collaboration word " as ... holding stock before finding collaboration word " quilt " in the text The second relationship by objective (RBO) word " subsidiary " near " purchase " between collaboration word " quilt " and relative main body " purchase " is found in east ".
Step 403, first object relatival first is extracted in the text by the of fact object and the second relationship by objective (RBO) word Two by fact object.
First object relative is noun inverse relationship " as ... controlling shareholder ", at this point, this relatival first by Fact object is located in the text " U.S. TV Programs manufacturing company Dick " before " as ", passes through Entity recognition process, it may be determined that " beautiful Television production company of state Dick " is first object relatival first by fact object.Second relationship by objective (RBO) word " subsidiary " is run after fame Word positive relationship, at this time relatival second word denoting the receiver of an action be physically located at after " subsidiary " " B furnishes funds for 1,000,000,000 dollars (about 7,800,000,000 Hongkong dollar) " text in, after Entity recognition, second by fact object be " B ".
Step 404, using first by fact object as relatival target by fact object and second by fact object as close The target agent entity of copula.
By after step 403, the first of acquisition by fact object is " U.S. TV Programs manufacturing company Dick ", and second by the fact Body is " B ", so, the target of relative " quilt ... purchase " is " U.S. TV Programs manufacturing company Dick " by fact object, and target is applied Fact object is " B ".
Further according to step 103, entity associated relationship " B- can be generated>Purchase->U.S. TV Programs manufacturing company Dick ".
Optionally, in the above content it is found that using first by fact object as relatival target by fact object, Yi Ji Two detailed processes by fact object as relatival target agent entity, including:Respectively to first by fact object and second by Fact object carries out Entity recognition;Using first after Entity recognition by fact object as relatival target by fact object, Yi Jishi Body identification after second by fact object as relatival target agent entity.In fact, the step of Entity recognition, is in step 403 The middle synchronous requirement carried out or all meet the embodiment of the present application in step 404 can be realized and identify segment Chinese text The purpose of entity in this.Due to extracting first provider location is inherently determined by the process of fact object by fact object and second Process can only actually determine the range where entity, and really entity and entity could be determined really after Entity recognition Position is cut, so this process of Entity recognition can increase the accuracy of entire entity associated Relation extraction process.
In addition, if the text including relative " quilt ... purchase " is that " U.S. TV Programs manufacturing company Dick is collected by Wanda Group furnishes funds for 1,000,000,000 dollars (about 7,800,000,000 Hongkong dollar) purchase ", then other relatives are just not present before collaboration word " quilt ", need at this time Identify that the entity " U.S. TV Programs manufacturing company Dick " before " quilt " in text near " quilt " is used as target by fact object;Together Other relatives are also not present between collaboration word " quilt " and relative main body " purchase ", then identify " quilt " and " purchase " for reason Between near " purchase " entity " Wanda group " be used as target agent entity.So the entity associated relationship ultimately generated For " Wanda group->Purchase->U.S. TV Programs manufacturing company Dick ".
In the 4th preferred embodiment of the application, by taking noun inverse relationship as an example, step 102 is explained further, such as Shown in Fig. 5, step 102 can specifically include:
Step 501, if relatival nexus nature is noun inverse relationship, relative is decomposed into collaboration word and closed Copula main body.
Using text, " subsidiary's second of practical Heat & Control Pty Ltd.'s first company of Gansu Power Company as State Grid Corporation of China is public For the wholly-owned subsidiary of department ", wherein " as ... wholly-owned subsidiary " is the relative of noun inverse relationship, and " as " is It is relative main body to cooperate with word, " wholly-owned subsidiary ".
Step 502, it finds in text before being located at collaboration word and near the first object relative of collaboration word, and text It is located at before relative main body and near the second relationship by objective (RBO) word of relative main body in this.
The first object relative near " as " before finding " as " in the text is " practical Heat & Control Pty Ltd. ", Between " as " and " wholly-owned subsidiary ", the second relationship by objective (RBO) word near " wholly-owned subsidiary " is " subsidiary ".
Step 503, first object relatival first is extracted in the text by the of fact object and the second relationship by objective (RBO) word Two by fact object.
First object relative " practical Heat & Control Pty Ltd. " is noun positive relationship, and first by fact object is " first company ". First agent entity is " Gansu Power Company ";Second relationship by objective (RBO) word " subsidiary " be noun positive relationship, second by Fact object is located among " company B " text, by Entity recognition, it may be determined that " company B " be " subsidiary " second by Fact object, the second agent are physically located among " State Grid Corporation of China " text, can determine that " national grid is public after Entity recognition Department " is the second agent entity.
Step 504, using first by fact object as relatival target by fact object and second by fact object as close The target agent entity of copula.
By after step 503, determine first by fact object be " first company " as relative " as ... wholly-owned son is public The target of department " by fact object, second be " company B " as relative " as ... wholly-owned subsidiary " by fact object target Agent entity, so being " company B-according to the entity associated relationship that step 103 generates>Wholly-owned subsidiary->First company ".
Optionally, in the above content it is found that using first by fact object as relatival target by fact object, Yi Ji Two detailed processes by fact object as relatival target agent entity, including:Respectively to first by fact object and second by Fact object carries out Entity recognition;Using first after Entity recognition by fact object as relatival target by fact object, Yi Jishi Body identification after second by fact object as relatival target agent entity.In fact, the step of Entity recognition, is in step 503 The middle synchronous requirement carried out or all meet the embodiment of the present application in step 504 can be realized and identify segment Chinese text The purpose of entity in this.Due to extracting first provider location is inherently determined by the process of fact object by fact object and second Process can only actually determine the range where entity, and really entity and entity could be determined really after Entity recognition Position is cut, so this process of Entity recognition can increase the accuracy of entire entity associated Relation extraction process.
In addition, if when text is " wholly-owned subsidiary of the Gansu Power Company as State Grid Corporation of China ", in text Other relatives are not present before the collaboration word " as " of noun inverse relationship word " as ... wholly-owned subsidiary ", then identify Entity " Gansu Power Company " before " as " near " as " is as relative " as ... wholly-owned subsidiary " Other relationships are also not present by fact object, then due to cooperateing in target between word " as " and relative main body " wholly-owned subsidiary " Word, so identifying that the entity " State Grid Corporation of China " between collaboration word and relative main body near relative main body is used as pass Copula " as ... wholly-owned subsidiary " target agent entity.The entity associated relationship ultimately generated is " State Grid Corporation of China-> Wholly-owned subsidiary->Gansu Power Company ".
In above preferred embodiment, said respectively to how the relative of different nexus natures carries out entity relation extraction It is bright, have for multiple relatival complicated Chinese texts for above-mentioned, needs to carry out entity pass to each relative respectively The extraction of connection relationship, then the corresponding entity associated relationship of all relatives constitute all entity in this section of Chinese text and close Connection relationship.
" series of the IRONMAN under world's iron man's house flag has been purchased in last year, Wanda's sport under Wanda for example, text In race ", there are three relatives " under ", " purchase " and " under ", and three relatival nexus natures are name respectively Word positive relationship, verb active relationship and noun positive relationship carry out entity to these three relatives respectively according to nexus nature The extraction and generation of incidence relation can obtain three entity associated relationships, be respectively:" Wanda group->Under->Wanda's body Educate ", " Wanda's sport->Purchase->IRONMAN series competitions " and " iron man company of the world->Under->IRONMAN series competitions ".
" U.S. TV Programs manufacturing company Dick furnishes funds for 10 to text as the controlling shareholder of company A by the subsidiary B of Wanda group Hundred million dollars (about 7,800,000,000 Hongkong dollar) is purchased " in, exist " as ... controlling shareholder ", " quilt ... purchase " and " subsidiary " three are closed Copula, and three relatival nexus natures are noun inverse relationship, the passive relationship of verb and noun positive relationship, root respectively The extraction and generation that respectively these three relatives are carried out with entity associated relationship according to nexus nature, can obtain three entity associateds Relationship is respectively:" company A->Controlling shareholder->U.S. TV Programs manufacturing company Dick ", " B->Purchase->U.S. TV Programs make public Take charge of Dick " and " Wanda group->Subsidiary->B”.
Text " subsidiary company B of the practical Heat & Control Pty Ltd.'s first company of Gansu Power Company as State Grid Corporation of China Wholly-owned subsidiary " in, there are " practical Heat & Control Pty Ltd. ", " as ... wholly-owned subsidiary " and " subsidiary " three relatives, And three relatival nexus natures are noun positive relationship, noun inverse relationship and noun positive relationship respectively, according to pass It is the extraction and generation that property respectively carries out these three relatives entity associated relationship, three entity associateds can be obtained and closed It is, is respectively:" Gansu Power Company->Practical Heat & Control Pty Ltd.->First company ", " company B->Wholly-owned subsidiary->First company " " State Grid Corporation of China->Subsidiary->Company B ".
By above technical scheme it is found that the abstracting method of Chinese entity associated relationship provided by the embodiments of the present application, according to Relatival nexus nature in Chinese text extracts in text with the relevant target agent entity of the relative and target by the fact Body generates the relative pair in text further according to relative and the corresponding target agent entity of relative and target by fact object The Chinese entity associated relationship answered.Technical solution provided by the embodiments of the present application, by unstructured Chinese text according to different passes It is that property is divided into different words and expressions, further reduces each relatival target agent entity and target by fact object institute Position range reduce operand to improve search precision and search speed.In addition, the technology in the embodiment of the present application Scheme, also uses the division rule on Chinese syntactic level, largely filters out the fault relationships word and mistake of some redundancies Accidentally entity improves the accuracy rate for extracting relative and extracting entity.
Referring to Fig. 6, the embodiment of the present application also provides a kind of draw-out device of Chinese entity associated relationship, including:
Relative abstraction module 601, for extracting the relative in text;
Property determining module 602 determines that each is relatival if the relatival quantity for extracting is more than 1 Nexus nature;
Target entity abstraction module 603, for according to each relatival nexus nature, being extracted successively from text every The corresponding target agent entity of one relative and target are by fact object;
Incidence relation generation module 604, for according to relative and the corresponding target agent entity of relative and target By fact object, Chinese entity associated relationship is generated.
Optionally, target entity abstraction module 603 further includes:Verb active relationship entity abstraction module, is used for,
If relatival nexus nature is verb active relationship, in the text find be located at relative before and most Positioned at the second farthest relationship by objective (RBO) of distance relation word after relative in relatival first object relative and text Word;
First object relatival first is extracted in the text by the second of fact object and the second relationship by objective (RBO) word by the fact Body;
Using first by fact object as relatival target agent entity and second by fact object as relatival mesh Mark is by fact object.
Optionally, target entity abstraction module 603 further includes:Noun positive relationship entity abstraction module, is used for,
If relatival nexus nature is noun positive relationship, in the text find be located at relative before and most Positioned at the second farthest relationship by objective (RBO) of distance relation word after relative in relatival first object relative and text Word;
First object relatival first is extracted in the text by the second of fact object and the second relationship by objective (RBO) word by the fact Body;
Using first by fact object as relatival target agent entity and second by fact object as relatival mesh Mark is by fact object.
Optionally, target entity abstraction module 603 further includes:The passive relationship entity abstraction module of verb, is used for,
If relatival nexus nature is the passive relationship of verb, relative is decomposed into collaboration word and relative master Body;
It finds before being located at collaboration word in text and is located in the first object relative and text of collaboration word Before relative main body and near the second relationship by objective (RBO) word of relative main body;
First object relatival first is extracted in the text by the second of fact object and the second relationship by objective (RBO) word by the fact Body;
Using first by fact object as relatival target by fact object and second by fact object as relatival mesh Mark agent entity.
Optionally, target entity abstraction module 603 further includes:Noun inverse relationship entity abstraction module, is used for,
If relatival nexus nature is noun inverse relationship, relative is decomposed into collaboration word and relative master Body;
It finds before being located at collaboration word in text and is located in the first object relative and text of collaboration word Before relative main body and near the second relationship by objective (RBO) word of relative main body;
First object relatival first is extracted in the text by the second of fact object and the second relationship by objective (RBO) word by the fact Body;
Using first by fact object as relatival target by fact object and second by fact object as relatival mesh Mark agent entity.
Optionally, described device further includes:
Relative judgment module, is used for, and judges that relative whether there is in predefined relationship library;
If relative is present in predefined relationship library, it is determined that relatival nexus nature.
Optionally, verb active relationship entity abstraction module or noun positive relationship entity abstraction module include:
First instance identification module, for being carried out Entity recognition by fact object by fact object and second to first respectively;
Using first after Entity recognition by fact object as after relatival target agent entity and Entity recognition Two by fact object as relatival target by fact object.
Optionally, the passive relationship entity abstraction module of verb or noun inverse relationship entity abstraction module include:
Second instance identification module, for being carried out Entity recognition by fact object by fact object and second to first respectively;
Using first after Entity recognition by fact object as relatival target after by fact object and Entity recognition Two by fact object as relatival target agent entity.
Referring to Fig. 7, the embodiment of the present application also provides a kind of extraction system of Chinese entity associated relationship, the system comprises Memory 701 and processor 702;
Memory 701 is used to store the executable program of processor 702;
Processor 702 is configured as:
Extract the relative in text;
If the relatival quantity extracted is more than 1, each relatival nexus nature is determined;
According to each relatival nexus nature, the corresponding target agent of each relative is extracted successively from text Entity and target are by fact object;
According to relative and the corresponding target agent entity of relative and target by fact object, Chinese entity associated is generated Relationship.
By above technical scheme it is found that abstracting method, the device of Chinese entity associated relationship provided by the embodiments of the present application And system, according to relatival nexus nature in Chinese text, extract in text with the relevant target agent entity of the relative With target text is generated further according to relative and the corresponding target agent entity of relative and target by fact object by fact object In the corresponding Chinese entity associated relationship of the relative.Technical solution provided by the embodiments of the present application, by unstructured Chinese text This is divided into different words and expressions according to different nexus natures, further reduces each relatival target agent entity and mesh Position range of the mark where by fact object reduces operand to improve search precision and search speed.In addition, the application is real The technical solution in example is applied, the division rule on Chinese syntactic level is also used, largely filters out the mistake of some redundancies Accidentally relative and false entries improve the accuracy rate for extracting relative and extracting entity.
The application can be used in numerous general or special purpose computing system environments or configuration.Such as:Personal computer, service Device computer, handheld device or portable device, laptop device, multicomputer system, microprocessor-based system, top set Box, programmable consumer-elcetronics devices, network PC, minicomputer, mainframe computer including any of the above system or equipment Distributed computing environment etc..
The application can describe in the general context of computer-executable instructions executed by a computer, such as program Module.Usually, program module includes routines performing specific tasks or implementing specific abstract data types, program, object, group Part, data structure etc..The application can also be put into practice in a distributed computing environment, in these distributed computing environments, by Task is executed by the connected remote processing devices of communication network.In a distributed computing environment, program module can be with In the local and remote computer storage media including storage device.
It should be noted that herein, the relational terms of such as " first " and " second " or the like are used merely to one A entity or operation with another entity or operate distinguish, without necessarily requiring or implying these entities or operation it Between there are any actual relationship or orders.Moreover, the terms "include", "comprise" or its any other variant are intended to Cover non-exclusive inclusion, so that the process, method, article or equipment including a series of elements includes not only those Element, but also include other elements that are not explicitly listed, or further include for this process, method, article or setting Standby intrinsic element.
Those skilled in the art will readily occur to its of the application after considering specification and putting into practice application disclosed herein Its embodiment.This application is intended to cover any variations, uses, or adaptations of the application, these modifications, purposes or Person's adaptive change follows the general principle of the application and includes the undocumented common knowledge in the art of the application Or conventional techniques.The description and examples are only to be considered as illustrative, and the true scope and spirit of the application are by following Claim is pointed out.
It should be understood that the application is not limited to the precision architecture for being described above and being shown in the accompanying drawings, and And various modifications and changes may be made without departing from the scope thereof.Scope of the present application is only limited by the accompanying claims.

Claims (10)

1. a kind of abstracting method of Chinese entity associated relationship, which is characterized in that including:
Extract the relative in text;
If the relatival quantity extracted is more than 1, each relatival nexus nature is determined;
According to each relatival nexus nature, the corresponding target agent entity of each relative is extracted successively from text With target by fact object;
According to the relative and the corresponding target agent entity of the relative and target by fact object, Chinese entity is generated Incidence relation.
2. according to the method described in claim 1, it is characterized in that, described according to each relatival nexus nature, from text The step of corresponding target agent entity of each relative and target are by fact object is extracted successively in this, including:
If the relatival nexus nature is verb active relationship, in the text find be located at relative before and most It is located at relative is farthest described in distance after relative second in the relatival first object relative and text Relationship by objective (RBO) word;
Extract in the text the first object relatival first by fact object and the second relationship by objective (RBO) word second by Fact object;
Using described first by fact object as the relatival target agent entity and described second by fact object as institute Relatival target is stated by fact object.
3. according to the method described in claim 1, it is characterized in that, described according to each relatival nexus nature, from text The step of corresponding target agent entity of each relative and target are by fact object is extracted successively in this, including:
If the relatival nexus nature is noun positive relationship, in the text find be located at relative before and most It is located at relative is farthest described in distance after relative second in the relatival first object relative and text Relationship by objective (RBO) word;
Extract in the text the first object relatival first by fact object and the second relationship by objective (RBO) word second by Fact object;
Using described first by fact object as the relatival target agent entity and described second by fact object as institute Relatival target is stated by fact object.
4. according to the method described in claim 1, it is characterized in that, described according to each relatival nexus nature, from text The step of corresponding target agent entity of each relative and target are by fact object is extracted successively in this, including:
If the relatival nexus nature is the passive relationship of verb, the relative is decomposed into collaboration word and relative Main body;
It finds before being located at the collaboration word in text and in the first object relative and text of the collaboration word Before the relative main body and near the second relationship by objective (RBO) word of the relative main body;
Extract in the text the first object relatival first by fact object and the second relationship by objective (RBO) word second by Fact object;
Using described first by fact object as the relatival target by fact object and described second by fact object as institute State relatival target agent entity.
5. according to the method described in claim 1, it is characterized in that, described according to each relatival nexus nature, from text The step of corresponding target agent entity of each relative and target are by fact object is extracted successively in this, including:
If the relatival nexus nature is noun inverse relationship, the relative is decomposed into collaboration word and relative Main body;
It finds before being located at the collaboration word in text and in the first object relative and text of the collaboration word Before the relative main body and near the second relationship by objective (RBO) word of the relative main body;
Extract in the text the first object relatival first by fact object and the second relationship by objective (RBO) word second by Fact object;
Using described first by fact object as the relatival target by fact object and described second by fact object as institute State relatival target agent entity.
6. according to claim 2-5 any one of them methods, which is characterized in that after the relative extracted in text, Further include:
Judge that the relative whether there is in predefined relationship library;
If the relative is present in predefined relationship library, it is determined that the relatival nexus nature.
7. according to claim 2-3 any one of them methods, which is characterized in that it is described using first by fact object as the pass The target agent entity of copula and second by fact object as the relatival target by fact object the step of, including:
Respectively Entity recognition is carried out by fact object to described first by fact object and described second;
Using first after Entity recognition by fact object as after the relatival target agent entity and Entity recognition Two by fact object as the relatival target by fact object.
8. according to claim 4-5 any one of them methods, which is characterized in that it is described using first by fact object as the pass The target of copula by fact object and second by fact object as the relatival target agent entity the step of, including:
Respectively Entity recognition is carried out by fact object to described first by fact object and described second;
Using first after Entity recognition by fact object as the relatival target after by fact object and Entity recognition Two by fact object as the relatival target agent entity.
9. a kind of draw-out device of Chinese entity associated relationship, which is characterized in that described device includes:
Relative abstraction module, for extracting the relative in text;
Property determining module determines that each is relatival relational if the relatival quantity for extracting is more than 1 Matter;
Target entity abstraction module, for according to each described relatival nexus nature, being extracted successively from text each The corresponding target agent entity of a relative and target are by fact object;
Incidence relation generation module, for according to the relative and the corresponding target agent entity of the relative and target By fact object, Chinese entity associated relationship is generated.
10. a kind of extraction system of Chinese entity associated relationship, which is characterized in that the system comprises memories and processor;
The memory is used to store the executable program of the processor;
The processor is configured as:
Extract the relative in text;
If the relatival quantity extracted is more than 1, each relatival nexus nature is determined;
According to each described relatival nexus nature, the corresponding target agent of each relative is extracted successively from text Entity and target are by fact object;
According to the relative and the corresponding target agent entity of the relative and target by fact object, Chinese entity is generated Incidence relation.
CN201810329836.4A 2018-04-13 2018-04-13 Method, device and system for extracting Chinese entity association relationship Active CN108595421B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810329836.4A CN108595421B (en) 2018-04-13 2018-04-13 Method, device and system for extracting Chinese entity association relationship

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810329836.4A CN108595421B (en) 2018-04-13 2018-04-13 Method, device and system for extracting Chinese entity association relationship

Publications (2)

Publication Number Publication Date
CN108595421A true CN108595421A (en) 2018-09-28
CN108595421B CN108595421B (en) 2022-04-08

Family

ID=63622268

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810329836.4A Active CN108595421B (en) 2018-04-13 2018-04-13 Method, device and system for extracting Chinese entity association relationship

Country Status (1)

Country Link
CN (1) CN108595421B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110489496A (en) * 2019-07-22 2019-11-22 腾讯科技(深圳)有限公司 A kind of data processing method, device, electronic equipment and storage medium
CN111046656A (en) * 2019-11-15 2020-04-21 北京三快在线科技有限公司 Text processing method and device, electronic equipment and readable storage medium
CN111079433A (en) * 2019-11-29 2020-04-28 北京奇艺世纪科技有限公司 Event extraction method and device and electronic equipment
CN111310446A (en) * 2020-01-15 2020-06-19 中科鼎富(北京)科技发展有限公司 Information extraction method and device for referee document
CN113420120A (en) * 2021-06-24 2021-09-21 平安科技(深圳)有限公司 Training method, extracting method, device and medium of key information extracting model
CN116975284A (en) * 2023-07-05 2023-10-31 厦门渊亭信息科技有限公司 Entity relation extraction method and device based on priori knowledge and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102508830A (en) * 2011-11-28 2012-06-20 北京工商大学 Method and system for extracting social network from news document
CN103268311A (en) * 2012-11-07 2013-08-28 上海大学 Event-structure-based Chinese statement analysis method
WO2015080561A1 (en) * 2013-11-27 2015-06-04 Mimos Berhad A method and system for automated relation discovery from texts
WO2015194934A2 (en) * 2014-06-18 2015-12-23 Mimos Berhad A system and method for entity resolution
CN106777275A (en) * 2016-12-29 2017-05-31 北京理工大学 Entity attribute and property value extracting method based on many granularity semantic chunks
CN107357933A (en) * 2017-08-04 2017-11-17 刘应波 A kind of label for multi-source heterogeneous science and technology information resource describes method and apparatus
CN107368470A (en) * 2017-06-27 2017-11-21 北京神州泰岳软件股份有限公司 A kind of method and apparatus for extracting enterprises organizational structure information
CN107608948A (en) * 2017-10-16 2018-01-19 北京神州泰岳软件股份有限公司 A kind of construction method and device of Text Information Extraction model

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102508830A (en) * 2011-11-28 2012-06-20 北京工商大学 Method and system for extracting social network from news document
CN103268311A (en) * 2012-11-07 2013-08-28 上海大学 Event-structure-based Chinese statement analysis method
WO2015080561A1 (en) * 2013-11-27 2015-06-04 Mimos Berhad A method and system for automated relation discovery from texts
WO2015194934A2 (en) * 2014-06-18 2015-12-23 Mimos Berhad A system and method for entity resolution
CN106777275A (en) * 2016-12-29 2017-05-31 北京理工大学 Entity attribute and property value extracting method based on many granularity semantic chunks
CN107368470A (en) * 2017-06-27 2017-11-21 北京神州泰岳软件股份有限公司 A kind of method and apparatus for extracting enterprises organizational structure information
CN107357933A (en) * 2017-08-04 2017-11-17 刘应波 A kind of label for multi-source heterogeneous science and technology information resource describes method and apparatus
CN107608948A (en) * 2017-10-16 2018-01-19 北京神州泰岳软件股份有限公司 A kind of construction method and device of Text Information Extraction model

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
杨月蓉: "连动句和兼语句中的语义关系——兼论连动式与兼语式的区别", 《西南师范大学学报(哲学社会科学版)》 *
马晓静 等: "基于内容分析的网络新闻中社会网络自动抽取", 《科研信息化技术与应用》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110489496A (en) * 2019-07-22 2019-11-22 腾讯科技(深圳)有限公司 A kind of data processing method, device, electronic equipment and storage medium
CN111046656A (en) * 2019-11-15 2020-04-21 北京三快在线科技有限公司 Text processing method and device, electronic equipment and readable storage medium
CN111046656B (en) * 2019-11-15 2023-07-14 北京三快在线科技有限公司 Text processing method, text processing device, electronic equipment and readable storage medium
CN111079433A (en) * 2019-11-29 2020-04-28 北京奇艺世纪科技有限公司 Event extraction method and device and electronic equipment
CN111079433B (en) * 2019-11-29 2023-10-27 北京奇艺世纪科技有限公司 Event extraction method and device and electronic equipment
CN111310446A (en) * 2020-01-15 2020-06-19 中科鼎富(北京)科技发展有限公司 Information extraction method and device for referee document
CN111310446B (en) * 2020-01-15 2023-11-24 鼎富智能科技有限公司 Information extraction method and device for judge document
CN113420120A (en) * 2021-06-24 2021-09-21 平安科技(深圳)有限公司 Training method, extracting method, device and medium of key information extracting model
CN113420120B (en) * 2021-06-24 2024-05-31 平安科技(深圳)有限公司 Training method, extraction method, equipment and medium for key information extraction model
CN116975284A (en) * 2023-07-05 2023-10-31 厦门渊亭信息科技有限公司 Entity relation extraction method and device based on priori knowledge and storage medium
CN116975284B (en) * 2023-07-05 2024-09-13 厦门渊亭信息科技有限公司 Entity relation extraction method and device based on priori knowledge and storage medium

Also Published As

Publication number Publication date
CN108595421B (en) 2022-04-08

Similar Documents

Publication Publication Date Title
CN108595421A (en) A kind of abstracting method, the apparatus and system of Chinese entity associated relationship
CN107766371B (en) Text information classification method and device
CN102207948B (en) Method for generating incident statement sentence material base
Inel et al. Crowdtruth: Machine-human computation framework for harnessing disagreement in gathering annotated data
CN101706794B (en) Information browsing and retrieval method based on semantic entity-relationship model and visualized recommendation
CN105468605A (en) Entity information map generation method and device
Gao et al. PAID: Prioritizing app issues for developers by tracking user reviews over versions
CN101661513A (en) Detection method of network focus and public sentiment
CN110263248A (en) A kind of information-pushing method, device, storage medium and server
CN104685495A (en) A system and method for automatic generation of information-rich content from multiple microblogs, each microblog containing only sparse information
US10769196B2 (en) Method and apparatus for displaying electronic photo, and mobile device
US20220277044A1 (en) Systems, methods, and devices for generating real-time analytics
CN111737443B (en) Answer text processing method and device and key text determining method
CN109039710B (en) Routing data auditing method, device, server and storage medium
CN102073641A (en) Method, device and program for processing consumer-generated media information
CN113642867A (en) Method and system for assessing risk
Tahmasebi A Study on Word2Vec on a Historical Swedish Newspaper Corpus.
US8620918B1 (en) Contextual text interpretation
Bhole et al. Extracting named entities and relating them over time based on Wikipedia
CN102929948A (en) List page identification system and method
CN115757720A (en) Project information searching method, device, equipment and medium based on knowledge graph
CN115269771A (en) Big data analysis system based on semantics
KR102041915B1 (en) Database module using artificial intelligence, economic data providing system and method using the same
JP6975118B2 (en) Extractor and program
CN113127574A (en) Service data display method, system, equipment and medium based on knowledge graph

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20190906

Address after: Room 630, 6th floor, Block A, Wanliu Xingui Building, 28 Wanquanzhuang Road, Haidian District, Beijing

Applicant after: China Science and Technology (Beijing) Co., Ltd.

Address before: 100089 Beijing city Haidian District wanquanzhuang Road No. 28 Wanliu new building block A Room 601

Applicant before: Beijing Shenzhou Taiyue Software Co., Ltd.

CB02 Change of applicant information
CB02 Change of applicant information

Address after: 230000 zone B, 19th floor, building A1, 3333 Xiyou Road, hi tech Zone, Hefei City, Anhui Province

Applicant after: Dingfu Intelligent Technology Co., Ltd

Address before: Room 630, 6th floor, Block A, Wanliu Xingui Building, 28 Wanquanzhuang Road, Haidian District, Beijing

Applicant before: DINFO (BEIJING) SCIENCE DEVELOPMENT Co.,Ltd.

GR01 Patent grant
GR01 Patent grant