CN109308323A - A kind of construction method, device and the equipment of causality knowledge base - Google Patents
A kind of construction method, device and the equipment of causality knowledge base Download PDFInfo
- Publication number
- CN109308323A CN109308323A CN201811494944.3A CN201811494944A CN109308323A CN 109308323 A CN109308323 A CN 109308323A CN 201811494944 A CN201811494944 A CN 201811494944A CN 109308323 A CN109308323 A CN 109308323A
- Authority
- CN
- China
- Prior art keywords
- causality
- entity
- sentence
- knowledge base
- cause
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
The invention discloses construction method, device, equipment and the computer readable storage mediums of a kind of causality knowledge base, this method comprises: being processed into multiple sentences by obtaining text data in data source, and by text data;It determines to identify the entity pair for including in cause and effect sentence, and determination includes to identify that the collection of obtained entity pair is combined into causality knowledge base there are causal cause and effect sentence in the multiple sentences obtained by processing;Entity is to including reason entity and result entity;It is every at the time of by obtaining text data in data source to pass through preset time period, then judge whether the total amount of data changed in data source reaches data-quantity threshold, if it is, then return to execution is by obtaining text data in data source the step of, if it is not, then determining without rebuilding building causality knowledge base.To which the frame based on eternal study ensure that the real-time effectiveness of causality knowledge base, and then it ensure that accuracy when realizing prediction of result based on causality knowledge base.
Description
Technical field
The present invention relates to data analysis technique fields, more specifically to a kind of building side of causality knowledge base
Method, device, equipment and computer readable storage medium.
Background technique
With the rapid development of Internet technology, author of each user as data shows each " individual ", promotees
The explosive growth of data is made.
As the carrier of expression self daily behavior and social affection, a large amount of individual is contained in text data for life
The summary of experience living, wherein including incidence relation or even causality between things abundant.Causality is contacted between things
Main form, be widely used in the fields such as economic, medical treatment, military and safety.Generally for prediction of result is realized, need
Creation includes the causality knowledge base of reason and result correspondent entity, so for need realize prediction the reason of, by because
It is determined in fruit relational knowledge base corresponding with the reason as a result, realizing prediction of result.Wherein, causality knowledge base is pre- in result
Play the role of very important in survey, be typically based on certain data in the prior art and construct after obtaining causality knowledge base,
Then realized every time using the causality knowledge base as the later period prediction of result based on causality knowledge base, but due to cause and effect
Relationship may change because of time change, this to change including causal increase, reduction and change etc., and existing skill
Subsequent each prediction of result is realized based on the causality knowledge base after constructing causality knowledge base in art, it is clear that can deposit
In the lower problem of accuracy.
In conclusion when realizing that the technical solution of causality construction of knowledge base has realization prediction of result in the prior art
The lower problem of accuracy.
Summary of the invention
The object of the present invention is to provide a kind of construction method of causality knowledge base, device, equipment and computer-readable
Storage medium is able to solve and realizes the existing realization prediction of result of the technical solution of causality construction of knowledge base in the prior art
When the lower problem of accuracy.
To achieve the goals above, the invention provides the following technical scheme:
A kind of construction method of causality knowledge base, comprising:
Multiple sentences are processed by obtaining text data in data source, and by the text data;
It determines in the multiple sentences obtained by processing there are causal cause and effect sentence, identifies in the cause and effect sentence and include
Entity pair, and determination includes that the obtained collection of entity pair of identification is combined into causality knowledge base;Wherein, the entity is to packet
Include reason entity and result entity;
It is every at the time of by obtaining text data in the data source to pass through preset time period, then judge the data source
Whether the total amount of data of middle variation reaches data-quantity threshold, if it is, it is described by obtaining textual data in data source to return to execution
According to the step of, if it is not, then determine without rebuild building causality knowledge base.
Preferably, determine whether any sentence is that there are causal cause and effect sentences, comprising:
It determines that any sentence is current statement, whether judges in current statement comprising in clear causality prompt set of words
Any clear causality prompt word for including, if it is, determining that current statement is cause and effect sentence, if it is not, then determining current
Sentence is not cause and effect sentence;Wherein, the clear causality prompt word is to show to determine that there are causal causalities
Prompt word.
Preferably, before to determine current statement not be cause and effect sentence, further includes:
If not including any clear causality for including in clear causality prompt set of words in current statement to mention
Show word, then judges whether any FUZZY RELATION OF CAUSE AND EFFECT comprising including in FUZZY RELATION OF CAUSE AND EFFECT prompt set of words mentions in current statement
Show word, if it is, current statement is converted to the classifier for being input to and being pre-created after feature vector, if the classifier
The result of output is preset value, it is determined that current statement is cause and effect sentence, if the result of classifier output is not preset value,
It then determines that current statement is not cause and effect sentence, determines the step of current statement is not cause and effect sentence if it is not, then executing;Wherein,
The FUZZY RELATION OF CAUSE AND EFFECT prompt word is to show there may be causal causality prompt word, and the classifier is to utilize
Feature vector that multiple sentences with FUZZY RELATION OF CAUSE AND EFFECT prompt word are converted to and corresponding sentence whether be cause and effect sentence mark
Label training obtains.
Preferably, determination includes that the collection for the entity pair that identification obtains is combined into after causality knowledge base, further includes:
Will the obtained entity of identification to cartesian product pairing is carried out, determine obtained multiple entities to being novel entities
It is right;
To the novel entities to comprising entity carry out cluster operation, obtain multiple being made of causality entity sets
Set pair, wherein comprising being classified as of a sort reason entity or knot in cluster operation in each causality entity sets
Sporocarp;
Retain it is each set to comprising the co-occurrence frequency in the data source be greater than frequency threshold novel entities pair, delete
Other entities pair.
Preferably, further includes:
The co-occurrence frequency of the novel entities pair of each reservation is added into the causality knowledge base.
Preferably, further includes:
The support of each novel entities pair is calculated according to the following formula, and the support of each novel entities pair is added to institute
It states in causality knowledge base:
SupportNum=(α * Adverb+ β * SentenceType+ γ * Emotion) * Negative;
Wherein, α, β and γ are preset weight coefficient, α > β > γ, and alpha+beta+γ=1;SupportNum is
Support, Adverb are that novel entities correspond to the degree adverb reciprocal fraction for including in sentence, and SentenceType is novel entities pair
The causality prompt word reciprocal fraction for including in sentence is answered, Emotion is that novel entities correspond to the emotion word pair for including in sentence
Score is answered, Negative is that novel entities correspond to the negative word reciprocal fraction for including in sentence.
Preferably, further includes:
Reason entity to be predicted is compared with each reason entity in newest obtained causality knowledge base, and is exported
Novel entities corresponding with entity the reason of reason Entities Matching to be predicted to comprising result entity, novel entities are to corresponding co-occurrence frequency
Rate and novel entities are to corresponding support.
A kind of construction device of causality knowledge base, comprising:
Preprocessing module is used for: being processed into multiple languages by obtaining text data in data source, and by the text data
Sentence;
Module is constructed, is used for: determining to identify institute there are causal cause and effect sentence in the multiple sentences obtained by processing
The entity pair for including in cause and effect sentence is stated, and determination includes to identify that the collection of obtained entity pair is combined into causality knowledge base;Its
In, the entity is to including reason entity and result entity;
Incremental learning module, is used for: every at the time of by obtaining text data in the data source to pass through preset time
Section, then judge whether the total amount of data changed in the data source reaches data-quantity threshold, if it is, return execute it is described by
The step of text data is obtained in data source, if it is not, then determining without rebuilding building causality knowledge base.
A kind of building equipment of causality knowledge base, comprising:
Memory, for storing computer program;
Processor realizes the building of the as above any one causality knowledge base when for executing the computer program
The step of method.
A kind of computer readable storage medium is stored with computer program on the computer readable storage medium, described
The step of as above construction method of any one causality knowledge base is realized when computer program is executed by processor.
The present invention provides construction method, device, equipment and the computer-readable storage mediums of a kind of causality knowledge base
Matter, wherein this method comprises: being processed into multiple sentences by obtaining text data in data source, and by the text data;By
It manages and determines to identify the entity pair for including in the cause and effect sentence there are causal cause and effect sentence in obtained multiple sentences, and
Determination includes that the collection for the entity pair that identification obtains is combined into causality knowledge base;Wherein, the entity is to including reason entity
And result entity;It is every at the time of by obtaining text data in the data source to pass through preset time period, then judge the number
Whether reach data-quantity threshold according to the total amount of data changed in source, if it is, it is described by obtaining text in data source to return to execution
The step of notebook data, if it is not, then determining without rebuilding building causality knowledge base.In technical solution disclosed in the present application,
The text data in data source is obtained, the text data that will acquire is processed into after sentence, will exist in the cause and effect sentence in sentence
Entity to identifying processing, causality knowledge base is combined into obtain the collection comprising entity pair, to be based on the causality knowledge
Causal identification is realized in library, and then realizes prediction of result;And per after a period of time, then judging to become in data source
Whether the data volume of change is enough, if it is, thinking causality present in data source, there may be more variations, in turn
Again realize otherwise the building of causality knowledge base then waits until next judgement.As it can be seen that technical solution disclosed in the present application
In, building complete causality knowledge base after, can also periodically judge the causality in data source whether may have occurred compared with
More variations rebuilds causality knowledge base when more variation may occur with the causality in data source, thus
Frame based on this eternal study ensure that causality knowledge base matches with the causality in data source, so that cause and effect
Relational knowledge base has real-time effectiveness, and then ensure that accuracy when realizing prediction of result based on causality knowledge base.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below
There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this
The embodiment of invention for those of ordinary skill in the art without creative efforts, can also basis
The attached drawing of offer obtains other attached drawings.
Fig. 1 is a kind of flow chart of the construction method of causality knowledge base provided in an embodiment of the present invention;
Fig. 2 is cluster operation acquired results in a kind of construction method of causality knowledge base provided in an embodiment of the present invention
Exemplary diagram;
Fig. 3 is a kind of structural schematic diagram of the construction device of causality knowledge base provided in an embodiment of the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete
Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on
Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other
Embodiment shall fall within the protection scope of the present invention.
Referring to Fig. 1, it illustrates a kind of streams of the construction method of causality knowledge base provided in an embodiment of the present invention
Cheng Tu may include:
S11: multiple sentences are processed by obtaining text data in data source, and by text data.
A kind of execution subject of the construction method of causality knowledge base provided in an embodiment of the present invention can be corresponding
Construction device;Under big data environment, selection is covered abundant in content, is related to that field is extensive, and by the data of big well-established approval
It is the most important thing for carrying out knowledge extraction, the present embodiment select Chinese wikipedia as data source, real-time update, can be random
Access, be maximum network resources system on internet, and most contents all pass through different user multiple edit validation it is complete
At the comprehensive and accuracy for having fully ensured that data.Natural language processing is carried out to the text data obtained in data source, is obtained
To corresponding multiple sentences, sentence set XML file can be made of obtained whole sentences, it is this that text data is handled
Technical solution to sentence is consistent with the realization principle for corresponding to technical solution in the prior art, and details are not described herein.
S12: determining in the multiple sentences obtained by processing there are causal cause and effect sentence, identifies in cause and effect sentence and includes
Entity pair, and determination includes that the obtained collection of entity pair of identification is combined into causality knowledge base;Wherein, entity is to including original
Because of entity and result entity.
Wherein, entity is Chinese vocabulary, such as teacher, temperature gos up, incurs loss all is entity;Cause and effect sentence be there are because
The sentence of fruit relationship includes showing the entity (reason entity) of reason and showing the entity (result entity) of result in cause and effect sentence,
After determining cause and effect sentence, by identified in cause and effect sentence it includes entity pair, so that obtaining the collection comprising entity pair is combined into cause and effect
Relational knowledge base.Wherein, the realization principle one for identifying the entity pair for including in cause and effect sentence and corresponding to technical solution in the prior art
It causes, details are not described herein.It obtains in causality knowledge base, can realize causal knowledge based on causality knowledge base
Not, and then prediction of result is realized, specifically, when there is the reason of needing realization prediction, by reason entity and causality
Reason entity in knowledge base is compared, if existing in causality knowledge base and needing the reason of predicting Entities Matching
The reason of (identical) entity, it is determined that the result entity that entity centering corresponding with the reason of predicting entity is needed includes be and need
The reason of predicting entity corresponds to result entity namely the bright result of the fructufy body surface is to need the reason of predicting corresponding knot
Fruit.
S13: it is every at the time of by obtaining text data in data source to pass through preset time period, then judge to become in data source
Whether the total amount of data of change reaches data-quantity threshold, if it is, the step of executing by obtaining text data in data source is returned,
If it is not, then determining without rebuilding building causality knowledge base.
Wherein, preset time period and the specific value of data-quantity threshold can be set according to actual needs, by
At the time of once by obtaining text data in database, every data volume for then judging to change in data source by certain time
It is whether enough, to thereby determine whether to need to rebuild causality knowledge base.It should be noted that data-quantity threshold can root
It is set according to actual needs, the data volume changed in data source is enough it may be considered that may have occurred in data source more
Causality variation, at this time carry out causality knowledge base reconstruction, and building obtain new causality knowledge base,
It then needs to realize prediction of result using newest obtained causality knowledge base when needing to carry out prediction of result again.As it can be seen that this Shen
Please the variable quantity timing based on data in data source in embodiment judge whether the building for needing to carry out causality knowledge base, and
The reconstruction that causality knowledge base is realized when the data volume for determining variation is sufficiently large, to ensure that causality knowledge base
The data for having occurred and that variation can be met, be also to construct complete and practical causality knowledge base, the present embodiment is using every
When more causality variation may occur in data source, then this eternal study of reconstruction of causality knowledge base is carried out
Frame, increment type decimation pattern ensure that the real-time effectiveness of causality knowledge base.
In technical solution disclosed in the present application, the text data in data source is obtained, the text data that will acquire is processed into
After sentence, by entity present in the cause and effect sentence in sentence to identifying processing, cause and effect is combined into obtain the collection comprising entity pair
Relational knowledge base, to realize causal identification based on the causality knowledge base;And per after a period of time, then judging
Whether changed data volume is enough in data source, if it is, thinking that causality present in data source may deposit
In more variation, and then the building of causality knowledge base is realized again, otherwise, then wait until next judgement.As it can be seen that the application
In disclosed technical solution, after causality knowledge base is completed in building, it can also periodically judge that the causality in data source is
It is no to may have occurred more variation, cause and effect pass is rebuild when more variation may occur with the causality in data source
It is knowledge base, so that the frame based on this eternal study ensure that causality knowledge base and the causality phase in data source
Matching so that causality knowledge base has real-time effectiveness, and then ensure that and realize that result is pre- based on causality knowledge base
Accuracy when survey.
A kind of construction method of causality knowledge base provided in an embodiment of the present invention determines whether any sentence is presence
Causal cause and effect sentence may include:
It determines that any sentence is current statement, whether judges in current statement comprising in clear causality prompt set of words
Any clear causality prompt word for including, if it is, determining that current statement is cause and effect sentence, if it is not, then determining current
Sentence is not cause and effect sentence;Wherein, specifying causality prompt word is to show to determine that there are the prompts of causal causality
Word.
It should be noted that clear causality prompt word is to show that corresponding sentence determines that there are causal cause and effect passes
Be prompt word, such as because, lead to word;It can be prompted to summarize the clear causality obtained in advance by staff
Set of words, as long as thus any clear causality prompt in sentence comprising including in clear causality prompt set of words
Word, then it is assumed that corresponding sentence is to determine that there are causal cause and effect sentences, otherwise, then it is assumed that corresponding sentence is not cause and effect sentence.From
And it can quickly and effectively determine whether sentence is cause and effect sentence in this way.
A kind of construction method of causality knowledge base provided in an embodiment of the present invention, determining current statement not is cause and effect language
Before sentence, can also include:
If not including any clear causality for including in clear causality prompt set of words in current statement to mention
Show word, then judges whether any FUZZY RELATION OF CAUSE AND EFFECT comprising including in FUZZY RELATION OF CAUSE AND EFFECT prompt set of words mentions in current statement
Show word, if it is, current statement is converted to the classifier for being input to and being pre-created after feature vector, if classifier exports
Result be preset value, it is determined that current statement be cause and effect sentence, if classifier output result be not preset value, it is determined that when
Preceding sentence is not cause and effect sentence, determines the step of current statement is not cause and effect sentence if it is not, then executing;Wherein, fuzzy causation
Relationship prompt word is to show there may be causal causality prompt word, and classifier is to have fuzzy causation using multiple
Whether feature vector that the sentence of relationship prompt word is converted to and corresponding sentence are that the label training of cause and effect sentence obtains.
It should be noted that FUZZY RELATION OF CAUSE AND EFFECT prompt word be show corresponding sentence there may be but cannot completely really
Surely whether there is causal causality prompt word, such as then, following word;It can be summarized in advance for staff
FUZZY RELATION OF CAUSE AND EFFECT out prompts set of words, when thus not including clear causality prompt word in sentence, in if statement
Include any FUZZY RELATION OF CAUSE AND EFFECT prompt word for including in FUZZY RELATION OF CAUSE AND EFFECT prompt set of words, then it is assumed that corresponding sentence is possible
There are causal cause and effect sentences, otherwise, then it is assumed that corresponding sentence is not cause and effect sentence.To be further ensured that whether judge sentence
For the accuracy of cause and effect sentence.In addition, existing technology is only limitted to carry out relationship in the sentence for containing causality prompt word
Extraction, therefore comprehensively, accurately identification causality prompt word for promoted obtain causal quality have it is great
Help.The present embodiment is based on Chinese literature knowledge and comprehensively summarizes existing causality prompt word, to accurately divide
Class difference causality, so carry out through this embodiment sentence whether be cause and effect sentence multiple judgement.Additionally need explanation
Be, in the present embodiment using classifier realize sentence whether be cause and effect sentence judgement, actually be use LTP natural language processing skill
Art, according to the judgement of the realizations such as part-of-speech rule, syntax dependence, principal component analysis.It specifically, can be by manually marking instruction
Practice collection, the training sample for including in training set be the feature vector being converted by the sentence comprising FUZZY RELATION OF CAUSE AND EFFECT prompt word and
This feature vector corresponds to whether sentence is that the label of cause and effect sentence forms, to can identify using training set training is available
The classifier of cause and effect sentence, and then realize the identification of FUZZY RELATION OF CAUSE AND EFFECT prompt word, accuracy is higher.Wherein, classifier can be
The classifier that NB Algorithm is realized, in simple terms, whether the sentence containing FUZZY RELATION OF CAUSE AND EFFECT prompt word is cause and effect sentence
Identification problem can directly be defined as two classification problems, i.e., are as follows: be or be not cause and effect sentence (0 | 1 problem).Based on engineering
Learning method can convert sentence to feature vector, for example the feature vector of sentence X indicates are as follows: X=(x1, x2...xn), xi (i
It is indicated for 1 to the numeralization for n) being sentence word sequence;Similarly, the categorised decision variable C={ 1,0 } of cause and effect sentence, 1 expression sentence are
Cause and effect sentence, 0 indicates that sentence is non-causal sentence.Using machine learning algorithm to given one group of training set { { X1 }, { X2 } ...
{ Xn } }, it whether is cause and effect sentence according to following equation learning objective sentence.
f:X→C
Using the method for machine learning based on the training set Study strategies and methods f manually marked.So that giving new sentence just
Can judge whether it is cause and effect sentence.
In addition, the NB Algorithm due to text classification has stable classification effectiveness and higher classification performance,
Classifier is realized using NB Algorithm in the present embodiment.The algorithm is less sensitive to missing data, and feature vector is each
Component is relatively independent relative to decision variable, so that the adaptability of the algorithm compares analogous algorithms with complexity with apparent excellent
Gesture.There is the characteristic of division of the sentence of FUZZY RELATION OF CAUSE AND EFFECT prompt word by observing, it is known that corresponding sentence is based primarily upon cause and effect pass
It is prompt word, context-sensitive morphology, syntactic feature, due to the randomness and diversity that Chinese expression is semantic, so these are special
Weak dependence is presented between sign, corresponding Chinese expresses weight justice and most of cause and effect sentence is caused not have complete syntactic structure, i.e., are as follows: instruction
Practice the loss of the data of sample.To sum up feature carries out the differentiation of sentence using NB Algorithm.It can be calculated by following equation
Its posterior probability:
A kind of construction method of causality knowledge base provided in an embodiment of the present invention, determination include the reality that identification obtains
The collection of body pair is combined into after causality knowledge base, can also include:
Will the obtained entity of identification to cartesian product pairing is carried out, determine obtained multiple entities to being novel entities
It is right;
To novel entities to comprising entity carry out cluster operation, obtain multiple set being made of causality entity sets
It is right, wherein comprising being classified as of a sort reason entity or fructufy in cluster operation in each causality entity sets
Body;
Retain it is each set to comprising the co-occurrence frequency in data source be greater than frequency threshold novel entities pair, delete other
Entity pair.
Entity obtained in step S12 is to may be due to lengthy and jumbled and meaningless, it is difficult to form knowledge.Therefore the present embodiment
In based on the similitude of entity to the entity of identification to doing cluster operation.Specifically, to identification obtain whole entities (including
Reason entity and result entity) cartesian product pairing is carried out, multiple novel entities are obtained, cluster behaviour is carried out to this multiple novel entities
Make, obtain corresponding multiple set pair, each set centering includes the causality entity sets and result of reason entity composition
Entity composition causality entity sets, and then only by entity representative in each causality entity sets,
Greater than the novel entities of the frequency threshold set according to actual needs to reservation, remaining is then deleted co-occurrence frequency as in data source
It removes.Wherein, the co-occurrence frequency of novel entities pair is that novel entities account in data source the sentence sum occurred jointly in data source
The probability for the whole sentences sum for including, so that it is determined that by the most representative entity of the entity centering being all likely to occur
It is right, and then ensure that the accuracy that prediction of result is realized using causality knowledge base.
It illustrates below and the present embodiment is specifically described, such as sentence are as follows: due to climate warming, snowcap melts, draws
It sends out snow property mountain torrents molten, causes more areas disaster-stricken, make the common people by serious financial consequences.Indicate causal entity to including:
Temperature rise snow melting mountain torrents → disaster-stricken incur loss.To entity to cartesian product pairing is carried out, obtain as follows
Each novel entities pair:
Data source, which is based on, based on TF-IDF method counts each novel entities centering reason part and result part in data source
In co-occurrence frequency.It is superseded lower than being carried out as trustless causality if certain threshold value if its co-occurrence frequency;
Otherwise retained as trusted causality.Retain its co-occurrence frequency as later period causal trusted simultaneously
Spend index.To novel entities to comprising whole entities carry out cluster operation, obtain effect picture as shown in Figure 2, will most generation
The entity of table is retained, and when input " overcasting ", " lightning accompanied by peals of thunder ", following result can be obtained:
Wherein, 0.38,0.52,0.78 is the co-occurrence frequency for corresponding to novel entities pair, and alternatively referred to as thus kind reason leads to this
The intensity that kind result occurs.
A kind of construction method of causality knowledge base provided in an embodiment of the present invention can also include:
The co-occurrence frequency of the novel entities pair of each reservation is added into causality knowledge base.
It should be noted that the co-occurrence frequency of novel entities pair can consider that novel entities cause it to correspond to result corresponding reason
Intensity, therefore the co-occurrence frequency of novel entities pair is added into causality knowledge base, is inquired when can be for needing, into
The perfect causality knowledge base of one step.
A kind of construction method of causality knowledge base provided in an embodiment of the present invention can also include:
Calculate the support of each novel entities pair according to the following formula, and by the support of each novel entities pair be added to because
In fruit relational knowledge base:
SupportNum=(α * Adverb+ β * SentenceType+ γ * Emotion) * Negative;
Wherein, α, β and γ are preset weight coefficient, α > β > γ, and alpha+beta+γ=1;SupportNum is
Support, Adverb are that novel entities correspond to the degree adverb reciprocal fraction for including in sentence, and SentenceType is novel entities pair
The causality prompt word reciprocal fraction for including in sentence is answered, Emotion is that novel entities correspond to the emotion word pair for including in sentence
Score is answered, Negative is that novel entities correspond to the negative word reciprocal fraction for including in sentence.
Wherein, the specific value of every weight coefficient and the score value of various words can be set according to actual needs
Fixed, the present embodiment on the basis of causality, mentions in excavating sentence according to emotion word, degree adverb, causality in sentence
Show that word and negative word calculate causal support between reason entity and corresponding result entity.It should be noted that cause and effect
The degree namely reason part that the support or intensity of relationship refer to that reason entity influence result entity occurs in cause and effect sentence have
Great probability leads to the generation of result;And support and intensity are to indicate above-mentioned implication from different perspectives.
Degree adverb refers to the adverbial word that adverbial word or adjective are modified or limited in degree in sentence, is used for
Express semantic intensity or word denoting the receiver of an action degree in sentence.The semantic component served as according to degree adverb in text, it is known that either opposite
Degree adverb or degree absolute adverbial word, all various trait sentimental polarity degree in expression text.Therefore, it is based on Chinese knowledge
The magnitude classification method of middle degree adverb requires according to causality support, degree adverb is made the appropriate adjustments and is assigned respectively
It, specifically can be as shown in table 1 with different polarity numbers (score):
1 degree adverb polarity number of table
In sentence emotion word be expression actor tendentiousness is passed judgement on to word denoting the receiver of an action person, have certain emotion, because
Also there is a degree of polarity effect, so the present embodiment is closed using Chinese emotion word as cause and effect is influenced in the expression of fruit relationship
It is that a weak factor of support is paid attention to.It can be that each emotion word sets corresponding score previously according to actual needs,
In general, emotion word indicate pass judgement on tendentiousness be more obvious, emotion it is heavier, corresponding score is bigger;It can also be based on
Emotion word is divided into 7 major class and 20 groups by the ontology library of the third party's emotion vocabulary obtained in advance;And by the polarity of emotion word
It is divided into 9,7,5,3,1 five ranks by descending order, wherein the classification chart of emotion word can be as shown in table 2, and each emotion word exists
There is different ranks in different groups, rank of each emotion word in all groups can be weighted at this time
To value be then corresponding emotion word score, weight coefficient can be set according to actual needs, such as " happiness " this emotion
Word rank in " happy " this group is 9, and rank is 7 in this group in " feeling at ease ", etc..
The classification of 2 emotion word of table
For causality between expressing entity in Chinese based on different causality prompt words, the cause and effect pass of expression
System will have different semantic intensity.The causal semanitics of rigorous auxiliary type causality prompt word expression are opposite be based on it is fuzzy because
The causal semanitics of fruit relationship prompt word expression will have stronger support.Such theory can be explained to a certain extent are as follows: base
There is stronger certainty in the causality that Chinese corpus extracts in rigorous causality prompt word.The present embodiment is based on this
Kind theory is according to the difference of causality prompt word, and the causality to extract is assigned to different cause and effect supports, such as 3 institute of table
Show:
3 causality prompt word polarity number of table
Cause and effect prompt word part of speech | Polarity number | Cause and effect prompt word part of speech | Polarity number |
Nested cause and effect conjunction | 0.7 | The adverbial word of table cause and effect | 0.3 |
Single conjunction of table cause and effect | 0.5 | The verb of table cause and effect | 0.3 |
The preposition of table cause and effect | 0.1 | The verb of table result production | 0.6 |
Causality generally can be divided into positive association and two kinds of negative customers;It is having the same that positive association often shows as cause and effect
Trend feature, i.e. cause and effect have the tendency that increase or reduction simultaneously;Negative customers then have different growths to become on the contrary, showing as the two
Gesture;I.e. reason causes the generation of result and reason to inhibit the generation of result.The identification of such semantic results
Causal judgement is carried out based on negative word herein.If expressing in the sentence of cause and effect and negative word occur, then it represents that such
Causality has inhibition relationship.Therefore it is recognized in the present embodiment based on negative word causal positive and negative.Specifically, negate
Word reciprocal fraction is that there are negative words in if statement, it is determined that negative word reciprocal fraction is 1, otherwise then determines negative word pair
Answering score is -1, and negates that vocabulary can be as shown in table 4:
The Chinese negative word of table 4
To be indicated by calculating the degree adverb, causality prompt word, emotion word and the negative word that include in sentence
The support of influence degree between causality, and record into causality knowledge base, thus further perfect cause and effect pass
It is knowledge base.It should be noted that if the sentence comprising some novel entities pair be it is multiple, this multiple sentence are new for this
Support of the entity to the mean value of calculated support as the novel entities pair;In addition, obtaining the intensity and branch of novel entities pair
After degree of holding, it can be deduced that such as the causality of several classifications in table 5, wherein strong reason → strong resulting class is corresponding new real
The support and intensity of body pair are respectively greater than or are equal to corresponding threshold value, and weak reason → strong resulting class corresponds to novel entities pair
Support is greater than corresponding threshold value, intensity is less than corresponding threshold value, and weak reason → weak resulting class corresponds to the support of novel entities pair
And intensity is respectively less than or equal to corresponding threshold value, the support that strong reason → weak resulting class corresponds to novel entities pair is less than pair
Threshold value, intensity is answered to be greater than corresponding threshold value, wherein every threshold value can be set according to actual needs, and can also will be each
The classification of novel entities pair is also added in causality knowledge base, for inquiry.
5 causality category table of table
Strong reason → strong result | Weak reason → weak result |
Weak reason → strong result | Strong reason → weak result |
A kind of construction method of causality knowledge base provided in an embodiment of the present invention can also include:
Reason entity to be predicted is compared with each reason entity in newest obtained causality knowledge base, and is exported
Novel entities corresponding with entity the reason of reason Entities Matching to be predicted to comprising result entity, novel entities are to corresponding co-occurrence frequency
Rate and novel entities are to corresponding support.
Wherein, any to need the reason of realizing prediction of result correspondent entity and can be used as reason entity to be predicted, it is based on
Newest obtained causality knowledge base is capable of determining that (same or similar degree is greater than preparatory root with reason Entities Matching to be predicted
According to the similarity threshold of actual needs setting) the reason of entity, and then determine the corresponding result entity of the reason entity be with to
It predicts the corresponding result entity of reason entity, and then realizes prediction of result, and will result corresponding with reason entity to be predicted
Intensity, support and the classification of the affiliated novel entities pair of entity export, to realize the comprehensive and integrity of result output.
In addition, realize the result correspondent entity of reason prediction if necessary, then it can be using the entity as knot to be predicted
Sporocarp, so that result entity to be predicted is compared with each result entity in newest obtained causality knowledge base, and
Novel entities corresponding with the matched result entity of result entity to be predicted are exported to entity, novel entities the reason of including to corresponding total
Existing frequency and novel entities are to corresponding support;Wherein, with the matched result entity of result entity to be predicted can for to pre-
Survey the result entity that the same or similar degree of result entity is greater than the similarity threshold previously according to actual needs setting, Jin Ershi
Existing reason prediction, certainly can also export the classification of novel entities pair corresponding with result entity to be predicted, to realize knot
The comprehensive and integrity of fruit output.
Technical solution disclosed in the present application excavates the causality between entity, under internet big data background with cause and effect
The form of relationship entity pair is presented to user, on the one hand alleviates the puzzlement that people are brought by " information overload ", on the other hand
The advantage for taking full advantage of big data has pushed the change of information technology;Meanwhile using a kind of increment type duration machine learning
Frame carries out increment extraction to causality entity, improves the real-time effectiveness of causality knowledge base.As it can be seen that the application is public
The technical solution opened is under internet big data background, using eternal learning framework, increment type decimation pattern, at the same learn just,
Negative customers and causalnexus intensity, with building and gradually complete causality knowledge base, to qualitative reasoning and north because explaining offer
It supports.In the environment of " internet "+" big data ", it can make full use of community network resource, acquire Various types of data in real time;Institute
The causality knowledge base of building, by effectively in promote information-intensive society process, be public demands and scientific and technological resources basic data
Shared to provide big data analysis service, industrial application prospect is wide.
The embodiment of the invention also provides a kind of construction devices of causality knowledge base, as shown in figure 3, may include:
Preprocessing module 11, is used for: being processed into multiple sentences by obtaining text data in data source, and by text data;
Module 12 is constructed, is used for: determining that there are causal cause and effect sentence, identifications in the multiple sentences obtained by processing
The entity pair for including in cause and effect sentence, and determination includes to identify that the collection of obtained entity pair is combined into causality knowledge base;Wherein,
Entity is to including reason entity and result entity;
Incremental learning module 13, is used for: it is every at the time of by obtaining text data in data source to pass through preset time period,
Then judge whether the total amount of data changed in data source reaches data-quantity threshold, executes if it is, returning by being obtained in data source
The step of taking text data, if it is not, then determining without rebuilding building causality knowledge base.
A kind of construction device of causality knowledge base provided in an embodiment of the present invention, building module may include:
First judging unit, is used for: determine that any sentence is current statement, judge in current statement whether comprising it is clear because
Any clear causality prompt word for including in fruit relationship prompt set of words, if it is, determine that current statement is cause and effect sentence,
If it is not, then determining that current statement is not cause and effect sentence;Wherein, specifying causality prompt word is to show to determine that there are cause and effect passes
The causality prompt word of system.
A kind of construction device of causality knowledge base provided in an embodiment of the present invention, building module can also include:
Second judgment unit is used for: before to determine current statement not be cause and effect sentence, if do not included in current statement bright
Whether any clear causality prompt word for including in true causality prompt set of words then judges in current statement comprising mould
Any FUZZY RELATION OF CAUSE AND EFFECT prompt word for including in paste causality prompt set of words, if it is, converting current statement to
The classifier being pre-created is input to after feature vector, if the result of classifier output is preset value, it is determined that current statement
For cause and effect sentence, if the result of classifier output is not preset value, it is determined that current statement is not cause and effect sentence, if it is not, then
It indicates that the first judging unit executes and determines the step of current statement is not cause and effect sentence;Wherein, FUZZY RELATION OF CAUSE AND EFFECT prompt word is
Show there may be causal causality prompt word, classifier is to have FUZZY RELATION OF CAUSE AND EFFECT prompt word using multiple
Whether feature vector that sentence is converted to and corresponding sentence are that the label training of cause and effect sentence obtains.
A kind of construction device of causality knowledge base provided in an embodiment of the present invention can also include:
Reprocess module, be used for: determination includes that the collection for the entity pair that identification obtains is combined into after causality knowledge base,
Will the obtained entity of identification to cartesian product pairing is carried out, determine obtained multiple entities to being novel entities pair;To new reality
Body to comprising entity carry out cluster operation, obtain multiple set pair being made of causality entity sets, wherein Mei Geyin
Comprising being classified as of a sort reason entity or result entity in cluster operation in fruit relationship entity set;Retain each set
To comprising the co-occurrence frequency in data source be greater than frequency threshold novel entities pair, delete other entities pair.
A kind of construction device of causality knowledge base provided in an embodiment of the present invention can also include:
Adding module is used for: the co-occurrence frequency of the novel entities pair of each reservation is added into causality knowledge base.
A kind of construction device of causality knowledge base provided in an embodiment of the present invention can also include:
Computing module is used for: calculating the support of each novel entities pair according to the following formula, and by each novel entities pair
Support is added into causality knowledge base:
SupportNum=(α * Adverb+ β * SentenceType+ γ * Emotion) * Negative;
Wherein, α, β and γ are preset weight coefficient, α > β > γ, and alpha+beta+γ=1;SupportNum is
Support, Adverb are that novel entities correspond to the degree adverb reciprocal fraction for including in sentence, and SentenceType is novel entities pair
The causality prompt word reciprocal fraction for including in sentence is answered, Emotion is that novel entities correspond to the emotion word pair for including in sentence
Score is answered, Negative is that novel entities correspond to the negative word reciprocal fraction for including in sentence.
A kind of construction device of causality knowledge base provided in an embodiment of the present invention can also include:
Comparison module is used for: by each reason entity in reason entity to be predicted and newest obtained causality knowledge base
Be compared, and export corresponding with entity the reason of reason Entities Matching to be predicted novel entities to comprising result entity, new reality
Body is to corresponding co-occurrence frequency and novel entities to corresponding support.
The embodiment of the invention also provides a kind of building equipment of causality knowledge base, may include:
Memory, for storing computer program;
Processor realizes the step of the as above construction method of any one causality knowledge base when for executing computer program
Suddenly.
The embodiment of the invention also provides a kind of computer readable storage medium, it is stored on computer readable storage medium
Computer program realizes the step of the as above construction method of any one causality knowledge base when computer program is executed by processor
Suddenly.
It should be noted that construction device, equipment and the meter of a kind of causality knowledge base provided in an embodiment of the present invention
The explanation of relevant portion refers to a kind of causality knowledge base provided in an embodiment of the present invention in calculation machine readable storage medium storing program for executing
The detailed description of corresponding part in construction method, details are not described herein.In addition above-mentioned technical proposal provided in an embodiment of the present invention
In with correspond to the consistent part of technical solution realization principle and unspecified in the prior art, in order to avoid excessively repeat.
The foregoing description of the disclosed embodiments can be realized those skilled in the art or using the present invention.To this
A variety of modifications of a little embodiments will be apparent for a person skilled in the art, and the general principles defined herein can
Without departing from the spirit or scope of the present invention, to realize in other embodiments.Therefore, the present invention will not be limited
It is formed on the embodiments shown herein, and is to fit to consistent with the principles and novel features disclosed in this article widest
Range.
Claims (10)
1. a kind of construction method of causality knowledge base characterized by comprising
Multiple sentences are processed by obtaining text data in data source, and by the text data;
It determines to identify the reality for including in the cause and effect sentence there are causal cause and effect sentence in the multiple sentences obtained by processing
Body pair, and determination includes to identify that the collection of obtained entity pair is combined into causality knowledge base;Wherein, the entity is to including original
Because of entity and result entity;
It is every at the time of by obtaining text data in the data source to pass through preset time period, then judge to become in the data source
Whether the total amount of data of change reaches data-quantity threshold, if it is, it is described by obtaining text data in data source to return to execution
Step, if it is not, then determining without rebuilding building causality knowledge base.
2. the method according to claim 1, wherein determining whether any sentence is that there are causal causes and effects
Sentence, comprising:
It determines that any sentence is current statement, whether judges in current statement comprising including in clear causality prompt set of words
Any clear causality prompt word, if it is, determine current statement be cause and effect sentence, if it is not, then determine current statement
It is not cause and effect sentence;Wherein, the clear causality prompt word is to show to determine that there are the prompts of causal causality
Word.
3. according to the method described in claim 2, it is characterized in that, before determining that current statement is not cause and effect sentence, further includes:
If not including any clear causality prompt word for including in clear causality prompt set of words in current statement,
Then judge in current statement whether to include any FUZZY RELATION OF CAUSE AND EFFECT prompt word for including in FUZZY RELATION OF CAUSE AND EFFECT prompt set of words,
If it is, current statement is converted to the classifier for being input to and being pre-created after feature vector, if the classifier exports
Result be preset value, it is determined that current statement be cause and effect sentence, if the classifier output result be not preset value, really
Determining current statement not is cause and effect sentence, determines the step of current statement is not cause and effect sentence if it is not, then executing;Wherein, described
FUZZY RELATION OF CAUSE AND EFFECT prompt word is to show there may be causal causality prompt word, and the classifier is using multiple
Feature vector that sentence with FUZZY RELATION OF CAUSE AND EFFECT prompt word is converted to and corresponding sentence whether be cause and effect sentence label instruction
It gets.
4. the method according to claim 1, wherein determination include the obtained collection of entity pair of identification be combined into because
After fruit relational knowledge base, further includes:
Will the obtained entity of identification to cartesian product pairing is carried out, determine obtained multiple entities to being novel entities pair;
To the novel entities to comprising entity carry out cluster operation, obtain multiple set being made of causality entity sets
It is right, wherein comprising being classified as of a sort reason entity or fructufy in cluster operation in each causality entity sets
Body;
Retain it is each set to comprising the co-occurrence frequency in the data source be greater than frequency threshold novel entities pair, delete other
Entity pair.
5. according to the method described in claim 4, it is characterized by further comprising:
The co-occurrence frequency of the novel entities pair of each reservation is added into the causality knowledge base.
6. according to the method described in claim 4, it is characterized by further comprising:
Calculate the support of each novel entities pair according to the following formula, and by the support of each novel entities pair be added to it is described because
In fruit relational knowledge base:
SupportNum=(α * Adverb+ β * SentenceType+ γ * Emotion) * Negative;
Wherein, α, β and γ are preset weight coefficient, α > β > γ, and alpha+beta+γ=1;SupportNum is to support
Degree, Adverb are that novel entities correspond to the degree adverb reciprocal fraction for including in sentence, and SentenceType is that novel entities correspond to language
The causality prompt word reciprocal fraction for including in sentence, Emotion are that novel entities correspond to corresponding point of the emotion word for including in sentence
Number, Negative are that novel entities correspond to the negative word reciprocal fraction for including in sentence.
7. according to the method described in claim 6, it is characterized by further comprising:
Reason entity to be predicted is compared with each reason entity in newest obtained causality knowledge base, and export with to
The reason of predicting reason Entities Matching entity correspond to novel entities to comprising result entity, novel entities to corresponding co-occurrence frequency and
Novel entities are to corresponding support.
8. a kind of construction device of causality knowledge base characterized by comprising
Preprocessing module is used for: being processed into multiple sentences by obtaining text data in data source, and by the text data;
Construct module, be used for: determining in the multiple sentences obtained by processing there are causal cause and effect sentence, identify it is described because
The entity pair for including in fruit sentence, and determination includes to identify that the collection of obtained entity pair is combined into causality knowledge base;Wherein, institute
Entity is stated to including reason entity and result entity;
Incremental learning module, is used for: it is every at the time of by obtaining text data in the data source to pass through preset time period, then
Judge whether the total amount of data changed in the data source reaches data-quantity threshold, if it is, it is described by data to return to execution
The step of text data is obtained in source, if it is not, then determining without rebuilding building causality knowledge base.
9. a kind of building equipment of causality knowledge base characterized by comprising
Memory, for storing computer program;
Processor realizes the causality knowledge base as described in any one of claim 1 to 7 when for executing the computer program
Construction method the step of.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer on the computer readable storage medium
Program, realizing the causality knowledge base as described in any one of claim 1 to 7 when the computer program is executed by processor
The step of construction method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811494944.3A CN109308323A (en) | 2018-12-07 | 2018-12-07 | A kind of construction method, device and the equipment of causality knowledge base |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811494944.3A CN109308323A (en) | 2018-12-07 | 2018-12-07 | A kind of construction method, device and the equipment of causality knowledge base |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109308323A true CN109308323A (en) | 2019-02-05 |
Family
ID=65222443
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811494944.3A Pending CN109308323A (en) | 2018-12-07 | 2018-12-07 | A kind of construction method, device and the equipment of causality knowledge base |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109308323A (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110377759A (en) * | 2019-07-22 | 2019-10-25 | 中国工商银行股份有限公司 | Event relation map construction method and device |
CN110674308A (en) * | 2019-08-23 | 2020-01-10 | 上海科技发展有限公司 | Scientific and technological word list expansion method, device, terminal and medium based on grammar mode |
CN111428052A (en) * | 2020-03-30 | 2020-07-17 | 中国科学技术大学 | Method for constructing educational concept graph with multiple relations from multi-source data |
CN112100312A (en) * | 2019-06-18 | 2020-12-18 | 国际商业机器公司 | Intelligent extraction of causal knowledge from data sources |
CN112287111A (en) * | 2020-12-18 | 2021-01-29 | 腾讯科技(深圳)有限公司 | Text processing method and related device |
CN112543897A (en) * | 2019-03-13 | 2021-03-23 | 欧姆龙株式会社 | Analysis device, analysis method, and analysis program |
CN113033809A (en) * | 2021-04-16 | 2021-06-25 | 复旦大学 | Common sense causal reasoning method and system based on weak evidence aggregation |
CN113642321A (en) * | 2021-06-28 | 2021-11-12 | 浙江工业大学 | Financial field-oriented causal relationship extraction method and system |
CN113742445A (en) * | 2021-07-16 | 2021-12-03 | 中国科学院自动化研究所 | Text recognition sample obtaining method and device and text recognition method and device |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8976063B1 (en) * | 2014-04-29 | 2015-03-10 | Google Inc. | Automated detection of vehicle parking and location |
CN104735074A (en) * | 2015-03-31 | 2015-06-24 | 江苏通付盾信息科技有限公司 | Malicious URL detection method and implement system thereof |
CN105550288A (en) * | 2015-12-10 | 2016-05-04 | 百度在线网络技术(北京)有限公司 | Database system updating method and management system |
CN106022018A (en) * | 2016-05-14 | 2016-10-12 | 丁贤根 | CMS object-oriented artificial intelligence information secrecy system |
CN107783973A (en) * | 2016-08-24 | 2018-03-09 | 慧科讯业有限公司 | The methods, devices and systems being monitored based on domain knowledge spectrum data storehouse to the Internet media event |
-
2018
- 2018-12-07 CN CN201811494944.3A patent/CN109308323A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8976063B1 (en) * | 2014-04-29 | 2015-03-10 | Google Inc. | Automated detection of vehicle parking and location |
CN104735074A (en) * | 2015-03-31 | 2015-06-24 | 江苏通付盾信息科技有限公司 | Malicious URL detection method and implement system thereof |
CN105550288A (en) * | 2015-12-10 | 2016-05-04 | 百度在线网络技术(北京)有限公司 | Database system updating method and management system |
CN106022018A (en) * | 2016-05-14 | 2016-10-12 | 丁贤根 | CMS object-oriented artificial intelligence information secrecy system |
CN107783973A (en) * | 2016-08-24 | 2018-03-09 | 慧科讯业有限公司 | The methods, devices and systems being monitored based on domain knowledge spectrum data storehouse to the Internet media event |
Non-Patent Citations (2)
Title |
---|
杨攀飞: ""因果关系知识库的研究与构建"", 《中国优秀博硕士学位论文全文数据库(硕士) 信息科技辑》 * |
焦玉英 等: ""合作数字参考服务中的知识库建设——DREW与DCVRS的Knowledge Base"", 《图书情报知识》 * |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112543897B (en) * | 2019-03-13 | 2024-02-02 | 欧姆龙株式会社 | Analysis device, analysis method, and storage medium |
CN112543897A (en) * | 2019-03-13 | 2021-03-23 | 欧姆龙株式会社 | Analysis device, analysis method, and analysis program |
CN112100312A (en) * | 2019-06-18 | 2020-12-18 | 国际商业机器公司 | Intelligent extraction of causal knowledge from data sources |
CN110377759B (en) * | 2019-07-22 | 2022-02-11 | 中国工商银行股份有限公司 | Method and device for constructing event relation graph |
CN110377759A (en) * | 2019-07-22 | 2019-10-25 | 中国工商银行股份有限公司 | Event relation map construction method and device |
CN110674308A (en) * | 2019-08-23 | 2020-01-10 | 上海科技发展有限公司 | Scientific and technological word list expansion method, device, terminal and medium based on grammar mode |
CN111428052A (en) * | 2020-03-30 | 2020-07-17 | 中国科学技术大学 | Method for constructing educational concept graph with multiple relations from multi-source data |
CN111428052B (en) * | 2020-03-30 | 2023-06-16 | 中国科学技术大学 | Method for constructing education conceptual diagram with multiple relations from multi-source data |
CN112287111A (en) * | 2020-12-18 | 2021-01-29 | 腾讯科技(深圳)有限公司 | Text processing method and related device |
CN113033809A (en) * | 2021-04-16 | 2021-06-25 | 复旦大学 | Common sense causal reasoning method and system based on weak evidence aggregation |
CN113033809B (en) * | 2021-04-16 | 2023-01-17 | 复旦大学 | Common sense causal reasoning method and system based on weak evidence aggregation |
CN113642321A (en) * | 2021-06-28 | 2021-11-12 | 浙江工业大学 | Financial field-oriented causal relationship extraction method and system |
CN113642321B (en) * | 2021-06-28 | 2024-03-29 | 浙江工业大学 | Financial field-oriented causal relationship extraction method and system |
CN113742445A (en) * | 2021-07-16 | 2021-12-03 | 中国科学院自动化研究所 | Text recognition sample obtaining method and device and text recognition method and device |
CN113742445B (en) * | 2021-07-16 | 2022-09-27 | 中国科学院自动化研究所 | Text recognition sample obtaining method and device and text recognition method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109308323A (en) | A kind of construction method, device and the equipment of causality knowledge base | |
Hashimoto et al. | Topic detection using paragraph vectors to support active learning in systematic reviews | |
CN106503055B (en) | A kind of generation method from structured text to iamge description | |
Schwarz | ldagibbs: A command for topic modeling in Stata using latent Dirichlet allocation | |
Snyder et al. | Interactive learning for identifying relevant tweets to support real-time situational awareness | |
JP5904559B2 (en) | Scenario generation device and computer program therefor | |
Dima et al. | Automatic noun compound interpretation using deep neural networks and word embeddings | |
WO2015093539A1 (en) | Complex predicate template gathering device, and computer program therefor | |
Jain et al. | A particle swarm optimized learning model of fault classification in Web-Apps | |
CN110851593A (en) | Complex value word vector construction method based on position and semantics | |
Whitehouse et al. | Evaluation of fake news detection with knowledge-enhanced language models | |
Parde et al. | A corpus of metaphor novelty scores for syntactically-related word pairs | |
Wu et al. | Maximum entropy-based sentiment analysis of online product reviews in Chinese | |
Singh et al. | Utilizing Advanced Artificial Intelligence for Early Detection of Epidemic Outbreaks through Global Data Analysis | |
Ruposh et al. | A computational approach of recognizing emotion from Bengali texts | |
Yang et al. | Learning topic-oriented word embedding for query classification | |
Jeong et al. | Discovery of research interests of authors over time using a topic model | |
Isaeva et al. | Neurostimulation for Finding Metaphor in Terminology | |
Bai et al. | Gated character-aware convolutional neural network for effective automated essay scoring | |
Voronov et al. | Forecasting popularity of news article by title analyzing with BN-LSTM network | |
Das et al. | Graph-based text summarization and its application on COVID-19 twitter data | |
Alsheri et al. | MOOCSent: a sentiment predictor for massive open online courses | |
Ullah et al. | Unveiling the Power of Deep Learning: A Comparative Study of LSTM, BERT, and GRU for Disaster Tweet Classification | |
El Kah et al. | Arabic authorship attribution on twitter: what is really matters? | |
Ma et al. | Aspect-Based Attention LSTM for Aspect-Level Sentiment Analysis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190205 |
|
RJ01 | Rejection of invention patent application after publication |