CN109918646B - Method, system and device for judging causal relationship of chapters - Google Patents

Method, system and device for judging causal relationship of chapters Download PDF

Info

Publication number
CN109918646B
CN109918646B CN201910089352.1A CN201910089352A CN109918646B CN 109918646 B CN109918646 B CN 109918646B CN 201910089352 A CN201910089352 A CN 201910089352A CN 109918646 B CN109918646 B CN 109918646B
Authority
CN
China
Prior art keywords
target text
registration event
causal relationship
experience
registration
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910089352.1A
Other languages
Chinese (zh)
Other versions
CN109918646A (en
Inventor
向露
刘洋
张家俊
周玉
宗成庆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Automation of Chinese Academy of Science
Original Assignee
Institute of Automation of Chinese Academy of Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Automation of Chinese Academy of Science filed Critical Institute of Automation of Chinese Academy of Science
Priority to CN201910089352.1A priority Critical patent/CN109918646B/en
Publication of CN109918646A publication Critical patent/CN109918646A/en
Application granted granted Critical
Publication of CN109918646B publication Critical patent/CN109918646B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Machine Translation (AREA)

Abstract

The invention belongs to the technical field of natural language processing, and particularly relates to a method, a system and a device for judging discourse causal relationship, aiming at solving the problem of judging discourse causal relationship in robot interaction. The method comprises the following steps: based on a language activation model, acquiring a registration event with the highest matching degree for each target text in an input target text pair; based on the registration event corresponding to each target text, calculating the correlation between the two registration events according to the stored registration event sequence in each scene; and calculating the causal relationship of the target text pair based on the correlation of the target text pair and the two registration events. The method and the device can accurately judge the cause and effect relationship of the input target text pair.

Description

Method, system and device for judging causal relationship of chapters
Technical Field
The invention belongs to the technical field of natural language processing, and particularly relates to a method, a system and a device for judging causal relationship of chapters.
Background
The chapters are almost inseparable from the daily communication of people. People more frequently use discourse relationships to express and convey semantic relationships (e.g., causal relationships, progressive relationships, turning relationships, etc.) between contexts. Meanwhile, as the robot gradually plays an increasingly important role in the daily life of people, how to make the robot understand the chapter relationship in the daily expression of people becomes an irremediable problem. However, the difficulty of the problem is that a good theory and model framework does not exist, and the existing method is limited at a text level and difficult to acquire the inherent semantics of the text, so that the existing method still has many problems.
Recently, in the field of artificial intelligence, researchers have proposed an idea of episodic language learning, which considers that "the language semantics mainly come from the environment", and based on this idea, if a machine is expected to understand the language, the machine should be sensible and interactive, wherein the judgment of the causal relationship of the chapters plays an important role in the interaction of the robot.
Disclosure of Invention
In order to solve the above problems in the prior art, that is, to solve the problem of chapter cause and effect relationship judgment in robot interaction, a first aspect of the present invention provides a chapter cause and effect relationship judgment method, including:
step S10, based on the language activation model, obtaining the registration event with the highest matching degree for each target text in the input target text; the target text pair is two input target texts;
step S20, based on the registration event corresponding to each target text, calculating the correlation between two registration events according to the stored registration event sequence in each scene; the registration event sequence in the scene is a registration event sequence with Boolean time sequence characteristics, which is constructed based on scene structured experience information;
step S30, calculating the causal relationship of the target text pair based on the correlation between the target text pair and the two registration events obtained in step S20;
the language activation model is obtained by training a machine translation model through experience-language activation training corpora; the experience-language activation corpus is constructed based on registered events in the robot experience.
In some preferred embodiments, step S30, "calculating the causal relationship of the target text pair" includes:
fr=softmax(tanh(Wc*[s1;s2]+Wt*feat+b))
wherein f isrThe causal relationship probability value comprises a causal relationship probability and a non-causal relationship probability; s1、s2The sentence vectors are obtained by two target texts in the target text pair through a text coding model; wcA parameter matrix of a preset text vector; wtA parameter matrix which is a preset Boolean time sequence characteristic; b is a preset offset.
In some preferred embodiments, step S20 "calculate the correlation between two registration events" is performed by:
Figure BDA0001962783850000021
wherein featIs the calculated correlation; e.g. of the type1、e2Respectively matching a first target text and a second target text in a target text pair with a registration event; p (e)1) E of experience information structured for all stored scenes1The probability of occurrence; p (e)2|e1) To appear e1E in the structured empirical information of2The probability of occurrence of.
In some preferred embodiments, the registration event,
ei={obji},i∈Ri
wherein e isiIs the ith registration event; objiThe object in the activated state in the registration event is selected; riThe numbers of all objects in the language activation model are signed for the objects in the registration event.
In some preferred embodiments, the empirically-linguistically-activated corpus is expressed as
Ej={obji}:{LSi},i∈Ri
Wherein E isjActivating a corpus for a jth experience-language; LS (least squares)iAs an object objiCorresponding language characters.
In some preferred embodiments, the event sequence is registered in each scenario in step S20, and the obtaining method includes:
step A10, in the working environment of the robot, the object information obtained by the interaction between the robot and the object is obtained by the perception device of the robot; the object information comprises object attributes and object state information obtained by interaction between the robot and the object;
and A20, performing structuring processing on the object information acquired in the step A10, organizing and storing the object information according to time sequence, and using the object information as the structured experience information obtained by the operation working environment of the robot.
And A30, removing the object in the non-activated state in the structured experience information to obtain a registration event sequence in the corresponding scene.
In some preferred embodiments, the structured empirical information is expressed as
E0-t={f0,f1,f2,…,ft}
ft={obj1:a1;…obji:ai;…objn:an;}
Wherein E is0-tAll experience from time 0 to time t; f. oftAll the object information input for the t moment; objiIs the ith object, aiAs an object objiActive state of ai When 1, it means that the ith object is activated, aiWhen 0 indicates that the ith object is not activated, i ∈ [1, n]。
The second aspect of the invention provides a chapter causal relationship judging system, which comprises a registration event matching module, a correlation calculation module and a causal relationship calculation module;
the registration event matching module is configured to respectively acquire a registration event with the highest matching degree for each target text in the input target text based on the language activation model; the target text pair is two input target texts;
the correlation calculation module is configured to calculate the correlation between the two registration events according to the stored registration event sequences in each scene based on the registration event corresponding to each target text; the registration event sequence in the scene is a registration event sequence with Boolean time sequence characteristics, which is constructed based on scene structured experience information;
the causal relationship calculation module is configured to calculate a causal relationship of the target text pair based on the target text pair and the correlation between the two registration events obtained in step S20;
the language activation model is obtained by training a machine translation model through experience-language activation training corpora; the experience-language activation corpus is constructed based on registered events in the robot experience.
In a third aspect of the present invention, a storage device is provided, in which a plurality of programs are stored, the programs being adapted to be loaded and executed by a processor to implement the discourse cause and effect relationship determination method described above.
In a fourth aspect of the present invention, a processing apparatus is provided, which includes a processor, a storage device; a processor adapted to execute various programs; a storage device adapted to store a plurality of programs; the program is adapted to be loaded and executed by a processor to implement the discourse cause and effect determination method described above.
The invention has the beneficial effects that:
correlation calculation is carried out through the registration event sequence of the scene, and the causal relationship of the target text pair is calculated by further combining the registration events of the target text pair, so that an accurate causal relationship judgment result can be obtained.
Drawings
Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:
FIG. 1 is a schematic flow chart of a chapter cause and effect relationship determination method according to an embodiment of the present invention;
FIG. 2 is an example of chapter causality determination according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is apparent that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The present application will be described in further detail with reference to the following drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the related invention are shown in the drawings.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.
The method has the basic idea that a curing network is constructed aiming at the robot by utilizing the signal input of a robot working platform, and the text semantics are better modeled by utilizing the deep neural network technology so as to achieve the aim of judging the final causal relationship.
The invention relates to a method for judging the causal relationship of sections, which comprises the following steps as shown in figure 1:
step S10, based on the language activation model, obtaining the registration event with the highest matching degree for each target text in the input target text; the target text pair is two input target texts;
step S20, based on the registration event corresponding to each target text, calculating the correlation between two registration events according to the stored registration event sequence in each scene; the registration event sequence in the scene is a registration event sequence with Boolean time sequence characteristics, which is constructed based on scene structured experience information;
step S30, calculating the causal relationship of the target text pair based on the correlation between the target text pair and the two registration events obtained in step S20;
the language activation model is obtained by training a machine translation model through experience-language activation training corpora; the experience-language activation corpus is constructed based on registered events in the robot experience.
In order to more clearly illustrate the causal relationship determination method of the discourse of the present invention, the following three aspects are detailed in detail: 1. constructing a registration event sequence in each scene; 2. constructing a language activation model; 3. judging the causal relationship of the sections.
1. Construction of registration event sequences in each scene
Step A10, in the working environment of the robot, the object information obtained by the interaction between the robot and the object is obtained by the perception device of the robot; the object information comprises object attributes and object state information obtained by interaction between the robot and the object.
The robot is placed in a working environment, and a sensing device of the robot is used for receiving signals in the environment (namely object information obtained by interaction between the robot and an object). And continuously updating the acquired information through the continuous interaction of the robot in the working environment. For example, in a scene, there are a large number of objects interacting with the robot. The objects themselves have attributes, with the attributes having corresponding states. In particular, the state of the object changes after the robot interacts with the object. The robot needs to perceive these attributes and states to obtain information about these objects. The attribute and state of an object "cow" such as a robot face are displayed and expressed, and the object is denoted as obj2, and name is cow).
And A20, performing structuring processing on the object information acquired in the step A10, organizing and storing the object information according to time sequence, and using the object information as the structured experience information obtained by the operation working environment of the robot.
Object information resulting from the robot-object interaction from step a10 is stored in the long-term memory of the robot. The specific implementation method comprises the following steps: the robot stores these collected object attribute state information in long-term memory. In long-term memory, we reserve corresponding boolean nodes for the attributes and states of different objects.
The structuring process carries out structuring processing on the signals and stores the signals; the specific implementation scheme is as follows: structure points are reserved for the information Boolean nodes inside the robot to store information. And, these information will be organized and stored in chronological order. That is, let the input at a time t be ftThen the expression of the structured empirical information is:
E0-t={f0,f1,f2,…,ft}
wherein E is0-tAll experience from time 0 to time t; f. oftAll object information (attribute, status) input for the t-th time.
Meanwhile, the experience at each time includes activation information (expressed by boolean values) of all objects, and the formal expression is:
ft={obj1:a1;…obji:ai;…objn:an}
wherein objiIs the ith object, aiAs an object objiActive state of aiWhen 1, it means that the ith object is activated, aiWhen 0 indicates that the ith object is not activated, i ∈ [1, n]。
E.g. at time 5 f5={obj1:0;obj2:1;obj3:1;obj 41 represents obj in all objects at that time2、obj3、obj4In an activated state, obj1In an inactive state.
And A30, removing the object in the non-activated state in the structured experience information to obtain a registration event sequence in the corresponding scene.
For example, f above5Corresponding to the registration sequence T in the scene5={obj2;obj3;obj4;}。
2. And (5) constructing a language activation model.
(1) Registration events in the robot experience are constructed.
According to the experience of the robot, these objects exhibit a boolean state, and if the robot receives a corresponding signal in the working environment, the corresponding object is activated. In the case where objects belonging to an event are all activated, the event is an activated state. The registration event expression is as follows:
ei={obji},i∈Ri
wherein e isiIs the ith registration event; objiThe object in the activated state in the registration event is selected; riThe numbers of all objects in the language activation model are signed for the objects in the registration event.
For example, based on f above5The obtained registration event is e5={obj2;obj3;obj4At this time Ri=[2,3,4]. When e is5When activated, may be denoted as e5:1={obj2:1;obj3:1;obj4:1;}。
(2) Based on the constructed enrollment events, an experience-language activated corpus is constructed.
This step can be viewed as a symbolic translation process, and thus, the corpus is in the form of parallel corpora. The method specifically comprises the following steps: further experiential curing of internal objects into a sequence of symbols, e.g. obj15Obj _ 15. When the text mentions that reference is made to this symbol, the symbol and associated concepts are activated. For example, in our actual experiment, obj15(i.e., Obj _15) indicates "cow" in chinese, and "cow" in english. This step constructs a parallel corpus of such tokens corresponding to empirical internal symbols, as follows:
Ej={obji}:{LSi},i∈Ri
wherein E isjActivating a corpus for a jth experience-language; LS (least squares)iAs an object objCorresponding language characters.
For example, if the object obj2、obj5、obj15The corresponding language characters are Adam, attack and cow, and the training language chat can be expressed as Adam, attack and cow
obj2,obj5,obj15∶Adam attack cow
(3) And training the machine translation model through experience-language activation training corpora to obtain a language activation model.
Training corpora are activated according to experience-language, and training is based on a machine translation model. In this embodiment, a sequence-to-sequence neural network machine translation model is adopted, and of course, other machine translation models may be adopted.
3. Judgment of causal relationship of chapters
Step S10, based on the language activation model, obtaining the registration event with the highest matching degree for each target text in the input target text; the target text pair is two target texts.
The construction process of the language activation model has been described in detail above, where the corresponding registration event can be obtained directly by changing the model, and the key process of the model for processing text can be shown below.
(1) The input text is translated by the language activation model into a sequence of symbols of the registration event, i.e. the object symbols contained in the registration event inside the robot. The formula is as follows:
y1,y2,y3,…,yj=P(yi|x1,x2,x3,…,xn)
wherein, y1,y2,y3,…,yjIs a target symbol sequence; x is the number of1,x2,x3,…,xnIs a sequence of language symbols; p (y)i|x1,x2,x3,…,xn) To decode (i.e., translate) the object symbols according to the language symbol sequence.
In some embodiments, two word vector sets Emb may be constructed for the linguistic end and the empirical notation end, respectivelylAnd Embe
(2) The language activation model activates corresponding experience information of the robot according to the input target text. And matching the registration events stored in all robots according to the translated object symbol sequences, and selecting the event with the highest matching degree as an activated event. In this case, a long-short time memory model LSTM may be used as a tool for generating a symbol sequence vector to generate a sequence vector of target symbols, as shown in the following formula
LSTM(Embe[obj1],Embe[obj2],…,Embe[obj])=ve
Then, the event with the highest matching degree is obtained according to the cosine similarity of the vectors.
Figure BDA0001962783850000101
Wherein v iseThe translated object symbol sequence for the target text passes through the LSTM network and EmbeObtaining a vector; v. ofiPassing LSTM network and Emb for ith registration event in language activation modeleObtaining a vector; sim (v)i,ve) Is the cosine similarity between the two vectors.
And selecting the registration event with the highest cosine similarity as an activation event, namely the registration event corresponding to the input target text.
Step S20, based on the registration event corresponding to each target text, calculating the correlation between two registration events according to the stored registration event sequence in each scene; the registration event sequence in the scene is a registration event sequence with Boolean time sequence characteristics, which is constructed based on scene structured experience information.
Removing the object in the non-activated state in the structured experience information to obtain a registration event sequence in the corresponding scene
When the causal relation is judged, two sentences are input each time, so that two registration events need to be searched at one time. Simultaneously according to the obtained event e1、e2The corresponding boolean temporal features of the two events need to be obtained in the sequence of registered events in each scene stored by the robot in order to calculate the correlation of the two registered events.
The correlation of the two registration events is calculated as follows:
Figure BDA0001962783850000102
wherein featIs the calculated correlation; e.g. of the type1、e2Respectively matching a first target text and a second target text in a target text pair with a registration event; p (e)1) E of experience information structured for all stored scenes1The probability of occurrence; p (e)2|e1) To appear e1E in the structured empirical information of2The probability of occurrence of.
And step S30, calculating the causal relationship of the target text pair based on the correlation between the target text pair and the two registration events obtained in the step S20.
The method for calculating the causal relationship of the target text pair comprises the following steps:
fr=softmax(tanh(Wc*[s1;s2]+Wt*feat+b))
wherein f isrThe causal relationship probability values comprise causal relationship probability Cause and Non-causal relationship probability Non-Cause; s1、s2The sentence vectors are obtained by two target texts in the target text pair through a text coding model; wcA parameter matrix of a preset text vector; wtA parameter matrix which is a preset Boolean time sequence characteristic; b is a preset offset.
In this embodiment, a softmax classification model is adopted, and the softmax function outputs probability values of two classifications of the two classifications, and the largest one is taken as output. Such as: softmax will output a probability value of 0.7 for Cause and 0.3 for Non-Cause, and we select Cause as output.
The text coding model may adopt LSTM (long-short time memory model), BOW (mean bag of words model based on word vectors), CNN (convolutional neural network model), and may also adopt other models.
As shown in fig. 2, the target text pair of this example input is S1: adam anchors the cow with shock, S2: and Adam gains some beef. Acquiring a registration event e1 of the target text pair through a language activation model in Part1, wherein the registration event e1 is Adam anchors cow chord, e 2: adam gains some beef. In Part2, based on the perception signal sequence received by the robot in each scene (S plus number represents the ith scene, each scene is composed of many continuous time instants, such as T9-T10-T11, etc., each time instant contains a large number of events), finding out that e1 appears in scenes S87 and S72, e2 appears in scenes S54, S87 and S72, and locating the time instant of occurrence through an arrow in the figure (i.e. the event is located in the perception signal sequence in the robot memory), e1 locates at time instants T0 and T1 of scenes S87 and S72, e2 locates at time instants T11 of scenes S54, and T2 and T3 of S72 of S87, and calculating the correlation of two registered events; respectively carrying out text coding on target texts S1 and S2 at Part3 to obtain sentence vectors; and carrying out causal judgment on the Part4 based on the correlation obtained by the Part2 and the two sentence vectors obtained by the Part3, and outputting a judgment result.
The code implementation of the invention can be realized by adopting a python programming language and a TensorFlow deep learning framework, and the development platform is Ubuntu Linux 16.04. Meanwhile, the invention can also rely on the general artificial intelligence platform Malmo released by Microsoft as the working environment of the robot. The system implementation can also run on the Windows operating system, since the written program does not use any platform-dependent code.
The invention discloses a chapter causal relationship judging system of an embodiment, which comprises a registration event matching module, a correlation calculation module and a causal relationship calculation module;
the registration event matching module is configured to respectively acquire a registration event with the highest matching degree for each target text in the input target text based on the language activation model; the target text pair is two input target texts;
the correlation calculation module is configured to calculate the correlation between the two registration events according to the stored registration event sequences in each scene based on the registration event corresponding to each target text; the registration event sequence in the scene is a registration event sequence with Boolean time sequence characteristics, which is constructed based on scene structured experience information;
the causal relationship calculation module is configured to calculate a causal relationship of the target text pair based on the target text pair and the correlation between the two registration events obtained in step S20;
the language activation model is obtained by training a machine translation model through experience-language activation training corpora; the experience-language activation corpus is constructed based on registered events in the robot experience.
It can be clearly understood by those skilled in the art that, for convenience and brevity of description, the specific working process and related description of the system described above may refer to the corresponding process in the foregoing method embodiments, and will not be described herein again.
It should be noted that the discourse cause and effect relationship determination system provided in the foregoing embodiment is only illustrated by the division of the functional modules, and in practical applications, the functions may be allocated by different functional modules according to needs, that is, the modules or steps in the embodiment of the present invention are further decomposed or combined, for example, the modules in the foregoing embodiment may be combined into one module, or may be further split into multiple sub-modules, so as to complete all or part of the functions described above. The names of the modules and steps involved in the embodiments of the present invention are only for distinguishing the modules or steps, and are not to be construed as unduly limiting the present invention.
The storage device of an embodiment of the present invention stores a plurality of programs, and the programs are suitable for being loaded and executed by a processor to realize the discourse cause and effect relationship judging method.
The processing device of one embodiment of the invention comprises a processor, a storage device; a processor adapted to execute various programs; a storage device adapted to store a plurality of programs; the program is adapted to be loaded and executed by a processor to implement the discourse cause and effect determination method described above.
It can be clearly understood by those skilled in the art that, for convenience and brevity of description, the specific working processes and related descriptions of the storage device and the processing device described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
And (3) experimental verification:
in the corpus constructed in the experiment, each example is composed of two sentences, and each sentence describes a separate event. By training on this corpus, the method of the present invention is compared to traditional plain text-based methods. Table 1 shows the basic statistics of the corpus of the causal relationship of chapters constructed in this experiment.
Attached table 1
Cause and effect relationship Non-causal relationship Total number of Degree of identity
Number of 1619 3233 4852 98.2%
The attached table 2 shows the results of the method for judging the causal relationship between the invention and several existing chapters. In the upper part of the table, three kinds of neural network methods (BOW, CNN, LSTM) for processing sentence information are mainly used at present. The experiment takes two sentences as input, and examines whether the target model has the capability of judging whether the sentence pair is causal or not. It can be seen that the present invention (Our Model) performs significantly better on the data set than the conventional text method, where Model + BOW, Model + CNN, and Model + LSTM in table 2 respectively indicate that the BOW, CNN, and LSTM text encoding models are used in step S30 of the present invention method.
The obvious difference from the traditional text method is that the invention uses experience information collected from the robot platform. This process is very similar to human beings, so the key to the present invention is to propose a new approach, distinguished from the traditional thinking, which emphasizes the importance of the experience related to text.
Attached table 2
Figure BDA0001962783850000141
Those of skill in the art would appreciate that the various illustrative modules, method steps, and modules described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that programs corresponding to the software modules, method steps may be located in Random Access Memory (RAM), memory, Read Only Memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. To clearly illustrate this interchangeability of electronic hardware and software, various illustrative components and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as electronic hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
The terms "first," "second," and the like are used for distinguishing between similar elements and not necessarily for describing or implying a particular order or sequence.
The terms "comprises," "comprising," or any other similar term are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
So far, the technical solutions of the present invention have been described in connection with the preferred embodiments shown in the drawings, but it is easily understood by those skilled in the art that the scope of the present invention is obviously not limited to these specific embodiments. Equivalent changes or substitutions of related technical features can be made by those skilled in the art without departing from the principle of the invention, and the technical scheme after the changes or substitutions can fall into the protection scope of the invention.

Claims (10)

1. A method for judging causal relationship of chapters is characterized by comprising the following steps:
step S10, based on the language activation model, obtaining the registration event with the highest matching degree for each target text in the input target text; the target text pair is two input target texts;
step S20, based on the registration event corresponding to each target text, calculating the correlation between two registration events according to the stored registration event sequence in each scene; the registration event sequence in the scene is a registration event sequence with Boolean time sequence characteristics, which is constructed based on scene structured experience information;
step S30, calculating the causal relationship of the target text pair based on the correlation between the target text pair and the two registration events obtained in step S20;
the language activation model is obtained by training a machine translation model through experience-language activation training corpora; the experience-language activation corpus is constructed based on registered events in the robot experience.
2. The discourse causal relationship determination method according to claim 1, wherein the step S30 of calculating the causal relationship of the target text pair comprises:
fr=softmax(tanh(Wc*[s1;s2]+Wt*feat+b))
wherein f isrThe causal relationship probability value comprises a causal relationship probability and a non-causal relationship probability; s1、s2The sentence vectors are obtained by two target texts in the target text pair through a text coding model; wcA parameter matrix of a preset text vector; wtA parameter matrix which is a preset Boolean time sequence characteristic; b is a preset offset.
3. The discourse causal relationship determination method according to claim 1, wherein step S20 "calculating the correlation between two registered events" comprises:
Figure FDA0002514750750000021
wherein featIs the calculated correlation; e.g. of the type1、e2Respectively matching a first target text and a second target text in a target text pair with a registration event; p (e)1) E of experience information structured for all stored scenes1The probability of occurrence; p (e)2|e1) To appear e1E in the structured empirical information of2The probability of occurrence of.
4. The discourse cause and effect relationship determination method according to claim 1, wherein the registration event,
ei={obji},i∈Ri
wherein e isiIs the ith registration event; objiThe object in the activated state in the registration event is selected; riThe numbers of all objects in the language activation model are signed for the objects in the registration event.
5. The discourse causal relationship determination method of claim 4, wherein the empirical-language-activated corpus is expressed as
Ej={obji}:{LSi},i∈Ri
Wherein E isjActivating a corpus for a jth experience-language; LS (least squares)iAs an object objiCorresponding language characters.
6. The discourse cause and effect relationship judgment method according to claim 1, wherein the event sequence is registered in each scene in step S20, and the obtaining method comprises:
step A10, in the working environment of the robot, the object information obtained by the interaction between the robot and the object is obtained by the perception device of the robot; the object information comprises object attributes and object state information obtained by interaction between the robot and the object;
step A20, carrying out structuring processing on the object information obtained in step A10, organizing and storing the object information according to time sequence, and using the object information as structured experience information obtained by the operation working environment of the robot;
and A30, removing the object in the non-activated state in the structured experience information to obtain a registration event sequence in the corresponding scene.
7. The method of claim 6, wherein the structured empirical information is expressed as
E0-t={f0,f1,f2,...,ft}
ft={obj1:a1;...obji:ai;...objn:an;}
Wherein E is0-tAll experience from time 0 to time t; f. oftAll the object information input for the t moment; objiIs the ith object, aiAs an object objiActive state of aiWhen 1, it means that the ith object is activated, aiWhen 0 indicates that the ith object is not activated, i ∈ [1, n]。
8. The system is characterized by comprising a registration event matching module, a correlation calculation module and a causal relationship calculation module;
the registration event matching module is configured to respectively acquire a registration event with the highest matching degree for each target text in the input target text based on the language activation model; the target text pair is two input target texts;
the correlation calculation module is configured to calculate the correlation between the two registration events according to the stored registration event sequences in each scene based on the registration event corresponding to each target text; the registration event sequence in the scene is a registration event sequence with Boolean time sequence characteristics, which is constructed based on scene structured experience information;
the causal relationship calculation module is configured to calculate a causal relationship of the target text pair based on the target text pair and the correlation between the two registration events obtained by the correlation calculation module;
the language activation model is obtained by training a machine translation model through experience-language activation training corpora; the experience-language activation corpus is constructed based on registered events in the robot experience.
9. A storage device having stored therein a plurality of programs, wherein the programs are adapted to be loaded and executed by a processor to implement the discourse cause and effect judgment method of claims 1-7.
10. A processing device comprising a processor, a storage device; a processor adapted to execute various programs; a storage device adapted to store a plurality of programs; wherein the program is adapted to be loaded and executed by a processor to implement the chapter cause and effect judgment method of claims 1-7.
CN201910089352.1A 2019-01-30 2019-01-30 Method, system and device for judging causal relationship of chapters Active CN109918646B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910089352.1A CN109918646B (en) 2019-01-30 2019-01-30 Method, system and device for judging causal relationship of chapters

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910089352.1A CN109918646B (en) 2019-01-30 2019-01-30 Method, system and device for judging causal relationship of chapters

Publications (2)

Publication Number Publication Date
CN109918646A CN109918646A (en) 2019-06-21
CN109918646B true CN109918646B (en) 2020-08-11

Family

ID=66961026

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910089352.1A Active CN109918646B (en) 2019-01-30 2019-01-30 Method, system and device for judging causal relationship of chapters

Country Status (1)

Country Link
CN (1) CN109918646B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113408730B (en) * 2020-03-17 2024-04-26 北京达佳互联信息技术有限公司 Causal relationship generation method and device and electronic equipment

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6993535B2 (en) * 2001-06-18 2006-01-31 International Business Machines Corporation Business method and apparatus for employing induced multimedia classifiers based on unified representation of features reflecting disparate modalities
US7685099B2 (en) * 2007-06-28 2010-03-23 Microsoft Corporation Forecasting time-independent search queries
US8793715B1 (en) * 2007-12-18 2014-07-29 The Nielsen Company (Us), Llc Identifying key media events and modeling causal relationships between key events and reported feelings
CN104298225A (en) * 2014-09-25 2015-01-21 中国石油化工股份有限公司 Modeling and graphical displaying method for causal relationship reasoning model of unusual working conditions of chemical process
CN104462229A (en) * 2014-11-13 2015-03-25 苏州大学 Event classification method and device
CN108228158A (en) * 2018-01-18 2018-06-29 东南大学 A kind of framework behavior pattern recognition method based on ontology
CN108920473A (en) * 2018-07-04 2018-11-30 中译语通科技股份有限公司 A kind of data enhancing machine translation method based on similar word and synonym replacement

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070094219A1 (en) * 2005-07-14 2007-04-26 The Boeing Company System, method, and computer program to predict the likelihood, the extent, and the time of an event or change occurrence using a combination of cognitive causal models with reasoning and text processing for knowledge driven decision support
US8423346B2 (en) * 2007-09-05 2013-04-16 Electronics And Telecommunications Research Institute Device and method for interactive machine translation
JP5100445B2 (en) * 2008-02-28 2012-12-19 株式会社東芝 Machine translation apparatus and method
CN104268133B (en) * 2014-09-11 2018-02-13 北京交通大学 machine translation method and system
CN107944027B (en) * 2017-12-12 2020-03-31 苏州思必驰信息科技有限公司 Method and system for creating semantic key index
CN108959651A (en) * 2018-08-03 2018-12-07 北京捷易迅信息技术有限公司 A method of user's intention assessment based on transfer learning

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6993535B2 (en) * 2001-06-18 2006-01-31 International Business Machines Corporation Business method and apparatus for employing induced multimedia classifiers based on unified representation of features reflecting disparate modalities
US7685099B2 (en) * 2007-06-28 2010-03-23 Microsoft Corporation Forecasting time-independent search queries
US8793715B1 (en) * 2007-12-18 2014-07-29 The Nielsen Company (Us), Llc Identifying key media events and modeling causal relationships between key events and reported feelings
CN104298225A (en) * 2014-09-25 2015-01-21 中国石油化工股份有限公司 Modeling and graphical displaying method for causal relationship reasoning model of unusual working conditions of chemical process
CN104462229A (en) * 2014-11-13 2015-03-25 苏州大学 Event classification method and device
CN108228158A (en) * 2018-01-18 2018-06-29 东南大学 A kind of framework behavior pattern recognition method based on ontology
CN108920473A (en) * 2018-07-04 2018-11-30 中译语通科技股份有限公司 A kind of data enhancing machine translation method based on similar word and synonym replacement

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
基于短语的统计机器翻译模型训练中若干关键问题的研究;梁华参;《中国博士学位论文全文数据库 信息科技辑》;20140115;I138-75 *
基于神经网络的自然语言语义表达及推理方法研究;刘权;《中国博士学位论文全文数据库 信息科技辑》;20170915;I138-77 *
维吾尔语事件间因果关系抽取;周兴发;《中国优秀硕士学位论文全文数据库 信息科技辑》;20190115;I138-4494 *

Also Published As

Publication number Publication date
CN109918646A (en) 2019-06-21

Similar Documents

Publication Publication Date Title
CN111078836B (en) Machine reading understanding method, system and device based on external knowledge enhancement
CN110427461B (en) Intelligent question and answer information processing method, electronic equipment and computer readable storage medium
CN110765775B (en) Self-adaptive method for named entity recognition field fusing semantics and label differences
Shinghal et al. Experiments in text recognition with the modified Viterbi algorithm
CN110349572A (en) A kind of voice keyword recognition method, device, terminal and server
CN111368078A (en) Model training method, text classification device and storage medium
CN110472002B (en) Text similarity obtaining method and device
WO2018196718A1 (en) Image disambiguation method and device, storage medium, and electronic device
CN109948140B (en) Word vector embedding method and device
CN106649739B (en) Multi-round interactive information inheritance identification method and device and interactive system
CN111858898A (en) Text processing method and device based on artificial intelligence and electronic equipment
CN112667796B (en) Dialogue reply method and device, electronic equipment and readable storage medium
CN109918646B (en) Method, system and device for judging causal relationship of chapters
CN110909174B (en) Knowledge graph-based method for improving entity link in simple question answering
CN115099189A (en) Speech recognition model based on parallel computation and determining method
CN113761151A (en) Synonym mining method, synonym mining device, synonym question answering method, synonym question answering device, computer equipment and storage medium
CN113705207A (en) Grammar error recognition method and device
CN116362242A (en) Small sample slot value extraction method, device, equipment and storage medium
CN110968702A (en) Method and device for extracting matter relationship
CN115687576A (en) Keyword extraction method and device represented by theme constraint
CN115455169A (en) Knowledge graph question-answering method and system based on vocabulary knowledge and semantic dependence
CN111507098B (en) Ambiguous word recognition method and device, electronic equipment and computer-readable storage medium
CN111783473B (en) Method and device for identifying best answer in medical question and answer and computer equipment
CN110533035B (en) Student homework page number identification method based on text matching
CN113763934A (en) Training method and device of audio recognition model, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant