JPWO2020261002A5

JPWO2020261002A5 -

Info

Publication number: JPWO2020261002A5
Application number: JP2021575889A
Authority: JP
Publication date: 2022-10-25

Claims

A processor- implemented method, the method comprising:
importing a training text with emphasis that includes a first plurality of training nodes;
importing a non-emphasized training text containing a second plurality of training nodes;
one-hot encoding the emphasized and non-emphasized training text;
training a projection model using the emphasized and non-emphasized training text;
processing the training text with emphasis using the projection model;
training a classifier model using the processed emphasized training text;
importing new text containing a plurality of new nodes;
one-hot encoding the new text;
processing the new text using the projection model;
and determining whether one of the plurality of new nodes is in a desired class using the classifier model.

2. The method of claim 1, wherein the sought class is a member of a hypothetical text span.

3. The method of claim 1 or claim 2 , further comprising outputting new text with emphasis indicating that each of said plurality of new nodes is in said desired class.

4. The method of any one of claims 1 to 3 , further comprising training a one-hot encoder using the emphasized and non-emphasized training texts.

processing the processed stressed training text with the classifier model to determine whether each node is in the desired class;
comparing the determination of whether each node is in the required class and the emphasis of each node;
5. The method of any one of claims 1-4 , further comprising: adjusting the classifier model to increase the number of decisions that are the same as the emphasis.

performing feature selection;
6. The method of claim 1, further comprising: removing nodes from the emphasized and unemphasized training texts based on the feature selection prior to training the projection model. Method.

A training method executed by a processor , the method comprising:
importing a training text with emphasis that includes a first plurality of training nodes;
importing a non-emphasized training text containing a second plurality of training nodes;
converting the emphasized training text into an emphasized training conversion table;
converting the non-emphasized training text into a non-emphasized training conversion table;
training a one-hot encoder with the emphasized and non-emphasized training transform tables;
one-hot encoding the emphasized training transform table to generate an emphasized training vector;
one-hot encoding the unemphasized training transform table to generate unemphasized training vectors;
training a projection model using the enhanced and unenhanced training vectors;
processing the stressed training vectors using the projection model to generate processed stressed training vectors;
and training a classifier model using the processed training vectors with emphasis, wherein the classifier model determines whether a node is in a desired class.

8. The method of claim 7, wherein the class sought is a member of a hypothetical text span or a member of a factual text span.

converting the emphasized training text into an emphasized parse tree;
9. The method of claim 7 or claim 8 , further comprising: converting the unemphasized training text into an unemphasized parse tree.

processing the processed training vector with emphasis with the classifier model to determine whether each node is in the desired class;
comparing the determination of whether each node is in the required class and the emphasis of each node;
10. The method of any one of claims 7-9, further comprising: adjusting the classifier model to increase the number of decisions that are the same as the emphasis.

performing feature selection;
and removing columns from the enhanced and unenhanced training vectors based on the feature selection prior to training the projection model. Method.

A system for finding nodes in a span, said system comprising:
a plurality of stressed parse trees representing labeled natural language text;
a plurality of unstressed parse trees representing unlabeled natural language text;
a new parse tree representing the new natural language text;
A natural language processing (NLP) learning machine configured to process the plurality of stressed parse trees, the plurality of unstressed parse trees, and the new parse trees, wherein the NLP learning machine comprises a computer. the natural language processing (NLP) learning machine comprising a learning processor;
and a memory coupled to said computing processor, said memory including instructions, said instructions for importing into said computing processor training text with emphasis including a first plurality of training nodes. When,
importing a non-emphasized training text containing a second plurality of training nodes;
one-hot encoding the emphasized and non-emphasized training text;
training a projection model using the emphasized and non-emphasized training text;
processing the training text with emphasis using the projection model;
training a classifier model using the processed emphasized training text;
importing new text containing a plurality of new nodes;
one-hot encoding the new text;
processing the new text using the projection model;
using the classifier model to determine whether one of the plurality of new nodes is in a desired class.

13. The system of claim 12, wherein the sought class is a member of a hypothetical text span.

3. The memory further comprises instructions for causing the computing processor to output new text with emphasis indicating that each of the plurality of new nodes is in the sought class. 14. A system according to claim 12 or 13 .

15. The memory of any one of claims 12-14 , wherein the memory further comprises instructions for causing the computing processor to train a one-hot encoder using the emphasized and non-emphasized training texts. A system as described in .

processing the processed stressed training vector with the classifier model to determine whether each node is in the desired class;
comparing the determination of whether each node is in the required class and the emphasis of each node;
16. The system of any one of claims 12-15, further comprising instructions for: adjusting the classifier model to increase the number of decisions that are the same as the emphasis.

A system for finding nodes in a span, said system comprising:
a plurality of stressed parse trees representing labeled natural language text;
a plurality of unstressed parse trees representing unlabeled natural language text;
a new parse tree representing the new natural language text;
A natural language processing (NLP) learning machine configured to process the plurality of stressed parse trees, the plurality of unstressed parse trees, and the new parse trees, wherein the NLP learning machine comprises a computer. the natural language processing (NLP) learning machine comprising a learning processor;
and a memory coupled to said computing processor, said memory containing instructions, said instructions being transmitted to said computing processor.
converting the emphasized training text into an emphasized training conversion table;
converting the non- emphasized training text into a non-emphasized training conversion table;
training a one-hot encoder with the emphasized and non-emphasized training transform tables;
one-hot encoding the emphasized training transform table to generate an emphasized training vector;
one-hot encoding the unemphasized training transform table to generate unemphasized training vectors;
training a projection model using the enhanced and unenhanced training vectors;
processing the stressed training vectors using the projection model to generate processed stressed training vectors;
and training a classifier model using the processed training vectors with emphasis, wherein the classifier model determines whether a node is in a desired class.

18. The system of claim 17 , wherein the sought class is a member of hypothetical text spans or a member of factual text spans.

the memory instructs the computing processor to convert the emphasized training text into an emphasized parse tree;
19. The system of claim 17 or claim 18 , further comprising instructions for: converting the unemphasized training text into an unemphasized parse tree.

processing the processed stressed training vector with the classifier model to determine whether each node is in the desired class;
comparing the determination of whether each node is in the required class and the emphasis of each node;
20. The system of any one of claims 17-19, comprising instructions for: adjusting the classifier model to increase the number of decisions that are the same as the emphasis.

A computer program product for causing a processor to perform the steps of the method according to any one of claims 1 to 11.