CN111753552B

CN111753552B - NLP-based training mode and recognition mode dynamic switching method

Info

Publication number: CN111753552B
Application number: CN202010624878.8A
Authority: CN
Inventors: 阮平达; 王磊
Original assignee: Zhejiang Baiying Technology Co Ltd
Current assignee: Zhejiang Baiying Technology Co Ltd
Priority date: 2020-07-02
Filing date: 2020-07-02
Publication date: 2024-04-19
Anticipated expiration: 2040-07-02
Also published as: CN111753552A

Abstract

The invention relates to a training mode and recognition mode dynamic switching method based on NLP, which comprises the following steps: s1: when the NLP request passes through the NLP gateway, routing is carried out according to the request type; s2: judging the request type, if the request is a semantic recognition request, entering a recognition mode, selecting the NLP service in idle or in recognition, outputting a semantic recognition result and returning a recognition result, and if the NLP service in idle or in recognition is not available, continuing to try to select the NLP service until the request is overtime; s3: judging the request type, if the request is a request of the corpus publishing type, entering a training model, adding the corpus into a publishing queue, and directly returning the result. The invention can display whether the NLP service currently provides the recognition mode or the training mode, and switch the flow according to the state of the current NLP service, thereby solving the problem of inaccurate semantic recognition caused by the NLP service of which the recognition request is mistakenly input into the training mode.

Description

NLP-based training mode and recognition mode dynamic switching method

Technical Field

The invention relates to the field of data processing, in particular to a training mode and recognition mode dynamic switching method based on NLP.

Background

NLP is an important direction in the fields of computer science and artificial intelligence. It is studying various theories and methods that enable effective communication between a person and a computer in natural language. Natural language processing is a discipline that integrates linguistics, computer science, and mathematics. NLP is composed of two main technical areas: natural language understanding and natural language generation.

The main objective of natural language understanding direction is to help the machine better understand the language of people, including semantic understanding of basic lexical and syntactic, and high-level understanding of requirements, chapters and emotion levels.

The direction of natural language generation is mainly aimed at helping a machine generate a language which can be understood by a person, such as text generation, automatic digest and the like.

NLP technology is based on big data, knowledge graph, machine learning, linguistics and other technologies and resources, and can form a specific application system of machine translation, deep question-answering and dialogue system, thereby serving various actual services and products

At present, most intentions and semantic recognition adopt NLP service clusters, and the NLP service occupies a large amount of CPU resources in the training mode process, so that no idle resources are needed to process the semantic recognition request, but if the entry of the semantic recognition request is not closed, the semantic recognition result cannot be processed if the recognition request enters, and the recognition rate is greatly reduced;

When NLP service is processing recognition request, if corpus is issued to NLP service, the request processing semantic recognition is directly interrupted, NLP service is triggered to perform model training, and training is not performed after recognition request processing is finished, so that recognition rate is reduced

When a problem occurs at present, since the NLP service does not have an elegant switching mode, the overall recognition rate is disturbed, so that the recognition rate is unstable and suddenly low, and meanwhile, the false impression of inaccurate recognition of a model obtained by NLP service training is also caused.

Disclosure of Invention

In order to solve the problems, the invention provides a method for dynamically switching between a training mode and an identification mode based on NLP, which can display whether the NLP service currently provides the identification mode or the training mode, and switch the flow according to the state of the current NLP service, thereby solving the problem of inaccurate semantic identification caused by the fact that the identification request is mistakenly input into the NLP service of the training mode.

The technical scheme of the invention is as follows:

A method for dynamically switching training modes and recognition modes based on NLP, comprising the following steps:

s1: when the NLP request passes through the NLP gateway, routing is carried out according to the request type;

S2: judging the request type, if the request is a semantic recognition request, entering a recognition mode, selecting the NLP service in idle or in recognition, outputting a semantic recognition result and returning a recognition result, and if the NLP service in idle or in recognition is not available, continuing to try to select the NLP service until the request is overtime;

s3: judging the request type, if the request is a request of the corpus publishing type, entering a training model, adding the corpus into a publishing queue, and directly returning the result.

Preferably, the NLP service includes 4 states, specifically: in idle, it means any type of request can be accepted; in the synchronization, the service is in a locked state and does not accept any new request when the service is in data synchronization; in training, the service is shown to be in corpus training, and is in a locked state at the moment and does not accept any new request; in recognition, the function of semantic recognition is provided by the representation service, and the request of the semantic recognition type can be accepted.

Preferably, the step S3 asynchronously performs the following procedures: and adding the corpus into a release queue, periodically checking the task queue by a corpus task processor, pushing the corpus data into the NLP service to perform model training when the corpus task processor has data and has idle NLP service, actively reporting the state of the NLP service, automatically pushing the data of the training result to a training result buffer area after the model training is finished, and sequentially synchronizing the data into the NLP service until all the NLP service completes the data synchronization of the latest model, and emptying the training result buffer area.

Preferably, the synchronization process of the NLP specifically includes: if the idle NLP service exists, starting to synchronize a single NLP service and reporting the state, reporting the state after the single NLP service is synchronized, judging whether all NLP services are synchronized, if so, emptying a training result buffer area, otherwise, starting the next synchronization process; if no NLP service exists in the idle state, the next synchronization process is started.

Preferably, the NLP service registers the current state into the NLP gateway, and actively synchronizes the state of the NLP service into the NLP gateway every 60 seconds.

The beneficial effects of the invention are as follows: the method comprises the steps that whether an NLP service currently provides an identification mode or a training mode is displayed in real time, and the NLP gateway can switch flow according to the state of the current NLP service, so that the problem of inaccurate semantic identification caused by the fact that an identification request is mistakenly input into the NLP service of the training mode is solved; the problem of inaccurate semantic recognition caused by the fact that a request for semantic recognition is forcedly interrupted and an error result is returned in the switching process is solved; the NLP gateway can be used for adjusting the request flow control of recognition and training, ensuring the voice recognition request flow and improving the resource utilization rate of the machine.

Drawings

FIG. 1 is a flow chart of a method according to an embodiment of the invention.

Detailed Description

Embodiments of the present invention will be further described below with reference to the accompanying drawings.

The invention provides a training mode and recognition mode dynamic switching method based on NLP, firstly NLP service registers the current state into NLP gateway, and actively synchronizes the state of NLP into NLP gateway every 60 s.

Among them, NLP services have four states:

in idle, it means any type of request can be accepted;

In the synchronization, the service is in a locked state and does not accept any new request when the service is in data synchronization;

In training, the service is shown to be in corpus training, and is in a locked state at the moment and does not accept any new request;

in recognition, the function of semantic recognition is provided by the representation service, and the request of the semantic recognition type can be accepted.

The NLP gateway has functions of request flow control, request type routing, service state synchronization and service heartbeat detection, the NLP gateway can be used for adjusting the request flow control for recognition and training, 90% NLP service nodes are switched into a recognition mode when busy, voice recognition request flow is guaranteed, a training mode is started when idle, NLP training functions are provided, and the resource utilization rate of the machine is improved.

As shown in fig. 1, the specific implementation flow of the present invention is:

When an NLP request passes through an NLP gateway, the gateway carries out routing through a request type, at the moment, a request of a semantic recognition type is input, NLP service in idle or in recognition is automatically selected, if no NLP service in idle or in recognition at the moment is available, the NLP service continues to be tried to be selected until the request is overtime, if the NLP service is selected, voice recognition is started, meanwhile, the state of the NLP service is actively reported to the NLP gateway, the NLP service is displayed as a recognition mode at the moment, and a result is returned after the recognition is finished.

When a request of corpus release type (namely training request) is input, firstly, corpus is added into a release queue, a result is directly returned, then a corpus task queue processor continuously checks a task queue every few seconds, if data exists and idle NLP services exist, corpus data is pushed into the NLP services for model training, meanwhile, the state of the NLP services is actively reported, namely, a training mode is entered, the situation that semantic recognition cannot be processed due to the fact that a request of semantic recognition enters the NLP services at the moment is avoided, an error result is returned, after model training is finished, the corpus is automatically pushed into a training result buffer area, if the idle NLP services are found, the corpus task queue processor actively and sequentially synchronizes to the NLP services until all the NLP services complete data synchronization of the latest model, and then the training result buffer area is emptied.

The training request and the identification request are not simultaneously input into the NPL service, and when any type of request is input, the NPL gateway decides whether to execute the request according to the real-time state of the NPL service.

Finally, it should be noted that: the above examples are only specific embodiments of the present invention, and are not intended to limit the scope of the present invention, but it should be understood by those skilled in the art that the present invention is not limited thereto, and that the present invention is described in detail with reference to the foregoing examples: any person skilled in the art may modify or easily conceive of the technical solution described in the foregoing embodiments, or perform equivalent substitution of some of the technical features, while remaining within the technical scope of the present disclosure; such modifications, changes or substitutions do not depart from the spirit and scope of the corresponding technical solutions. Are intended to be encompassed within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1.A method for dynamically switching between a training mode and an identification mode based on NLP, comprising the steps of:

S3: judging the request type, entering a training model if the request type is a corpus publishing type request and NLP service is idle, adding the corpus into a publishing queue, and directly returning a result.

2. The method for dynamically switching between training mode and recognition mode based on NLP according to claim 1, wherein the NLP service comprises 4 states, specifically: in idle, it means any type of request can be accepted; in the synchronization, the service is in a locked state and does not accept any new request when the service is in data synchronization; in training, the service is shown to be in corpus training, and is in a locked state at the moment and does not accept any new request; in recognition, the function of semantic recognition is provided by the representation service, and the request of the semantic recognition type can be accepted.

3. The method for dynamically switching between the training mode and the recognition mode based on the NLP according to claim 1, wherein the step S3 asynchronously performs the following procedures: and adding the corpus into a release queue, periodically checking the task queue by a corpus task processor, pushing the corpus data into the NLP service to perform model training when the corpus task processor has data and has idle NLP service, actively reporting the state of the NLP service, automatically pushing the data of the training result to a training result buffer area after the model training is finished, and sequentially synchronizing the data into the NLP service until all the NLP service completes the data synchronization of the latest model, and emptying the training result buffer area.

4. A method for dynamically switching between a training mode and an identification mode based on NLP according to claim 3, wherein the synchronization process of NLP is specifically: if the idle NLP service exists, starting to synchronize a single NLP service and reporting the state, reporting the state after the single NLP service is synchronized, judging whether all NLP services are synchronized, if so, emptying a training result buffer area, otherwise, starting the next synchronization process; if no NLP service exists in the idle state, the next synchronization process is started.

5. The method for dynamically switching between training mode and recognition mode based on NLP according to claim 2, wherein the NLP service registers the current state into the NLP gateway, and the NLP service actively synchronizes the own state into the NLP gateway every 60 seconds.