WO2020098100A1

WO2020098100A1 - Method and apparatus for allocating resources on basis of data analysis, and computer device

Info

Publication number: WO2020098100A1
Application number: PCT/CN2018/124400
Authority: WO
Inventors: 张垒; 邹芳; 邢艳; 李晋; 张裕生
Original assignee: 平安科技（深圳）有限公司
Priority date: 2018-11-12
Filing date: 2018-12-27
Publication date: 2020-05-22
Also published as: CN109462647A

Abstract

Provided in the present application are a method and apparatus for allocating resources on the basis of data analysis, and a computer device: receiving a speech recognition request sent by an IVR system; calculating the total time length of speech data to be recognized of each speech recognition server; comparing the total time lengths, and determining the speech recognition server having the shortest total time length to be a first speech recognition server for processing the speech recognition request; and distributing the speech recognition request to the first speech recognition server. The present application improves the processing efficiency for speech recognition requests.

Description

Resource allocation method, device and computer equipment based on data analysis

This application requires the priority of the Chinese patent application filed on November 12, 2018 in the Chinese Patent Office with the application number 201811340885.4 and the invention titled "Data analysis-based resource allocation method, device and computer equipment" Incorporated in this application.

Technical field

This application relates to the field of speech recognition, and in particular to a resource allocation method, device and computer equipment based on data analysis.

Background technique

Voice recognition technology refers to the technology that allows the machine to convert the voice signal into the corresponding text or command through recognition and understanding, that is, to let the machine understand human speech.

The load balancer is used to distribute the received voice recognition request to the voice recognition server, and the voice recognition server is used to process the received voice recognition request. In the prior art, the IVR (Interactive Voice Response (Interactive Voice Response) After the voice recognition request issued by the system, the load balancer will use the polling method to call the voice recognition server, that is, select one of multiple voice recognition servers to process the received Voice recognition request to achieve load balancing of each voice recognition server. However, polling can only be allocated based on the number of interactions, and the length of each interaction may vary greatly, because the load balancer does not select a reasonable voice recognition server to process the entry based on the actual use of the resources of each voice recognition server. Voice recognition request, it is easy to cause uneven resource allocation of each voice recognition server, and the processing efficiency of voice recognition request is not high.

Therefore, the existing technology still has defects and needs improvement.

technical problem

The main purpose of the present application is to provide a resource allocation method, device and computer equipment based on data analysis, aiming to solve the existing technical problem of uneven resource allocation to the voice recognition server and low processing efficiency of voice recognition requests.

Technical solution

In order to achieve the above application purpose, this application proposes a resource allocation method based on data analysis, including:

Receive voice recognition request sent by IVR (Interactive Voice Response) system;

Calculate the total duration of the current voice data to be recognized by each voice recognition server separately;

Comparing each of the total durations, determining the voice recognition server with the smallest total duration of the voice data to be recognized, and determining the voice recognition server with the smallest total duration as the first voice for processing the voice recognition request Identification server

Distribute the voice recognition request to the first voice recognition server.

This application also provides a resource allocation device based on data analysis, including:

The first receiving module is used to receive the voice recognition request sent by the IVR system;

The first calculation module is used to separately calculate the total duration of the current voice data to be recognized by each voice recognition server;

The first comparison module is used to compare each of the total durations to obtain a voice recognition server with the minimum total duration of the voice data to be recognized, and determine the voice recognition server with the minimum total duration to be used for processing The first voice recognition server for the voice recognition request;

The first distribution module is configured to distribute the recognition request to the first voice recognition server.

The present application also provides a computer device, including a memory and a processor, where the memory stores computer-readable instructions, and when the processor executes the computer-readable instructions, the steps of the foregoing method are implemented.

The present application also provides a computer-readable storage medium on which computer-readable instructions are stored, and when the computer-readable instructions are executed by a processor, the steps of the above method are implemented.

Beneficial effect

The resource allocation method, device, computer equipment and storage medium based on data analysis provided in this application receive the voice recognition request sent by the IVR system; respectively calculate the total duration of the current voice data to be recognized by each voice recognition server; Compare the total duration, determine the voice recognition server with the minimum total duration of the voice data to be recognized, and determine the voice recognition server with the minimum total duration as the first voice recognition server for processing the voice recognition request; The recognition request is distributed to the first voice recognition server. This application selects a reasonable voice recognition server to process the received voice recognition request through comparative analysis of the current total length of voice data to be recognized by each voice recognition server, which is conducive to achieving a balanced use of the voice recognition server's resources and improving voice Identify the processing efficiency of the request.

BRIEF DESCRIPTION

FIG. 1 is a schematic flowchart of a resource allocation method based on data analysis according to an embodiment of the present application;

2 is a schematic flowchart of a resource allocation method based on data analysis according to another embodiment of the present application;

3 is a schematic block diagram of a structure of a resource allocation device based on data analysis according to an embodiment of the present application;

4 is a schematic block diagram of a computer device according to an embodiment of the present application.

Best Mode of the Invention

It should be understood that the specific embodiments described herein are only used to explain the present application, and are not used to limit the present application.

Referring to FIG. 1, an embodiment of the present application provides a resource allocation method based on data analysis, including the following steps:

S1: Receive voice recognition request sent by IVR (Interactive Voice Response) system;

S2: Calculate the total duration of the current voice data to be recognized by each voice recognition server separately;

S3: Compare each of the total durations, determine the voice recognition server with the smallest total duration of the voice data to be recognized, and determine the voice recognition server with the smallest total duration as the first to process the voice recognition request A voice recognition server;

S4: Distribute the voice recognition request to the first voice recognition server.

As described in step S1 above, the execution subject of this embodiment is a resource allocation device based on data analysis, which may specifically be a load balancer. The load balancer is used to receive IVR (Interactive Voice Response (interactive voice response) voice recognition request sent by the system, and distribute the voice recognition request to the voice recognition server, so that the voice recognition server processes the voice recognition request. Among them, before receiving the voice recognition request sent by the IVR system, the load balancer needs to establish a connection with the IVR system to realize information interaction with the IVR system. For example, the method for establishing the connection between the load balancer and the IVR system may be: carrying the identification resource information in the sdp message of the load balancer to establish the connection between the load balancer and the IVR system, wherein the sdp message establishes a session connection for the IVR system The message body used at the time; or directly configure the load pool in the IVR system, so that the IVR system can directly establish a connection with the load balancer through the load pool.

As described in steps S2 to S4 above, after receiving the voice recognition request sent by the IVR system, the load balancer analyzes the actual resource utilization rate of each voice recognition server. Among them, the actual resource utilization rate is reflected by the duration data of the voice data to be recognized by each voice recognition server, that is, by analyzing and calculating the total duration of the current voice data to be recognized by each voice recognition server, because the load balancer records There is data information of all voice recognition requests sent by the IVR system, the data information includes the duration of each voice data, and each voice recognition server corresponding to each voice data sent, by processing the voice that each voice recognition server needs to process The duration of the data minus the duration of the processed voice data that has been returned to the load balancer, the difference is the total duration of the current voice data to be processed by each voice recognition server. A comparative analysis of the total duration of the recognized voice data can determine the voice recognition server with the minimum total duration of the voice data to be recognized, and determine the voice recognition server with the minimum total duration as the first voice for processing the voice recognition request Identify the server. In the prior art, the resource utilization rate of the voice recognition server is expressed by the amount of voice data. For example, the voice recognition server A1 has 2 voice data to be recognized, and the voice recognition server A2 has 5 voice data to be recognized. When a voice recognition request is received, the load balancer will distribute the voice recognition request to the voice recognition server A1 with a small amount of voice data to be recognized, but the load balancer will not analyze the voice recognition to be recognized in the voice recognition servers A1 and A2 The duration of the data. If the total duration of the two pieces of voice data to be recognized by the voice recognition server A1 is 5 minutes, and the total duration of the five pieces of voice data to be recognized by the voice recognition server A2 is 2 minutes, then the resources of the voice recognition server A1 are actually used The rate is greater than the voice recognition server A2. It can be seen that the existing resource allocation method for the voice recognition server is unreasonable. At this time, the recognition request should be distributed to the voice recognition server A2 with a smaller actual resource utilization rate. It is beneficial to realize the balanced distribution of resources of the speech recognition server. This embodiment uses the total duration of the voice data to be recognized to characterize the actual resource utilization rate, which can accurately reflect the current actual use of resources of each voice recognition server, and according to the current actual use of resources of each voice recognition server, Distributing the voice recognition request reasonably to the voice recognition server with a low actual utilization rate of resources is conducive to achieving a balanced use of the resource of the voice recognition server and improving the processing efficiency of the voice recognition request.

Referring to FIG. 2, further, in an embodiment of the present application, after the above step S4, it includes:

S400: Determine whether there is a second voice recognition server whose total duration of the voice data to be recognized is greater than a preset duration threshold;

S401: If yes, add the second voice recognition server to a blacklist that prohibits the processing of new voice recognition requests.

In this embodiment, the preset preset duration threshold represents the maximum number of voice data durations that can be processed by the voice recognition server. If the voice recognition server processes voice data within a long period of time, the number of durations is greater than this The preset duration threshold means that the voice recognition server is in an overloaded working state for a long time, and the voice recognition server will be severely damaged or even scrapped. In order to ensure that the voice recognition server can maintain a good service life, the load balancer needs to monitor the working status of each voice recognition server at all times. When it is found that there is a second voice recognition server whose first total duration of voice data to be recognized is greater than a preset duration threshold, Will move the second voice recognition server into a blacklist that does not receive new voice recognition requests temporarily, that is, no new voice recognition request will be distributed to the second voice recognition server for a period of time, only when the second voice After the total duration of the voice data to be processed in the recognition server is less than the preset duration threshold, the second voice recognition server is removed from the blacklist. In this embodiment, when the load balancer monitors that a voice recognition server is in an overloaded working state, it will intelligently stop distributing new voice recognition requests to the voice recognition server to effectively protect the voice recognition server to ensure The service life of the voice recognition server.

Further, in an embodiment of the present application, after the above step S401, it includes:

S402: Calculate the average duration of the voice data to be recognized by each voice recognition server according to the total duration of the voice data to be recognized by each voice recognition server;

S403: Filter out the third voice recognition server whose total duration of the voice data to be recognized is less than the average duration;

S404: After the first preset time, determine whether the total duration of the voice data to be recognized in the second voice recognition server is greater than the preset duration threshold;

S405: If yes, allocate part of the voice data to be recognized in the second voice recognition server to the third voice recognition server, so that the total voice data to be recognized in the second voice recognition server after distribution The duration is less than the preset duration threshold.

As described in steps S402 to S403 above, the step of calculating the average duration of the voice data to be recognized by each voice recognition server according to the total duration of the voice data to be recognized by each voice recognition server is specifically The total duration of the voice data to be recognized by the voice recognition server is summed to obtain the sum value, and then the sum is divided by the number of all voice recognition servers to obtain the average duration. A voice recognition server whose total duration of voice data is less than the average duration, wherein the number of voice recognition servers is determined by actual conditions.

As described in steps S404 to S405 above, the preset first preset time indicates that the voice recognition processor is in an overloaded working state within the first preset time, and hardly causes damage to the voice recognition processor itself. In this embodiment, the second speech recognition server processes the speech data to be recognized, if after the first preset time passes, the total duration of the remaining speech data to be recognized in the second speech recognition server is less than the preset duration threshold, It means that the second voice recognition server has not been in overload working state for a long time; and if after the first preset time is exceeded, the total duration of the voice data to be recognized remaining in the second voice recognition server is still greater than the preset duration The threshold value indicates that the second voice recognition server is in an overloaded working state for a long time, which will greatly affect the service life of the second voice recognition server itself. In order to protect the second voice recognition server and ensure the service life of the voice recognition server, it is necessary to distribute some unrecognized voice data in the second voice recognition processor to other voice recognition servers, such as the third voice recognition server for processing , The third voice recognition server is a voice recognition server whose total duration of voice data to be recognized is less than the average duration, wherein the part of the voice data to be recognized may be any part of all the voice data to be recognized in the second voice recognition server Of the voice data to be recognized, the duration of the part of the voice data to be recognized that needs to be allocated is greater than or equal to the specified duration, which is the total of the voice data to be recognized in the second voice recognition server after the first preset time The difference between the duration and the preset duration threshold. In addition, part of the voice data to be recognized in the second voice recognition server is allocated to the third voice recognition server whose total duration of the voice data to be recognized is less than the average duration. The number of the third voice recognition server depends on the actual situation, specifically: Analyze the actual duration data of the voice data in the second voice recognition server. If the actual duration data is not very large, part of the voice data to be recognized in the voice recognition server can only be allocated to the total duration of the voice data to be recognized A designated voice recognition server that is less than the average duration, and ensures that after the second voice recognition server is allocated, the total duration of the remaining voice data to be recognized after the processing of the first preset time is less than the preset duration Threshold; or if the actual duration data is too large, part of the voice data to be recognized in the second voice recognition server may be allocated to multiple designated voice recognition servers whose total duration is less than the average duration (specific number Set according to the actual situation), and ensure that after the second speech recognition server has been allocated, the total duration of the remaining speech data to be recognized can be less than the preset duration threshold after the first preset time is processed. By allocating part of the voice data to be recognized in the voice recognition server with a load greater than a preset value to one or more designated voice recognition servers with a small load, the service life of the voice recognition server with a large load is effectively ensured. It also further promotes the balanced use of resources of each speech recognition server.

Further, in an embodiment of the present application, after the above step S4, it includes:

S410: Determine whether the first voice recognition server is in a normal working state;

S411: If not, compare the total duration of the voice data to be recognized by other voice recognition servers except the first voice recognition server to obtain a comparison result;

S412: Determine a fourth voice recognition server for processing the recognition request according to the comparison result, where the fourth voice recognition server is in a normal working state;

S413: Distribute the voice recognition request to the fourth voice recognition server.

In this embodiment, after the load balancer determines to distribute the voice recognition request to the first voice recognition server, there may be a situation where the first voice recognition server is not in a normal working state, then the first voice recognition server will Unable to respond to the voice recognition request and the subsequent recognition of the voice data corresponding to the voice recognition request, the entire recognition process cannot work normally, and the IVR system cannot receive the recognition result of the first voice recognition server, Moreover, the user cannot get the response of the IVR system in time, resulting in poor user experience. Therefore, in this embodiment, it is necessary to further determine whether the first voice recognition server is in a normal working state. If it is determined that the first voice recognition server is in a state where it cannot work normally, the first voice recognition server is excluded and the first voice The actual utilization rate of the resources of the voice recognition servers other than the recognition server is analyzed to determine a fourth voice recognition server for processing the voice recognition request, wherein the total duration of the current voice data of the fourth voice recognition server to be recognized is The least among the other voice recognition servers mentioned above, and after confirming that the fourth voice recognition server is in a normal working state, distribute the voice recognition request to the fourth voice recognition server so that the fourth voice recognition server can correspond deal with. In this embodiment, when the first voice recognition server cannot work normally, the voice recognition request is processed by calling the fourth voice recognition server in a normal working state, which is beneficial to the normal operation of the recognition process, and the IVR system can receive it in time The recognition result of the fourth voice recognition server, and the user can also get the response of the IVR system in time, ensuring the user's experience.

Further, in an embodiment of the present application, the above step S412 includes:

S4120: Sort the other speech recognition servers in order of the total duration of the speech data to be recognized from small to large;

S4121: Traverse the other speech recognition servers according to the sorting;

S4122: determine whether the currently accessed voice recognition server is in a normal working state;

S4123: If yes, determine the currently accessed voice recognition server as the fourth voice recognition server for processing the voice recognition request, and stop traversing.

In this embodiment, the step of determining the fourth voice recognition server for processing the voice recognition request according to the above comparison result specifically includes: when the first voice recognition server is in an inoperable state, remove Analyze the total duration of each voice recognition server other than a voice recognition server, and sort the other voice recognition servers according to the order of the total duration of the voice data to be recognized from small to large, in increasing order, and traverse according to the sorting For each of the other voice recognition servers mentioned above, each time a voice recognition server is traversed, if it is determined that the voice recognition server is in a normal working state, the voice recognition server is determined as the fourth voice recognition for processing the voice recognition request The server distributes the voice recognition request to the voice recognition server for processing, and stops traversing. Further, if no speech recognition server in normal working state is found after the end of the traversal, it will return the processing failure information to the IVR system, that is, there is no currently available speech recognition server to process the above speech recognition request, in addition Maintenance information will also be issued to enable maintenance personnel to perform maintenance on the speech recognition server that is in a non-functional state.

Further, in an embodiment of the present application, before the above step S1, it includes:

S100: Establish a long TCP (transmission control protocol) connection with each of the voice recognition servers, so as to perform information interaction with each of the voice recognition servers through the long TCP connection.

In this embodiment, when the load balancer is initialized, TCP (Transmission Control Protocol (Transmission Control Protocol) long connection. Optionally, at least one TCP long connection can be established with each voice recognition server, where the specific number of established TCP long connections is determined according to the actual situation. After the transmission control protocol long connection is successfully established, when the load balancer needs to distribute the speech recognition request about a certain speech data sent by the IVR system to a certain speech recognition server, the transmission control protocol established above can be used directly Long connection, through the transmission control protocol, the long connection performs information interaction with the voice recognition server, thereby eliminating the cost of establishing a long transmission control protocol connection when a load balancer is required to call the voice recognition server time. In addition, the specific value of the number of long transmission control protocol connections established between the load balancer and each voice recognition server may be determined according to actual needs, and may be one or more. For example, load balancing The server establishes a long transmission control protocol connection with the voice recognition server A, the load balancer establishes two long transmission control protocol connections with the voice recognition server B, and the load balancer establishes five long transmission control protocol connections with the voice recognition server C . Among them, the advantage of establishing multiple transmission control protocol long connections is that when the load balancer simultaneously receives multiple voice recognition requests corresponding to multiple voice data sent by the IVR system, and determines that multiple voice recognition requests need to be from the same voice When the recognition server performs processing, multiple transmission control protocol long connections can be used to distribute multiple voice recognition requests to the voice recognition server at the same time, and if there is only one transmission control protocol long connection, only one speech recognition can be distributed. After the request, another voice recognition request is distributed until the multiple voice recognition requests are distributed, so establishing a certain number of long transmission control protocol connections can effectively improve the transmission efficiency of the voice recognition request.

S420: Determine whether the response information for the voice recognition request fed back by the first voice recognition server is received;

S421: If yes, forward the response information to the IVR system;

S422: Receive voice data corresponding to the voice recognition request sent by the IVR system;

S423: Send the voice data to the first voice recognition server, so that the first voice recognition server recognizes the voice data.

In this embodiment, after the load balancer distributes the voice recognition request to the first voice recognition server, the first voice recognition server will parse the syntax of the voice recognition request, and if the parsing is successful, it will return to the load balancer The request identifies successful response information, and if the resolution fails, an error is returned to the load balancer. After receiving the response information, the load balancer will forward the response information to the IVR system. After receiving the response information, the IVR system will send the voice data corresponding to the voice recognition request to the load balancer. After receiving the voice data, the equalizer will forward it to the first voice recognition server, and then the first voice recognition server will recognize the voice data using the corresponding language model according to the grammar, obtain the recognition result, and The recognition result is returned to the load balancer, so that the load balancer returns the recognition result to the IVR system, so that the IVR system performs information interaction with the user according to the recognition result.

Further, in an embodiment of the present application, before the step of receiving the voice recognition request sent by the IVR system, the IVR system sends heartbeat information to the load balancer. After receiving the heartbeat information sent by the IVR system, the load balancer It will determine whether it can send feedback information about the heartbeat information. If not, the load balancer will issue an alarm message that recognizes the voice data recognition request sent by the IVR through the second load balancer. In this embodiment, the above-mentioned load balancer is the main load balancer, and correspondingly sets at least one backup load balancer, that is, the above-mentioned second load balancer. When the above load balancer, that is, the active load balancer, receives the heartbeat information sent by the IVR system and determines that it is malfunctioning, and cannot send feedback information on the heartbeat information, it will automatically send an alarm message to request fast Obtain the corresponding maintenance. In addition, the active load balancer will also issue an alarm message to receive the voice recognition request sent by the IVR system through the backup load balancer. When the IVR system learns that the main load balancer has failed, IVR will recognize the voice The request is sent to the backup load balancer. The backup load balancer will work instead of the main load balancer to receive the voice data recognition request sent by the IVR and perform subsequent processing, and the actual resource utilization rate of each voice recognition server Analysis, determine the first voice recognition server for processing the recognition request, and distribute the recognition request to the first voice recognition server. By adding a backup load balancer, the availability of the load balancer is effectively improved, and continuous and stable operation of services is ensured. In addition, the backup load balancer can also actively detect whether the main load balancer sends feedback information on heartbeat information. When it is determined that the main load balancer has failed, that is, it cannot work normally, it can directly replace the main load balancer. The device works to remind the IVR system to send the voice data recognition request to the backup load balancer.

The resource allocation method based on data analysis in the embodiment of the present application selects a reasonable voice recognition server to process the received voice recognition request by comparing and analyzing the total duration of the current voice data to be recognized by each voice recognition server, which is beneficial to achieve The balanced use of the resources of the voice recognition server improves the efficiency of processing voice recognition requests.

Referring to FIG. 3, an embodiment of the present application further provides a resource allocation device based on data analysis, including:

The first receiving module 1 is used to receive a voice recognition request sent by an IVR (Interactive Voice Response) system;

The first calculation module 2 separately calculates the total duration of the current voice data to be recognized by each voice recognition server;

The first comparison module 3 is used to compare the total durations, determine the voice recognition server with the minimum total duration of the voice data to be recognized, and determine the voice recognition server with the minimum total duration as the The first voice recognition server for the voice recognition request;

The first distribution module 4 is configured to distribute the voice recognition request to the first voice recognition server.

In the above first receiving module, the execution subject of this embodiment is a resource allocation device based on data analysis, which may specifically be a load balancer. The load balancer is used to receive IVR (Interactive Voice Response (interactive voice response) voice recognition request sent by the system, and distribute the voice recognition request to the voice recognition server, so that the voice recognition server processes the voice recognition request. Among them, before receiving the voice recognition request sent by the IVR system, the load balancer needs to establish a connection with the IVR system to realize information interaction with the IVR system. For example, the method for establishing the connection between the load balancer and the IVR system may be: carrying the identification resource information in the sdp message of the load balancer to establish the connection between the load balancer and the IVR system, wherein the sdp message establishes a session connection for the IVR system The message body used at the time; or directly configure the load pool in the IVR system, so that the IVR system can directly establish a connection with the load balancer through the load pool.

In the foregoing first calculation module, first comparison module, and first distribution module, after receiving the voice recognition request sent by the IVR system, the load balancer analyzes the actual resource utilization rate of each voice recognition server. Among them, the actual resource utilization rate is reflected by the duration data of the voice data to be recognized by each voice recognition server, that is, by analyzing and calculating the total duration of the current voice data to be recognized by each voice recognition server, because the load balancer records There is data information of all voice recognition requests sent by the IVR system, the data information includes the duration of each voice data, and each voice recognition server corresponding to each voice data sent, by processing the voice that each voice recognition server needs to process The duration of the data minus the duration of the processed voice data that has been returned to the load balancer, the difference is the total duration of the current voice data to be processed by each voice recognition server. A comparative analysis of the total duration of the recognized voice data can determine the voice recognition server with the minimum total duration of the voice data to be recognized, and determine the voice recognition server with the minimum total duration as the first voice for processing the voice recognition request Identify the server. In the prior art, the resource utilization rate of the voice recognition server is expressed by the amount of voice data. For example, the voice recognition server A1 has 2 voice data to be recognized, and the voice recognition server A2 has 5 voice data to be recognized. When a voice recognition request is received, the load balancer will distribute the voice recognition request to the voice recognition server A1 with a small amount of voice data to be recognized, but the load balancer will not analyze the voice recognition to be recognized in the voice recognition servers A1 and A2 The duration of the data. If the total duration of the two pieces of voice data to be recognized by the voice recognition server A1 is 5 minutes, and the total duration of the five pieces of voice data to be recognized by the voice recognition server A2 is 2 minutes, then the resources of the voice recognition server A1 are actually used The rate is greater than the voice recognition server A2. It can be seen that the existing resource allocation method for the voice recognition server is unreasonable. At this time, the recognition request should be distributed to the voice recognition server A2 with a smaller actual resource utilization rate. It is beneficial to realize the balanced distribution of resources of the speech recognition server. This embodiment uses the total duration of the voice data to be recognized to characterize the actual resource utilization rate, which can accurately reflect the current actual use of resources of each voice recognition server, and according to the current actual use of resources of each voice recognition server, Distributing the voice recognition request reasonably to the voice recognition server with a low actual utilization rate of resources is conducive to achieving a balanced use of the resource of the voice recognition server and improving the processing efficiency of the voice recognition request.

Further, in an embodiment of the present application, the above resource allocation device based on data analysis includes:

A first judgment module, configured to judge whether there is a second voice recognition server whose total duration of the voice data to be recognized is greater than a preset duration threshold;

The joining module is used to add the second speech recognition server to the blacklist that prohibits the processing of new speech recognition requests if it is.

A second calculation module, configured to calculate the average duration of voice data to be recognized by each of the voice recognition servers according to the total duration of voice data to be recognized by each of the voice recognition servers;

A screening module, configured to screen out a third voice recognition server whose total duration of voice data to be recognized is less than the average duration;

A second judgment module, configured to judge whether the total duration of the voice data to be recognized in the second voice recognition server is greater than the preset duration threshold after the first preset time has passed;

The distribution module is configured to, if it is, distribute part of the voice data to be recognized in the second voice recognition server to the third voice recognition server, so that the distributed voice to be recognized in the second voice recognition server The total duration of the data is less than the preset duration threshold.

In the above-mentioned second calculation module and screening module, the step of calculating the average duration of the voice data to be recognized by each voice recognition server according to the total duration of the voice data to be recognized by each voice recognition server is specifically The total duration of the voice data to be recognized by all current voice recognition servers is summed to obtain a sum value, and then the sum is divided by the number of all voice recognition servers to obtain the average duration. A voice recognition server whose total duration of voice data to be recognized is less than the average duration, wherein the number of voice recognition servers is determined by actual conditions.

In the above second judgment module and distribution module, the preset first preset time indicates that the voice recognition processor is in an overloaded working state within the first preset time, and almost does not cause damage to the voice recognition processor itself . In this embodiment, the second speech recognition server processes the speech data to be recognized, if after the first preset time passes, the total duration of the remaining speech data to be recognized in the second speech recognition server is less than the preset duration threshold, It means that the second voice recognition server has not been in overload working state for a long time; and if after the first preset time is exceeded, the total duration of the voice data to be recognized remaining in the second voice recognition server is still greater than the preset duration The threshold value indicates that the second voice recognition server is in an overloaded working state for a long time, which will greatly affect the service life of the second voice recognition server itself. In order to protect the second voice recognition server and ensure the service life of the voice recognition server, it is necessary to distribute some unrecognized voice data in the second voice recognition processor to other voice recognition servers, such as the third voice recognition server for processing , The third voice recognition server is a voice recognition server whose total duration of voice data to be recognized is less than the average duration, wherein the part of the voice data to be recognized may be any part of all the voice data to be recognized in the second voice recognition server Of the voice data to be recognized, the duration of the part of the voice data to be recognized that needs to be allocated is greater than or equal to the specified duration, which is the total of the voice data to be recognized in the second voice recognition server after the first preset time The difference between the duration and the preset duration threshold. In addition, part of the voice data to be recognized in the second voice recognition server is allocated to the third voice recognition server whose total duration of the voice data to be recognized is less than the average duration. The number of the third voice recognition server depends on the actual situation, specifically: Analyze the actual duration data of the voice data in the second voice recognition server. If the actual duration data is not very large, part of the voice data to be recognized in the voice recognition server can only be allocated to the total duration of the voice data to be recognized A designated voice recognition server that is less than the average duration, and ensures that after the second voice recognition server is allocated, the total duration of the remaining voice data to be recognized after the processing of the first preset time is less than the preset duration Threshold; or if the actual duration data is too large, part of the voice data to be recognized in the second voice recognition server may be allocated to multiple designated voice recognition servers whose total duration is less than the average duration (specific number Set according to the actual situation), and ensure that after the second speech recognition server has been allocated, the total duration of the remaining speech data to be recognized can be less than the preset duration threshold after the first preset time is processed. By allocating part of the voice data to be recognized in the voice recognition server with a load greater than a preset value to one or more designated voice recognition servers with a small load, the service life of the voice recognition server with a large load is effectively ensured. It also further promotes the balanced use of resources of each speech recognition server.

The third judgment module is used to judge whether the first voice recognition server is in a normal working state;

A second comparison module, if not, compare the total duration of the voice data to be recognized by other voice recognition servers except the first voice recognition server to obtain a comparison result;

A determining module, configured to determine a fourth voice recognition server for processing the voice recognition request according to the comparison result, wherein the fourth voice recognition server is in a normal working state;

The second distribution module is configured to distribute the voice recognition request to the fourth voice recognition server.

In this embodiment, after the load balancer determines to distribute the voice recognition request to the first voice recognition server, there may be a situation where the first voice recognition server is not in a normal working state, then the first voice recognition server will Unable to respond to the voice recognition request and the subsequent recognition of the voice data corresponding to the voice recognition request, the whole process cannot work normally, the IVR system cannot receive the recognition result of the first voice recognition server, and The user cannot get the response of the IVR system in time, resulting in poor user experience. Therefore, in this embodiment, it is necessary to further determine whether the first voice recognition server is in a normal working state. If it is determined that the first voice recognition server is in a state where it cannot work normally, the first voice recognition server is excluded and the first voice The actual utilization rate of the resources of the voice recognition servers other than the recognition server is analyzed to determine a fourth voice recognition server for processing the voice recognition request, wherein the total duration of the current voice data of the fourth voice recognition server to be recognized is The least among the other voice recognition servers mentioned above, and after confirming that the fourth voice recognition server is in a normal working state, distribute the voice recognition request to the fourth voice recognition server so that the fourth voice recognition server can correspond deal with. In this embodiment, when the first voice recognition server cannot work normally, the voice recognition request is processed by calling the fourth voice recognition server in a normal working state, which is beneficial to the normal operation of the recognition process, and the IVR system can receive it in time The recognition result of the fourth voice recognition server, and the user can also get the response of the IVR system in time, ensuring the user's experience.

Further, in an embodiment of the present application, the above determination module includes:

A sorting unit, configured to sort the other voice recognition servers in the order of the total duration of the voice data to be recognized from small to large;

A traversal unit, configured to traverse the other speech recognition servers according to the order;

The judging unit is used to judge whether the currently accessed voice recognition server is in a normal working state;

The determining unit is configured to determine the currently accessed voice recognition server as the fourth voice recognition server for processing the voice recognition request if it is, and stop traversing.

The establishment module is configured to establish a long TCP (Transmission Control Protocol) connection with each of the voice recognition servers, so as to perform information interaction with each of the voice recognition servers through the long TCP connection.

A fourth judgment module, configured to judge whether the response information for the voice recognition request fed back by the first voice recognition server is received;

A forwarding module, if yes, forwarding the response information to the IVR system;

A second receiving module, configured to receive voice data corresponding to the voice recognition request sent by the IVR system;

The sending module is configured to send the voice data to the first voice recognition server, so that the first voice recognition server recognizes the voice data.

The resource allocation device based on data analysis in the embodiment of the present application selects a reasonable voice recognition server to process the received voice recognition request by comparing and analyzing the total duration of the current voice data to be recognized by each voice recognition server, which is beneficial to achieve The balanced use of the resources of the voice recognition server improves the efficiency of processing voice recognition requests.

Referring to FIG. 4, an embodiment of the present application further provides a computer device. The computer device may be a server, and its internal structure may be as shown in FIG. 4. The computer device includes a processor, memory, network interface, and database connected by a system bus. Among them, the processor designed by the computer is used to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer-readable instructions, and a database. The internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile storage medium. The database of the computer device is used to store data such as voice recognition request and duration of voice data to be recognized. The network interface of the computer device is used to communicate with external terminals through a network connection. When the computer-readable instructions are executed, the processes of the foregoing method embodiments are executed. Those skilled in the art can understand that the structure shown in FIG. 4 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied.

An embodiment of the present application further provides a computer non-volatile readable storage medium on which computer readable instructions are stored. When the computer readable instructions are executed, the processes of the foregoing method embodiments are performed. The above are only the preferred embodiments of the present application and do not limit the patent scope of the present application. Any equivalent structure or equivalent process transformation made by the description and drawings of this application, or directly or indirectly used in other related In the technical field, the same reason is included in the scope of patent protection of this application.

Claims

A resource allocation method based on data analysis is characterized by including:

Receive voice recognition request sent by IVR (Interactive Voice Response) system;

Calculate the total duration of the current voice data to be recognized by each voice recognition server separately;

Comparing each of the total durations, determining the voice recognition server with the smallest total duration of the voice data to be recognized, and determining the voice recognition server with the smallest total duration as the first voice for processing the voice recognition request Identification server

Distribute the voice recognition request to the first voice recognition server.
The resource allocation method based on data analysis according to claim 1, wherein after the step of distributing the voice recognition request to the first voice recognition server, it includes:

Judging whether there is a second voice recognition server whose total duration of the voice data to be recognized is greater than a preset duration threshold;

If yes, add the second voice recognition server to the blacklist that prohibits the processing of new voice recognition requests.
The resource allocation method based on data analysis according to claim 2, wherein the step of adding the second voice recognition server to a blacklist that prohibits the processing of new voice recognition requests includes:

Calculating the average duration of the voice data to be recognized by each voice recognition server according to the total duration of the voice data to be recognized by each voice recognition server;

Filtering out a third voice recognition server whose total duration of voice data to be recognized is less than the average duration;

After the first preset time, determine whether the total duration of the voice data to be recognized in the second voice recognition server is greater than the preset duration threshold;

If yes, allocate part of the voice data to be recognized in the second voice recognition server to the third voice recognition server, so that the total duration of the voice data to be recognized in the second voice recognition server after distribution is less than The preset duration threshold.
The resource allocation method based on data analysis according to claim 1, wherein after the step of distributing the voice recognition request to the first voice recognition server, it includes:

Determine whether the first voice recognition server is in a normal working state;

If not, compare the current total duration of voice data to be recognized by other voice recognition servers except the first voice recognition server to obtain a comparison result;

According to the comparison result, a fourth voice recognition server for processing the voice recognition request is determined, wherein the fourth voice recognition server is in a normal working state;

Distribute the voice recognition request to the fourth voice recognition server.
The resource allocation method based on data analysis according to claim 4, wherein the step of determining a fourth voice recognition server for processing the voice recognition request according to the comparison result includes:

Sort the other voice recognition servers according to the total duration of the voice data to be recognized from small to large;

Traverse the other speech recognition servers according to the ranking;

Determine whether the currently accessed voice recognition server is in a normal working state;

If yes, determine the currently accessed voice recognition server as the fourth voice recognition server for processing the voice recognition request, and stop traversing.
The resource allocation method based on data analysis according to claim 1, wherein before the step of receiving the voice recognition request sent by the IVR system, it includes:

Establishing a long TCP (transmission control protocol) connection with each of the voice recognition servers, so as to perform information interaction with each of the voice recognition servers through the long TCP connection.
A resource allocation device based on data analysis is characterized in that it includes:

The first receiving module is used to receive the voice recognition request sent by the IVR system;

The first calculation module is used to separately calculate the total duration of the current voice data to be recognized by each voice recognition server;

The first comparison module is used to compare each of the total durations to obtain a voice recognition server with the minimum total duration of the voice data to be recognized, and determine the voice recognition server with the minimum total duration to be used for processing The first voice recognition server for the voice recognition request;

The first distribution module is configured to distribute the recognition request to the first voice recognition server.
The resource allocation device based on data analysis according to claim 7, comprising:

A first judgment module, configured to judge whether there is a second voice recognition server whose total duration of the voice data to be recognized is greater than a preset duration threshold;

The joining module is used to add the second speech recognition server to the blacklist that prohibits the processing of new speech recognition requests if it is.
The resource allocation device based on data analysis according to claim 8, comprising:

A second calculation module, configured to calculate the average duration of voice data to be recognized by each of the voice recognition servers according to the total duration of voice data to be recognized by each of the voice recognition servers;

A screening module, configured to screen out a third voice recognition server whose total duration of voice data to be recognized is less than the average duration;

A second judgment module, configured to judge whether the total duration of the voice data to be recognized in the second voice recognition server is greater than the preset duration threshold after the first preset time has passed;

The distribution module is configured to, if it is, distribute part of the voice data to be recognized in the second voice recognition server to the third voice recognition server, so that the distributed voice to be recognized in the second voice recognition server The total duration of the data is less than the preset duration threshold.
The resource allocation device based on data analysis according to claim 7, comprising:

The third judgment module is used to judge whether the first voice recognition server is in a normal working state;

A second comparison module, if not, compare the total duration of the voice data to be recognized by other voice recognition servers except the first voice recognition server to obtain a comparison result;

A determining module, configured to determine a fourth voice recognition server for processing the voice recognition request according to the comparison result, wherein the fourth voice recognition server is in a normal working state;

The second distribution module is configured to distribute the voice recognition request to the fourth voice recognition server.
The resource allocation device based on data analysis according to claim 10, wherein the determination module comprises:

A sorting unit, configured to sort the other voice recognition servers in the order of the total duration of the voice data to be recognized from small to large;

A traversal unit, configured to traverse the other speech recognition servers according to the order;

The judging unit is used to judge whether the currently accessed voice recognition server is in a normal working state;

The determining unit is configured to determine the currently accessed voice recognition server as the fourth voice recognition server for processing the voice recognition request if it is, and stop traversing.
The resource allocation device based on data analysis according to claim 7, comprising:

The establishment module is configured to establish a long TCP (Transmission Control Protocol) connection with each of the voice recognition servers, so as to perform information interaction with each of the voice recognition servers through the long TCP connection.
A computer device includes a memory and a processor, and the memory stores computer-readable instructions, wherein the processor implements the computer-readable instructions to implement a resource allocation method based on data analysis, which is based on data analysis Resource allocation methods, including:

Receive voice recognition request sent by IVR (Interactive Voice Response) system;

Calculate the total duration of the current voice data to be recognized by each voice recognition server separately;

Comparing each of the total durations, determining the voice recognition server with the smallest total duration of the voice data to be recognized, and determining the voice recognition server with the smallest total duration as the first voice for processing the voice recognition request Identification server

Distribute the voice recognition request to the first voice recognition server.
The computer device based on data analysis according to claim 13, wherein after the step of distributing the voice recognition request to the first voice recognition server, it includes:

Judging whether there is a second voice recognition server whose total duration of the voice data to be recognized is greater than a preset duration threshold;

If yes, add the second voice recognition server to the blacklist that prohibits the processing of new voice recognition requests.
The computer device based on data analysis according to claim 14, wherein the step of adding the second voice recognition server to a blacklist that prohibits the processing of new voice recognition requests includes:

Calculating the average duration of the voice data to be recognized by each voice recognition server according to the total duration of the voice data to be recognized by each voice recognition server;

Filtering out a third voice recognition server whose total duration of voice data to be recognized is less than the average duration;

After the first preset time, determine whether the total duration of the voice data to be recognized in the second voice recognition server is greater than the preset duration threshold;

If yes, allocate part of the voice data to be recognized in the second voice recognition server to the third voice recognition server, so that the total duration of the voice data to be recognized in the second voice recognition server after distribution is less than The preset duration threshold.
The computer device based on data analysis according to claim 13, wherein after the step of distributing the voice recognition request to the first voice recognition server, it includes:

Determine whether the first voice recognition server is in a normal working state;

If not, compare the current total duration of voice data to be recognized by other voice recognition servers except the first voice recognition server to obtain a comparison result;

According to the comparison result, a fourth voice recognition server for processing the voice recognition request is determined, wherein the fourth voice recognition server is in a normal working state;

Distribute the voice recognition request to the fourth voice recognition server.
A computer non-volatile readable storage medium on which computer-readable instructions are stored, characterized in that, when the computer-readable instructions are executed by a processor, a resource allocation method based on data analysis is implemented. Resource allocation methods, including:

Receive voice recognition request sent by IVR (Interactive Voice Response) system;

Calculate the total duration of the current voice data to be recognized by each voice recognition server separately;

Comparing each of the total durations, determining the voice recognition server with the smallest total duration of the voice data to be recognized, and determining the voice recognition server with the smallest total duration as the first voice for processing the voice recognition request Identification server

Distribute the voice recognition request to the first voice recognition server.
The computer non-volatile readable storage medium according to claim 17, wherein after the step of the processor distributing the voice recognition request to the first voice recognition server includes:

Judging whether there is a second voice recognition server whose total duration of the voice data to be recognized is greater than a preset duration threshold;

If yes, add the second voice recognition server to the blacklist that prohibits the processing of new voice recognition requests.
The computer non-volatile readable storage medium according to claim 18, wherein after the processor adds the second voice recognition server to a blacklist that prohibits the processing of new voice recognition requests, it includes:

Calculating the average duration of the voice data to be recognized by each voice recognition server according to the total duration of the voice data to be recognized by each voice recognition server;

Filtering out a third voice recognition server whose total duration of voice data to be recognized is less than the average duration;

After the first preset time, determine whether the total duration of the voice data to be recognized in the second voice recognition server is greater than the preset duration threshold;

If yes, allocate part of the voice data to be recognized in the second voice recognition server to the third voice recognition server, so that the total duration of the voice data to be recognized in the second voice recognition server after distribution is less than The preset duration threshold.
The computer non-volatile readable storage medium according to claim 17, wherein after the step of the processor distributing the voice recognition request to the first voice recognition server includes:

Determine whether the first voice recognition server is in a normal working state;

If not, compare the current total duration of voice data to be recognized by other voice recognition servers except the first voice recognition server to obtain a comparison result;

According to the comparison result, a fourth voice recognition server for processing the voice recognition request is determined, wherein the fourth voice recognition server is in a normal working state;

Distribute the voice recognition request to the fourth voice recognition server.