WO2021143095A1

WO2021143095A1 - Dialing test method and apparatus, and computer device and storage medium

Info

Publication number: WO2021143095A1
Application number: PCT/CN2020/105045
Authority: WO
Inventors: 刘建华; 吕林澧; 徐从国; 叶松; 余艳萍; 樊维; 毛娟
Original assignee: 深圳壹账通智能科技有限公司
Priority date: 2020-01-14
Filing date: 2020-07-28
Publication date: 2021-07-22
Also published as: CN111225114A

Abstract

Disclosed in the present application are a dialing test method and apparatus, and a computer device and a storage medium. The method comprises: obtaining a sustained energy value fed back by a host to be tested; when the fed back sustained energy value is within a preset energy range, establishing a communication connection between a service interface to be tested and said host; monitoring, by means of an audio endpoint, whether said host issues first preset voice data in a communication connection process; when said host issues the first preset voice data, acquiring a first sampling frequency of the first preset voice data, and verifying whether the first sampling frequency reaches a standard; when the first sampling frequency reaches a standard, sending a voice reply to said host, and after obtaining second preset voice data fed back for the voice reply by said host, converting the second preset voice data into text data and then matching the text data with matching content; and when matching succeeds, completing the communication connection after determining that the second preset voice data is errorless. By means of the present application, the call service quality of the host to be tested can be effectively tested.

Description

Dial test method, device, computer equipment and storage medium

This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on January 14, 2020, the application number is 202010037782.1, and the invention title is "Dial test method, device, computer equipment and storage medium", the entire content of which is incorporated by reference In this application.

To

Technical field

本申请涉及人工智能领域中的语音处理领域，尤其涉及一种拨测方法、装置、计算机设备及存储介质。This application relates to the field of voice processing in the field of artificial intelligence, and in particular to a dial test method, device, computer equipment, and storage medium.

To

背景技术Background technique

随着人工智能的发展，目前市面上出现很多智能语音机器人的产品或者系统，通过语音机器人的产品或者系统代替人工进行客服和营销等工作。但发明人意识到目前的智能语音机器人产品或者系统的服务质量是否正常是通过人工方式去进行拨测，此方式需浪费大量人力成本且耗时过多，且在人工拨测的过程中，需人工手动聚合各组件监控信息并根据聚合的监控信息的逻辑制定策略，因此将导致最终确定语音机器人的服务质量的异常的时间严重滞后，甚至根本无法察觉语音机器人的服务质量的异常。因此，寻找一种解决上述问题的技术方案成为本领域人员亟需解决的问题。With the development of artificial intelligence, there are many intelligent voice robot products or systems on the market. The voice robot products or systems replace humans for customer service and marketing. However, the inventor realizes that whether the service quality of current intelligent voice robot products or systems is normal is to be dialed manually. This method wastes a lot of labor costs and is time-consuming, and in the process of manual dialing, it is necessary Manually aggregate the monitoring information of each component and formulate strategies based on the logic of the aggregated monitoring information, which will cause a serious delay in the final determination of the abnormal service quality of the voice robot, and even the abnormal service quality of the voice robot cannot be detected at all. Therefore, finding a technical solution to solve the above-mentioned problems has become an urgent problem to be solved by those skilled in the art.

To

发明内容Summary of the invention

基于此，有必要针对上述技术问题，提供一种拨测方法、装置、计算机设备及存储介质，用于解决现有技术无法对语音机器人（也即待测主机）的通话服务质量进行有效检测的问题。Based on this, it is necessary to address the above technical problems and provide a dial test method, device, computer equipment and storage medium to solve the problem that the existing technology cannot effectively detect the call service quality of the voice robot (that is, the host under test). problem.

一种拨测方法，包括：A dial test method, including:

调用待测主机的待测服务接口，获取所述待测服务接口的唯一标识，并通过所述唯一标识向所述待测服务接口发送通信连接指令；Call the service interface to be tested of the host to be tested, obtain the unique identifier of the service interface to be tested, and send a communication connection instruction to the service interface to be tested through the unique identifier;

监听所述待测主机的主机声卡，并获取所述待测主机的主机声卡在预设时间阈值内反馈的持续能量值；Monitor the host sound card of the host under test, and obtain the continuous energy value fed back by the host sound card of the host under test within a preset time threshold;

在所述主机声卡在预设时间阈值内反馈的所述持续能量值均位于预设能量范围之内时，发送接听指令至所述待测服务接口，以通过所述待测服务接口与所述待测主机建立通信连接；When the sustained energy value fed back by the host sound card within a preset time threshold is within the preset energy range, an answering instruction is sent to the service interface under test, so as to communicate with the service interface under test through the service interface under test. The host to be tested establishes a communication connection;

通过音频端点监听所述待测主机在通信连接过程中是否发出第一预设语音数据；Monitor through the audio endpoint whether the host under test sends out the first preset voice data during the communication connection process;

在所述待测主机发出所述第一预设语音数据时，采集所述第一预设语音数据的第一采样频率，并根据预设的背景频率范围验证所述第一采样频率是否达标；When the host under test sends out the first preset voice data, collecting the first sampling frequency of the first preset voice data, and verifying whether the first sampling frequency meets the standard according to the preset background frequency range;

在所述第一采样频率达标时，按照预设模板发送与所述第一预设语音数据对应的语音回复至所述待测主机，获取所述待测主机针对所述语音回复所反馈的第二预设语音数据，将所述第二预设语音数据转换为文本数据，并将所述文本数据与所述预设模板对应的匹配内容进行匹配；When the first sampling frequency reaches the standard, send a voice response corresponding to the first preset voice data to the host under test according to a preset template, and obtain the first feedback from the host under test for the voice response 2. preset voice data, converting the second preset voice data into text data, and matching the text data with matching content corresponding to the preset template;

在所有所述文本数据与所述预设模板对应的所述匹配内容匹配成功时，确定所述待测主机反馈的所述第二预设语音数据无误。When all the text data and the matching content corresponding to the preset template are successfully matched, it is determined that the second preset voice data fed back by the host to be tested is correct.

一种拨测装置，包括：A dial test device, including:

发送模块，用于调用待测主机的待测服务接口，获取所述待测服务接口的唯一标识，并通过所述唯一标识向所述待测服务接口发送通信连接指令；A sending module, configured to call the service interface under test of the host under test, obtain the unique identifier of the service interface under test, and send a communication connection instruction to the service interface under test through the unique identifier;

获取模块，用于监听所述待测主机的主机声卡，并获取所述待测主机的主机声卡在预设时间阈值内反馈的持续能量值；An obtaining module, configured to monitor the host sound card of the host under test, and obtain the continuous energy value fed back by the host sound card of the host under test within a preset time threshold;

触发模块，用于在所述主机声卡在预设时间阈值内反馈的所述持续能量值均位于预设能量范围之内时，发送接听指令至所述待测服务接口，以通过所述待测服务接口与所述待测主机建立通信连接；The trigger module is configured to send an answering instruction to the service interface to be tested when the sustained energy value fed back by the host sound card within a preset time threshold is within the preset energy range, so as to pass the test The service interface establishes a communication connection with the host to be tested;

监听模块，用于通过音频端点监听所述待测主机在通信连接过程中是否发出第一预设语音数据；The monitoring module is configured to monitor whether the host under test sends out the first preset voice data during the communication connection process through the audio endpoint;

采集模块，用于在所述待测主机发出所述第一预设语音数据时，采集所述第一预设语音数据的第一采样频率，并根据预设的背景频率范围验证所述第一采样频率是否达标；The collection module is configured to collect the first sampling frequency of the first preset voice data when the host under test sends out the first preset voice data, and verify the first sampling frequency according to the preset background frequency range. Whether the sampling frequency meets the standard;

匹配模块，用于在所述第一采样频率达标时，按照预设模板发送与所述第一预设语音数据对应的语音回复至所述待测主机，获取所述待测主机针对所述语音回复所反馈的第二预设语音数据，将所述第二预设语音数据转换为文本数据，并将所述文本数据与所述预设模板对应的匹配内容进行匹配；The matching module is configured to send a voice response corresponding to the first preset voice data to the host under test according to a preset template when the first sampling frequency reaches the standard, and obtain the host under test for the voice Replying to the second preset voice data fed back, converting the second preset voice data into text data, and matching the text data with matching content corresponding to the preset template;

第一确定模块，用于在所有所述文本数据与所述预设模板对应的所述匹配内容匹配成功时，确定所述待测主机反馈的所述第二预设语音数据无误。The first determining module is configured to determine that the second preset voice data fed back by the host to be tested is correct when all the text data and the matching content corresponding to the preset template are successfully matched.

一种计算机设备，包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序，所述处理器执行所述计算机程序时实现上述拨测方法。A computer device includes a memory, a processor, and a computer program that is stored in the memory and can run on the processor, and the processor implements the above dial test method when the computer program is executed.

一种计算机可读存储介质，所述计算机可读存储介质存储有计算机程序，所述计算机程序被处理器执行时实现上述拨测方法。A computer-readable storage medium that stores a computer program that implements the above dial test method when the computer program is executed by a processor.

一种计算机设备，包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令，所述处理器执行所述计算机可读指令时实现如下步骤：A computer device includes a memory, a processor, and computer-readable instructions that are stored in the memory and can run on the processor, and the processor implements the following steps when the processor executes the computer-readable instructions:

一个或多个存储有计算机可读指令的可读存储介质，所述计算机可读指令被一个或多个处理器执行时，使得所述一个或多个处理器执行如下步骤：One or more readable storage media storing computer readable instructions, when the computer readable instructions are executed by one or more processors, the one or more processors execute the following steps:

上述拨测方法、装置、计算机设备及存储介质，通过音频端点检测持续能量值来证明主机声卡所在的待测主机是否能被呼成功，从而确保待测主机的被呼功能；通过音频端点监听待测主机（可被理解成一种虚拟的语音机器人）在通信连接过程中是否能进行正常的通语音服务，从而能确保待测主机可以进行正常的对话服务；通过预设的背景频率范围验证第一采样频率是否达标，从而能确保第一采样频率可以维持在一个相对于人类人耳可以接受的频率范围中（过高过低的第一采样频率以及背景频率的干扰都会影响到人耳的接受程度），方便在用户与待测主机进行通信过程中，可提高用户的体验效果。总的来说，本发明能确保待测主机的通话服务质量。The above dial test method, device, computer equipment and storage medium use the audio endpoint to detect the continuous energy value to prove whether the host under test where the sound card of the host is located can be called successfully, so as to ensure the called function of the host under test; Whether the test host (can be understood as a virtual voice robot) can provide normal voice services during the communication connection process, so as to ensure that the host under test can perform normal conversation services; verify the first through the preset background frequency range Whether the sampling frequency is up to standard, so as to ensure that the first sampling frequency can be maintained in a frequency range that is acceptable to the human ear (the first sampling frequency that is too high or too low and the interference of the background frequency will affect the acceptance of the human ear ), which is convenient for the user to communicate with the host under test, which can improve the user experience. In general, the present invention can ensure the call service quality of the host to be tested.

本申请的一个或多个实施例的细节在下面的附图和描述中提出，本申请的其他特征和优点将从说明书、附图以及权利要求变得明显。The details of one or more embodiments of the present application are presented in the following drawings and description, and other features and advantages of the present application will become apparent from the description, drawings and claims.

To

附图说明Description of the drawings

为了更清楚地说明本申请实施例的技术方案，下面将对本申请实施例的描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本申请的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the following will briefly introduce the drawings that need to be used in the description of the embodiments of the present application. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative labor.

图1是本申请一实施例中拨测方法的一应用环境示意图；FIG. 1 is a schematic diagram of an application environment of the dial test method in an embodiment of the present application;

图2是本申请一实施例中拨测方法的一流程图；FIG. 2 is a flowchart of a dial test method in an embodiment of the present application;

图3是本申请一实施例中拨测装置的结构示意图；3 is a schematic diagram of the structure of a dial test device in an embodiment of the present application;

图4是本申请一实施例中计算机设备的一示意图。Fig. 4 is a schematic diagram of a computer device in an embodiment of the present application.

To

具体实施方式Detailed ways

下面将结合本申请实施例中的附图，对本申请实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本申请一部分实施例，而不是全部的实施例。基于本申请中的实施例，本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例，都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, rather than all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.

本申请提供的拨测方法，可应用在如图1的应用环境中，其中，客户端通过网络与服务器进行通信。其中，客户端可以但不限于各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备。服务器可以用独立的服务器或者是多个服务器组成的服务器集群来实现。The dial test method provided in this application can be applied in the application environment as shown in Fig. 1, in which the client communicates with the server through the network. Among them, the client can be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices. The server can be implemented as an independent server or a server cluster composed of multiple servers.

在一实施例中，如图2所示，提供一种拨测方法，以该方法应用在图1中的服务器为例进行说明，包括如下步骤S10-S70：In one embodiment, as shown in FIG. 2, a dial test method is provided. The method is applied to the server in FIG. 1 as an example for description, including the following steps S10-S70:

S10S10 ，调用待测主机的待测服务接口，获取所述待测服务接口的唯一标识，并通过所述唯一标识向所述待测服务接口发送通信连接指令；Invoke the service interface under test of the host under test, obtain the unique identifier of the service interface under test, and send a communication connection instruction to the service interface under test through the unique identifier;

可理解地，待测主机具有可向外界提供一个智能语音回复的功能，且该待测主机可被作为客服机器人（也即一种虚拟的语音机器人)去完成咨询回复、营销和调查等工作；唯一标识是待测服务接口的识别码，通过该识别码可向指定的待测服务接口发送通信连接指令。在本实施例中，向待测服务接口发送通信连接指令是为了进行拨测任务，测试待测主机的待测服务接口的智能语音回复的功能。Understandably, the host under test has the function of providing an intelligent voice response to the outside world, and the host under test can be used as a customer service robot (that is, a virtual voice robot) to complete consultation responses, marketing, and surveys; The unique identifier is the identification code of the service interface to be tested, through which the communication connection instruction can be sent to the designated service interface to be tested. In this embodiment, the purpose of sending a communication connection instruction to the service interface under test is to perform a dial test task and test the intelligent voice reply function of the service interface under test of the host under test.

S20S20 ，监听所述待测主机的主机声卡，并获取所述待测主机的主机声卡在预设时间阈值内反馈的持续能量值；, Monitor the host sound card of the host under test, and obtain the continuous energy value fed back by the host sound card of the host under test within a preset time threshold;

可理解地，通过音频端点技术可以确定出静音点与声音点之间的区别，静音点是指主机声卡未反馈声音所在的端点，声音点是指主机声卡反馈的声音所在的端点，且不管是静音点还是声音点，都可以通过与音频端点技术相关联的能量组件（通过能量振幅确定出能量值）确定出能量振幅（也即确定出能量值，一个端点可代表一个能量值，且每个端点可位于其中一语音帧中，且每语音帧包含相同数量的端点，因此可计算出一语音帧的能量值），此时可获取主机声卡在预设时间阈值内反馈的持续能量值（可能为多个连续静音点组成的至少一个语音帧所对应的持续能量值，也可能为多个连续声音点组成的至少一个语音帧所对应的持续能量值）。Understandably, the difference between the mute point and the sound point can be determined through audio endpoint technology. The mute point refers to the endpoint where the host sound card does not feedback sound, and the sound point refers to the endpoint where the sound feedback from the host sound card is located, regardless of whether it is The mute point or the sound point can be determined by the energy component associated with the audio endpoint technology (the energy value is determined by the energy amplitude) (that is, the energy value is determined). An endpoint can represent an energy value, and each The endpoint can be located in one of the voice frames, and each voice frame contains the same number of endpoints, so the energy value of a voice frame can be calculated). At this time, the continuous energy value of the host sound card feedback within the preset time threshold can be obtained (possibly The sustained energy value corresponding to at least one speech frame composed of multiple consecutive silent points may also be the sustained energy value corresponding to at least one speech frame composed of multiple consecutive sound points).

S30S30 ，在所述主机声卡在预设时间阈值内反馈的所述持续能量值均位于预设能量范围之内时，发送接听指令至所述待测服务接口，以通过所述待测服务接口与所述待测主机建立通信连接；, When the sustained energy value fed back by the host sound card within the preset time threshold is within the preset energy range, the answering instruction is sent to the service interface under test, so as to communicate with the service interface under test through the service interface under test. The host to be tested establishes a communication connection;

可理解地，通过在预设时间阈值内的主机声卡反馈的声音的平均能量值和经验数据值设置一个预设能量范围（由于主机声卡在静音的时候，待测主机在与音频端点相关联的能量组件中所确定的能量值，明显小于在主机声卡在反馈声音的时候，待测主机在与音频端点相关联的能量组件中所确定的能量值，因此，预设能量范围将是以上述提到的声音点组成的至少一个语音帧对应的持续能量值而设置的一个对比标准）。可理解地，通过音频端点能确定出语音开始的前端点和语音结束的后端点，但在本实施例中无需确定语音结束的后端点，只需确定语音开始的前端点（上述提到的声音点）和设置一个预设时间阈值就能实现判断主机声卡所在的待测主机是否能实现被呼的功能。Understandably, a preset energy range is set by the average energy value of the sound feedback from the host sound card within the preset time threshold and the empirical data value (because the host sound card is muted, the host under test is The energy value determined in the energy component is significantly smaller than the energy value determined in the energy component associated with the audio endpoint when the host sound card is feeding back sound. Therefore, the preset energy range will be based on the above mentioned value. A comparison standard is set based on the sustained energy value corresponding to at least one speech frame composed of the received sound points). Understandably, the audio endpoint can determine the front-end point where the voice starts and the back-end point where the voice ends. However, in this embodiment, there is no need to determine the back-end point where the voice ends. You only need to determine the front-end point where the voice starts (the aforementioned sound Point) and setting a preset time threshold can realize the function of judging whether the host under test where the host sound card is located can be called.

在另一实施例中，在所述主机声卡在预设时间阈值内反馈的所述持续能量值小于预设能量范围时，向预设人员发送无法呼出的提示信息，并结束通信连接，此时，主机声卡所在的待测主机不能实现被呼功能。In another embodiment, when the sustained energy value fed back by the host sound card within a preset time threshold is less than the preset energy range, a reminder message that the call cannot be made is sent to the preset person, and the communication connection is terminated. , The host under test where the host sound card is located cannot realize the called function.

S40S40 ，通过音频端点监听所述待测主机在通信连接过程中是否发出第一预设语音数据；, Monitoring through the audio endpoint whether the host under test sends out the first preset voice data during the communication connection process;

可理解地，待测主机发出一句第一预设语音数据时，就能通过音频端点监听到第一预设语音数据所对应的一个瞬时能量值，第一预设语音数据可以为待测主机的反馈的一句礼貌用语，在识别到礼貌用语对应的第一预设语音数据的瞬时能量值达到预设能量阈值时，就可以确定待测主机在通信连接过程是正常。本实施例是通过音频端点来判断待测主机在通信连接过程是否能进行正常的对话服务。Understandably, when the host under test sends out a sentence of first preset voice data, an instantaneous energy value corresponding to the first preset voice data can be monitored through the audio endpoint. The first preset voice data can be the host's A feedback polite phrase, when it is recognized that the instantaneous energy value of the first preset voice data corresponding to the polite phrase reaches the preset energy threshold, it can be determined that the host under test is normal in the communication connection process. In this embodiment, the audio endpoint is used to determine whether the host under test can perform normal conversation services during the communication connection process.

进一步地，所述通过音频端点检测监听所述待测主机在通信连接过程中是否发出第一预设语音数据，包括：Further, the detecting and monitoring whether the host to be tested sends out the first preset voice data during the communication connection process through the audio endpoint detection includes:

通过音频端点检测监听所述待测主机反馈的瞬时能量值是否达到预设能量阈值；Monitoring whether the instantaneous energy value fed back by the host under test reaches a preset energy threshold through audio endpoint detection;

在反馈的所述瞬时能量值达到所述预设能量阈值时，确定所述待测主机发出所述第一预设语音数据。When the feedback instantaneous energy value reaches the preset energy threshold, it is determined that the host under test emits the first preset voice data.

在本实施例中具体的方法同步骤S30的一致，只不过在本实施例中只需确定出第一个声音点所组成的一个语音帧对应的瞬时能量值。The specific method in this embodiment is the same as that of step S30, except that in this embodiment, only the instantaneous energy value corresponding to a speech frame formed by the first sound point needs to be determined.

S50S50 ，在所述待测主机发出所述第一预设语音数据时，采集所述第一预设语音数据的第一采样频率，并根据预设的背景频率范围验证所述第一采样频率是否达标；, When the host under test sends out the first preset voice data, collect the first sampling frequency of the first preset voice data, and verify whether the first sampling frequency meets the standard according to the preset background frequency range ；

可理解地，第一采样频率是指服务器每秒钟采集多少个声音样本，是描述声音文件的音质、音调，衡量声卡、声音文件的质量标准。Understandably, the first sampling frequency refers to the number of sound samples collected by the server per second, which describes the sound quality and tone of the sound file, and measures the quality standards of the sound card and the sound file.

在本实施例中是在待测主机发出第一预设语音数据时，为了监控待测主机发出的第一预设语音数据的第一采样频率，通过预设的背景频率范围验证第一采样频率是否达标，可确定第一采样频率在根据背景频率（也即后文中提及的背景语音的第二采样频率）确定的预设的背景频率范围下，是否能排除背景频率的干扰，使得当前的第一采样频率可以维持在一个相对于人类人耳可以接受的频率范围中（过高过低的第一采样频率以及背景频率的干扰都会影响到人耳的接受程度），方便后续在用户与待测主机进行通信过程中，可提高用户的体验效果。In this embodiment, when the host under test sends out the first preset voice data, in order to monitor the first sampling frequency of the first preset voice data sent by the host under test, the first sampling frequency is verified through the preset background frequency range Whether it meets the standard, it can be determined whether the first sampling frequency can eliminate the interference of the background frequency under the preset background frequency range determined according to the background frequency (that is, the second sampling frequency of the background voice mentioned later), so that the current The first sampling frequency can be maintained in a frequency range that is acceptable to human ears (the first sampling frequency that is too high or too low and the interference of the background frequency will affect the acceptance of the human ear), which is convenient for subsequent users and waiting During the communication process of the test host, the user experience can be improved.

在另一实施例中，在所述待测主机未发出所述第一预设语音数据时，向预设人员发送无声警告的提示信息，并结束通信连接。本实施例可以说明待测主机出现问题（比如待测主机崩溃等）导致不能发声。In another embodiment, when the host under test does not send out the first preset voice data, a silent warning prompt message is sent to the preset personnel, and the communication connection is terminated. This embodiment can explain that the host to be tested has a problem (for example, the host to be tested crashes, etc.), which causes the host to be unable to make a sound.

进一步地，所述在所述待测主机发出所述第一预设语音数据时，采集所述第一预设语音数据的第一采样频率，并根据预设的背景频率范围验证所述第一采样频率是否达标，包括：Further, when the host under test sends out the first preset voice data, the first sampling frequency of the first preset voice data is collected, and the first sampling frequency is verified according to the preset background frequency range. Whether the sampling frequency meets the standard, including:

获取所述待测主机发出所述第一预设语音数据时的背景声音，采集所述背景语音的第二采样频率，通过所述第二采样频率确定所述背景频率范围。Acquire a background sound when the host under test emits the first preset voice data, collect a second sampling frequency of the background voice, and determine the background frequency range by the second sampling frequency.

在本实施例中，由于背景声音能一定程度影响到对第一采样频率是否达标的判断，且背景声音对应的第二采样频率也处于一种随时受外界因素干扰而产生变化并波动的状态，因此可通过当前采集的背景声音的第二采样频率来实时确定出背景频率范围，如此，此背景频率范围将更好地适用于对第一采样频率是否达标的判断。In this embodiment, because the background sound can affect the judgment of whether the first sampling frequency is up to the standard to a certain extent, and the second sampling frequency corresponding to the background sound is also in a state of being subject to interference from external factors at any time to change and fluctuate, Therefore, the background frequency range can be determined in real time through the second sampling frequency of the currently collected background sound. In this way, this background frequency range will be better suited for judging whether the first sampling frequency meets the standard.

S60S60 ，在所述第一采样频率达标时，按照预设模板发送与所述第一预设语音数据对应的语音回复至所述待测主机，获取所述待测主机针对所述语音回复所反馈的第二预设语音数据，将所述第二预设语音数据转换为文本数据，并将所述文本数据与所述预设模板对应的匹配内容进行匹配；When the first sampling frequency reaches the standard, send a voice response corresponding to the first preset voice data to the host under test according to a preset template, and obtain feedback from the host under test for the voice response Second preset voice data, converting the second preset voice data into text data, and matching the text data with matching content corresponding to the preset template;

可理解地，噪声数据可以背景声音或者其他杂音等；服务器可按照预设选取规则从预设模板中选出一套具体的应用场景模板（比如针对一个问题所进行的咨询），且各个预设模板都存在预设好的匹配内容（可理解为一个标准答案）；待测主机能识别和理解到服务器发送的关于第一预设语音数据对应的语音回复的内容，且待测主机识别的方法可通过语义识别模型去进行识别；待测主机针对语音回复所反馈的第二预设语音数据必须是针对于选中预设模板对应的匹配内容（若第二预设语音数据对应的文本数据与预设模板对应的匹配内容匹配失败，则说明待测主机由于问题的存在导致答非所问或者待测主机不能发出具体且完整的第二预设语音数据）。Understandably, the noise data can be background sounds or other noises; the server can select a set of specific application scenario templates from the preset templates according to preset selection rules (for example, a consultation for a problem), and each preset The templates have preset matching content (which can be understood as a standard answer); the host to be tested can recognize and understand the content of the voice response corresponding to the first preset voice data sent by the server, and the method for the host to be tested to identify The semantic recognition model can be used for recognition; the second preset voice data feedback by the host to be tested for the voice response must be for the matching content corresponding to the selected preset template (if the text data corresponding to the second preset voice data and the preset Assuming that the matching content corresponding to the template fails to match, it means that the host under test does not answer the question due to the existence of the problem or the host under test cannot send out specific and complete second preset voice data).

本实施例中是在第一采样频率达标时，为了检验待测主机是否能针对外界的问答所给出正确的反映（反馈第二预设语音数据），保证了待测主机可以进行正常的对话服务状态。In this embodiment, when the first sampling frequency reaches the standard, in order to check whether the host under test can give correct responses to the external question and answer (feedback the second preset voice data), it ensures that the host under test can conduct normal conversations. service status.

进一步地，所述获取所述待测主机针对所述语音回复所反馈的第二预设语音数据，包括：Further, the acquiring the second preset voice data fed back by the host under test for the voice response includes:

获取模拟人工组件，并以所述模拟人工组件针对所述待测主机的所述第一预设语音数据完成语音回复的时间点为所述待测主机反馈的未处理语音数据的开始时间点，以所述模拟人工组件针对所述未处理语音数据开始进行下一次语音回复的时间点为所述待测主机反馈的所述未处理语音数据的结束时间点，获取位于所述开始时间点和所述结束时间点之间的所述未处理语音数据；所述未处理语音数据包含噪声数据段；Obtain a simulated manual component, and use the time point when the simulated manual component completes a voice response to the first preset voice data of the host under test as the start time point of the unprocessed voice data fed back by the host under test, Taking the time point when the simulated manual component starts the next voice response to the unprocessed voice data as the end time point of the unprocessed voice data fed back by the host to be tested, it is acquired at the start time point and the time point. The unprocessed voice data between the end time points; the unprocessed voice data includes a noise data segment;

确定所述未处理语音数据中的所有语音段的频谱，根据所述语音段的频谱对所述未处理语音数据中的所述噪声数据段进行清除，将所述待测主机针对所述语音回复所反馈的未包含所述噪声数据段的所述未处理语音数据记录为第二预设语音数据。Determine the spectrum of all voice segments in the unprocessed voice data, clear the noise data segment in the unprocessed voice data according to the spectrum of the voice segment, and send the host under test to the voice response The feedback unprocessed voice data that does not include the noise data segment is recorded as second preset voice data.

可理解地，模拟人工组件可被服务器调用，且服务器通过该模拟人工组件可让服务器针对待测主机的第一预设语音数据完成语音回复，并且该模拟人工组件是按照预设模板进行语音回复。Understandably, the simulated manual component can be called by the server, and the server can make the server complete the voice response to the first preset voice data of the host under test through the simulated manual component, and the simulated manual component is the voice response according to the preset template .

具体地，首先通过时间点来确定出一段未处理语音数据的开始时间点和结束时间点，并将位于开始时间点和结束时间点之间的这段包含噪声数据段的未处理语音数据获取出来；然后通过同端点类似方法确定出噪声数据段（通过只包含噪声数据段的一个语音数据确定出未处理语音数据中的噪声数据段），通过傅里叶变换确定出噪声数据段的频谱和包含噪声数据段的未处理语音数据的频谱后，在频谱中用包含噪声数据段的未处理语音数据的频谱减去噪声数据段的频谱从而实现对噪声数据段的清除，并得到语音降噪后第二预设语音数据的频谱；最后通过傅里叶逆变换将第二预设语音数据所在的频域转回到时域中，得到语音降噪后的第二预设语音数据。Specifically, the start time point and the end time point of a piece of unprocessed voice data are first determined by the time point, and the segment of unprocessed voice data containing the noise data segment between the start time point and the end time point is obtained ; Then the noise data segment is determined by a similar method to the endpoint (the noise data segment in the unprocessed voice data is determined by a voice data that only contains the noise data segment), and the frequency spectrum and inclusion of the noise data segment are determined by Fourier transform After the spectrum of the unprocessed voice data in the noise data segment, the spectrum of the unprocessed voice data containing the noise data segment is subtracted from the spectrum of the noise data segment in the spectrum to realize the removal of the noise data segment, and the first voice after noise reduction is obtained. 2. The frequency spectrum of the preset voice data; finally, the frequency domain where the second preset voice data is located is converted back to the time domain through an inverse Fourier transform to obtain the second preset voice data after voice noise reduction.

在本实施例中可确定出未包含噪声数据段的第二预设语音数据，可便于服务器较好地识别第二预设语音数据，并将第二预设语音数据完整转换为文本数据。In this embodiment, the second preset voice data that does not contain the noise data segment can be determined, which can facilitate the server to better recognize the second preset voice data and completely convert the second preset voice data into text data.

进一步地，所述获取所述待测主机针对所述语音回复所反馈的第二预设语音数据之后，还包括：Further, after obtaining the second preset voice data fed back by the host under test for the voice response, the method further includes:

实时采集所述第二预设语音数据的第三采样频率，并检测所述第三采样频率是否落入所述背景频率范围；Collecting the third sampling frequency of the second preset voice data in real time, and detecting whether the third sampling frequency falls within the background frequency range;

在所述第二预设语音数据对应的所述第三采样频率未落入所述背景频率范围时，调用频率调制组件并通过所述频率调制组件对通信连接进行频率调制。When the third sampling frequency corresponding to the second preset voice data does not fall within the background frequency range, a frequency modulation component is called and the communication connection is frequency modulated through the frequency modulation component.

在本实施例中在第三采样频率未落入背景频率范围时，通过频率调制组件可以对通信连接进行频率调制，在第二预设语音数据对应的第三采样频率未落入背景频率范围时，可以将第二预设语音数据对应采集到的第三采样频率维持在背景频率范围中，保持良好的通话质量。在另一实施例中，在所述第二预设语音数据对应的所述第三采样频率落入所述背景频率范围时，无需调用频率调制组件并通过所述频率调制组件对通信连接进行频率调制。In this embodiment, when the third sampling frequency does not fall within the background frequency range, the frequency modulation component can be used to frequency modulate the communication connection. When the third sampling frequency corresponding to the second preset voice data does not fall within the background frequency range , The third sampling frequency corresponding to the second preset voice data can be maintained in the background frequency range to maintain good call quality. In another embodiment, when the third sampling frequency corresponding to the second preset voice data falls within the background frequency range, there is no need to call a frequency modulation component and use the frequency modulation component to frequency the communication connection. modulation.

S70S70 ，在所有所述文本数据与所述预设模板对应的所述匹配内容匹配成功时，确定所述待测主机反馈的所述第二预设语音数据无误。When all the text data and the matching content corresponding to the preset template are successfully matched, it is determined that the second preset voice data fed back by the host to be tested is correct.

在本实施例中可在所有文本数据与预设模板对应的匹配内容匹配成功时，实现通过待测主机的本次的拨测任务来确定待测主机未存在问题。In this embodiment, when all the text data and the matching content corresponding to the preset template are successfully matched, it is possible to determine that there is no problem with the host under test through the current dial test task of the host under test.

进一步地，所述将所述文本数据与所述预设模板对应的匹配内容进行匹配之后，还包括：Further, after the matching the text data with the matching content corresponding to the preset template, the method further includes:

在所述文本数据与所述预设模板对应的所述匹配内容匹配失败时，以预设提示方式发出第二预设语音数据有误的提示信息至预设人员终端。When the text data fails to match the matching content corresponding to the preset template, a preset prompting manner is used to send a prompt message indicating that the second preset voice data is incorrect to the preset personnel terminal.

可理解地，预设提示方式包括但不限于邮件和短信提示。在本实施例是为了给预设人员一个警惕的功能。Understandably, the preset prompt methods include, but are not limited to, email and short message prompts. In this embodiment, it is to give the preset personnel a vigilant function.

进一步地，所述在所述文本数据与所述预设模板对应的所述匹配内容匹配失败之后，还包括：Further, after the text data fails to match the matching content corresponding to the preset template, the method further includes:

根据所述文本数据确定所述待测主机的第二预设语音数据是否出现与预设丢包原因对应的丢包现象；Determining, according to the text data, whether the second preset voice data of the host to be tested has a packet loss phenomenon corresponding to a preset packet loss cause;

在出现与预设丢包原因对应的丢包现象时，对所述待测主机反馈的所述第二预设语音数据进行解码，并通过解码得到的解码信息确定出所述第二预设语音数据中发生丢包的丢包语音帧；When a packet loss phenomenon corresponding to a preset packet loss cause occurs, decode the second preset voice data fed back by the host to be tested, and determine the second preset voice based on the decoded information obtained by decoding Packet loss voice frames with packet loss in the data;

根据所述丢包语音帧的前一个所述语音帧的状态信息确定出替代语音帧，并将所述第二预设语音数据中的所述丢包语音帧替换为所述替代语音帧。Determine a replacement voice frame according to the state information of the previous voice frame of the packet loss voice frame, and replace the packet loss voice frame in the second preset voice data with the replacement voice frame.

具体地，首先分析待测主机在通信连接过程中针对语音回复所反馈的第二预设语音数据，并在第二预设语音数据不为具体且完整的语音数据时，可以确定待测主机在通信连接中出现预设丢包原因（第二预设语音数据中的部分或者全部语音帧由于外界环境的干扰导致出现丢包现象，上述外界环境的干扰包括物理线路故障、设备故障、病毒攻击、路由信息错误等干扰）；接着对第二预设语音数据进行解码流程（解码是编码的逆过程，同时去掉比特流在传播过程中混入的噪声数据，利用译码表把文字译成一组组数码或用译码表将代表某一项信息的一系列信号译成文字的过程称之为解码，且解码通常是一种较少输入变为较多输出的过程），且在解码流程中可以获取第二预设语音数据各语音帧的状态信息（状态信息包括LPC信息和解码后的残缺信号等，其中，LPC信息即线性预测编码信息），并通过计算各语音帧的比特率来确定语音帧是否丢失，将比特率丢失的语音帧作为发生丢包的丢包语音帧；最后令丢包补偿单元根据丢包的丢包语音帧的前一个语音帧的状态信息,重建出丢包语音帧的LPC信息和残缺信号，进而，丢包补偿单元再通过LPC信息和残缺信号重建出替代语音帧，此时，服务器将替代语音帧替代丢包语音帧。Specifically, firstly, analyze the second preset voice data fed back by the host under test for voice responses during the communication connection process, and when the second preset voice data is not specific and complete voice data, it can be determined that the host under test is A preset cause of packet loss occurs in the communication connection (part or all of the voice frames in the second preset voice data cause packet loss due to the interference of the external environment. The interference of the above external environment includes physical line failure, equipment failure, virus attack, Routing information error and other interference); then decode the second preset voice data (decoding is the reverse process of encoding, and at the same time remove the noise data mixed in the bit stream during the propagation process, and use the decoding table to translate the text into a group of groups The process of translating a series of signals representing a certain item of information into text by digital or using a decoding table is called decoding, and decoding is usually a process in which less input becomes more output), and it can be used in the decoding process. Acquire the state information of each voice frame of the second preset voice data (state information includes LPC information and decoded incomplete signal, etc., where LPC information is linear predictive coding information), and determine the voice by calculating the bit rate of each voice frame Whether the frame is lost, the speech frame with the bit rate loss is regarded as the lost speech frame with packet loss; finally the packet loss compensation unit is made to reconstruct the lost speech frame according to the state information of the previous speech frame of the lost packet speech frame Then, the packet loss compensation unit reconstructs the replacement voice frame through the LPC information and the torn signal. At this time, the server replaces the lost voice frame with the replacement voice frame.

在本实施例中可在出现与预设丢包原因对应的丢包现象时，对第二预设语音数据中发生丢包的丢包语音帧进行修复，保证了第二预设语音数据的完整性。In this embodiment, when the packet loss phenomenon corresponding to the preset packet loss reason occurs, the lost packet voice frame in the second preset voice data can be repaired, so as to ensure the integrity of the second preset voice data sex.

进一步地，所述根据所述丢包语音帧的前一个所述语音帧的状态信息确定出替代语音帧，并将所述第二预设语音数据中的所述丢包语音帧替换为所述替代语音帧之后，还包括：Further, the replacement voice frame is determined according to the state information of the previous voice frame of the packet loss voice frame, and the packet loss voice frame in the second preset voice data is replaced with the After replacing the voice frame, it also includes:

将包含所述替代语音帧的所述第二预设语音数据重新转换为新的文本数据，并将所述新的文本数据与所述预设模板对应的所述匹配内容进行匹配。The second preset voice data including the replacement voice frame is converted back into new text data, and the new text data is matched with the matching content corresponding to the preset template.

在本实施例中可参照步骤S60进一步地将所述新的文本数据与所述预设模板对应的所述匹配内容进行匹配，之后，可确保修复后的第二预设语音数据是否还出现预设丢包原因对应的丢包现象，在未出现预设丢包原因对应的丢包现象时，保证了该通信过程能再延续运行；在再次出现预设丢包原因对应的丢包现象时，再次以预设提示方式提示预设人员第二预设语音数据有误，以提示预设人员对该问题进行下一步处理（比如人工处理）。In this embodiment, referring to step S60, the new text data can be further matched with the matching content corresponding to the preset template. After that, it can be ensured whether the repaired second preset voice data still has a preset Set the packet loss phenomenon corresponding to the packet loss reason. When the packet loss phenomenon corresponding to the preset packet loss reason does not occur, the communication process can continue to run; when the packet loss phenomenon corresponding to the preset packet loss reason occurs again, The preset prompt method is used to prompt the preset personnel that the second preset voice data is incorrect, so as to prompt the preset personnel to perform the next step of processing the problem (such as manual processing).

进一步地，所述步骤S70之后，还包括：Further, after the step S70, it further includes:

调用预设数量的外呼组件，并通过所述外呼组件向所述待测主机的所述待测服务接口发送通信连接指令，并根据所述待测服务接口的当前通信负载信息确定所述待测服务接口当前可负荷的最大外呼数量，并根据所述最大外呼数量确定通过所有所述待测服务接口建立与所述待测主机的所述外呼组件的数量。Call a preset number of outbound call components, and send a communication connection instruction to the service interface under test of the host under test through the outbound call component, and determine the current communication load information of the service interface under test The maximum number of outbound calls currently loadable by the service interface to be tested, and the number of outbound call components established with the host under test through all the service interfaces to be tested is determined according to the maximum number of outbound calls.

可理解地，待测主机的通信负载信息是指待测主机所有的待测服务接口能同时负载多少数量外呼组件的运行。在本实施例中确定出的最大外呼数量能使待测主机维持在一个最优的运行状态，避免影响待测主机的运行效率，也可同时为最大数量的咨询用户（一个外呼组件可代表一个客户)进行更优的通信连接服务。Understandably, the communication load information of the host to be tested refers to how many outbound components can be simultaneously loaded by all the service interfaces of the host to be tested. The maximum number of outbound calls determined in this embodiment can maintain the host under test in an optimal operating state, avoid affecting the operating efficiency of the host under test, and can also serve as the largest number of consulting users at the same time (one outbound component can be On behalf of a customer) for better communication connection services.

综上所述，上述提供了一种拨测方法，通过音频端点检测持续能量值来证明主机声卡所在的待测主机是否能被呼成功，从而确保待测主机的被呼功能；通过音频端点监听待测主机（可被理解成一种虚拟的语音机器人）在通信连接过程中是否能进行正常的通语音服务，从而能确保待测主机可以进行正常的对话服务；通过预设的背景频率范围验证第一采样频率是否达标，从而能确保第一采样频率可以维持在一个相对于人类人耳可以接受的频率范围中（过高过低的第一采样频率以及背景频率的干扰都会影响到人耳的接受程度），方便在用户与待测主机进行通信过程中，可提高用户的体验效果。总的来说，本申请能确保待测主机的通话服务质量。In summary, the above provides a dial test method, which uses the audio endpoint to detect the continuous energy value to prove whether the host under test where the host sound card is located can be called successfully, so as to ensure the called function of the host under test; monitor through the audio endpoint Whether the host under test (can be understood as a virtual voice robot) can perform normal voice services during the communication connection process, so as to ensure that the host under test can perform normal conversation services; verify the first through the preset background frequency range A sampling frequency is up to the standard, so as to ensure that the first sampling frequency can be maintained in a frequency range that is acceptable to human ears (too high or low first sampling frequency and interference from background frequencies will affect the acceptance of human ears Degree), to facilitate the communication between the user and the host under test, which can improve the user experience. In general, this application can ensure the call service quality of the host under test.

应理解，上述实施例中各步骤的序号的大小并不意味着执行顺序的先后，各过程的执行顺序应以其功能和内在逻辑确定，而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence number of each step in the foregoing embodiment does not mean the order of execution. The execution sequence of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiment of the present application.

To

在一实施例中，提供一种拨测装置，该拨测装置与上述实施例中拨测方法一一对应。如图3所示，该拨测装置包括发送模块11、获取模块12、触发模块13、监听模块14、采集模块15、匹配模块16和第一确定模块17。各功能模块详细说明如下：In one embodiment, a dial test device is provided, and the dial test device corresponds to the dial test method in the above-mentioned embodiment one-to-one. As shown in FIG. 3, the dial test device includes a sending module 11, an acquisition module 12, a trigger module 13, a monitoring module 14, an acquisition module 15, a matching module 16 and a first determination module 17. The detailed description of each functional module is as follows:

发送模块11，用于调用待测主机的待测服务接口，获取所述待测服务接口的唯一标识，并通过所述唯一标识向所述待测服务接口发送通信连接指令；The sending module 11 is configured to call the service interface under test of the host under test, obtain the unique identifier of the service interface under test, and send a communication connection instruction to the service interface under test through the unique identifier;

获取模块12，用于监听所述待测主机的主机声卡，并获取所述待测主机的主机声卡在预设时间阈值内反馈的持续能量值；The obtaining module 12 is configured to monitor the host sound card of the host under test, and obtain the continuous energy value fed back by the host sound card of the host under test within a preset time threshold;

触发模块13，用于在所述主机声卡在预设时间阈值内反馈的所述持续能量值均位于预设能量范围之内时，发送接听指令至所述待测服务接口，以通过所述待测服务接口与所述待测主机建立通信连接；The trigger module 13 is configured to send an answering instruction to the service interface to be tested when the sustained energy value fed back by the host sound card within a preset time threshold is within the preset energy range, so as to pass the waiting The test service interface establishes a communication connection with the host to be tested;

监听模块14，用于通过音频端点监听所述待测主机在通信连接过程中是否发出第一预设语音数据；The monitoring module 14 is configured to monitor whether the host to be tested sends out the first preset voice data during the communication connection process through the audio endpoint;

采集模块15，用于在所述待测主机发出所述第一预设语音数据时，采集所述第一预设语音数据的第一采样频率，并根据预设的背景频率范围验证所述第一采样频率是否达标；The collection module 15 is configured to collect the first sampling frequency of the first preset voice data when the host to be tested emits the first preset voice data, and verify the first sampling frequency according to the preset background frequency range 1. Whether the sampling frequency meets the standard;

匹配模块16，用于在所述第一采样频率达标时，按照预设模板发送与所述第一预设语音数据对应的语音回复至所述待测主机，获取所述待测主机针对所述语音回复所反馈的第二预设语音数据，将所述第二预设语音数据转换为文本数据，并将所述文本数据与所述预设模板对应的匹配内容进行匹配；The matching module 16 is configured to send a voice response corresponding to the first preset voice data to the host under test according to a preset template when the first sampling frequency meets the standard, and obtain that the host under test responds to the Voice reply to the second preset voice data fed back, convert the second preset voice data into text data, and match the text data with matching content corresponding to the preset template;

第一确定模块17，用于在所有所述文本数据与所述预设模板对应的所述匹配内容匹配成功时，确定所述待测主机反馈的所述第二预设语音数据无误。The first determining module 17 is configured to determine that the second preset voice data fed back by the host to be tested is correct when all the text data and the matching content corresponding to the preset template are successfully matched.

进一步地，所述拨测装置还包括：Further, the dial test device further includes:

验证模块，用于获取所述待测主机发出所述第一预设语音数据时的背景声音，采集所述背景语音的第二采样频率，通过所述第二采样频率确定所述背景频率范围。The verification module is configured to obtain the background sound when the host under test emits the first preset voice data, collect the second sampling frequency of the background voice, and determine the background frequency range by the second sampling frequency.

进一步地，所述匹配模块包括：Further, the matching module includes:

匹配子模块，用于获取模拟人工组件，并以所述模拟人工组件针对所述待测主机的所述第一预设语音数据完成语音回复的时间点为所述待测主机反馈的未处理语音数据的开始时间点，以所述模拟人工组件针对所述未处理语音数据开始进行下一次语音回复的时间点为所述待测主机反馈的所述未处理语音数据的结束时间点，获取位于所述开始时间点和所述结束时间点之间的所述未处理语音数据；所述未处理语音数据包含噪声数据段；The matching sub-module is used to obtain an analog manual component, and use the time point when the analog manual component completes a voice response to the first preset voice data of the host under test as the unprocessed voice feedback from the host under test The start time point of the data, the time point at which the simulated manual component starts the next voice response for the unprocessed voice data is the end time point of the unprocessed voice data fed back by the host under test, and the location is acquired The unprocessed voice data between the start time point and the end time point; the unprocessed voice data includes a noise data segment;

记录子模块，用于确定所述未处理语音数据中的所有语音段的频谱，根据所述语音段的频谱对所述未处理语音数据中的所述噪声数据段进行清除，将所述待测主机针对所述语音回复所反馈的未包含所述噪声数据段的所述未处理语音数据记录为第二预设语音数据。The recording sub-module is used to determine the frequency spectrum of all voice segments in the unprocessed voice data, clear the noise data segment in the unprocessed voice data according to the frequency spectrum of the voice segment, and remove the to-be-tested The unprocessed voice data that does not include the noise data segment that is fed back by the host for the voice response is recorded as second preset voice data.

检测模块，用于实时采集所述第二预设语音数据的第三采样频率，并检测所述第三采样频率是否落入所述背景频率范围；The detection module is configured to collect the third sampling frequency of the second preset voice data in real time, and detect whether the third sampling frequency falls within the background frequency range;

调制模块，用于在所述第二预设语音数据对应的所述第三采样频率未落入所述背景频率范围时，调用频率调制组件并通过所述频率调制组件对通信连接进行频率调制。The modulation module is configured to call a frequency modulation component and perform frequency modulation on the communication connection through the frequency modulation component when the third sampling frequency corresponding to the second preset voice data does not fall within the background frequency range.

提示模块，用于在所述文本数据与所述预设模板对应的所述匹配内容匹配失败时，以预设提示方式发出第二预设语音数据有误的提示信息至预设人员终端。The prompt module is configured to send a prompt message indicating that the second preset voice data is incorrect in a preset prompt manner to a preset personnel terminal when the text data fails to match the matching content corresponding to the preset template.

第二确定模块，用于根据所述文本数据确定所述待测主机的第二预设语音数据是否出现与预设丢包原因对应的丢包现象；The second determining module is configured to determine, according to the text data, whether the second preset voice data of the host to be tested has a packet loss phenomenon corresponding to a preset packet loss cause;

第三确定模块，用于在出现与预设丢包原因对应的丢包现象时，对所述待测主机反馈的所述第二预设语音数据进行解码，并通过解码得到的解码信息确定出所述第二预设语音数据中发生丢包的丢包语音帧；The third determining module is configured to decode the second preset voice data fed back by the host to be tested when a packet loss phenomenon corresponding to a preset packet loss cause occurs, and determine the result from the decoding information obtained by the decoding Packet loss voice frames in the second preset voice data where packet loss occurs;

替换模块，用于根据所述丢包语音帧的前一个所述语音帧的状态信息确定出替代语音帧，并将所述第二预设语音数据中的所述丢包语音帧替换为所述替代语音帧。The replacement module is configured to determine a replacement voice frame according to the state information of the voice frame before the packet loss voice frame, and replace the packet loss voice frame in the second preset voice data with the Replace speech frame.

第四确定模块，用于调用预设数量的外呼组件，并通过所述外呼组件向所述待测主机的所述待测服务接口发送通信连接指令，并根据所述待测服务接口的当前通信负载信息确定所述待测服务接口当前可负荷的最大外呼数量，并根据所述最大外呼数量确定通过所有所述待测服务接口建立与所述待测主机的所述外呼组件的数量。The fourth determining module is configured to call a preset number of outbound call components, and send a communication connection instruction to the service interface under test of the host under test through the outbound call component, and according to the service interface under test The current communication load information determines the maximum number of outbound calls that the service interface under test can currently load, and determines, according to the maximum number of outbound calls, to establish the outbound call component with the host under test through all the service interfaces under test quantity.

关于拨测装置的具体限定可以参见上文中对于拨测方法的限定，在此不再赘述。上述拨测装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中，也可以以软件形式存储于计算机设备中的存储器中，以便于处理器调用执行以上各个模块对应的操作。For the specific definition of the dial test device, please refer to the above definition of the dial test method, which will not be repeated here. Each module in the above dial test device can be implemented in whole or in part by software, hardware and a combination thereof. The above-mentioned modules may be embedded in the form of hardware or independent of the processor in the computer equipment, or may be stored in the memory of the computer equipment in the form of software, so that the processor can call and execute the operations corresponding to the above-mentioned modules.

To

在一个实施例中，提供了一种计算机设备，该计算机设备可以是服务器，其内部结构图可以如图4所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。其中，该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括计算机可读指令、内存储器。该计算机可读指令存储有操作系统、计算机可读指令和数据库。该内存储器为计算机可读指令中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于存储多条历史测试数据，每条历史测试数据对应有测试问题记录。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机可读指令被处理器执行时以实现一种拨测方法。理方法。本实施例所提供的可读存储介质包括非易失性可读存储介质和易失性可读存储介质。In one embodiment, a computer device is provided. The computer device may be a server, and its internal structure diagram may be as shown in FIG. 4. The computer equipment includes a processor, a memory, a network interface, and a database connected through a system bus. Among them, the processor of the computer device is used to provide calculation and control capabilities. The memory of the computer device includes computer readable instructions and internal memory. The computer readable instructions are stored with an operating system, computer readable instructions and a database. The internal memory provides an environment for the operation of the operating system and the computer-readable instructions in the computer-readable instructions. The database of the computer equipment is used to store multiple pieces of historical test data, and each piece of historical test data corresponds to a test problem record. The network interface of the computer device is used to communicate with an external terminal through a network connection. The computer readable instruction is executed by the processor to realize a dial test method.理method. The readable storage medium provided in this embodiment includes a non-volatile readable storage medium and a volatile readable storage medium.

在一个实施例中，提供了一种计算机设备，包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机可读指令，处理器执行计算机可读指令时实现上述实施例中所述拨测方法。In one embodiment, a computer device is provided, including a memory, a processor, and computer-readable instructions stored on the memory and capable of running on the processor. The processor executes the computer-readable instructions to implement the above-mentioned embodiments. State the dial test method.

在一个实施例中，提供了一个或多个存储有计算机可读指令的可读存储介质，本实施例所提供的可读存储介质包括非易失性可读存储介质和易失性可读存储介质；该可读存储介质上存储有计算机可读指令，该计算机可读指令被一个或多个处理器执行时，使得一个或多个处理器实现上述实施例中所述拨测方法。In one embodiment, one or more readable storage media storing computer readable instructions are provided. The readable storage media provided in this embodiment include non-volatile readable storage media and volatile readable storage. Medium; the readable storage medium stores computer readable instructions, and when the computer readable instructions are executed by one or more processors, the one or more processors implement the dial test method described in the above embodiments.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程，是可以通过计算机可读指令来指令相关的硬件来完成，所述的计算机可读指令可存储于一非易失性计算机可读取存储介质或易失性可读存储介质中，该计算机可读指令在执行时，可包括如上述各方法的实施例的流程。其中，本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用，均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器（ROM）、可编程ROM（PROM）、电可编程ROM（EPROM）、电可擦除可编程ROM（EEPROM）或闪存。易失性存储器可包括随机存取存储器（RAM）或者外部高速缓冲存储器。作为说明而非局限，RAM以多种形式可得，诸如静态RAM（SRAM）、动态RAM（DRAM）、同步DRAM（SDRAM）、双数据率SDRAM（DDRSDRAM）、增强型SDRAM（ESDRAM）、同步链路（Synchlink） DRAM（SLDRAM）、存储器总线（Rambus）直接RAM（RDRAM）、直接存储器总线动态RAM（DRDRAM）、以及存储器总线动态RAM（RDRAM）等。A person of ordinary skill in the art can understand that all or part of the processes in the methods of the foregoing embodiments can be implemented by computer-readable instructions to instruct relevant hardware. The computer-readable instructions can be stored in a non-volatile computer. In a readable storage medium or a volatile readable storage medium, when the computer readable instruction is executed, it may include the processes of the above-mentioned method embodiments. Wherein, any reference to memory, storage, database, or other media used in the embodiments provided in this application may include non-volatile and/or volatile memory. Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. As an illustration and not a limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.

所属领域的技术人员可以清楚地了解到，为了描述的方便和简洁，仅以上述各功能单元、模块的划分进行举例说明，实际应用中，可以根据需要而将上述功能分配由不同的功能单元、模块完成，即将所述装置的内部结构划分成不同的功能单元或模块，以完成以上描述的全部或者部分功能。Those skilled in the art can clearly understand that, for the convenience and conciseness of description, only the division of the above functional units and modules is used as an example. In practical applications, the above functions can be allocated to different functional units and modules as needed. Module completion, that is, the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above.

The above-mentioned embodiments are only used to illustrate the technical solutions of the present application, not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that they can still implement the foregoing The technical solutions recorded in the examples are modified, or some of the technical features are equivalently replaced; and these modifications or replacements do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the application, and should be included in Within the scope of protection of this application .

Claims

A dial test method, which includes:

Call the service interface to be tested of the host to be tested, obtain the unique identifier of the service interface to be tested, and send a communication connection instruction to the service interface to be tested through the unique identifier;

Monitor the host sound card of the host under test, and obtain the continuous energy value fed back by the host sound card of the host under test within a preset time threshold;

When the sustained energy value fed back by the host sound card within a preset time threshold is within the preset energy range, an answering instruction is sent to the service interface under test, so as to communicate with the service interface under test through the service interface under test. The host to be tested establishes a communication connection;

Monitor through the audio endpoint whether the host under test sends out the first preset voice data during the communication connection process;

When the host under test sends out the first preset voice data, collecting the first sampling frequency of the first preset voice data, and verifying whether the first sampling frequency meets the standard according to the preset background frequency range;

When the first sampling frequency reaches the standard, send a voice response corresponding to the first preset voice data to the host under test according to a preset template, and obtain the first feedback from the host under test for the voice response 2. preset voice data, converting the second preset voice data into text data, and matching the text data with matching content corresponding to the preset template;

When all the text data and the matching content corresponding to the preset template are successfully matched, it is determined that the second preset voice data fed back by the host to be tested is correct.
The dial test method according to claim 1, wherein, after the host under test sends out the first preset voice data, the method further comprises:

Acquire a background sound when the host under test emits the first preset voice data, collect a second sampling frequency of the background voice, and determine the background frequency range by the second sampling frequency.
The dial test method according to claim 1, wherein the obtaining the second preset voice data fed back by the host under test for the voice response comprises:

Obtain a simulated manual component, and use the time point when the simulated manual component completes a voice response to the first preset voice data of the host under test as the start time point of the unprocessed voice data fed back by the host under test, Taking the time point when the simulated manual component starts the next voice response to the unprocessed voice data as the end time point of the unprocessed voice data fed back by the host to be tested, it is acquired at the start time point and the time point. The unprocessed voice data between the end time points; the unprocessed voice data includes a noise data segment;

Determine the spectrum of all voice segments in the unprocessed voice data, clear the noise data segment in the unprocessed voice data according to the spectrum of the voice segment, and send the host under test to the voice response The feedback unprocessed voice data that does not include the noise data segment is recorded as second preset voice data.
The dial test method according to claim 1, wherein after obtaining the second preset voice data fed back by the host under test for the voice response, the method further comprises:

Collecting the third sampling frequency of the second preset voice data in real time, and detecting whether the third sampling frequency falls within the background frequency range;

When the third sampling frequency corresponding to the second preset voice data does not fall within the background frequency range, a frequency modulation component is called and the communication connection is frequency modulated through the frequency modulation component.
The dial test method according to claim 1, wherein after the matching the text data with the matching content corresponding to the preset template, the method further comprises:

When the text data fails to match the matching content corresponding to the preset template, a preset prompt manner is used to send a prompt message indicating that the second preset voice data is incorrect to the preset personnel terminal.
The dial test method according to claim 5, wherein after the matching content corresponding to the text data and the preset template fails to match, the method further comprises:

Determining, according to the text data, whether the second preset voice data of the host to be tested has a packet loss phenomenon corresponding to a preset packet loss cause;

When a packet loss phenomenon corresponding to a preset packet loss cause occurs, decode the second preset voice data fed back by the host to be tested, and determine the second preset voice based on the decoded information obtained by decoding Packet loss voice frames with packet loss in the data;

Determine a replacement voice frame according to the state information of the previous voice frame of the packet loss voice frame, and replace the packet loss voice frame in the second preset voice data with the replacement voice frame.
The dial test method according to claim 1, wherein after determining that the second preset voice data fed back by the host to be tested is correct, the method further comprises:

Call a preset number of outbound call components, and send a communication connection instruction to the service interface under test of the host under test through the outbound call component, and determine the current communication load information of the service interface under test The maximum number of outbound calls currently loadable by the service interface to be tested, and the number of outbound call components established with the host under test through all the service interfaces to be tested is determined according to the maximum number of outbound calls.
A dial test device, which includes:

A sending module, configured to call the service interface under test of the host under test, obtain the unique identifier of the service interface under test, and send a communication connection instruction to the service interface under test through the unique identifier;

An obtaining module, configured to monitor the host sound card of the host under test, and obtain the continuous energy value fed back by the host sound card of the host under test within a preset time threshold;

The trigger module is configured to send an answering instruction to the service interface under test when the sustained energy value fed back by the host sound card within a preset time threshold remains within the preset energy range, so as to pass the test The service interface establishes a communication connection with the host to be tested;

The monitoring module is configured to monitor whether the host under test sends out the first preset voice data during the communication connection process through the audio endpoint;

The collection module is configured to collect the first sampling frequency of the first preset voice data when the host under test sends out the first preset voice data, and verify the first sampling frequency according to the preset background frequency range. Whether the sampling frequency meets the standard;

The matching module is configured to send a voice response corresponding to the first preset voice data to the host under test according to a preset template when the first sampling frequency reaches the standard, and obtain the host under test for the voice Replying to the second preset voice data fed back, converting the second preset voice data into text data, and matching the text data with matching content corresponding to the preset template;

The first determining module is configured to determine that the second preset voice data fed back by the host to be tested is correct when all the text data and the matching content corresponding to the preset template are successfully matched.
A computer device includes a memory, a processor, and computer-readable instructions that are stored in the memory and can run on the processor, wherein the processor implements the following steps when the processor executes the computer-readable instructions:

Call the service interface to be tested of the host to be tested, obtain the unique identifier of the service interface to be tested, and send a communication connection instruction to the service interface to be tested through the unique identifier;

Monitor the host sound card of the host under test, and obtain the continuous energy value fed back by the host sound card of the host under test within a preset time threshold;

When the sustained energy value fed back by the host sound card within a preset time threshold is within the preset energy range, an answering instruction is sent to the service interface under test, so as to communicate with the service interface under test through the service interface under test. The host to be tested establishes a communication connection;

Monitor through the audio endpoint whether the host under test sends out the first preset voice data during the communication connection process;

When the host under test sends out the first preset voice data, collecting the first sampling frequency of the first preset voice data, and verifying whether the first sampling frequency meets the standard according to the preset background frequency range;

When the first sampling frequency reaches the standard, send a voice response corresponding to the first preset voice data to the host under test according to a preset template, and obtain the first feedback from the host under test for the voice response 2. preset voice data, converting the second preset voice data into text data, and matching the text data with matching content corresponding to the preset template;

When all the text data and the matching content corresponding to the preset template are successfully matched, it is determined that the second preset voice data fed back by the host to be tested is correct.
9. The computer device according to claim 9, wherein the following steps are further implemented when the processor executes the computer-readable instruction after the host under test sends out the first preset voice data:

Acquire a background sound when the host under test emits the first preset voice data, collect a second sampling frequency of the background voice, and determine the background frequency range by the second sampling frequency.
9. The computer device according to claim 9, wherein said acquiring the second preset voice data fed back by said host under test for said voice response comprises:

Obtain a simulated manual component, and use the time point when the simulated manual component completes a voice response to the first preset voice data of the host under test as the start time point of the unprocessed voice data fed back by the host under test, Taking the time point when the simulated manual component starts the next voice response to the unprocessed voice data as the end time point of the unprocessed voice data fed back by the host to be tested, it is acquired at the start time point and the time point. The unprocessed voice data between the end time points; the unprocessed voice data includes a noise data segment;

Determine the spectrum of all voice segments in the unprocessed voice data, clear the noise data segment in the unprocessed voice data according to the spectrum of the voice segment, and send the host under test to the voice response The feedback unprocessed voice data that does not include the noise data segment is recorded as second preset voice data.
The computer device according to claim 9, wherein, after the acquisition of the second preset voice data fed back by the host under test for the voice response, when the processor executes the computer-readable instruction, it also implements The following steps:

Collecting the third sampling frequency of the second preset voice data in real time, and detecting whether the third sampling frequency falls within the background frequency range;

When the third sampling frequency corresponding to the second preset voice data does not fall within the background frequency range, a frequency modulation component is called and the communication connection is frequency modulated through the frequency modulation component.
9. The computer device according to claim 9, wherein, after the matching content corresponding to the preset template is matched between the text data, the processor further implements the following steps when executing the computer-readable instruction:

When the text data fails to match the matching content corresponding to the preset template, a preset prompt manner is used to send a prompt message indicating that the second preset voice data is incorrect to the preset personnel terminal.
15. The computer device according to claim 13, wherein, after the text data fails to match the matching content corresponding to the preset template, the processor further implements the following steps when executing the computer readable instruction:

Determining, according to the text data, whether the second preset voice data of the host to be tested has a packet loss phenomenon corresponding to a preset packet loss cause;

When a packet loss phenomenon corresponding to a preset packet loss cause occurs, decode the second preset voice data fed back by the host to be tested, and determine the second preset voice based on the decoded information obtained by decoding Packet loss voice frames with packet loss in the data;

Determine a replacement voice frame according to the state information of the previous voice frame of the packet loss voice frame, and replace the packet loss voice frame in the second preset voice data with the replacement voice frame.
One or more readable storage media storing computer readable instructions, where when the computer readable instructions are executed by one or more processors, the one or more processors execute the following steps:

Call the service interface to be tested of the host to be tested, obtain the unique identifier of the service interface to be tested, and send a communication connection instruction to the service interface to be tested through the unique identifier;

Monitor the host sound card of the host under test, and obtain the continuous energy value fed back by the host sound card of the host under test within a preset time threshold;

When the sustained energy value fed back by the host sound card within a preset time threshold is within the preset energy range, an answering instruction is sent to the service interface under test, so as to communicate with the service interface under test through the service interface under test. The host to be tested establishes a communication connection;

Monitor through the audio endpoint whether the host under test sends out the first preset voice data during the communication connection process;

When the host under test sends out the first preset voice data, collecting the first sampling frequency of the first preset voice data, and verifying whether the first sampling frequency meets the standard according to the preset background frequency range;

When the first sampling frequency reaches the standard, send a voice response corresponding to the first preset voice data to the host under test according to a preset template, and obtain the first feedback from the host under test for the voice response 2. preset voice data, converting the second preset voice data into text data, and matching the text data with matching content corresponding to the preset template;

When all the text data and the matching content corresponding to the preset template are successfully matched, it is determined that the second preset voice data fed back by the host to be tested is correct.
The readable storage medium according to claim 15, wherein, after the computer-readable instruction is executed by one or more processors after the host under test sends out the first preset voice data, So that the one or more processors further execute the following steps:

Acquire a background sound when the host under test emits the first preset voice data, collect a second sampling frequency of the background voice, and determine the background frequency range by the second sampling frequency.
15. The readable storage medium according to claim 15, wherein said acquiring the second preset voice data fed back by said host under test for said voice response comprises:

Obtain a simulated manual component, and use the time point when the simulated manual component completes a voice response to the first preset voice data of the host under test as the start time point of the unprocessed voice data fed back by the host under test, Taking the time point when the simulated manual component starts the next voice response to the unprocessed voice data as the end time point of the unprocessed voice data fed back by the host to be tested, it is acquired at the start time point and the time point. The unprocessed voice data between the end time points; the unprocessed voice data includes a noise data segment;

Determine the spectrum of all voice segments in the unprocessed voice data, clear the noise data segment in the unprocessed voice data according to the spectrum of the voice segment, and send the host under test to the voice response The feedback unprocessed voice data that does not include the noise data segment is recorded as second preset voice data.
The readable storage medium according to claim 15, wherein, after obtaining the second preset voice data fed back by the host under test for the voice response, the computer-readable instruction is processed by one or more When the processor executes, the one or more processors further execute the following steps:

Collecting the third sampling frequency of the second preset voice data in real time, and detecting whether the third sampling frequency falls within the background frequency range;

When the third sampling frequency corresponding to the second preset voice data does not fall within the background frequency range, a frequency modulation component is called and the communication connection is frequency modulated through the frequency modulation component.
The readable storage medium according to claim 15, wherein, after the matching content corresponding to the preset template is matched with the text data, when the computer-readable instructions are executed by one or more processors , So that the one or more processors further execute the following steps:

When the text data fails to match the matching content corresponding to the preset template, a preset prompt manner is used to send a prompt message indicating that the second preset voice data is incorrect to the preset personnel terminal.
The readable storage medium according to claim 19, wherein the computer-readable instructions are executed by one or more processors after the text data fails to match the matching content corresponding to the preset template. , So that the one or more processors further execute the following steps:

Determining, according to the text data, whether the second preset voice data of the host to be tested has a packet loss phenomenon corresponding to a preset packet loss cause;

When a packet loss phenomenon corresponding to a preset packet loss cause occurs, decode the second preset voice data fed back by the host to be tested, and determine the second preset voice based on the decoded information obtained by decoding Packet loss voice frames with packet loss in the data;

Determine a replacement voice frame according to the state information of the previous voice frame of the packet loss voice frame, and replace the packet loss voice frame in the second preset voice data with the replacement voice frame.