WO2014173342A1 - Method and device for identifying automatic response of calling system - Google Patents

Method and device for identifying automatic response of calling system Download PDF

Info

Publication number
WO2014173342A1
WO2014173342A1 PCT/CN2014/077731 CN2014077731W WO2014173342A1 WO 2014173342 A1 WO2014173342 A1 WO 2014173342A1 CN 2014077731 W CN2014077731 W CN 2014077731W WO 2014173342 A1 WO2014173342 A1 WO 2014173342A1
Authority
WO
WIPO (PCT)
Prior art keywords
response
data packet
audio data
preset
automatic
Prior art date
Application number
PCT/CN2014/077731
Other languages
French (fr)
Chinese (zh)
Inventor
张伟
刘澍
张武雄
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2014173342A1 publication Critical patent/WO2014173342A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/22Arrangements for supervision, monitoring or testing
    • H04M3/26Arrangements for supervision, monitoring or testing with means for applying test signals or for measuring
    • H04M3/28Automatic routine testing ; Fault testing; Installation testing; Test methods, test equipment or test arrangements therefor
    • H04M3/30Automatic routine testing ; Fault testing; Installation testing; Test methods, test equipment or test arrangements therefor for subscriber's lines, for the local loop

Definitions

  • the present invention relates to the field of communications, and in particular, to an identification method and apparatus for automatic answering of a call system. Background technique
  • the NGCC (Next Generation Call Center) call system has broad market prospects in the outsourcing industry and overseas.
  • the call system generally responds with automatic voice response through the answering device.
  • the call system includes a control device that controls a business process in the call system, the call system further includes an identification server that provides media processing functions in the basic and enhanced services for all audio and video related media processing, including video and Audio RTP (Real Time Transport Protocol) The conversion of data streams to video and audio files.
  • an identification server that provides media processing functions in the basic and enhanced services for all audio and video related media processing, including video and Audio RTP (Real Time Transport Protocol) The conversion of data streams to video and audio files.
  • DTMF Dual Tone Multi-Frequency
  • It has a SIP (Text-Based Protocol) protocol and MSML (Media Session Markup Language and Media Object Markup Language) MOML capabilities that enable it to interact with users throughout the session process under the control of the application server APP.
  • the identification server can recognize the called response, the called busy, and no response.
  • automatic answering devices including, for example, modems, faxes, telephone messages, voice mail, and secretarial desks that need to recognize the automatic answering device response.
  • existing identification servers are unable to identify the portion of the called response that is automatically answered. Summary of the invention
  • the embodiments of the present invention mainly provide a method and device for identifying an automatic response of a call system.
  • An embodiment of the present invention provides a method for identifying an automatic response of a call system, the method comprising: when receiving a response identification request sent by a response device in a call system, the identification server establishes an audio communication channel with the response device;
  • the recognition server receives the response audio data packet sent by the response device based on the established audio communication channel
  • the identification server analyzes the response audio data packet to determine whether the response audio data packet satisfies a preset automatic response parameter, and when determining that the response audio data packet satisfies a preset automatic response parameter, the identification server determines the response The voice response type corresponding to the audio data packet is an automatic response.
  • the present invention also provides an identification device for automatically answering a call system, the device comprising: a data transmission module configured to: when receiving a response identification request sent by a response device in the call system, the identification server establishes with the response device Audio communication channel; and
  • An identification module configured to analyze the response audio data packet, to determine whether the response audio data packet meets a preset automatic response parameter, and when determining that the response audio data packet meets a preset automatic response parameter, determining the The voice response type corresponding to the response audio data packet is an automatic response.
  • the identification server receives the response audio data packet sent by the response device by establishing an audio communication channel with the answering device in the calling system; and the identification server analyzes the response audio data packet to determine Whether the response audio data packet satisfies a preset automatic response parameter, and determines that the response audio data packet satisfies a preset automatic response When the parameter is, the identification server determines that the voice response type corresponding to the response audio data packet is an automatic response.
  • the realization realizes the automatic response in the called response, so that the control device in the calling system controls the business process of the calling system according to the recognition result.
  • FIG. 1 is a specific flowchart of a first embodiment of a method for identifying an automatic response of a call system according to the present invention
  • FIG. 2 is a specific flowchart of a second embodiment of a method for identifying an automatic response of a call system according to the present invention
  • FIG. 3 is a specific flowchart of a third embodiment of a method for identifying an automatic response of a call system according to the present invention.
  • FIG. 4 is a specific flowchart of a fourth embodiment of a method for identifying an automatic response of a call system according to the present invention.
  • FIG. 5 is a specific flowchart of a first embodiment of an apparatus for identifying an automatic answering system of a call system according to the present invention
  • Fig. 6 is a detailed flow chart showing a second embodiment of the apparatus for automatically answering the call system of the present invention.
  • FIG. 1 it is a specific flowchart of the first embodiment of the method for identifying the automatic response of the call system of the present invention.
  • the identification server receives, by the answering device, a response identification request sent by the answering device, the identification server establishes an audio communication channel with the answering device; and based on the established audio communication channel, the identifying server receives the answering audio data packet sent by the answering device The identification server analyzes the response audio data packet to determine whether the response audio data packet satisfies a preset automatic response parameter, and when determining that the response audio data packet satisfies a preset automatic response parameter, the identification server determines the The voice response type corresponding to the response audio data packet is an automatic response.
  • Step S11 When receiving the response identification request sent by the answering device in the calling system, the identification server establishes an audio communication channel with the answering device.
  • the call flow of the call system is started, the answering device in the call system receives the media from the phone user, and sends a called response audio data packet based on the received media, and the answering device sends a response identification request to the identification server, when Upon receiving the response identification request, the identification server establishes an audio communication channel with the answering device.
  • the answering device in the calling system includes at least one communication channel for receiving media from a telephone user; the communication channel for receiving media from the telephone user corresponds to a communication channel provided with a response audio data packet; Establishing an audio communication channel with the response device; for example, when the response device receives the media of the telephone user through the A communication channel, the response device transmits the called response audio data packet based on the received media,
  • the communication channel sent by the response audio data packet is a B communication channel set with the A communication channel mapping; the response device sends a response identification request to the identification server, and when receiving the response identification request, the identification server establishes the response device Audio communication channel of the B communication channel; as described above, the identification server is responsive to the number of communication channels that the response device answers to the audio data packet - corresponding to the establishment of the response device Audio communication channel.
  • Step S12 Based on the established audio communication channel, the identification server receives the response audio data packet sent by the response device.
  • Step S13 The identification server analyzes the response audio data packet to determine whether the response audio data packet satisfies a preset automatic response parameter.
  • Step S14 When it is determined that the response audio data packet meets the preset automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is an automatic response.
  • the identification server receives the response audio data packet sent by the response device, and the identification server analyzes the response audio data packet to determine the response. Whether the audio data packet meets the preset automatic response parameter, and when determining that the response audio data packet meets the preset automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is an automatic response; When the response audio data packet does not satisfy the preset automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is a manual response.
  • the preset automatic response parameter may be a preset mute reference ratio, or may be a preset continuous mute reference duration or a preset continuous speech reference duration, and the like. In the other embodiments of the present invention, the preset automatic response parameter may also be a mute reference ratio or a continuous mute reference duration or a preset time period.
  • the continuous voice reference duration, etc., the preset time may be an applicable duration set by a user such as 30s or 40s in advance.
  • an audio communication channel C is established between the identification server and the response communication channel B of the response device, and the identification server receives the response audio data packet D from the response communication channel B in the C audio communication channel receiving call system, the pre- The set auto-answer parameter takes the preset mute reference ratio as an example.
  • the preset mute reference ratio is 25%.
  • the recognition server analyzes the response audio data packet D to obtain the mute ratio in the response audio data packet D. If the obtained mute ratio in the response audio packet D is 20%, the obtained mute ratio is 20%.
  • the recognition server determines that the response audio data packet D satisfies the preset automatic response parameter, that is, the recognition server determines that the voice response type corresponding to the response audio data packet of the silence ratio is 20% is automatic.
  • the identification server determines that the response audio data packet D does not satisfy the preset automatic answering parameters, i.e., identifying the other server 1 J determines the ratio of 30% mute audio packet corresponding to the response voice response type manual answer.
  • the identification server sends the recognition result of the voice response type to the automatic response to the control device in the call system, so that the control device controls the business process of the call system according to the recognition result.
  • the identification server receives the response audio data packet sent by the response device by establishing an audio communication channel with the answering device in the calling system; the identification server analyzes the response audio data packet to determine whether the response audio data packet satisfies
  • the automatic answering parameter is configured to: when determining that the response audio data packet meets the preset automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is an automatic response.
  • the realization recognizes the automatic response in the called response, so that the control device in the calling system controls the business process of the calling system according to the recognition result.
  • FIG. 2 it is a specific flowchart of the second embodiment of the method for identifying the automatic response of the call system of the present invention.
  • step S14 the method further includes:
  • Step S15 The identification server closes the audio communication channel, and performs deletion detection on the saved data, and deletes data that meets the preset deletion condition.
  • the identification server closes the response device.
  • the established audio communication channel, and the deleted data is deleted and detected, and the data that meets the preset deletion condition is deleted.
  • the response audio data packet D transmitted by the response device is received through the audio communication channel C, and the response audio data packet D is determined.
  • the identification server closes the audio channel channel C;
  • the preset deletion condition may be a reference storage duration for saving data, or may be other parameters for saving data set by the user in advance, the preset The deletion condition is taken as an example of the storage duration of the saved data. If the storage duration of the saved data is detected, if the storage duration of the saved data is greater than the reference storage duration, the found saved data is deleted, and the reference is deleted.
  • the save time can be 10 days or 15 days, or it can be other reference save time set by the user in advance.
  • the identification server when the preset time is reached, the identification server performs deletion detection on the saved data, and deletes data that meets the preset deletion condition, and is not limited to when the communication channel is closed.
  • the server performs the deletion detection on the saved data, and the preset time may be 1 day or 5 days, or may be any other time interval or time point set by the user in advance.
  • the established communication channel After receiving and recognizing the response audio data packet sent by the response device, the established communication channel is closed, P strives to lower the running load of the identification server, improves the running speed, and deletes the saved data of the identification server, and utilizes the identification reasonably
  • the storage space of the server increases the processing speed.
  • FIG. 3 it is a specific flowchart of the third embodiment of the method for identifying the automatic response of the call system of the present invention.
  • step S13 further includes:
  • Step S16 Identify the continuous mute duration of the audio obtained by the server from the response audio data packet.
  • Step S17 When the acquired continuous mute duration is less than the preset continuous mute reference duration, the identification server determines that the response audio data packet satisfies the preset auto-answer parameter.
  • the preset automatic response parameter is a preset continuous silent reference duration
  • the preset continuous mute reference duration can be set to 0.7s or 1.2s, or it can be the time interval value obtained by any other user through actual detection. Take the preset continuous mute reference duration of 0.7s as an example, identify the server analysis. Receiving the response audio data packet sent by the response device, and obtaining the longest continuous silent duration from the received response audio data packet, and if the longest continuous silent duration obtained is 0.6s, the longest acquisition time is obtained.
  • the continuous mute duration is 0.6s less than the preset continuous mute reference duration of 0.7s
  • the recognition server determines that the longest continuous mute duration of 0.6s response audio data packet satisfies the preset auto-answer parameter, that is, the identification server determines the longest continuous
  • the voice response type corresponding to the response audio packet with a silence duration of 0.6s is an automatic response. If the longest continuous silence duration is 1.6s, the obtained continuous silence duration is 1.6s longer than the continuous silent reference duration of 0.7s.
  • the server determines that the response audio packet with the longest continuous silent duration of 1.6s does not satisfy the preset automatic response parameter, that is, the identification service Determines the length of the audio 1.6s response packet corresponding to the type of artificial response voice response longest continuous silence.
  • the preset automatic response parameter may also be a preset continuous mute reference duration within a preset time period.
  • the data packet satisfies the preset automatic response parameter, and determines that the voice response type corresponding to the response audio data packet is an automatic response.
  • the realization recognizes the automatic response in the called response, so that the control device in the calling system controls the business process of the calling system according to the recognition result.
  • FIG. 4 it is a specific flowchart of a fourth embodiment of the method for identifying an automatic answering system of the present invention.
  • step S13 further includes:
  • Step S18 Identify a continuous voice duration that the server obtains audio from the response audio data packet;
  • Step S19 When the acquired continuous voice duration is greater than the preset continuous voice reference duration, the identification server determines that the response audio data packet meets the preset automatic response parameter.
  • the preset automatic response parameter is a preset continuous voice reference duration
  • the preset continuous voice reference duration may be set to 3.0s or 4.0s, or may be obtained by any other user through actual detection.
  • the identifier of the preset continuous voice reference duration is 3.0s
  • the identification server analyzes the received response audio data packet sent by the answering device in the calling system, and obtains the received response audio data packet from the received response audio data packet.
  • the longest continuous speech duration if the longest continuous speech duration is 3.6 s, the longest continuous speech duration obtained is 3.6 s greater than the preset continuous speech reference duration of 3.0 s, and the recognition server determines the longest continuous
  • the response audio data packet with a voice duration of 3.6 s satisfies the preset automatic response parameter, that is, the recognition server determines that the voice response type corresponding to the longest continuous voice duration of 3.6 s is an automatic response; if the longest acquisition is obtained
  • the continuous speech duration is 1.6s, and the continuous speech duration is 1.6s less than the continuous speech reference duration of 3.0s.
  • the device determines that the response audio packet with the longest continuous speech duration of 1.6s does not satisfy the preset automatic response parameter, that is, the recognition server determines that the voice response type corresponding to the longest continuous speech duration of 1.6s is artificial. Answer.
  • the preset automatic response parameter may also be a continuous voice reference duration within a preset time period.
  • the data packet satisfies the preset automatic response parameter, and determines that the voice response type corresponding to the response audio data packet is an automatic response.
  • the realization recognizes the automatic response in the called response, so that the control device in the calling system controls the business process of the calling system according to the recognition result.
  • FIG. 5 it is a specific architectural diagram of a first embodiment of an identification device for automatically answering a call system of the present invention.
  • the device is disposed in the identification server, and includes: the data transmission module 10 and the identification Module 20, wherein
  • the data transmission module 10 generally refers to a communication interface of the identification server, configured to establish an audio communication channel with the response device when receiving a response identification request sent by the answering device in the call system;
  • the call flow of the call system is started, the answering device in the call system receives the media from the phone user, and sends the called response audio data packet based on the received media, and the answering device sends the response identification to the data transmitting module 10
  • the request when receiving the response identification request, the data transmitting module 10 establishes an audio communication channel with the answering device.
  • the response device includes at least one communication channel for receiving media from a telephone user; the communication channel for receiving media from the telephone user corresponds to a communication channel provided with a response audio data packet; and the data transmission module 10 respectively Establishing an audio communication channel with the response device; for example, when the response device receives the media of the telephone user through the A communication channel, the response device transmits the called response audio data packet based on the received media,
  • the communication channel sent by the response audio data packet is a B communication channel set with the A communication channel mapping; the response device sends a response identification request to the data transmission module 10, and when receiving the response identification request, the data transmission module 10 Establishing an audio communication channel with the B communication channel of the answering device; as described above, the data transmitting module 10 responds to the number of communication channels of the answering device in response to the audio data packet - correspondingly establishing audio with the answering device Communication channel.
  • the identification module 20 is configured to analyze the response audio data packet to determine whether the response audio data packet satisfies a preset automatic response parameter;
  • the voice response type corresponding to the response audio data packet is an automatic response.
  • the data transmission module 10 receives the response audio data packet sent by the response device, and the identification module 20 analyzes the response audio data packet to determine Whether the response audio data packet meets the preset automatic response parameter, and when determining that the response audio data packet meets the preset automatic response parameter, the identification module 20 determines that the voice response type corresponding to the response audio data packet is an automatic response; When it is determined that the response audio data packet does not satisfy the preset automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is a manual response.
  • the preset automatic response parameter may be a preset mute reference ratio, or may be a preset continuous mute reference duration or a preset continuous speech reference duration, and the like. In the other embodiments of the present invention, the preset automatic response parameter may also be a mute reference ratio or a continuous mute reference duration or a preset time period.
  • the continuous voice reference duration, etc., the preset time may be an applicable duration set by a user such as 30s or 40s in advance.
  • an audio communication channel C is established between the data transmitting module 10 and the answering communication channel B of the answering device, and the data transmitting module 10 receives the answering audio data from the answering communication channel B in the C audio communication channel receiving call system.
  • the preset automatic response parameter takes a preset mute reference ratio as an example, and the preset mute reference ratio is 25% as an example, and the identification module 20 analyzes the response audio data packet D to obtain the response audio data.
  • the mute ratio in the packet D if the obtained mute ratio in the response audio data packet D is 20%, the mute ratio 20% obtained by the recognition module 20 is less than 25% of the preset mute reference ratio, and the identification module 20 determines the response.
  • the audio data packet D satisfies the preset automatic response parameter, that is, the identification module 20 determines that the voice response type corresponding to the response audio data packet of the silence ratio is 20% is an automatic response; if the acquired silence ratio in the response audio data packet D 30%, the acquired mute ratio is 30% greater than the preset mute reference ratio of 25%, and the identification module 20 determines that the response audio data packet D does not satisfy the preset automatic response parameter, ie, Module 20 determines the ratio of 30% mute audio packet corresponding to the response voice response to manual answer type.
  • the data transmission module 10 sets the voice response type to automatic
  • the identification result of the answer is sent to the control device in the call system, so that the control device controls the business process of the call system according to the recognition result.
  • the data transmitting module 10 receives the response audio data packet sent by the answering device through the established audio communication channel with the answering device in the calling system; the identifying module 20 analyzes the answering audio data packet to determine the answering audio data. Whether the packet satisfies the preset automatic response parameter, when determining that the response audio data packet satisfies the preset automatic response parameter, the identification module 20 determines that the voice response type corresponding to the response audio data packet is an automatic response.
  • the realization recognizes the automatic response in the called response, so that the control device in the calling system controls the business process of the calling system according to the recognition result.
  • FIG. 6 it is a specific architecture diagram of a second embodiment of the identification device for automatically answering the call system of the present invention.
  • the device includes a processing module 30,
  • the processing module 30 is configured to close the audio communication channel, perform deletion detection on the saved data, and delete data that meets the preset deletion condition.
  • the processing module 30 closes the response and the response.
  • the audio communication channel established between the devices, and the deleted data is deleted and detected, and the data that meets the preset deletion condition is deleted.
  • the processing module 30 closes the audio channel channel C;
  • the preset deletion condition may be a reference storage duration for saving data, or may be a user.
  • the other parameters for saving the data set in advance, the preset deletion condition is an example of the storage duration of the saved data
  • the processing module 30 detects the storage duration of the saved data, and if the saved data is found to be longer than the reference save time
  • the processing module 30 deletes the found saved data, and the reference save duration may be 10 Days or 15 days, it can also be other reference save durations set by the user in advance.
  • the processing module 30 when the preset time is reached, deletes and deletes the saved data, and deletes the data that meets the preset deletion condition, and is not limited to when a communication channel is closed.
  • the processing module 30 performs the deletion detection on the saved data, and the preset time may be 1 day or 5 days, or may be any other time interval or time point set by the user in advance.
  • the processing module 30 After receiving the response audio data packet sent by the answering device by the data transmitting module 10 and identifying by the identification module, the processing module 30 closes the established communication channel, and strives to lower the running load of the identifying device, thereby increasing the running speed, and By deleting the saved data of the identification device, the storage space of the identification device is utilized reasonably, and the processing speed is improved.
  • the identification module 20 is further configured to obtain a continuous silent duration of the audio from the response audio data packet;
  • the preset automatic response parameter is a preset continuous mute reference duration
  • the preset continuous mute reference duration may be set to 0.7s or 1.2s, or may be obtained by any other user through actual detection.
  • the identifier module 20 analyzes the received response audio data packet sent by the response device, and obtains the longest response voice packet from the received response audio data packet.
  • the continuous mute duration if the longest continuous mute duration is 0.6s, the longest continuous mute duration is 0.6s less than the preset continuous mute reference duration of 0.7s, and the recognition module 20 determines the longest continuous mute.
  • the response audio data packet with a duration of 0.6 s satisfies the preset automatic response parameter, that is, the identification module 20 determines that the voice response type corresponding to the longest continuous silent silence duration of 0.6 s is an automatic response, if the longest acquisition is obtained.
  • the continuous mute duration is 1.6s
  • the obtained continuous mute duration is 1.6s longer than the continuous mute reference duration of 0.7s
  • the identification module 20 determines the longest continuous mute duration.
  • the response audio data packet for 1.6s does not satisfy the preset automatic response parameter, that is, the identification module 20 determines that the longest continuous silent duration is
  • the response voice type of the 1.6s response audio packet is a manual response.
  • the preset automatic response parameter may also be a preset continuous mute reference duration within a preset time period.
  • the recognition audio data packet is analyzed by the identification module 20, and the longest continuous silent duration is obtained from the received response audio data packet.
  • the identification module 20 determines The response audio data packet satisfies a preset automatic response parameter, and determines that the voice response type corresponding to the response audio data packet is an automatic response.
  • the automatic response in the called response is recognized so that the control device in the calling system controls the business process of the calling system according to the recognition result.
  • the identification module 20 is further configured to acquire a continuous voice duration of audio from the response audio data packet;
  • the preset automatic response parameter is a preset continuous voice reference duration
  • the preset continuous voice reference duration may be set to 3.0s or 4.0s, or may be obtained by any other user through actual detection.
  • the identification module 20 analyzes the received response audio data packet sent by the answering device in the call system, and receives the response audio data packet from the received response audio data packet. Obtaining the longest continuous speech duration, if the longest continuous speech duration is 3.6 s, the acquired continuous speech duration is 3.6 s greater than the preset continuous speech reference duration of 3.0 s, and the recognition module 20 determines the longest continuous speech duration.
  • the response audio data packet with a duration of 3.6 s satisfies the preset automatic response parameter, that is, the recognition module 20 determines that the voice response type corresponding to the longest continuous voice duration of 3.6 s is an automatic response; if the longest acquisition is obtained
  • the continuous speech duration is 1.6s
  • the obtained continuous speech duration is 1.6s less than the continuous speech reference duration of 3.0s
  • the recognition module 20 determines the longest continuous When sound when the audio data packet length of the response does not satisfy the preset 1.6s automatic answering parameters, i.e. the identification module 20 determines the longest continuous speech
  • the voice response type corresponding to the 1.6S response audio packet is a manual response.
  • the preset automatic response parameter may also be a preset continuous voice reference duration within a preset time period.
  • the recognition audio data packet is analyzed by the identification module 20, and the longest continuous speech duration is obtained from the received response audio data packet.
  • the identification module 20 determines The response audio data packet satisfies a preset automatic response parameter, and determines that the voice response type corresponding to the response audio data packet is an automatic response.
  • the automatic response in the called response is recognized so that the control device in the calling system controls the business process of the calling system according to the recognition result.

Abstract

Disclosed are a method and device for identifying an automatic response of a calling system. In the present invention, the identification server receives a response audio data packet transmitted by a response device of a calling system by means of an audio communication channel established with the response device of the calling system; the identification server analyzes the response audio data packet so as to determine whether the response audio data packet satisfies a pre-set automatic response parameter; when it is determined that the response audio data packet satisfies the pre-set automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is automatic response.

Description

呼叫系统自动应答的识别方法及装置 技术领域  Calling system automatic answering identification method and device
本发明涉及到通信领域, 特别涉及到一种呼叫系统自动应答的识别方 法及装置。 背景技术  The present invention relates to the field of communications, and in particular, to an identification method and apparatus for automatic answering of a call system. Background technique
NGCC (下一代呼叫中心)呼叫系统在外包行业和海外具有广泛的市场 前景, 该呼叫系统一般情况下通过应答设备进行自动语音进行应答, 对于 应答。  The NGCC (Next Generation Call Center) call system has broad market prospects in the outsourcing industry and overseas. The call system generally responds with automatic voice response through the answering device.
呼叫系统包括控制设备, 该控制设备控制呼叫系统中的业务流程, 该 呼叫系统还包括识别服务器, 提供基本和增强业务中的媒体处理功能, 用 于所有与音视频相关的媒体处理, 包括视频和音频 RTP (实时传输协议) 数据流到视音频文件的相互转换。同时,也负责接收用户通过终端的 DTMF (双音多频)输入、 播放业务的引导语音、 显示动态的引导画面。 它具有 的 SIP (基于文本的协议 )协议和 MSML (媒体会话标记语言和媒体对象标 记语言) MOML能力使得其能在应用服务器 APP的控制下完成整个会话 过程与用户的交互。  The call system includes a control device that controls a business process in the call system, the call system further includes an identification server that provides media processing functions in the basic and enhanced services for all audio and video related media processing, including video and Audio RTP (Real Time Transport Protocol) The conversion of data streams to video and audio files. At the same time, it is also responsible for receiving the user's DTMF (Dual Tone Multi-Frequency) input through the terminal, guiding the voice of the broadcast service, and displaying a dynamic boot screen. It has a SIP (Text-Based Protocol) protocol and MSML (Media Session Markup Language and Media Object Markup Language) MOML capabilities that enable it to interact with users throughout the session process under the control of the application server APP.
该识别服务器能识别被叫应答、 被叫忙和无应答。 对于被叫应答中, 有一定的比例是自动应答设备, 例如包括 Modem (调制解调器)、传真、 电 话留言、 语音信箱和秘书台等需要将自动应答设备应答识别出来。 然而, 现有的识别服务器无法将被叫应答中自动应答的部分识别出来。 发明内容 The identification server can recognize the called response, the called busy, and no response. For the called response, there is a certain proportion of automatic answering devices, including, for example, modems, faxes, telephone messages, voice mail, and secretarial desks that need to recognize the automatic answering device response. However, existing identification servers are unable to identify the portion of the called response that is automatically answered. Summary of the invention
为解决现有存在的技术问题, 本发明实施例主要提供一种呼叫系统自 动应答的识别方法及装置。  In order to solve the existing technical problems, the embodiments of the present invention mainly provide a method and device for identifying an automatic response of a call system.
本发明实施例提出一种呼叫系统自动应答的识别方法, 该方法包括: 当接收到呼叫系统中应答设备发送来的应答识别请求时, 识别服务器 建立与所述应答设备的音频通信信道;  An embodiment of the present invention provides a method for identifying an automatic response of a call system, the method comprising: when receiving a response identification request sent by a response device in a call system, the identification server establishes an audio communication channel with the response device;
基于建立的音频通信信道, 识别服务器接收所述应答设备发送过来的 应答音频数据包;  And the recognition server receives the response audio data packet sent by the response device based on the established audio communication channel;
识别服务器分析所述应答音频数据包 , 以确定所述应答音频数据包是 否满足预设的自动应答参数, 在确定所述应答音频数据包满足预设的自动 应答参数时, 识别服务器确定所述应答音频数据包对应的语音应答类型为 自动应答。  The identification server analyzes the response audio data packet to determine whether the response audio data packet satisfies a preset automatic response parameter, and when determining that the response audio data packet satisfies a preset automatic response parameter, the identification server determines the response The voice response type corresponding to the audio data packet is an automatic response.
本发明还提出一种呼叫系统自动应答的识别装置, 该装置包括: 数据接发模块, 配置为当当接收到呼叫系统中应答设备发送来的应答 识别请求时, 识别服务器建立与所述应答设备的音频通信信道; 及  The present invention also provides an identification device for automatically answering a call system, the device comprising: a data transmission module configured to: when receiving a response identification request sent by a response device in the call system, the identification server establishes with the response device Audio communication channel; and
基于建立的音频通信信道, 接收所述应答设备发送过来的应答音频数 据包;  Receiving a response audio data packet sent by the response device based on the established audio communication channel;
识别模块, 配置为分析所述应答音频数据包, 以确定所述应答音频数 据包是否满足预设的自动应答参数, 在确定所述应答音频数据包满足预设 的自动应答参数时, 确定所述应答音频数据包对应的语音应答类型为自动 应答。  An identification module, configured to analyze the response audio data packet, to determine whether the response audio data packet meets a preset automatic response parameter, and when determining that the response audio data packet meets a preset automatic response parameter, determining the The voice response type corresponding to the response audio data packet is an automatic response.
相对现有技术, 本发明实施例通过建立的与呼叫系统中应答设备的音 频通信信道, 识别服务器接收所述应答设备发送过来的应答音频数据包; 识别服务器分析所述应答音频数据包 , 以确定所述应答音频数据包是否满 足预设的自动应答参数, 在确定所述应答音频数据包满足预设的自动应答 参数时, 识别服务器确定所述应答音频数据包对应的语音应答类型为自动 应答。 实现将被叫应答中的自动应答识别出来, 以使呼叫系统中控制设备 根据识别结果来控制呼叫系统的业务流程。 附图说明 Compared with the prior art, in the embodiment of the present invention, the identification server receives the response audio data packet sent by the response device by establishing an audio communication channel with the answering device in the calling system; and the identification server analyzes the response audio data packet to determine Whether the response audio data packet satisfies a preset automatic response parameter, and determines that the response audio data packet satisfies a preset automatic response When the parameter is, the identification server determines that the voice response type corresponding to the response audio data packet is an automatic response. The realization realizes the automatic response in the called response, so that the control device in the calling system controls the business process of the calling system according to the recognition result. DRAWINGS
图 1 为本发明呼叫系统自动应答的识别方法的第一实施例的具体流程 图;  1 is a specific flowchart of a first embodiment of a method for identifying an automatic response of a call system according to the present invention;
图 2为本发明呼叫系统自动应答的识别方法的第二实施例的具体流程 图;  2 is a specific flowchart of a second embodiment of a method for identifying an automatic response of a call system according to the present invention;
图 3 为本发明呼叫系统自动应答的识别方法的第三实施例的具体流程 图;  3 is a specific flowchart of a third embodiment of a method for identifying an automatic response of a call system according to the present invention;
图 4为本发明呼叫系统自动应答的识别方法的第四实施例的具体流程 图;  4 is a specific flowchart of a fourth embodiment of a method for identifying an automatic response of a call system according to the present invention;
图 5 为本发明呼叫系统自动应答的识别装置的第一实施例的具体流程 图;  FIG. 5 is a specific flowchart of a first embodiment of an apparatus for identifying an automatic answering system of a call system according to the present invention; FIG.
图 6为本发明呼叫系统自动应答的识别装置的第二实施例的具体流程 图。  Fig. 6 is a detailed flow chart showing a second embodiment of the apparatus for automatically answering the call system of the present invention.
本发明目的的实现、 功能特点及优点将结合实施例, 参照附图做进一 步说明。 具体实施方式  The implementation, functional features and advantages of the objects of the present invention will be further described in conjunction with the embodiments herein. detailed description
应当理解, 此处所描述的具体实施例仅仅用以解释本发明, 并不用于 限定本发明。  It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
如图 1 所示, 为本发明呼叫系统自动应答的识别方法的第一实施例的 具体流程图。  As shown in FIG. 1, it is a specific flowchart of the first embodiment of the method for identifying the automatic response of the call system of the present invention.
需要强调的是: 图 1 所示流程图仅为一个较佳实施例, 本领域的技术 人员当知, 任何围绕本发明思想构建的实施例都不应脱离于如下技术方案 涵盖的范围: It should be emphasized that the flowchart shown in FIG. 1 is only a preferred embodiment, and the technology in the field It is to be understood that any embodiment constructed around the inventive concept should not depart from the scope of the following technical solutions:
当接收到呼叫系统中应答设备发送来的应答识别请求时, 识别服务器 建立与所述应答设备的音频通信信道; 基于建立的音频通信信道, 识别服 务器接收所述应答设备发送过来的应答音频数据包; 识别服务器分析所述 应答音频数据包, 以确定所述应答音频数据包是否满足预设的自动应答参 数, 在确定所述应答音频数据包满足预设的自动应答参数时, 识别服务器 确定所述应答音频数据包对应的语音应答类型为自动应答。  Receiving, by the answering device, a response identification request sent by the answering device, the identification server establishes an audio communication channel with the answering device; and based on the established audio communication channel, the identifying server receives the answering audio data packet sent by the answering device The identification server analyzes the response audio data packet to determine whether the response audio data packet satisfies a preset automatic response parameter, and when determining that the response audio data packet satisfies a preset automatic response parameter, the identification server determines the The voice response type corresponding to the response audio data packet is an automatic response.
以下是本实施例逐步实现识别出呼叫系统中应答设备的自动应答的具 体步骤:  The following is a specific step in the embodiment to gradually recognize the automatic response of the answering device in the calling system:
步骤 S11 , 当接收到呼叫系统中应答设备发送来的应答识别请求时,识 别服务器建立与所述应答设备的音频通信信道。  Step S11: When receiving the response identification request sent by the answering device in the calling system, the identification server establishes an audio communication channel with the answering device.
具体的, 启动呼叫系统的呼叫流程, 呼叫系统中应答设备接收来自电 话用户的媒体, 并基于所述接收的媒体发送被叫应答音频数据包, 所述应 答设备向识别服务器发送应答识别请求, 当接收到应答识别请求时, 识别 服务器建立与所述应答设备的音频通信信道。 所述呼叫系统中应答设备包 括至少一个通信信道用于接收来自电话用户的媒体; 所述用于接收来自电 话用户的媒体的通信信道对应设置有应答音频数据包的通信信道; 所述识 别服务器分别建立与所述应答设备之间的音频通信信道; 例如, 当所述应 答设备通过 A通信信道接收到电话用户的媒体时, 所述应答设备基于所述 接收的媒体发送被叫应答音频数据包, 所述应答音频数据包发送的通信信 道为与 A通信信道映射设置的 B通信信道; 所述应答设备向识别服务器发 送应答识别请求, 当接收到应答识别请求时, 识别服务器建立与所述应答 设备的 B通信信道的音频通信信道; 如上所述, 识别服务器根据所述应答 设备应答音频数据包的通信信道的数量来——对应建立与所述应答设备的 音频通信信道。 Specifically, the call flow of the call system is started, the answering device in the call system receives the media from the phone user, and sends a called response audio data packet based on the received media, and the answering device sends a response identification request to the identification server, when Upon receiving the response identification request, the identification server establishes an audio communication channel with the answering device. The answering device in the calling system includes at least one communication channel for receiving media from a telephone user; the communication channel for receiving media from the telephone user corresponds to a communication channel provided with a response audio data packet; Establishing an audio communication channel with the response device; for example, when the response device receives the media of the telephone user through the A communication channel, the response device transmits the called response audio data packet based on the received media, The communication channel sent by the response audio data packet is a B communication channel set with the A communication channel mapping; the response device sends a response identification request to the identification server, and when receiving the response identification request, the identification server establishes the response device Audio communication channel of the B communication channel; as described above, the identification server is responsive to the number of communication channels that the response device answers to the audio data packet - corresponding to the establishment of the response device Audio communication channel.
步骤 S12,基于建立的音频通信信道,识别服务器接收所述应答设备发 送过来的应答音频数据包。  Step S12: Based on the established audio communication channel, the identification server receives the response audio data packet sent by the response device.
步骤 S13 ,识别服务器分析所述应答音频数据包, 以确定所述应答音频 数据包是否满足预设的自动应答参数。  Step S13: The identification server analyzes the response audio data packet to determine whether the response audio data packet satisfies a preset automatic response parameter.
步骤 S14,在确定所述应答音频数据包满足预设的自动应答参数时,识 别服务器确定所述应答音频数据包对应的语音应答类型为自动应答。  Step S14: When it is determined that the response audio data packet meets the preset automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is an automatic response.
具体的, 基于建立的识别服务器与所述应答设备之间的音频通信信道, 识别服务器接收所述应答设备发送过来的应答音频数据包, 识别服务器分 析所述应答音频数据包, 以确定所述应答音频数据包是否满足预设的自动 应答参数, 在确定所述应答音频数据包满足预设的自动应答参数时, 识别 服务器确定所述应答音频数据包对应的语音应答类型为自动应答; 在确定 所述应答音频数据包不满足预设的自动应答参数时, 识别服务器确定所述 应答音频数据包对应的语音应答类型为人工应答。 在本实施例中, 所述预 设的自动应答参数可以是预设的静音参考比例, 也还可以是预设的连续静 音参考时长或预设的连续语音参考时长等其他任意用于提前设置的适用的 能识别出自动应答的预设的自动应答参数; 在本发明其他实施例中, 所述 预设的自动应答参数还可以是在预设时间段内的静音参考比例或连续静音 参考时长或连续语音参考时长等,所述预设时间可以是 30s或 40s等用户提 前设置的适用的时长。例如,识别服务器与所述应答设备的应答通信信道 B 之间建立了音频通信信道 C, 识别服务器从 C音频通信信道接收呼叫系统 中的应答通信信道 B接收到应答音频数据包 D, 所述预设的自动应答参数 以预设的静音参考比例为例, 预设的静音参考比例以 25%为例, 识别服务 器分析所述应答音频数据包 D, 获取该应答音频数据包 D中的静音比例, 若获取的该应答音频数据包 D中的静音比例为 20% , 获取的静音比例 20% 小于预设的静音参考比例 25%, 识别服务器确定所述应答音频数据包 D满 足预设的自动应答参数, 即识别服务器确定所述静音比例 20%的应答音频 数据包对应的语音应答类型为自动应答; 若获取的该应答音频数据包 D中 的静音比例为 30%, 获取的静音比例 30%大于预设的静音参考比例 25%, 识别服务器确定所述应答音频数据包 D不满足预设的自动应答参数, 即识 另1 J服务器确定所述静音比例 30%的应答音频数据包对应的语音应答类型为 人工应答。 识别服务器将语音应答类型为自动应答的识别结果发送给呼叫 系统中控制设备, 以使所述控制设备根据所述识别结果来控制呼叫系统的 业务流程。 Specifically, based on an audio communication channel between the established identification server and the response device, the identification server receives the response audio data packet sent by the response device, and the identification server analyzes the response audio data packet to determine the response. Whether the audio data packet meets the preset automatic response parameter, and when determining that the response audio data packet meets the preset automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is an automatic response; When the response audio data packet does not satisfy the preset automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is a manual response. In this embodiment, the preset automatic response parameter may be a preset mute reference ratio, or may be a preset continuous mute reference duration or a preset continuous speech reference duration, and the like. In the other embodiments of the present invention, the preset automatic response parameter may also be a mute reference ratio or a continuous mute reference duration or a preset time period. The continuous voice reference duration, etc., the preset time may be an applicable duration set by a user such as 30s or 40s in advance. For example, an audio communication channel C is established between the identification server and the response communication channel B of the response device, and the identification server receives the response audio data packet D from the response communication channel B in the C audio communication channel receiving call system, the pre- The set auto-answer parameter takes the preset mute reference ratio as an example. The preset mute reference ratio is 25%. The recognition server analyzes the response audio data packet D to obtain the mute ratio in the response audio data packet D. If the obtained mute ratio in the response audio packet D is 20%, the obtained mute ratio is 20%. The recognition server determines that the response audio data packet D satisfies the preset automatic response parameter, that is, the recognition server determines that the voice response type corresponding to the response audio data packet of the silence ratio is 20% is automatic. Answering; if the obtained mute ratio in the response audio data packet D is 30%, the obtained mute ratio 30% is greater than the preset mute reference ratio 25%, and the identification server determines that the response audio data packet D does not satisfy the preset automatic answering parameters, i.e., identifying the other server 1 J determines the ratio of 30% mute audio packet corresponding to the response voice response type manual answer. The identification server sends the recognition result of the voice response type to the automatic response to the control device in the call system, so that the control device controls the business process of the call system according to the recognition result.
通过建立的与呼叫系统中应答设备的音频通信信道, 识别服务器接收 所述应答设备发送过来的应答音频数据包; 识别服务器分析所述应答音频 数据包, 以确定所述应答音频数据包是否满足预设的自动应答参数, 在确 定所述应答音频数据包满足预设的自动应答参数时, 识别服务器确定所述 应答音频数据包对应的语音应答类型为自动应答。 实现将被叫应答中的自 动应答识别出来, 以使呼叫系统中控制设备根据识别结果来控制呼叫系统 的业务流程。  The identification server receives the response audio data packet sent by the response device by establishing an audio communication channel with the answering device in the calling system; the identification server analyzes the response audio data packet to determine whether the response audio data packet satisfies The automatic answering parameter is configured to: when determining that the response audio data packet meets the preset automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is an automatic response. The realization recognizes the automatic response in the called response, so that the control device in the calling system controls the business process of the calling system according to the recognition result.
如图 2所示, 为本发明呼叫系统自动应答的识别方法的第二实施例的 具体流程图。  As shown in FIG. 2, it is a specific flowchart of the second embodiment of the method for identifying the automatic response of the call system of the present invention.
基于上述第一实施例, 在步骤 S14之后, 还包括:  Based on the first embodiment, after step S14, the method further includes:
步骤 S15 ,识别服务器关闭所述音频通信信道,并对保存的数据进行删 除检测, 将满足预设的删除条件的数据删除。  Step S15: The identification server closes the audio communication channel, and performs deletion detection on the saved data, and deletes data that meets the preset deletion condition.
具体的, 在确定所述应答音频数据包满足预设的自动应答参数时, 识 别服务器确定所述应答音频数据包对应的语音应答类型为自动应答之后, 识别服务器关闭所述与所述应答设备之间建立的音频通信信道, 并对保存 的数据进行删除检测, 将满足预设的删除条件的数据删除。 以识别服务器 与所述应答设备的应答通信信道 B之间建立的音频通信信道 C为例, 在通 过音频通信信道 C接收所述应答设备发送过来的应答音频数据包 D, 并确 定所述应答音频数据包 D满足自动应答参数之后, 识别服务器关闭所述音 频通道信道 C; 所述预设的删除条件可以是保存数据的参考保存时长, 也 还可以是用户提前设置的其他保存数据的参数, 所述预设的删除条件以保 存数据的参考保存时长为例, 对保存数据的保存时长的检测, 若找出有保 存数据的保存时长大于参考保存时长, 则将所述找出的保存数据删除, 所 述参考保存时长可以是 10天或 15天, 也还可以是用户提前设置的其他参 考保存时长。 在本发明其他实施例中, 还可以是在预设时间达到时, 识别 服务器对保存的数据进行删除检测, 将满足预设的删除条件的数据删除, 并不局限在关闭一个通信信道时, 识别服务器才对保存的数据进行删除检 测, 所述预设时间可以是 1天或 5天, 也还可以是用户提前设置的其他任 意时间间隔或时间点。 Specifically, after determining that the response audio data packet meets the preset automatic response parameter, after the identification server determines that the voice response type corresponding to the response audio data packet is an automatic response, the identification server closes the response device. The established audio communication channel, and the deleted data is deleted and detected, and the data that meets the preset deletion condition is deleted. To identify the server Taking the audio communication channel C established between the response communication channel B of the response device as an example, the response audio data packet D transmitted by the response device is received through the audio communication channel C, and the response audio data packet D is determined. After the automatic response parameter is met, the identification server closes the audio channel channel C; the preset deletion condition may be a reference storage duration for saving data, or may be other parameters for saving data set by the user in advance, the preset The deletion condition is taken as an example of the storage duration of the saved data. If the storage duration of the saved data is detected, if the storage duration of the saved data is greater than the reference storage duration, the found saved data is deleted, and the reference is deleted. The save time can be 10 days or 15 days, or it can be other reference save time set by the user in advance. In another embodiment of the present invention, when the preset time is reached, the identification server performs deletion detection on the saved data, and deletes data that meets the preset deletion condition, and is not limited to when the communication channel is closed. The server performs the deletion detection on the saved data, and the preset time may be 1 day or 5 days, or may be any other time interval or time point set by the user in advance.
通过在接收并识别所述应答设备发送的应答音频数据包后, 关闭建立 的通信信道, P争低识别服务器的运行负载, 提高运行速度, 并通过对识别 服务器的保存数据进行删除, 合理利用识别服务器的存储空间, 提高处理 速度。  After receiving and recognizing the response audio data packet sent by the response device, the established communication channel is closed, P strives to lower the running load of the identification server, improves the running speed, and deletes the saved data of the identification server, and utilizes the identification reasonably The storage space of the server increases the processing speed.
如图 3 所示, 为本发明呼叫系统自动应答的识别方法的第三实施例的 具体流程图。  As shown in FIG. 3, it is a specific flowchart of the third embodiment of the method for identifying the automatic response of the call system of the present invention.
基于上述第一实施例, 所述预设的自动应答参数为预设的连续静音参 考时长时, 步骤 S13还包括:  Based on the foregoing first embodiment, when the preset automatic response parameter is the preset continuous mute reference duration, step S13 further includes:
步骤 S16, 识别服务器从应答音频数据包中获取音频的连续静音时长。 步骤 S17 ,在获取的连续静音时长小于预设的连续静音参考时长时,识 别服务器确定所述应答音频数据包满足预设的自动应答参数。  Step S16: Identify the continuous mute duration of the audio obtained by the server from the response audio data packet. Step S17: When the acquired continuous mute duration is less than the preset continuous mute reference duration, the identification server determines that the response audio data packet satisfies the preset auto-answer parameter.
具体的, 所述预设的自动应答参数为预设的连续静音参考时长, 所述 预设的连续静音参考时长可以设置为 0.7s或 1.2s , 也还可以是其他任意用 户通过实际检测得出的时间间隔值; 以预设的连续静音参考时长为 0.7s为 例, 识别服务器分析接收的所述应答设备发送过来的应答音频数据包, 从 所述接收的应答音频数据包中获取最长的连续静音时长, 若获取的最长的 连续静音时长为 0.6s, 则获取的最长连续静音时长为 0.6s小于预设的连续 静音参考时长 0.7s, 识别服务器确定所述最长连续静音时长为 0.6s的应答 音频数据包满足预设的自动应答参数, 即识别服务器确定最长连续静音时 长为 0.6s的应答音频数据包对应的语音应答类型为自动应答, 若获取的最 长的连续静音时长为 1.6s , 则获取的连续静音时长为 1.6s大于连续静音参 考时长 0.7s , 识别服务器确定所述最长连续静音时长为 1.6s的应答音频数 据包不满足预设的自动应答参数, 即识别服务器确定最长连续静音时长为 1.6s的应答音频数据包对应的语音应答类型为人工应答。在本发明其他实施 例中, 所述预设的自动应答参数还可以是在预设时间段内的预设的连续静 音参考时长。 Specifically, the preset automatic response parameter is a preset continuous silent reference duration, The preset continuous mute reference duration can be set to 0.7s or 1.2s, or it can be the time interval value obtained by any other user through actual detection. Take the preset continuous mute reference duration of 0.7s as an example, identify the server analysis. Receiving the response audio data packet sent by the response device, and obtaining the longest continuous silent duration from the received response audio data packet, and if the longest continuous silent duration obtained is 0.6s, the longest acquisition time is obtained. The continuous mute duration is 0.6s less than the preset continuous mute reference duration of 0.7s, and the recognition server determines that the longest continuous mute duration of 0.6s response audio data packet satisfies the preset auto-answer parameter, that is, the identification server determines the longest continuous The voice response type corresponding to the response audio packet with a silence duration of 0.6s is an automatic response. If the longest continuous silence duration is 1.6s, the obtained continuous silence duration is 1.6s longer than the continuous silent reference duration of 0.7s. The server determines that the response audio packet with the longest continuous silent duration of 1.6s does not satisfy the preset automatic response parameter, that is, the identification service Determines the length of the audio 1.6s response packet corresponding to the type of artificial response voice response longest continuous silence. In other embodiments of the present invention, the preset automatic response parameter may also be a preset continuous mute reference duration within a preset time period.
通过分析所述应答音频数据包, 从所述接收的应答音频数据包中获取 最长的连续静音时长, 在获取的连续静音时长小于预设的连续静音参考时 长时, 识别服务器确定所述应答音频数据包满足预设的自动应答参数, 并 确定所述应答音频数据包对应的语音应答类型为自动应答。 实现将被叫应 答中的自动应答识别出来, 以使呼叫系统中控制设备根据识别结果来控制 呼叫系统的业务流程。  Obtaining, by analyzing the response audio data packet, a longest continuous silent duration from the received response audio data packet, where the recognition server determines the response audio when the acquired continuous silent duration is less than a preset continuous silent reference duration The data packet satisfies the preset automatic response parameter, and determines that the voice response type corresponding to the response audio data packet is an automatic response. The realization recognizes the automatic response in the called response, so that the control device in the calling system controls the business process of the calling system according to the recognition result.
如图 4所示, 为本发明呼叫系统自动应答的识别方法的第四实施例的 具体流程图。  As shown in FIG. 4, it is a specific flowchart of a fourth embodiment of the method for identifying an automatic answering system of the present invention.
基于上述第一实施例, 所述自动应答参数为连续语音参考时长时, 步 骤 S13还包括:  Based on the foregoing first embodiment, when the automatic response parameter is a continuous voice reference duration, step S13 further includes:
步骤 S 18 , 识别服务器从应答音频数据包中获取音频的连续语音时长; 步骤 S19 ,在获取的连续语音时长大于预设的连续语音参考时长时,识 别服务器确定所述应答音频数据包满足预设的自动应答参数。 Step S18: Identify a continuous voice duration that the server obtains audio from the response audio data packet; Step S19: When the acquired continuous voice duration is greater than the preset continuous voice reference duration, the identification server determines that the response audio data packet meets the preset automatic response parameter.
具体的, 所述预设的自动应答参数为预设的连续语音参考时长, 所述 预设的连续语音参考时长可以设置为 3.0s或 4.0s , 也还可以是其他任意用 户通过实际检测得出的时间间隔值; 以预设的连续语音参考时长为 3.0s为 例, 识别服务器分析接收的所述呼叫系统中应答设备发送过来的应答音频 数据包, 从所述接收的应答音频数据包中获取最长的连续语音时长, 若获 取的最长的连续语音时长为 3.6s , 则获取的最长连续语音时长为 3.6s大于 预设的连续语音参考时长 3.0s , 识别服务器确定所述最长连续语音时长为 3.6s的应答音频数据包满足预设的自动应答参数,即识别服务器确定最长连 续语音时长为 3.6s的应答音频数据包对应的语音应答类型为自动应答; 若 获取的最长的连续语音时长为 1.6s, 则获取的连续语音时长为 1.6s小于连 续语音参考时长 3.0s, 识别服务器确定所述最长连续语音时长为 1.6s的应 答音频数据包不满足预设的自动应答参数, 即识别服务器确定最长连续语 音时长为 1.6s的应答音频数据包对应的语音应答类型为人工应答。 在本发 明其他实施例中, 所述预设的自动应答参数还可以是在预设时间段内的连 续语音参考时长。  Specifically, the preset automatic response parameter is a preset continuous voice reference duration, and the preset continuous voice reference duration may be set to 3.0s or 4.0s, or may be obtained by any other user through actual detection. For example, the identifier of the preset continuous voice reference duration is 3.0s, and the identification server analyzes the received response audio data packet sent by the answering device in the calling system, and obtains the received response audio data packet from the received response audio data packet. The longest continuous speech duration, if the longest continuous speech duration is 3.6 s, the longest continuous speech duration obtained is 3.6 s greater than the preset continuous speech reference duration of 3.0 s, and the recognition server determines the longest continuous The response audio data packet with a voice duration of 3.6 s satisfies the preset automatic response parameter, that is, the recognition server determines that the voice response type corresponding to the longest continuous voice duration of 3.6 s is an automatic response; if the longest acquisition is obtained The continuous speech duration is 1.6s, and the continuous speech duration is 1.6s less than the continuous speech reference duration of 3.0s. The device determines that the response audio packet with the longest continuous speech duration of 1.6s does not satisfy the preset automatic response parameter, that is, the recognition server determines that the voice response type corresponding to the longest continuous speech duration of 1.6s is artificial. Answer. In other embodiments of the present invention, the preset automatic response parameter may also be a continuous voice reference duration within a preset time period.
通过分析所述应答音频数据包 , 从所述接收的应答音频数据包中获取 最长的连续语音时长, 在获取的连续语音时长大于预设的连续语音参考时 长时, 识别服务器确定所述应答音频数据包满足预设的自动应答参数, 并 确定所述应答音频数据包对应的语音应答类型为自动应答。 实现将被叫应 答中的自动应答识别出来, 以使呼叫系统中控制设备根据识别结果来控制 呼叫系统的业务流程。  Obtaining, by analyzing the response audio data packet, a longest continuous voice duration from the received response audio data packet, where the recognition server determines the response audio when the acquired continuous voice duration is greater than a preset continuous voice reference duration The data packet satisfies the preset automatic response parameter, and determines that the voice response type corresponding to the response audio data packet is an automatic response. The realization recognizes the automatic response in the called response, so that the control device in the calling system controls the business process of the calling system according to the recognition result.
如图 5 所示, 为本发明呼叫系统自动应答的识别装置的第一实施例的 具体架构图。 该装置设置于识别服务器中, 包括: 数据接发模块 10及识别 模块 20, 其中, As shown in FIG. 5, it is a specific architectural diagram of a first embodiment of an identification device for automatically answering a call system of the present invention. The device is disposed in the identification server, and includes: the data transmission module 10 and the identification Module 20, wherein
所述数据接发模块 10, —般是指识别服务器的通信接口, 配置为当接 收到呼叫系统中应答设备发送来的应答识别请求时, 建立与所述应答设备 的音频通信信道; 及  The data transmission module 10 generally refers to a communication interface of the identification server, configured to establish an audio communication channel with the response device when receiving a response identification request sent by the answering device in the call system;
基于建立的音频通信信道, 接收所述应答设备发送过来的应答音频数 据包。  Receiving a response audio data packet sent by the response device based on the established audio communication channel.
具体的, 启动呼叫系统的呼叫流程, 呼叫系统中应答设备接收来自电 话用户的媒体, 并基于所述接收的媒体发送被叫应答音频数据包, 所述应 答设备向数据接发模块 10发送应答识别请求, 当接收到应答识别请求时, 数据接发模块 10建立与所述应答设备的音频通信信道。 所述应答设备包括 至少一个通信信道用于接收来自电话用户的媒体; 所述用于接收来自电话 用户的媒体的通信信道对应设置有应答音频数据包的通信信道; 所述数据 接发模块 10分别建立与所述应答设备之间的音频通信信道; 例如, 当所述 应答设备通过 A通信信道接收到电话用户的媒体时, 所述应答设备基于所 述接收的媒体发送被叫应答音频数据包 , 所述应答音频数据包发送的通信 信道为与 A通信信道映射设置的 B通信信道; 所述应答设备向数据接发模 块 10发送应答识别请求, 当接收到应答识别请求时, 数据接发模块 10建 立与所述应答设备的 B通信信道的音频通信信道; 如上所述, 数据接发模 块 10根据所述应答设备应答音频数据包的通信信道的数量来——对应建立 与所述应答设备的音频通信信道。  Specifically, the call flow of the call system is started, the answering device in the call system receives the media from the phone user, and sends the called response audio data packet based on the received media, and the answering device sends the response identification to the data transmitting module 10 The request, when receiving the response identification request, the data transmitting module 10 establishes an audio communication channel with the answering device. The response device includes at least one communication channel for receiving media from a telephone user; the communication channel for receiving media from the telephone user corresponds to a communication channel provided with a response audio data packet; and the data transmission module 10 respectively Establishing an audio communication channel with the response device; for example, when the response device receives the media of the telephone user through the A communication channel, the response device transmits the called response audio data packet based on the received media, The communication channel sent by the response audio data packet is a B communication channel set with the A communication channel mapping; the response device sends a response identification request to the data transmission module 10, and when receiving the response identification request, the data transmission module 10 Establishing an audio communication channel with the B communication channel of the answering device; as described above, the data transmitting module 10 responds to the number of communication channels of the answering device in response to the audio data packet - correspondingly establishing audio with the answering device Communication channel.
所述识别模块 20, —般是指识别服务器的处理器, 配置为分析所述应 答音频数据包, 以确定所述应答音频数据包是否满足预设的自动应答参数; 及  The identification module 20, generally referred to as a processor of the identification server, is configured to analyze the response audio data packet to determine whether the response audio data packet satisfies a preset automatic response parameter;
在确定所述应答音频数据包满足预设的自动应答参数时, 确定所述应 答音频数据包对应的语音应答类型为自动应答。 具体的, 基于建立的与所述应答设备之间的音频通信信道, 数据接发 模块 10接收所述应答设备发送过来的应答音频数据包, 识别模块 20分析 所述应答音频数据包, 以确定所述应答音频数据包是否满足预设的自动应 答参数, 在确定所述应答音频数据包满足预设的自动应答参数时, 识别模 块 20确定所述应答音频数据包对应的语音应答类型为自动应答; 在确定所 述应答音频数据包不满足预设的自动应答参数时, 识别服务器确定所述应 答音频数据包对应的语音应答类型为人工应答。 在本实施例中, 所述预设 的自动应答参数可以是预设的静音参考比例, 也还可以是预设的连续静音 参考时长或预设的连续语音参考时长等其他任意用于提前设置的适用的能 识别出自动应答的预设的自动应答参数; 在本发明其他实施例中, 所述预 设的自动应答参数还可以是在预设时间段内的静音参考比例或连续静音参 考时长或连续语音参考时长等,所述预设时间可以是 30s或 40s等用户提前 设置的适用的时长。 例如, 数据接发模块 10与所述应答设备的应答通信信 道 B之间建立了音频通信信道 C,数据接发模块 10从 C音频通信信道接收 呼叫系统中的应答通信信道 B接收到应答音频数据包 D, 所述预设的自动 应答参数以预设的静音参考比例为例, 预设的静音参考比例以 25%为例, 识别模块 20分析所述应答音频数据包 D,获取该应答音频数据包 D中的静 音比例, 若获取的该应答音频数据包 D中的静音比例为 20%, 识别模块 20 获取的静音比例 20%小于预设的静音参考比例 25%,识别模块 20确定所述 应答音频数据包 D满足预设的自动应答参数,即识别模块 20确定所述静音 比例 20%的应答音频数据包对应的语音应答类型为自动应答; 若获取的该 应答音频数据包 D中的静音比例为 30%, 获取的静音比例 30%大于预设的 静音参考比例 25%,识别模块 20确定所述应答音频数据包 D不满足预设的 自动应答参数, 即识别模块 20确定所述静音比例 30%的应答音频数据包对 应的语音应答类型为人工应答。 数据接发模块 10将语音应答类型为自动应 答的识别结果发送给呼叫系统中控制设备, 以使所述控制设备根据所述识 别结果来控制呼叫系统的业务流程。 When it is determined that the response audio data packet meets the preset automatic response parameter, it is determined that the voice response type corresponding to the response audio data packet is an automatic response. Specifically, based on the established audio communication channel with the response device, the data transmission module 10 receives the response audio data packet sent by the response device, and the identification module 20 analyzes the response audio data packet to determine Whether the response audio data packet meets the preset automatic response parameter, and when determining that the response audio data packet meets the preset automatic response parameter, the identification module 20 determines that the voice response type corresponding to the response audio data packet is an automatic response; When it is determined that the response audio data packet does not satisfy the preset automatic response parameter, the identification server determines that the voice response type corresponding to the response audio data packet is a manual response. In this embodiment, the preset automatic response parameter may be a preset mute reference ratio, or may be a preset continuous mute reference duration or a preset continuous speech reference duration, and the like. In the other embodiments of the present invention, the preset automatic response parameter may also be a mute reference ratio or a continuous mute reference duration or a preset time period. The continuous voice reference duration, etc., the preset time may be an applicable duration set by a user such as 30s or 40s in advance. For example, an audio communication channel C is established between the data transmitting module 10 and the answering communication channel B of the answering device, and the data transmitting module 10 receives the answering audio data from the answering communication channel B in the C audio communication channel receiving call system. In the package D, the preset automatic response parameter takes a preset mute reference ratio as an example, and the preset mute reference ratio is 25% as an example, and the identification module 20 analyzes the response audio data packet D to obtain the response audio data. The mute ratio in the packet D, if the obtained mute ratio in the response audio data packet D is 20%, the mute ratio 20% obtained by the recognition module 20 is less than 25% of the preset mute reference ratio, and the identification module 20 determines the response. The audio data packet D satisfies the preset automatic response parameter, that is, the identification module 20 determines that the voice response type corresponding to the response audio data packet of the silence ratio is 20% is an automatic response; if the acquired silence ratio in the response audio data packet D 30%, the acquired mute ratio is 30% greater than the preset mute reference ratio of 25%, and the identification module 20 determines that the response audio data packet D does not satisfy the preset automatic response parameter, ie, Module 20 determines the ratio of 30% mute audio packet corresponding to the response voice response to manual answer type. The data transmission module 10 sets the voice response type to automatic The identification result of the answer is sent to the control device in the call system, so that the control device controls the business process of the call system according to the recognition result.
通过建立的与呼叫系统中应答设备的音频通信信道, 数据接发模块 10 接收所述应答设备发送过来的应答音频数据包; 识别模块 20分析所述应答 音频数据包, 以确定所述应答音频数据包是否满足预设的自动应答参数, 在确定所述应答音频数据包满足预设的自动应答参数时, 识别模块 20确定 所述应答音频数据包对应的语音应答类型为自动应答。 实现将被叫应答中 的自动应答识别出来, 以使呼叫系统中控制设备根据识别结果来控制呼叫 系统的业务流程。  The data transmitting module 10 receives the response audio data packet sent by the answering device through the established audio communication channel with the answering device in the calling system; the identifying module 20 analyzes the answering audio data packet to determine the answering audio data. Whether the packet satisfies the preset automatic response parameter, when determining that the response audio data packet satisfies the preset automatic response parameter, the identification module 20 determines that the voice response type corresponding to the response audio data packet is an automatic response. The realization recognizes the automatic response in the called response, so that the control device in the calling system controls the business process of the calling system according to the recognition result.
如图 6所示, 为本发明呼叫系统自动应答的识别装置的第二实施例的 具体架构图。 该装置包括处理模块 30,  As shown in FIG. 6, it is a specific architecture diagram of a second embodiment of the identification device for automatically answering the call system of the present invention. The device includes a processing module 30,
所述处理模块 30, 配置为关闭所述音频通信信道, 并对保存的数据进 行删除检测, 并将满足预设的删除条件的数据删除。  The processing module 30 is configured to close the audio communication channel, perform deletion detection on the saved data, and delete data that meets the preset deletion condition.
具体的, 在确定所述应答音频数据包满足预设的自动应答参数时, 识 别模块 20确定所述应答音频数据包对应的语音应答类型为自动应答之后, 处理模块 30关闭所述与所述应答设备之间建立的音频通信信道, 并对保存 的数据进行删除检测, 将满足预设的删除条件的数据删除。 以数据接发模 块 10建立的与所述应答设备的应答通信信道 B之间建立的音频通信信道 C 为例, 在通过音频通信信道 C接收都所述应答设备发送过来的应答音频数 据包 D,并识别模块 20确定所述应答音频数据包 D满足自动应答参数之后 , 处理模块 30关闭所述音频通道信道 C; 所述预设的删除条件可以是保存数 据的参考保存时长, 也还可以是用户提前设置的其他保存数据的参数, 所 述预设的删除条件以保存数据的参考保存时长为例, 处理模块 30对保存数 据的保存时长的检测 , 若找出有保存数据的保存时长大于参考保存时长, 则处理模块 30将所述找出的保存数据删除, 所述参考保存时长可以是 10 天或 15天, 也还可以是用户提前设置的其他参考保存时长。 在本发明其他 实施例中, 还可以是在预设时间达到时, 处理模块 30对保存的数据进行删 除检测, 将满足预设的删除条件的数据删除, 并不局限在关闭一个通信信 道时, 处理模块 30才对保存的数据进行删除检测, 所述预设时间可以是 1 天或 5天, 也还可以是用户提前设置的其他任意时间间隔或时间点。 Specifically, after determining that the response audio data packet meets the preset automatic response parameter, after the identification module 20 determines that the voice response type corresponding to the response audio data packet is an automatic response, the processing module 30 closes the response and the response. The audio communication channel established between the devices, and the deleted data is deleted and detected, and the data that meets the preset deletion condition is deleted. Taking the audio communication channel C established between the data transmission module 10 and the response communication channel B of the response device as an example, receiving the response audio data packet D sent by the response device through the audio communication channel C, After the identification module 20 determines that the response audio data packet D satisfies the automatic response parameter, the processing module 30 closes the audio channel channel C; the preset deletion condition may be a reference storage duration for saving data, or may be a user. The other parameters for saving the data set in advance, the preset deletion condition is an example of the storage duration of the saved data, and the processing module 30 detects the storage duration of the saved data, and if the saved data is found to be longer than the reference save time The processing module 30 deletes the found saved data, and the reference save duration may be 10 Days or 15 days, it can also be other reference save durations set by the user in advance. In another embodiment of the present invention, when the preset time is reached, the processing module 30 deletes and deletes the saved data, and deletes the data that meets the preset deletion condition, and is not limited to when a communication channel is closed. The processing module 30 performs the deletion detection on the saved data, and the preset time may be 1 day or 5 days, or may be any other time interval or time point set by the user in advance.
通过在数据接发模块 10接收并通过识别模块识别所述应答设备发送的 应答音频数据包后, 处理模块 30关闭建立的通信信道, P争低所述识别装置 的运行负载, 提高运行速度, 并通过对识别装置的保存数据进行删除, 合 理利用所述识别装置的存储空间 , 提高处理速度。  After receiving the response audio data packet sent by the answering device by the data transmitting module 10 and identifying by the identification module, the processing module 30 closes the established communication channel, and strives to lower the running load of the identifying device, thereby increasing the running speed, and By deleting the saved data of the identification device, the storage space of the identification device is utilized reasonably, and the processing speed is improved.
优选地, 所述识别模块 20, 还配置为从应答音频数据包中获取音频的 连续静音时长; 及  Preferably, the identification module 20 is further configured to obtain a continuous silent duration of the audio from the response audio data packet;
在获取的连续静音时长小于预设的连续静音参考时长时, 确定所述应 答音频数据包满足预设的自动应答参数。  When the obtained continuous mute duration is less than the preset continuous mute reference duration, it is determined that the answer audio data packet satisfies the preset auto answer parameter.
具体的, 所述预设的自动应答参数为预设的连续静音参考时长, 所述 预设的连续静音参考时长可以设置为 0.7s或 1.2s, 也还可以是其他任意用 户通过实际检测得出的时间间隔值; 以预设的连续静音参考时长为 0.7s为 例, 识别模块 20分析接收的所述应答设备发送过来的应答音频数据包, 从 所述接收的应答音频数据包中获取最长的连续静音时长, 若获取的最长的 连续静音时长为 0.6s, 则获取的最长连续静音时长为 0.6s小于预设的连续 静音参考时长 0.7s, 识别模块 20确定所述最长连续静音时长为 0.6s的应答 音频数据包满足预设的自动应答参数, 即识别模块 20确定最长连续静音时 长为 0.6s的应答音频数据包对应的语音应答类型为自动应答, 若获取的最 长的连续静音时长为 1.6s, 则获取的连续静音时长为 1.6s大于连续静音参 考时长 0.7s, 识别模块 20确定所述最长连续静音时长为 1.6s的应答音频数 据包不满足预设的自动应答参数, 即识别模块 20确定最长连续静音时长为 1.6s的应答音频数据包对应的语音应答类型为人工应答。在本发明其他实施 例中, 所述预设的自动应答参数还可以是在预设时间段内的预设的连续静 音参考时长。 Specifically, the preset automatic response parameter is a preset continuous mute reference duration, and the preset continuous mute reference duration may be set to 0.7s or 1.2s, or may be obtained by any other user through actual detection. For example, the identifier module 20 analyzes the received response audio data packet sent by the response device, and obtains the longest response voice packet from the received response audio data packet. The continuous mute duration, if the longest continuous mute duration is 0.6s, the longest continuous mute duration is 0.6s less than the preset continuous mute reference duration of 0.7s, and the recognition module 20 determines the longest continuous mute. The response audio data packet with a duration of 0.6 s satisfies the preset automatic response parameter, that is, the identification module 20 determines that the voice response type corresponding to the longest continuous silent silence duration of 0.6 s is an automatic response, if the longest acquisition is obtained. The continuous mute duration is 1.6s, and the obtained continuous mute duration is 1.6s longer than the continuous mute reference duration of 0.7s, and the identification module 20 determines the longest continuous mute duration. The response audio data packet for 1.6s does not satisfy the preset automatic response parameter, that is, the identification module 20 determines that the longest continuous silent duration is The response voice type of the 1.6s response audio packet is a manual response. In other embodiments of the present invention, the preset automatic response parameter may also be a preset continuous mute reference duration within a preset time period.
通过识别模块 20分析所述应答音频数据包, 从所述接收的应答音频数 据包中获取最长的连续静音时长, 在获取的连续静音时长小于预设的连续 静音参考时长时, 识别模块 20确定所述应答音频数据包满足预设的自动应 答参数, 并确定所述应答音频数据包对应的语音应答类型为自动应答。 实 现将被叫应答中的自动应答识别出来, 以使呼叫系统中控制设备根据识别 结果来控制呼叫系统的业务流程。  The recognition audio data packet is analyzed by the identification module 20, and the longest continuous silent duration is obtained from the received response audio data packet. When the acquired continuous silent duration is less than the preset continuous silent reference duration, the identification module 20 determines The response audio data packet satisfies a preset automatic response parameter, and determines that the voice response type corresponding to the response audio data packet is an automatic response. The automatic response in the called response is recognized so that the control device in the calling system controls the business process of the calling system according to the recognition result.
优选地, 所述识别模块 20, 还配置为从应答音频数据包中获取音频的 连续语音时长; 及  Preferably, the identification module 20 is further configured to acquire a continuous voice duration of audio from the response audio data packet; and
在获取的连续语音时长大于预设的连续语音参考时长时, 确定所述应 答音频数据包满足预设的自动应答参数。  When the acquired continuous voice duration is greater than the preset continuous voice reference duration, it is determined that the response audio packet satisfies the preset automatic response parameter.
具体的, 所述预设的自动应答参数为预设的连续语音参考时长, 所述 预设的连续语音参考时长可以设置为 3.0s或 4.0s , 也还可以是其他任意用 户通过实际检测得出的时间间隔值; 以预设的连续语音参考时长为 3.0s为 例, 识别模块 20分析接收的所述呼叫系统中应答设备发送过来的应答音频 数据包, 从所述接收的应答音频数据包中获取最长的连续语音时长, 若获 取的最长的连续语音时长为 3.6s , 获取的连续语音时长为 3.6s大于预设的 连续语音参考时长 3.0s , 识别模块 20确定所述最长连续语音时长为 3.6s的 应答音频数据包满足预设的自动应答参数, 即识别模块 20确定最长连续语 音时长为 3.6s的应答音频数据包对应的语音应答类型为自动应答; 若获取 的最长的连续语音时长为 1.6s , 则获取的连续语音时长为 1.6s小于连续语 音参考时长 3.0s , 识别模块 20确定所述最长连续语音时长为 1.6s的应答音 频数据包不满足预设的自动应答参数, 即识别模块 20确定最长连续语音时 长为 1.6S的应答音频数据包对应的语音应答类型为人工应答。 在本发明其 他实施例中, 所述预设的自动应答参数还可以是在预设时间段内的预设的 连续语音参考时长。 Specifically, the preset automatic response parameter is a preset continuous voice reference duration, and the preset continuous voice reference duration may be set to 3.0s or 4.0s, or may be obtained by any other user through actual detection. For example, the identification module 20 analyzes the received response audio data packet sent by the answering device in the call system, and receives the response audio data packet from the received response audio data packet. Obtaining the longest continuous speech duration, if the longest continuous speech duration is 3.6 s, the acquired continuous speech duration is 3.6 s greater than the preset continuous speech reference duration of 3.0 s, and the recognition module 20 determines the longest continuous speech duration. The response audio data packet with a duration of 3.6 s satisfies the preset automatic response parameter, that is, the recognition module 20 determines that the voice response type corresponding to the longest continuous voice duration of 3.6 s is an automatic response; if the longest acquisition is obtained The continuous speech duration is 1.6s, and the obtained continuous speech duration is 1.6s less than the continuous speech reference duration of 3.0s, and the recognition module 20 determines the longest continuous When sound when the audio data packet length of the response does not satisfy the preset 1.6s automatic answering parameters, i.e. the identification module 20 determines the longest continuous speech The voice response type corresponding to the 1.6S response audio packet is a manual response. In other embodiments of the present invention, the preset automatic response parameter may also be a preset continuous voice reference duration within a preset time period.
通过识别模块 20分析所述应答音频数据包, 从所述接收的应答音频数 据包中获取最长的连续语音时长, 在获取的连续语音时长大于预设的连续 语音参考时长时, 识别模块 20确定所述应答音频数据包满足预设的自动应 答参数, 并确定所述应答音频数据包对应的语音应答类型为自动应答。 实 现将被叫应答中的自动应答识别出来, 以使呼叫系统中控制设备根据识别 结果来控制呼叫系统的业务流程。  The recognition audio data packet is analyzed by the identification module 20, and the longest continuous speech duration is obtained from the received response audio data packet. When the acquired continuous speech duration is greater than the preset continuous speech reference duration, the identification module 20 determines The response audio data packet satisfies a preset automatic response parameter, and determines that the voice response type corresponding to the response audio data packet is an automatic response. The automatic response in the called response is recognized so that the control device in the calling system controls the business process of the calling system according to the recognition result.
以上所述仅为本发明的优选实施例, 并非因此限制本发明的专利范围, 凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换, 或直 接或间接运用在其他相关的技术领域, 均同理包括在本发明的专利保护范 围内。  The above description is only the preferred embodiment of the present invention, and is not intended to limit the scope of the invention, and the equivalent structure or equivalent flow transformation made by the specification and the drawings of the present invention may be directly or indirectly applied to other related The technical field is equally included in the scope of patent protection of the present invention.

Claims

权利要求书 claims
1、 一种呼叫系统自动应答的识别方法, 该方法包括: 1. An identification method for automatic answering of a call system. The method includes:
当接收到呼叫系统中应答设备发送来的应答识别请求时, 识别服务器 建立与所述应答设备的音频通信信道; When receiving a response identification request sent from the answering device in the calling system, the identification server establishes an audio communication channel with the answering device;
基于建立的音频通信信道, 识别服务器接收所述应答设备发送过来的 应答音频数据包; Based on the established audio communication channel, the identification server receives the response audio data packet sent by the response device;
识别服务器分析所述应答音频数据包 , 以确定所述应答音频数据包是 否满足预设的自动应答参数, 在确定所述应答音频数据包满足预设的自动 应答参数时, 识别服务器确定所述应答音频数据包对应的语音应答类型为 自动应答。 The recognition server analyzes the response audio data packet to determine whether the response audio data packet satisfies the preset automatic response parameters. When it is determined that the response audio data packet satisfies the preset automatic response parameters, the recognition server determines the response. The voice response type corresponding to the audio data packet is automatic response.
2、 根据权利要求 1所述的呼叫系统自动应答的识别方法, 其中, 在所 述在确定所述应答音频数据包满足预设的自动应答参数时, 识别服务器确 定所述应答音频数据包对应的语音应答类型为自动应答的步骤之后, 该方 法还包括: 2. The method for identifying automatic responses of a calling system according to claim 1, wherein when it is determined that the response audio data packet satisfies the preset automatic response parameters, the recognition server determines that the response audio data packet corresponds to After the step of setting the voice response type to automatic response, the method also includes:
识别服务器将语音应答类型为自动应答的识别结果发送给呼叫系统中 控制设备, 以使所述控制设备根据所述识别结果来控制呼叫系统的业务流 程。 The recognition server sends the recognition result that the voice response type is automatic response to the control device in the calling system, so that the control device controls the business process of the calling system based on the recognition result.
3、 根据权利要求 1所述的呼叫系统自动应答的识别方法, 其中, 在所 述在确定所述应答音频数据包满足预设的自动应答参数时, 识别服务器确 定所述应答音频数据包对应的语音应答类型为自动应答的步骤之后, 该方 法还包括: 3. The method for identifying automatic responses of a calling system according to claim 1, wherein when it is determined that the response audio data packet satisfies the preset automatic response parameters, the recognition server determines that the response audio data packet corresponds to After the step of setting the voice response type to automatic response, the method also includes:
识别服务器关闭所述音频通信信道, 并对保存的数据进行删除检测 , 将满足预设的删除条件的数据删除。 The recognition server closes the audio communication channel, performs deletion detection on the saved data, and deletes data that meets the preset deletion conditions.
4、 根据权利要求 1或 2所述的呼叫系统自动应答的识别方法, 其中, 所述预设的自动应答参数为预设的连续静音参考时长, 所述识别服务器分 析所述应答音频数据包, 以确定所述应答音频数据包是否满足自动应答参 数的步骤包括: 4. The method for identifying automatic responses of a calling system according to claim 1 or 2, wherein the preset automatic response parameter is a preset continuous mute reference duration, and the identification server divides The step of analyzing the response audio data packet to determine whether the response audio data packet meets the automatic response parameters includes:
识别服务器从应答音频数据包中获取音频的连续静音时长; The identification server obtains the continuous audio silence duration from the response audio data packet;
在获取的连续静音时长小于预设的连续静音参考时长时, 识别服务器 确定所述应答音频数据包满足预设的自动应答参数。 When the obtained continuous silence duration is less than the preset continuous silence reference duration, the recognition server determines that the response audio data packet meets the preset automatic response parameters.
5、 根据权利要求 1或 2所述的呼叫系统自动应答的识别方法, 其中, 所述预设的自动应答参数为预设的连续语音参考时长时, 所述识别服务器 分析所述应答音频数据包 , 以确定所述应答音频数据包是否满足自动应答 参数的步骤包括: 5. The method for identifying automatic responses of a calling system according to claim 1 or 2, wherein when the preset automatic response parameter is a preset continuous voice reference duration, the identification server analyzes the response audio data packet. , the steps to determine whether the response audio data packet meets the automatic response parameters include:
识别服务器从应答音频数据包中获取音频的连续语音时长; The recognition server obtains the continuous voice duration of the audio from the response audio data packet;
在获取的连续语音时长大于预设的连续语音参考时长时, 识别服务器 确定所述应答音频数据包满足预设的自动应答参数。 When the obtained continuous voice duration is greater than the preset continuous voice reference duration, the recognition server determines that the response audio data packet meets the preset automatic response parameters.
6、 一种呼叫系统自动应答的识别的装置, 该装置包括: 6. A device for identifying the automatic response of a call system. The device includes:
数据接发模块, 配置为当接收到呼叫系统中应答设备发送来的应答识 别请求时, 建立与所述应答设备的音频通信信道; 及 The data receiving and receiving module is configured to establish an audio communication channel with the answering device when receiving a response identification request sent by the answering device in the calling system; and
基于建立的音频通信信道, 接收所述应答设备发送过来的应答音频数 据包; Based on the established audio communication channel, receive the response audio data packet sent by the response device;
识别模块, 配置为分析所述应答音频数据包, 以确定所述应答音频数 据包是否满足预设的自动应答参数, 在确定所述应答音频数据包满足预设 的自动应答参数时, 确定所述应答音频数据包对应的语音应答类型为自动 应答。 The identification module is configured to analyze the response audio data packet to determine whether the response audio data packet satisfies the preset automatic response parameters; when it is determined that the response audio data packet satisfies the preset automatic response parameters, determine the The voice response type corresponding to the response audio data packet is automatic response.
7、 根据权利要求 6所述的呼叫系统自动应答的识别装置, 其中, 所述数据接发模块, 还配置为将语音应答类型为自动应答的识别结果 发送给呼叫系统中控制设备, 以使所述控制设备根据所述识别结果来控制 呼叫系统的业务流程。 7. The device for identifying automatic responses in a calling system according to claim 6, wherein the data receiving and receiving module is further configured to send the recognition result that the voice response type is automatic response to the control device in the calling system, so that all The control device controls the business process of the calling system according to the identification result.
8、 根据权利要求 6所述的呼叫系统自动应答的识别装置, 其中, 该装 置还包括处理模块, 8. The identification device for automatic answering of a call system according to claim 6, wherein the device further includes a processing module,
所述处理模块, 配置为关闭所述音频通信信道, 并对保存的数据进行 删除检测, 并将满足预设的删除条件的数据删除。 The processing module is configured to close the audio communication channel, perform deletion detection on the saved data, and delete data that meets preset deletion conditions.
9、 根据权利要求 6或 7所述的呼叫系统自动应答的识别装置, 其中, 所述预设的自动应答参数为预设的连续静音参考时长时, 9. The identification device for automatic answering of a calling system according to claim 6 or 7, wherein the preset automatic answering parameter is a preset continuous mute reference duration,
所述识别模块 , 还配置为从应答音频数据包中获取音频的连续静音时 长; 及 The identification module is also configured to obtain the continuous mute duration of the audio from the response audio data packet; and
在获取的连续静音时长小于预设的连续静音参考时长时, 确定所述应 答音频数据包满足预设的自动应答参数。 When the obtained continuous silence duration is less than the preset continuous silence reference duration, it is determined that the response audio data packet meets the preset automatic response parameters.
10、 根据权利要求 6或 7所述的呼叫系统自动应答的识别装置, 其中, 所述预设的自动应答参数为预设的连续语音参考时长时, 10. The identification device for automatic answering of a calling system according to claim 6 or 7, wherein when the preset automatic answering parameter is a preset continuous voice reference duration,
所述识别模块 , 还配置为从应答音频数据包中获取音频的连续语音时 长; 及 The identification module is also configured to obtain the continuous voice duration of the audio from the response audio data packet; and
在获取的连续语音时长大于预设的连续语音参考时长时, 确定所述应 答音频数据包满足预设的自动应答参数。 When the obtained continuous voice duration is greater than the preset continuous voice reference duration, it is determined that the response audio data packet meets the preset automatic response parameters.
PCT/CN2014/077731 2013-08-30 2014-05-16 Method and device for identifying automatic response of calling system WO2014173342A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201310390449.9A CN104427076A (en) 2013-08-30 2013-08-30 Recognition method and recognition device for automatic answering of calling system
CN201310390449.9 2013-08-30

Publications (1)

Publication Number Publication Date
WO2014173342A1 true WO2014173342A1 (en) 2014-10-30

Family

ID=51791072

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/077731 WO2014173342A1 (en) 2013-08-30 2014-05-16 Method and device for identifying automatic response of calling system

Country Status (2)

Country Link
CN (1) CN104427076A (en)
WO (1) WO2014173342A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107068154A (en) * 2017-03-13 2017-08-18 平安科技(深圳)有限公司 The method and system of authentication based on Application on Voiceprint Recognition

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0688486A1 (en) * 1993-01-14 1995-12-27 Mci Communications Corporation Telephone network performance monitoring method and system
CN1832515A (en) * 2005-03-08 2006-09-13 华为技术有限公司 Authorization system of automatic answering equipment and its authorization method
CN102082806A (en) * 2009-11-26 2011-06-01 上海拜翰网络科技有限公司 Method and system for pushing website navigation service through calling

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0688486A1 (en) * 1993-01-14 1995-12-27 Mci Communications Corporation Telephone network performance monitoring method and system
CN1832515A (en) * 2005-03-08 2006-09-13 华为技术有限公司 Authorization system of automatic answering equipment and its authorization method
CN102082806A (en) * 2009-11-26 2011-06-01 上海拜翰网络科技有限公司 Method and system for pushing website navigation service through calling

Also Published As

Publication number Publication date
CN104427076A (en) 2015-03-18

Similar Documents

Publication Publication Date Title
US8970662B2 (en) Output management for electronic communications
US20070263604A1 (en) Ring back notification system and method therefor
US8868136B2 (en) Handling a voice communication request
CN101909192B (en) Television terminal and communication method thereof
CN112953925B (en) Real-time audio and video communication system and method based on SIP (Session initiation protocol) and RTC (real time communication) network
EP2523474A1 (en) Method and equipment for realizing concurrency of voice and data
CN112887194B (en) Interactive method, device, terminal and storage medium for realizing communication of hearing-impaired people
US10313502B2 (en) Automatically delaying playback of a message
TWI378684B (en) Communication method and system of internet
WO2010102469A1 (en) Mobile terminal and method for performing call during cell phone television service
CN104618593B (en) A kind of operating method when mobile terminal and its third party's incoming call
WO2017166642A1 (en) Mobile terminal redial method, device, and mobile terminal
WO2014173342A1 (en) Method and device for identifying automatic response of calling system
CN112839192A (en) Audio and video communication system and method based on browser
WO2010000120A1 (en) Voice-video mailbox device and implementation method
CN108419124B (en) Audio processing method
TWI232431B (en) Method of speech transformation
US20160036864A1 (en) Providing external application services with an existing private branch exchange media server
WO2014190816A1 (en) Method and residential gateway for realizing voice message function
US10818295B1 (en) Maintaining network connections
US9398150B2 (en) Method of setting detection parameters in an apparatus for on hold music detection
CN103595885B (en) Domestic affection phone system
CN113612759A (en) High-performance high-concurrency intelligent broadcasting system based on SIP protocol and implementation method
WO2023088371A1 (en) Call method and system, electronic device and computer-readable storage medium
KR102014817B1 (en) System for providing communication services in device things using id

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14788353

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14788353

Country of ref document: EP

Kind code of ref document: A1