CN113518153B - Method and device for identifying call response state of user and electronic equipment - Google Patents

Method and device for identifying call response state of user and electronic equipment Download PDF

Info

Publication number
CN113518153B
CN113518153B CN202110447521.1A CN202110447521A CN113518153B CN 113518153 B CN113518153 B CN 113518153B CN 202110447521 A CN202110447521 A CN 202110447521A CN 113518153 B CN113518153 B CN 113518153B
Authority
CN
China
Prior art keywords
call
state
tasks
task
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110447521.1A
Other languages
Chinese (zh)
Other versions
CN113518153A (en
Inventor
郑渊中
张彤彤
钟雨崎
朱小波
孔醍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Qiyue Information Technology Co Ltd
Original Assignee
Shanghai Qiyue Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Qiyue Information Technology Co Ltd filed Critical Shanghai Qiyue Information Technology Co Ltd
Priority to CN202110447521.1A priority Critical patent/CN113518153B/en
Publication of CN113518153A publication Critical patent/CN113518153A/en
Application granted granted Critical
Publication of CN113518153B publication Critical patent/CN113518153B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/436Arrangements for screening incoming calls, i.e. evaluating the characteristics of a call before deciding whether to answer it
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • H04M3/4936Speech interaction details
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/527Centralised call answering arrangements not requiring operator intervention

Abstract

The embodiment of the specification provides a method for identifying a call response state of a user, which comprises the steps of constructing a call state identification model with a plurality of tasks, wherein common fragments are arranged among the plurality of tasks, different tasks are aimed at different call response states, initiating a call to the user, acquiring sound information, calling the call state identification model to identify the call response state of the user according to the sound information, processing the sound information by the common fragments of the plurality of tasks, transmitting processing results of the common fragments to special fragments of each task for identification, and judging the call response state of the user according to identification results of the plurality of tasks. By constructing a call state identification model with a plurality of tasks, the same part of calculation processes in calculation processes of different states are prevented from being repeatedly calculated due to the fact that a common segment is arranged among the plurality of tasks, and therefore the speed of identifying response states of users during call is improved.

Description

Method and device for identifying call response state of user and electronic equipment
Technical Field
The present invention relates to the field of computers, and in particular, to a method, an apparatus, and an electronic device for identifying a call response status of a user.
Background
When the artificial intelligent robot customer service performs conversation, the response speed of the robot is extremely high, and in order to avoid mechanical conversation, real-time identification of the response state of a user to the event of dialing is often required, so that judgment of the robot can be facilitated, and conversation can be smoothly performed. Currently, the industry recognizes response states, one model is used to determine whether a call is dialed, and the second model is used to determine whether a call is in person. Since the robot needs to wait for the calculation of the response state of the two models to be completed at one time, the efficiency of the method needs to be improved, and a new method is needed to be provided to improve the recognition speed of the response state of the user during the call.
Disclosure of Invention
The embodiment of the specification provides a method, a device and electronic equipment for identifying a user call response state, which are used for improving the speed of identifying the user response state during call.
The embodiment of the specification provides a method for identifying a call response state of a user, which comprises the following steps:
constructing a call state identification model with a plurality of tasks, wherein a common segment is arranged among the plurality of tasks, and different tasks are aimed at different call response states;
initiating a call to a user and acquiring sound information, and calling the call state identification model to identify a call response state of the user according to the sound information, wherein the call response state comprises the following steps:
and the common segment of the plurality of tasks processes the sound information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification result of the plurality of tasks.
Optionally, the delivering the processing result of the common segment to the proprietary segment of each task for identification includes:
each proprietary fragment calculates the probability of the corresponding target of each task according to the processing result of the shared fragment;
the judging the call response state of the user according to the identification results of the tasks comprises the following steps:
and judging the call response state of the user according to the calculated probability of the task and the target of each task.
Optionally, the building a call state identification model with a plurality of tasks further includes:
and compressing and deploying the constructed call state recognition model.
Optionally, the targets of the plurality of tasks are one of a non-connected, a noise call, and a voice call, respectively.
Optionally, the building a call state identification model with a plurality of tasks includes:
acquiring call response information samples, and performing label setting on the call response information samples according to call response states of the samples;
and constructing a call state identification model with a multitasking structure, and training the call state identification model by using the call response information sample provided with the label.
Optionally, the method further comprises:
and selecting the pushing time of the dialogue content according to the real-time identified dialogue response state.
Optionally, the method further comprises:
and screening the target user according to the call response state to perform call redial.
Optionally, the method further comprises:
predicting the probability of refusing the call initiating number by the target user, and eliminating the predicted refused user from the target user.
Optionally, the method further comprises:
and carrying out guest group division and priority sorting on the users according to the identified call response state, and releasing service resources according to the priority order.
The embodiment of the specification also provides a device for identifying the call response state of the user, which comprises:
the multi-task model module is used for constructing a call state identification model with a plurality of tasks, wherein a shared segment is arranged among the tasks, and different tasks are targeted to different call response states;
the identification module initiates a call to a user and acquires sound information, invokes the call state identification model to identify a call response state of the user according to the sound information, and comprises the following steps:
and the common segment of the plurality of tasks processes the sound information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification result of the plurality of tasks.
Optionally, the delivering the processing result of the common segment to the proprietary segment of each task for identification includes:
each proprietary fragment calculates the probability of the corresponding target of each task according to the processing result of the shared fragment;
the judging the call response state of the user according to the identification results of the tasks comprises the following steps:
and judging the call response state of the user according to the calculated probability of the task and the target of each task.
Optionally, the building a call state identification model with a plurality of tasks further includes:
and compressing and deploying the constructed call state recognition model.
Optionally, the targets of the plurality of tasks are one of a non-connected, a noise call, and a voice call, respectively.
Optionally, the building a call state identification model with a plurality of tasks includes:
acquiring call response information samples, and performing label setting on the call response information samples according to call response states of the samples;
and constructing a call state identification model with a multitasking structure, and training the call state identification model by using the call response information sample provided with the label.
Optionally, the method further comprises:
and selecting the pushing time of the dialogue content according to the real-time identified dialogue response state.
Optionally, the method further comprises:
and screening the target user according to the call response state to perform call redial.
Optionally, the method further comprises:
predicting the probability of refusing the call initiating number by the target user, and eliminating the predicted refused user from the target user.
Optionally, the method further comprises:
and carrying out guest group division and priority sorting on the users according to the identified call response state, and releasing service resources according to the priority order.
The embodiment of the specification also provides an electronic device, wherein the electronic device comprises:
a processor; the method comprises the steps of,
a memory storing computer executable instructions that, when executed, cause the processor to perform any of the methods described above.
The present description also provides a computer-readable storage medium storing one or more programs that, when executed by a processor, implement any of the methods described above.
According to the various technical schemes provided by the embodiment of the specification, a conversation state identification model with a plurality of tasks is constructed, common fragments are arranged among the tasks, different tasks are aimed at different conversation response states, conversation is initiated to a user, sound information is acquired, the conversation state identification model is called to identify the conversation response state of the user according to the sound information, the common fragments of the tasks process the sound information, processing results of the common fragments are transmitted to proprietary fragments of the tasks to identify, and the conversation response state of the user is judged according to the identification results of the tasks. By constructing a call state identification model with a plurality of tasks, the same part of calculation processes in calculation processes of different states are prevented from being repeatedly calculated due to the fact that a common segment is arranged among the plurality of tasks, and therefore the speed of identifying response states of users during call is improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiments of the application and together with the description serve to explain the application and do not constitute an undue limitation to the application. In the drawings:
fig. 1 is a schematic diagram of a method for identifying a call response status of a user according to an embodiment of the present disclosure;
fig. 2 is a schematic structural diagram of an apparatus for identifying a call response status of a user according to an embodiment of the present disclosure;
fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure;
fig. 4 is a schematic diagram of a computer readable medium according to an embodiment of the present disclosure.
Detailed Description
Exemplary embodiments of the present invention will now be described more fully with reference to the accompanying drawings. However, the exemplary embodiments can be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these exemplary embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of the invention to those skilled in the art. The same reference numerals in the drawings denote the same or similar elements, components or portions, and thus a repetitive description thereof will be omitted.
The features, structures, characteristics or other details described in a particular embodiment do not exclude that may be combined in one or more other embodiments in a suitable manner, without departing from the technical idea of the invention.
In the description of specific embodiments, features, structures, characteristics, or other details described in the present invention are provided to enable one skilled in the art to fully understand the embodiments. However, it is not excluded that one skilled in the art may practice the present invention without one or more of the specific features, structures, characteristics, or other details.
The flow diagrams depicted in the figures are exemplary only, and do not necessarily include all of the elements and operations/steps, nor must they be performed in the order described. For example, some operations/steps may be decomposed, and some operations/steps may be combined or partially combined, so that the order of actual execution may be changed according to actual situations.
The block diagrams depicted in the figures are merely functional entities and do not necessarily correspond to physically separate entities. That is, the functional entities may be implemented in software, or in one or more hardware modules or integrated circuits, or in different networks and/or processor devices and/or microcontroller devices.
The term "and/or" and/or "includes all combinations of any one or more of the associated listed items.
Fig. 1 is a schematic diagram of a method for identifying a call response status of a user according to an embodiment of the present disclosure, where the method may include:
s101: and constructing a call state identification model with a plurality of tasks, wherein the plurality of tasks are provided with shared fragments, and different tasks aim at different call response states.
In order to enable the model to finally obtain a response state from a plurality of possible states, a plurality of tasks can be constructed in the model, and corresponding response state feature matching rules are respectively constructed for each task, so that different tasks can calculate the probability of the response state to be identified by the task, and finally, the response states can be finally judged and obtained by comparing the probabilities calculated by the tasks.
In order to reduce the calculation amount, a plurality of tasks share part of task fragments, and three tasks are calculated in parallel, so that the calculation efficiency is improved.
In the embodiment of the present disclosure, when a specific training model is referred to, the constructing a call state recognition model with multiple tasks may include:
acquiring call response information samples, and performing label setting on the call response information samples according to call response states of the samples;
and constructing a call state identification model with a multitasking structure, and training the call state identification model by using the call response information sample provided with the label.
Wherein the targets of the plurality of tasks are one of a non-connected, a noise call, and a voice call, respectively.
The task configuration matching rule corresponding to each state characteristic matching rule of the tasks can enable the task in the unopened state to be configured with a rule matching the fixed sound, enable the task in the noise call state to be matched with the noise sent by the opposite terminal, and enable the task in the voice call state to be matched with the human voice sent by the opposite terminal.
Of course, when the task of recognizing the voice call state is matched, human voice with noise mixed therein can be matched because the noise can indicate that the counterpart has made a call.
The fixed tone may be a notification tone of a call not being connected, such as a beep, a notification that the other party has been turned off, a music, etc., which are not specifically described and limited herein.
In order to achieve the light weight of the model and increase the calculation speed, in the embodiment of the present disclosure, the constructing a call state recognition model with a plurality of tasks may further include:
and compressing and deploying the constructed call state recognition model.
S102: initiating a call to a user and acquiring voice information, and calling the call state identification model to identify a call response state of the user according to the voice information can comprise:
and the common segment of the plurality of tasks processes the sound information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification result of the plurality of tasks.
According to the method, a call state identification model with a plurality of tasks is constructed, common fragments are arranged among the tasks, different tasks are aimed at different call response states, calls are initiated to a user, sound information is acquired, the call state identification model is called to identify the call response state of the user according to the sound information, the common fragments of the tasks process the sound information, processing results of the common fragments are transmitted to special fragments of each task to be identified, and the call response state of the user is judged according to the identification results of the tasks. By constructing a call state identification model with a plurality of tasks, the same part of calculation processes in calculation processes of different states are prevented from being repeatedly calculated due to the fact that a common segment is arranged among the plurality of tasks, and therefore the speed of identifying response states of users during call is improved.
The sound information is sound played by the collected robot terminal.
In the embodiment of the present disclosure, the delivering the processing result of the common segment to the proprietary segment of each task to identify may include:
each proprietary fragment calculates the probability of the corresponding target of each task according to the processing result of the shared fragment;
the distinguishing the call response state of the user according to the recognition results of the plurality of tasks may include:
and judging the call response state of the user according to the calculated probability of the task and the target of each task.
The special segment of the task may include a calculation function for matching the processing result of the common segment, and the matching degree between the special segment and the target is obtained by calculating the probability, and the target corresponding to the task with the probability at the first position is determined to be the finally identified call response state.
In an embodiment of the present disclosure, the method may further include:
and selecting the pushing time of the dialogue content according to the real-time identified dialogue response state.
Specifically, it may be that different users may be identified as not being connected, a noise call and a voice call, for the noise call and the voice call, the former is not voice by the user, so that even if the robot initiates a topic, the robot will not feel objectionable, while for the latter, the robot needs to wait for a moment, so as to avoid the situation of breaking the topic of the user.
In the embodiment of the specification, considering different response states of a user to dialing, the requirement of the user on the robot service can be reflected to a certain degree, if the user is not connected for the first time, it is indicated that at least the user is not disliked, if the user is connected but is not speaking, it is indicated that the user is connected by accident, but is relatively disliked to the robot, the possibility that the person is taken as a target user is relatively small, and therefore, for the situation that the dialing effect represented by the first two states is poor, the user can dial again.
Thus, in embodiments of the present description, the method may further comprise:
and screening the target user according to the call response state to perform call redial.
Thus, because the response state of the users is considered in the redialing, the possibility of providing service hits for the target users is high, and the expected dialing effect is facilitated.
In an embodiment of the present disclosure, the method may further include:
predicting the probability of refusing the call initiating number by the target user, and eliminating the predicted refused user from the target user.
The method can be expressed in an actual application scene, and by identifying and eliminating the users shielding the robot numbers, the users can know the future refusal events in advance and dial no more to the users, so that the resource utilization rate can be improved, and the resource waste is avoided.
In an embodiment of the present disclosure, the method may further include:
and carrying out guest group division and priority sorting on the users according to the identified call response state, and releasing service resources according to the priority order.
Fig. 2 is a schematic structural diagram of an apparatus for identifying a call response status of a user according to an embodiment of the present disclosure, where the apparatus may include:
the multi-task model module 201 constructs a call state identification model with a plurality of tasks, wherein the tasks have shared fragments, and different tasks target different call response states;
the identification module 202 initiates a call to a user and acquires voice information, invokes the call state identification model to identify a call response state of the user according to the voice information, and includes:
and the common segment of the plurality of tasks processes the sound information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification result of the plurality of tasks.
In order to enable the model to finally obtain a response state from a plurality of possible states, a plurality of tasks can be constructed in the model, and corresponding response state feature matching rules are respectively constructed for each task, so that different tasks can calculate the probability of the response state to be identified by the task, and finally, the response states can be finally judged and obtained by comparing the probabilities calculated by the tasks.
In order to reduce the calculation amount, a plurality of tasks share part of task fragments, and three tasks are calculated in parallel, so that the calculation efficiency is improved.
In this embodiment of the present disclosure, the delivering the processing result of the common segment to the proprietary segment of each task to identify includes:
each proprietary fragment calculates the probability of the corresponding target of each task according to the processing result of the shared fragment;
the judging the call response state of the user according to the identification results of the tasks comprises the following steps:
and judging the call response state of the user according to the calculated probability of the task and the target of each task.
In an embodiment of the present disclosure, the building a call state recognition model with a plurality of tasks further includes:
and compressing and deploying the constructed call state recognition model.
In the embodiment of the present specification, the targets of the plurality of tasks are one of a non-connection, a noise call, and a voice call, respectively.
In an embodiment of the present disclosure, the building a call state recognition model with a plurality of tasks includes:
acquiring call response information samples, and performing label setting on the call response information samples according to call response states of the samples;
and constructing a call state identification model with a multitasking structure, and training the call state identification model by using the call response information sample provided with the label.
In this embodiment of the present specification, further includes:
and selecting the pushing time of the dialogue content according to the real-time identified dialogue response state.
In this embodiment of the present specification, further includes:
and screening the target user according to the call response state to perform call redial.
In this embodiment of the present specification, further includes:
predicting the probability of refusing the call initiating number by the target user, and eliminating the predicted refused user from the target user.
In this embodiment of the present specification, further includes:
and carrying out guest group division and priority sorting on the users according to the identified call response state, and releasing service resources according to the priority order.
The device constructs a call state identification model with a plurality of tasks, wherein the plurality of tasks are provided with shared fragments, different tasks are aimed at different call response states, calls the call state identification model to identify the call response state of a user according to the voice information, and comprises the steps that the shared fragments of the plurality of tasks process the voice information, transmit the processing results of the shared fragments to the special fragments of each task to identify, and judge the call response state of the user according to the identification results of the plurality of tasks. By constructing a call state identification model with a plurality of tasks, the same part of calculation processes in calculation processes of different states are prevented from being repeatedly calculated due to the fact that a common segment is arranged among the plurality of tasks, and therefore the speed of identifying response states of users during call is improved.
Based on the same inventive concept, the embodiments of the present specification also provide an electronic device.
The following describes an embodiment of an electronic device according to the present invention, which may be regarded as a specific physical implementation of the above-described embodiment of the method and apparatus according to the present invention. Details described in relation to the embodiments of the electronic device of the present invention should be considered as additions to the embodiments of the method or apparatus described above; for details not disclosed in the embodiments of the electronic device of the present invention, reference may be made to the above-described method or apparatus embodiments.
Fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure. An electronic device 300 according to this embodiment of the present invention is described below with reference to fig. 3. The electronic device 300 shown in fig. 3 is merely an example and should not be construed as limiting the functionality and scope of use of embodiments of the present invention.
As shown in fig. 3, the electronic device 300 is embodied in the form of a general purpose computing device. Components of electronic device 300 may include, but are not limited to: at least one processing unit 310, at least one memory unit 320, a bus 330 connecting the different system components (including the memory unit 320 and the processing unit 310), a display unit 340, and the like.
Wherein the storage unit stores program code that is executable by the processing unit 310 such that the processing unit 310 performs the steps according to various exemplary embodiments of the invention described in the above processing method section of the present specification. For example, the processing unit 310 may perform the steps shown in fig. 1.
The memory unit 320 may include readable media in the form of volatile memory units, such as Random Access Memory (RAM) 3201 and/or cache memory 3202, and may further include Read Only Memory (ROM) 3203.
The storage unit 320 may also include a program/utility 3204 having a set (at least one) of program modules 3205, such program modules 3205 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each or some combination of which may include an implementation of a network environment.
Bus 330 may be one or more of several types of bus structures including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.
The electronic device 300 may also communicate with one or more external devices 400 (e.g., keyboard, pointing device, bluetooth device, etc.), one or more devices that enable a user to interact with the electronic device 300, and/or any device (e.g., router, modem, etc.) that enables the electronic device 300 to communicate with one or more other computing devices. Such communication may occur through an input/output (I/O) interface 350. Also, electronic device 300 may communicate with one or more networks such as a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the Internet, through network adapter 360. The network adapter 360 may communicate with other modules of the electronic device 300 via the bus 330. It should be appreciated that although not shown in fig. 3, other hardware and/or software modules may be used in connection with electronic device 300, including, but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, raid systems, tape drives, data backup storage systems, and the like.
From the above description of embodiments, those skilled in the art will readily appreciate that the exemplary embodiments described herein may be implemented in software, or may be implemented in software in combination with necessary hardware. Thus, the technical solution according to the embodiments of the present invention may be embodied in the form of a software product, which may be stored in a computer readable storage medium (may be a CD-ROM, a usb disk, a mobile hard disk, etc.) or on a network, and includes several instructions to cause a computing device (may be a personal computer, a server, or a network device, etc.) to perform the above-mentioned method according to the present invention. The computer program, when executed by a data processing device, enables the computer readable medium to carry out the above-described method of the present invention, namely: such as the method shown in fig. 1.
Fig. 4 is a schematic diagram of a computer readable medium according to an embodiment of the present disclosure.
A computer program implementing the method shown in fig. 1 may be stored on one or more computer readable media. The computer readable medium may be a readable signal medium or a readable storage medium. The readable storage medium can be, for example, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium would include the following: an electrical connection having one or more wires, a portable disk, a hard disk, random Access Memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
The computer readable storage medium may include a data signal propagated in baseband or as part of a carrier wave, with readable program code embodied therein. Such a propagated data signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination of the foregoing. A readable storage medium may also be any readable medium that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Program code for carrying out operations of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C++ or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device, partly on a remote computing device, or entirely on the remote computing device or server. In the case of remote computing devices, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., connected via the Internet using an Internet service provider).
In summary, the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that some or all of the functionality of some or all of the components in accordance with embodiments of the present invention may be implemented in practice using a general purpose data processing device such as a microprocessor or Digital Signal Processor (DSP). The present invention can also be implemented as an apparatus or device program (e.g., a computer program and a computer program product) for performing a portion or all of the methods described herein. Such a program embodying the present invention may be stored on a computer readable medium, or may have the form of one or more signals. Such signals may be downloaded from an internet website, provided on a carrier signal, or provided in any other form.
The above-described specific embodiments further describe the objects, technical solutions and advantageous effects of the present invention in detail, and it should be understood that the present invention is not inherently related to any particular computer, virtual device or electronic apparatus, and various general-purpose devices may also implement the present invention. The foregoing description of the embodiments of the invention is not intended to be limiting, but rather is intended to cover all modifications, equivalents, alternatives, and improvements that fall within the spirit and scope of the invention.
In this specification, each embodiment is described in a progressive manner, and identical and similar parts of each embodiment are all referred to each other, and each embodiment mainly describes differences from other embodiments. The foregoing is merely exemplary of the present application and is not intended to limit the present application. Various modifications and changes may be made to the present application by those skilled in the art. Any modifications, equivalent substitutions, improvements, etc. which are within the spirit and principles of the present application are intended to be included within the scope of the claims of the present application.

Claims (10)

1. A method for identifying a user call response status, comprising:
constructing a call state identification model with a plurality of tasks, wherein each task respectively constructs a corresponding response state characteristic matching rule, the plurality of tasks are provided with shared fragments, and different tasks are targeted to different call response states; the call response state is one of a non-connected state, a noise call and a voice call, wherein a task for identifying the non-connected state is configured with a rule for matching fixed tones, the task for identifying the noise call state is matched with the noise sent by the opposite terminal, and the task for identifying the voice call state is matched with human voice sent by the opposite terminal;
initiating a call to a user and acquiring sound information, and calling the call state identification model to identify a call response state of the user according to the sound information, wherein the call response state comprises the following steps:
the common segment of the plurality of tasks processes the sound information, transmits the processing result of the common segment to the special segment of each task for recognition, and judges the call response state of the user according to the recognition results of the plurality of tasks, wherein each special segment calculates the probability of the corresponding target of each task in parallel according to the processing result of the common segment; and judging the call response state of the user according to the calculated probability of the task and the target of each task.
2. The method of claim 1, wherein constructing a call state identification model having a plurality of tasks further comprises:
and compressing and deploying the constructed call state recognition model.
3. The method of claim 1, wherein constructing a call state identification model having a plurality of tasks comprises:
acquiring call response information samples, and performing label setting on the call response information samples according to call response states of the samples;
and constructing a call state identification model with a multitasking structure, and training the call state identification model by using the call response information sample provided with the label.
4. The method as recited in claim 1, further comprising:
and selecting the pushing time of the dialogue content according to the real-time identified dialogue response state.
5. The method as recited in claim 1, further comprising:
and screening the target user according to the call response state to perform call redial.
6. The method as recited in claim 5, further comprising:
predicting the probability of refusing the call initiating number by the target user, and eliminating the predicted refused user from the target user.
7. The method as recited in claim 1, further comprising:
and carrying out guest group division and priority sorting on the users according to the identified call response state, and releasing service resources according to the priority order.
8. An apparatus for identifying a user call response status, comprising:
the system comprises a multi-task model module, a call state recognition module and a call state recognition module, wherein the multi-task model module is used for constructing a call state recognition model with a plurality of tasks, each task is used for constructing a corresponding response state characteristic matching rule respectively, a common segment is arranged among the plurality of tasks, and targets of different tasks are different call response states; the call response state is one of a non-connected state, a noise call and a voice call, wherein a task for identifying the non-connected state is configured with a rule for matching fixed tones, the task for identifying the noise call state is matched with the noise sent by the opposite terminal, and the task for identifying the voice call state is matched with human voice sent by the opposite terminal;
the identification module initiates a call to a user and acquires sound information, invokes the call state identification model to identify a call response state of the user according to the sound information, and comprises the following steps:
the common segment of the plurality of tasks processes the sound information, transmits the processing result of the common segment to the special segment of each task for recognition, and judges the call response state of the user according to the recognition results of the plurality of tasks, wherein each special segment calculates the probability of the corresponding target of each task in parallel according to the processing result of the common segment; and judging the call response state of the user according to the calculated probability of the task and the target of each task.
9. An electronic device, wherein the electronic device comprises:
a processor; and a memory storing computer executable instructions that, when executed, cause the processor to perform the method of any of claims 1-7.
10. A computer readable storage medium, wherein the computer readable storage medium stores one or more programs which, when executed by a processor, implement the method of any of claims 1-7.
CN202110447521.1A 2021-04-25 2021-04-25 Method and device for identifying call response state of user and electronic equipment Active CN113518153B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110447521.1A CN113518153B (en) 2021-04-25 2021-04-25 Method and device for identifying call response state of user and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110447521.1A CN113518153B (en) 2021-04-25 2021-04-25 Method and device for identifying call response state of user and electronic equipment

Publications (2)

Publication Number Publication Date
CN113518153A CN113518153A (en) 2021-10-19
CN113518153B true CN113518153B (en) 2023-07-04

Family

ID=78061438

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110447521.1A Active CN113518153B (en) 2021-04-25 2021-04-25 Method and device for identifying call response state of user and electronic equipment

Country Status (1)

Country Link
CN (1) CN113518153B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07162553A (en) * 1993-12-07 1995-06-23 Brother Ind Ltd Call signal identification device, facsimile equipment and telephone set
JP2004198831A (en) * 2002-12-19 2004-07-15 Sony Corp Method, program, and recording medium for speech recognition
JP2006171185A (en) * 2004-12-14 2006-06-29 Asahi Kasei Corp Speech recognition device and method
JP2015055653A (en) * 2013-09-10 2015-03-23 セイコーエプソン株式会社 Speech recognition device and method and electronic apparatus

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7653547B2 (en) * 2005-03-31 2010-01-26 Microsoft Corporation Method for testing a speech server
US9026444B2 (en) * 2009-09-16 2015-05-05 At&T Intellectual Property I, L.P. System and method for personalization of acoustic models for automatic speech recognition
CN109961776A (en) * 2017-12-18 2019-07-02 上海智臻智能网络科技股份有限公司 Speech information processing apparatus
CN109327621A (en) * 2018-11-21 2019-02-12 Oppo广东移动通信有限公司 Method, apparatus, system, equipment and the storage medium of call reminding
CN111212190B (en) * 2019-12-20 2022-05-17 北京淇瑀信息科技有限公司 Conversation management method, device and system based on conversation strategy management
CN111491058A (en) * 2020-03-31 2020-08-04 维沃移动通信有限公司 Method for controlling operation mode, electronic device, and storage medium
CN111916071A (en) * 2020-06-16 2020-11-10 深圳追一科技有限公司 Call calling method, device, computer equipment and storage medium
CN112017645B (en) * 2020-08-31 2024-04-26 广州市百果园信息技术有限公司 Voice recognition method and device
CN112202974B (en) * 2020-12-03 2021-04-02 安徽信晨通信技术有限公司 Method, device and system for automatically judging telephone answering state

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07162553A (en) * 1993-12-07 1995-06-23 Brother Ind Ltd Call signal identification device, facsimile equipment and telephone set
JP2004198831A (en) * 2002-12-19 2004-07-15 Sony Corp Method, program, and recording medium for speech recognition
JP2006171185A (en) * 2004-12-14 2006-06-29 Asahi Kasei Corp Speech recognition device and method
JP2015055653A (en) * 2013-09-10 2015-03-23 セイコーエプソン株式会社 Speech recognition device and method and electronic apparatus

Also Published As

Publication number Publication date
CN113518153A (en) 2021-10-19

Similar Documents

Publication Publication Date Title
US9043208B2 (en) System, method and program product for providing automatic speech recognition (ASR) in a shared resource environment
CN1722230A (en) Allocation of speech recognition tasks and combination of results thereof
CN111737022A (en) Interface calling method, system, equipment and medium based on micro-service
CN112528179A (en) Two-dimensional code processing method and system, electronic device and storage medium
CN113518153B (en) Method and device for identifying call response state of user and electronic equipment
US10623569B2 (en) Document detection and analysis-based routing
US10984229B2 (en) Interactive sign language response system and method
CN110970017B (en) Man-machine interaction method and system and computer system
CN109040490B (en) Incoming call distribution method, system, device and storage medium
CN110740212A (en) Call answering method and device based on intelligent voice technology and electronic equipment
AU2017353879B2 (en) System and method for parameterization of speech recognition grammar specification (SRGS) grammars
CN112133306B (en) Response method and device based on express delivery user and computer equipment
CN113141613B (en) Communication channel detection method and device and electronic equipment
US11405506B2 (en) Prompt feature to leave voicemail for appropriate attribute-based call back to customers
CN111194026B (en) Information sending method and device and electronic equipment
CN111813407B (en) Game development method, game running device and electronic equipment
CN110365664B (en) Mobile phone number registration method and device based on intelligent voice outbound call and electronic equipment
CN114257641A (en) Gesture-based call center agent state change control
CN106131310A (en) A kind of method that mobile terminal processes information with Telephone set
CN109788128A (en) A kind of income prompting method, incoming call prompting device and terminal device
CN112929675B (en) Image resource compression method and device and electronic equipment
CN112309399B (en) Method and device for executing task based on voice and electronic equipment
US11019178B1 (en) Deployment router based on channel traffic
CN114286343B (en) Multi-way outbound system, risk identification method, equipment, medium and product
CN114221940B (en) Audio data processing method, system, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant