CN113518153B

CN113518153B - Method and device for identifying call response state of user and electronic equipment

Info

Publication number: CN113518153B
Application number: CN202110447521.1A
Authority: CN
Inventors: 郑渊中; 张彤彤; 钟雨崎; 朱小波; 孔醍
Original assignee: Shanghai Qiyue Information Technology Co Ltd
Current assignee: Shanghai Qiyue Information Technology Co Ltd
Priority date: 2021-04-25
Filing date: 2021-04-25
Publication date: 2023-07-04
Anticipated expiration: 2041-04-25
Also published as: CN113518153A

Abstract

The embodiment of the specification provides a method for identifying a call response state of a user, which comprises the steps of constructing a call state identification model with a plurality of tasks, wherein common fragments are arranged among the plurality of tasks, different tasks are aimed at different call response states, initiating a call to the user, acquiring sound information, calling the call state identification model to identify the call response state of the user according to the sound information, processing the sound information by the common fragments of the plurality of tasks, transmitting processing results of the common fragments to special fragments of each task for identification, and judging the call response state of the user according to identification results of the plurality of tasks. By constructing a call state identification model with a plurality of tasks, the same part of calculation processes in calculation processes of different states are prevented from being repeatedly calculated due to the fact that a common segment is arranged among the plurality of tasks, and therefore the speed of identifying response states of users during call is improved.

Description

Method and device for identifying call response state of user and electronic equipment

Technical Field

The present invention relates to the field of computers, and in particular, to a method, an apparatus, and an electronic device for identifying a call response status of a user.

Background

When the artificial intelligent robot customer service performs conversation, the response speed of the robot is extremely high, and in order to avoid mechanical conversation, real-time identification of the response state of a user to the event of dialing is often required, so that judgment of the robot can be facilitated, and conversation can be smoothly performed. Currently, the industry recognizes response states, one model is used to determine whether a call is dialed, and the second model is used to determine whether a call is in person. Since the robot needs to wait for the calculation of the response state of the two models to be completed at one time, the efficiency of the method needs to be improved, and a new method is needed to be provided to improve the recognition speed of the response state of the user during the call.

Disclosure of Invention

The embodiment of the specification provides a method, a device and electronic equipment for identifying a user call response state, which are used for improving the speed of identifying the user response state during call.

The embodiment of the specification provides a method for identifying a call response state of a user, which comprises the following steps:

constructing a call state identification model with a plurality of tasks, wherein a common segment is arranged among the plurality of tasks, and different tasks are aimed at different call response states;

initiating a call to a user and acquiring sound information, and calling the call state identification model to identify a call response state of the user according to the sound information, wherein the call response state comprises the following steps:

and the common segment of the plurality of tasks processes the sound information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification result of the plurality of tasks.

Optionally, the delivering the processing result of the common segment to the proprietary segment of each task for identification includes:

each proprietary fragment calculates the probability of the corresponding target of each task according to the processing result of the shared fragment;

the judging the call response state of the user according to the identification results of the tasks comprises the following steps:

and judging the call response state of the user according to the calculated probability of the task and the target of each task.

Optionally, the building a call state identification model with a plurality of tasks further includes:

and compressing and deploying the constructed call state recognition model.

Optionally, the targets of the plurality of tasks are one of a non-connected, a noise call, and a voice call, respectively.

Optionally, the building a call state identification model with a plurality of tasks includes:

acquiring call response information samples, and performing label setting on the call response information samples according to call response states of the samples;

and constructing a call state identification model with a multitasking structure, and training the call state identification model by using the call response information sample provided with the label.

Optionally, the method further comprises:

and selecting the pushing time of the dialogue content according to the real-time identified dialogue response state.

Optionally, the method further comprises:

and screening the target user according to the call response state to perform call redial.

Optionally, the method further comprises:

predicting the probability of refusing the call initiating number by the target user, and eliminating the predicted refused user from the target user.

Optionally, the method further comprises:

and carrying out guest group division and priority sorting on the users according to the identified call response state, and releasing service resources according to the priority order.

The embodiment of the specification also provides a device for identifying the call response state of the user, which comprises:

the multi-task model module is used for constructing a call state identification model with a plurality of tasks, wherein a shared segment is arranged among the tasks, and different tasks are targeted to different call response states;

the identification module initiates a call to a user and acquires sound information, invokes the call state identification model to identify a call response state of the user according to the sound information, and comprises the following steps:

and compressing and deploying the constructed call state recognition model.

Optionally, the method further comprises:

The embodiment of the specification also provides an electronic device, wherein the electronic device comprises:

a processor; the method comprises the steps of,

a memory storing computer executable instructions that, when executed, cause the processor to perform any of the methods described above.

The present description also provides a computer-readable storage medium storing one or more programs that, when executed by a processor, implement any of the methods described above.

According to the various technical schemes provided by the embodiment of the specification, a conversation state identification model with a plurality of tasks is constructed, common fragments are arranged among the tasks, different tasks are aimed at different conversation response states, conversation is initiated to a user, sound information is acquired, the conversation state identification model is called to identify the conversation response state of the user according to the sound information, the common fragments of the tasks process the sound information, processing results of the common fragments are transmitted to proprietary fragments of the tasks to identify, and the conversation response state of the user is judged according to the identification results of the tasks. By constructing a call state identification model with a plurality of tasks, the same part of calculation processes in calculation processes of different states are prevented from being repeatedly calculated due to the fact that a common segment is arranged among the plurality of tasks, and therefore the speed of identifying response states of users during call is improved.

Drawings

The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiments of the application and together with the description serve to explain the application and do not constitute an undue limitation to the application. In the drawings:

fig. 1 is a schematic diagram of a method for identifying a call response status of a user according to an embodiment of the present disclosure;

fig. 2 is a schematic structural diagram of an apparatus for identifying a call response status of a user according to an embodiment of the present disclosure;

fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure;

fig. 4 is a schematic diagram of a computer readable medium according to an embodiment of the present disclosure.

Detailed Description

Exemplary embodiments of the present invention will now be described more fully with reference to the accompanying drawings. However, the exemplary embodiments can be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these exemplary embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of the invention to those skilled in the art. The same reference numerals in the drawings denote the same or similar elements, components or portions, and thus a repetitive description thereof will be omitted.

The features, structures, characteristics or other details described in a particular embodiment do not exclude that may be combined in one or more other embodiments in a suitable manner, without departing from the technical idea of the invention.

In the description of specific embodiments, features, structures, characteristics, or other details described in the present invention are provided to enable one skilled in the art to fully understand the embodiments. However, it is not excluded that one skilled in the art may practice the present invention without one or more of the specific features, structures, characteristics, or other details.

The flow diagrams depicted in the figures are exemplary only, and do not necessarily include all of the elements and operations/steps, nor must they be performed in the order described. For example, some operations/steps may be decomposed, and some operations/steps may be combined or partially combined, so that the order of actual execution may be changed according to actual situations.

The block diagrams depicted in the figures are merely functional entities and do not necessarily correspond to physically separate entities. That is, the functional entities may be implemented in software, or in one or more hardware modules or integrated circuits, or in different networks and/or processor devices and/or microcontroller devices.

The term "and/or" and/or "includes all combinations of any one or more of the associated listed items.

Fig. 1 is a schematic diagram of a method for identifying a call response status of a user according to an embodiment of the present disclosure, where the method may include:

s101: and constructing a call state identification model with a plurality of tasks, wherein the plurality of tasks are provided with shared fragments, and different tasks aim at different call response states.

In order to enable the model to finally obtain a response state from a plurality of possible states, a plurality of tasks can be constructed in the model, and corresponding response state feature matching rules are respectively constructed for each task, so that different tasks can calculate the probability of the response state to be identified by the task, and finally, the response states can be finally judged and obtained by comparing the probabilities calculated by the tasks.

In order to reduce the calculation amount, a plurality of tasks share part of task fragments, and three tasks are calculated in parallel, so that the calculation efficiency is improved.

In the embodiment of the present disclosure, when a specific training model is referred to, the constructing a call state recognition model with multiple tasks may include:

Wherein the targets of the plurality of tasks are one of a non-connected, a noise call, and a voice call, respectively.

The task configuration matching rule corresponding to each state characteristic matching rule of the tasks can enable the task in the unopened state to be configured with a rule matching the fixed sound, enable the task in the noise call state to be matched with the noise sent by the opposite terminal, and enable the task in the voice call state to be matched with the human voice sent by the opposite terminal.

Of course, when the task of recognizing the voice call state is matched, human voice with noise mixed therein can be matched because the noise can indicate that the counterpart has made a call.

The fixed tone may be a notification tone of a call not being connected, such as a beep, a notification that the other party has been turned off, a music, etc., which are not specifically described and limited herein.

In order to achieve the light weight of the model and increase the calculation speed, in the embodiment of the present disclosure, the constructing a call state recognition model with a plurality of tasks may further include:

and compressing and deploying the constructed call state recognition model.

S102: initiating a call to a user and acquiring voice information, and calling the call state identification model to identify a call response state of the user according to the voice information can comprise:

According to the method, a call state identification model with a plurality of tasks is constructed, common fragments are arranged among the tasks, different tasks are aimed at different call response states, calls are initiated to a user, sound information is acquired, the call state identification model is called to identify the call response state of the user according to the sound information, the common fragments of the tasks process the sound information, processing results of the common fragments are transmitted to special fragments of each task to be identified, and the call response state of the user is judged according to the identification results of the tasks. By constructing a call state identification model with a plurality of tasks, the same part of calculation processes in calculation processes of different states are prevented from being repeatedly calculated due to the fact that a common segment is arranged among the plurality of tasks, and therefore the speed of identifying response states of users during call is improved.

The sound information is sound played by the collected robot terminal.

In the embodiment of the present disclosure, the delivering the processing result of the common segment to the proprietary segment of each task to identify may include:

the distinguishing the call response state of the user according to the recognition results of the plurality of tasks may include:

The special segment of the task may include a calculation function for matching the processing result of the common segment, and the matching degree between the special segment and the target is obtained by calculating the probability, and the target corresponding to the task with the probability at the first position is determined to be the finally identified call response state.

In an embodiment of the present disclosure, the method may further include:

Specifically, it may be that different users may be identified as not being connected, a noise call and a voice call, for the noise call and the voice call, the former is not voice by the user, so that even if the robot initiates a topic, the robot will not feel objectionable, while for the latter, the robot needs to wait for a moment, so as to avoid the situation of breaking the topic of the user.

In the embodiment of the specification, considering different response states of a user to dialing, the requirement of the user on the robot service can be reflected to a certain degree, if the user is not connected for the first time, it is indicated that at least the user is not disliked, if the user is connected but is not speaking, it is indicated that the user is connected by accident, but is relatively disliked to the robot, the possibility that the person is taken as a target user is relatively small, and therefore, for the situation that the dialing effect represented by the first two states is poor, the user can dial again.

Thus, in embodiments of the present description, the method may further comprise:

Thus, because the response state of the users is considered in the redialing, the possibility of providing service hits for the target users is high, and the expected dialing effect is facilitated.

In an embodiment of the present disclosure, the method may further include:

The method can be expressed in an actual application scene, and by identifying and eliminating the users shielding the robot numbers, the users can know the future refusal events in advance and dial no more to the users, so that the resource utilization rate can be improved, and the resource waste is avoided.

In an embodiment of the present disclosure, the method may further include:

Fig. 2 is a schematic structural diagram of an apparatus for identifying a call response status of a user according to an embodiment of the present disclosure, where the apparatus may include:

the multi-task model module 201 constructs a call state identification model with a plurality of tasks, wherein the tasks have shared fragments, and different tasks target different call response states;

the identification module 202 initiates a call to a user and acquires voice information, invokes the call state identification model to identify a call response state of the user according to the voice information, and includes:

In this embodiment of the present disclosure, the delivering the processing result of the common segment to the proprietary segment of each task to identify includes:

In an embodiment of the present disclosure, the building a call state recognition model with a plurality of tasks further includes:

and compressing and deploying the constructed call state recognition model.

In the embodiment of the present specification, the targets of the plurality of tasks are one of a non-connection, a noise call, and a voice call, respectively.

In an embodiment of the present disclosure, the building a call state recognition model with a plurality of tasks includes:

In this embodiment of the present specification, further includes:

The device constructs a call state identification model with a plurality of tasks, wherein the plurality of tasks are provided with shared fragments, different tasks are aimed at different call response states, calls the call state identification model to identify the call response state of a user according to the voice information, and comprises the steps that the shared fragments of the plurality of tasks process the voice information, transmit the processing results of the shared fragments to the special fragments of each task to identify, and judge the call response state of the user according to the identification results of the plurality of tasks. By constructing a call state identification model with a plurality of tasks, the same part of calculation processes in calculation processes of different states are prevented from being repeatedly calculated due to the fact that a common segment is arranged among the plurality of tasks, and therefore the speed of identifying response states of users during call is improved.

Based on the same inventive concept, the embodiments of the present specification also provide an electronic device.

The following describes an embodiment of an electronic device according to the present invention, which may be regarded as a specific physical implementation of the above-described embodiment of the method and apparatus according to the present invention. Details described in relation to the embodiments of the electronic device of the present invention should be considered as additions to the embodiments of the method or apparatus described above; for details not disclosed in the embodiments of the electronic device of the present invention, reference may be made to the above-described method or apparatus embodiments.

Fig. 3 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure. An electronic device 300 according to this embodiment of the present invention is described below with reference to fig. 3. The electronic device 300 shown in fig. 3 is merely an example and should not be construed as limiting the functionality and scope of use of embodiments of the present invention.

As shown in fig. 3, the electronic device 300 is embodied in the form of a general purpose computing device. Components of electronic device 300 may include, but are not limited to: at least one processing unit 310, at least one memory unit 320, a bus 330 connecting the different system components (including the memory unit 320 and the processing unit 310), a display unit 340, and the like.

Wherein the storage unit stores program code that is executable by the processing unit 310 such that the processing unit 310 performs the steps according to various exemplary embodiments of the invention described in the above processing method section of the present specification. For example, the processing unit 310 may perform the steps shown in fig. 1.

The memory unit 320 may include readable media in the form of volatile memory units, such as Random Access Memory (RAM) 3201 and/or cache memory 3202, and may further include Read Only Memory (ROM) 3203.

The storage unit 320 may also include a program/utility 3204 having a set (at least one) of program modules 3205, such program modules 3205 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each or some combination of which may include an implementation of a network environment.

Bus 330 may be one or more of several types of bus structures including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.

The electronic device 300 may also communicate with one or more external devices 400 (e.g., keyboard, pointing device, bluetooth device, etc.), one or more devices that enable a user to interact with the electronic device 300, and/or any device (e.g., router, modem, etc.) that enables the electronic device 300 to communicate with one or more other computing devices. Such communication may occur through an input/output (I/O) interface 350. Also, electronic device 300 may communicate with one or more networks such as a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the Internet, through network adapter 360. The network adapter 360 may communicate with other modules of the electronic device 300 via the bus 330. It should be appreciated that although not shown in fig. 3, other hardware and/or software modules may be used in connection with electronic device 300, including, but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, raid systems, tape drives, data backup storage systems, and the like.

From the above description of embodiments, those skilled in the art will readily appreciate that the exemplary embodiments described herein may be implemented in software, or may be implemented in software in combination with necessary hardware. Thus, the technical solution according to the embodiments of the present invention may be embodied in the form of a software product, which may be stored in a computer readable storage medium (may be a CD-ROM, a usb disk, a mobile hard disk, etc.) or on a network, and includes several instructions to cause a computing device (may be a personal computer, a server, or a network device, etc.) to perform the above-mentioned method according to the present invention. The computer program, when executed by a data processing device, enables the computer readable medium to carry out the above-described method of the present invention, namely: such as the method shown in fig. 1.

A computer program implementing the method shown in fig. 1 may be stored on one or more computer readable media. The computer readable medium may be a readable signal medium or a readable storage medium. The readable storage medium can be, for example, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium would include the following: an electrical connection having one or more wires, a portable disk, a hard disk, random Access Memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.

The computer readable storage medium may include a data signal propagated in baseband or as part of a carrier wave, with readable program code embodied therein. Such a propagated data signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination of the foregoing. A readable storage medium may also be any readable medium that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.

Program code for carrying out operations of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C++ or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device, partly on a remote computing device, or entirely on the remote computing device or server. In the case of remote computing devices, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., connected via the Internet using an Internet service provider).

In summary, the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that some or all of the functionality of some or all of the components in accordance with embodiments of the present invention may be implemented in practice using a general purpose data processing device such as a microprocessor or Digital Signal Processor (DSP). The present invention can also be implemented as an apparatus or device program (e.g., a computer program and a computer program product) for performing a portion or all of the methods described herein. Such a program embodying the present invention may be stored on a computer readable medium, or may have the form of one or more signals. Such signals may be downloaded from an internet website, provided on a carrier signal, or provided in any other form.

The above-described specific embodiments further describe the objects, technical solutions and advantageous effects of the present invention in detail, and it should be understood that the present invention is not inherently related to any particular computer, virtual device or electronic apparatus, and various general-purpose devices may also implement the present invention. The foregoing description of the embodiments of the invention is not intended to be limiting, but rather is intended to cover all modifications, equivalents, alternatives, and improvements that fall within the spirit and scope of the invention.

In this specification, each embodiment is described in a progressive manner, and identical and similar parts of each embodiment are all referred to each other, and each embodiment mainly describes differences from other embodiments. The foregoing is merely exemplary of the present application and is not intended to limit the present application. Various modifications and changes may be made to the present application by those skilled in the art. Any modifications, equivalent substitutions, improvements, etc. which are within the spirit and principles of the present application are intended to be included within the scope of the claims of the present application.

Claims

1. A method for identifying a user call response status, comprising:

constructing a call state identification model with a plurality of tasks, wherein each task respectively constructs a corresponding response state characteristic matching rule, the plurality of tasks are provided with shared fragments, and different tasks are targeted to different call response states; the call response state is one of a non-connected state, a noise call and a voice call, wherein a task for identifying the non-connected state is configured with a rule for matching fixed tones, the task for identifying the noise call state is matched with the noise sent by the opposite terminal, and the task for identifying the voice call state is matched with human voice sent by the opposite terminal;

the common segment of the plurality of tasks processes the sound information, transmits the processing result of the common segment to the special segment of each task for recognition, and judges the call response state of the user according to the recognition results of the plurality of tasks, wherein each special segment calculates the probability of the corresponding target of each task in parallel according to the processing result of the common segment; and judging the call response state of the user according to the calculated probability of the task and the target of each task.

2. The method of claim 1, wherein constructing a call state identification model having a plurality of tasks further comprises:

and compressing and deploying the constructed call state recognition model.

3. The method of claim 1, wherein constructing a call state identification model having a plurality of tasks comprises:

4. The method as recited in claim 1, further comprising:

5. The method as recited in claim 1, further comprising:

6. The method as recited in claim 5, further comprising:

7. The method as recited in claim 1, further comprising:

8. An apparatus for identifying a user call response status, comprising:

the system comprises a multi-task model module, a call state recognition module and a call state recognition module, wherein the multi-task model module is used for constructing a call state recognition model with a plurality of tasks, each task is used for constructing a corresponding response state characteristic matching rule respectively, a common segment is arranged among the plurality of tasks, and targets of different tasks are different call response states; the call response state is one of a non-connected state, a noise call and a voice call, wherein a task for identifying the non-connected state is configured with a rule for matching fixed tones, the task for identifying the noise call state is matched with the noise sent by the opposite terminal, and the task for identifying the voice call state is matched with human voice sent by the opposite terminal;

9. An electronic device, wherein the electronic device comprises:

a processor; and a memory storing computer executable instructions that, when executed, cause the processor to perform the method of any of claims 1-7.

10. A computer readable storage medium, wherein the computer readable storage medium stores one or more programs which, when executed by a processor, implement the method of any of claims 1-7.