CN113518153A - Method and device for identifying user call response state and electronic equipment - Google Patents

Method and device for identifying user call response state and electronic equipment Download PDF

Info

Publication number
CN113518153A
CN113518153A CN202110447521.1A CN202110447521A CN113518153A CN 113518153 A CN113518153 A CN 113518153A CN 202110447521 A CN202110447521 A CN 202110447521A CN 113518153 A CN113518153 A CN 113518153A
Authority
CN
China
Prior art keywords
call
tasks
state
user
call response
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110447521.1A
Other languages
Chinese (zh)
Other versions
CN113518153B (en
Inventor
郑渊中
张彤彤
钟雨崎
朱小波
孔醍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Qiyue Information Technology Co Ltd
Original Assignee
Shanghai Qiyue Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Qiyue Information Technology Co Ltd filed Critical Shanghai Qiyue Information Technology Co Ltd
Priority to CN202110447521.1A priority Critical patent/CN113518153B/en
Publication of CN113518153A publication Critical patent/CN113518153A/en
Application granted granted Critical
Publication of CN113518153B publication Critical patent/CN113518153B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/436Arrangements for screening incoming calls, i.e. evaluating the characteristics of a call before deciding whether to answer it
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • H04M3/4936Speech interaction details
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/527Centralised call answering arrangements not requiring operator intervention

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The embodiment of the specification provides a method for identifying a call response state of a user, which comprises the steps of constructing a call state identification model with a plurality of tasks, wherein a shared segment is arranged among the tasks, the targets of different tasks are different call response states, initiating a call to the user and acquiring voice information, calling the call state identification model to identify the call response state of the user according to the voice information, and the method comprises the steps of processing the voice information by the shared segment of the tasks, transmitting the processing result of the shared segment to a special segment of each task for identification, and judging the call response state of the user according to the identification result of the tasks. By constructing the call state identification model with a plurality of tasks, because the plurality of tasks have a common segment, repeated calculation is avoided for the same part of calculation processes in the calculation processes of different states, and the speed of identifying the response state of the user during call is improved.

Description

Method and device for identifying user call response state and electronic equipment
Technical Field
The present application relates to the field of computers, and in particular, to a method and an apparatus for identifying a user call response status, and an electronic device.
Background
When the artificial intelligent robot serves to perform conversation, the requirement on the response speed of the robot is extremely high, and in order to avoid mechanical conversation, the response state of a user to the event of dialing is often required to be recognized in real time, so that the artificial intelligent robot can be helped to judge so as to smoothly perform conversation. In the identification of response status in the industry, one model is used to determine whether a call is made, and the second model is used to determine whether a call is made. Since the robot needs to wait for the two models to be calculated at one time, the efficiency of this method is to be improved, and it is necessary to provide a new method to increase the speed of recognizing the response state of the user during the call.
Disclosure of Invention
The embodiment of the specification provides a method and a device for identifying a user call response state and electronic equipment, which are used for improving the speed of identifying the user response state during call.
An embodiment of the present specification provides a method for identifying a user call response state, including:
establishing a call state identification model with a plurality of tasks, wherein the tasks have a common segment, and the targets of different tasks are different call response states;
initiating a call to a user, acquiring voice information, calling the call state identification model to identify the call response state of the user according to the voice information, and comprising the following steps:
the common segment of the tasks processes the voice information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification results of the tasks.
Optionally, the passing the processing result of the common segment to the proprietary segment of each task for identification includes:
calculating the probability of the corresponding target of each task by each special segment according to the processing result of the common segment;
the judging the call response state of the user according to the recognition results of the tasks comprises the following steps:
and judging the call response state of the user according to the probability calculated by the tasks and the target of each task.
Optionally, the constructing a call state recognition model with multiple tasks further includes:
and compressing and deploying the constructed call state identification model.
Optionally, the plurality of tasks are each targeted to one of an off, a murky call, and a voice call.
Optionally, the constructing a call state recognition model with multiple tasks includes:
acquiring call response information samples, and performing label setting on the call response information samples according to the call response states of the samples;
and constructing a call state recognition model with a multitask structure, and training the call state recognition model by using the call response information sample provided with the label.
Optionally, the method further comprises:
and selecting the push opportunity of the conversation content according to the real-time recognized conversation response state.
Optionally, the method further comprises:
and screening the target user according to the call response state to redial the call.
Optionally, the method further comprises:
and predicting the probability of refusing the number initiating the call by the target user, and removing the predicted refusing user from the target user.
Optionally, the method further comprises:
and carrying out guest group division and priority sequencing on the users according to the identified call response state, and releasing the service resources according to the priority sequence.
An embodiment of the present specification further provides an apparatus for identifying a user call response state, including:
the multi-task model module is used for constructing a call state identification model with a plurality of tasks, wherein the tasks have a common segment, and the targets of different tasks are different call response states;
the identification module initiates a call to a user, acquires voice information, and calls the call state identification module to identify the call response state of the user according to the voice information, and comprises:
the common segment of the tasks processes the voice information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification results of the tasks.
Optionally, the passing the processing result of the common segment to the proprietary segment of each task for identification includes:
calculating the probability of the corresponding target of each task by each special segment according to the processing result of the common segment;
the judging the call response state of the user according to the recognition results of the tasks comprises the following steps:
and judging the call response state of the user according to the probability calculated by the tasks and the target of each task.
Optionally, the constructing a call state recognition model with multiple tasks further includes:
and compressing and deploying the constructed call state identification model.
Optionally, the plurality of tasks are each targeted to one of an off, a murky call, and a voice call.
Optionally, the constructing a call state recognition model with multiple tasks includes:
acquiring call response information samples, and performing label setting on the call response information samples according to the call response states of the samples;
and constructing a call state recognition model with a multitask structure, and training the call state recognition model by using the call response information sample provided with the label.
Optionally, the method further comprises:
and selecting the push opportunity of the conversation content according to the real-time recognized conversation response state.
Optionally, the method further comprises:
and screening the target user according to the call response state to redial the call.
Optionally, the method further comprises:
and predicting the probability of refusing the number initiating the call by the target user, and removing the predicted refusing user from the target user.
Optionally, the method further comprises:
and carrying out guest group division and priority sequencing on the users according to the identified call response state, and releasing the service resources according to the priority sequence.
An embodiment of the present specification further provides an electronic device, where the electronic device includes:
a processor; and the number of the first and second groups,
a memory storing computer-executable instructions that, when executed, cause the processor to perform any of the methods described above.
The present specification also provides a computer readable storage medium, wherein the computer readable storage medium stores one or more programs which, when executed by a processor, implement any of the above methods.
In various technical solutions provided in this specification, a call state recognition model having a plurality of tasks is constructed, the plurality of tasks have a common segment therebetween, different tasks are targeted to different call response states, a call is initiated to a user and voice information is acquired, the call state recognition model is invoked to recognize the call response state of the user according to the voice information, including that the common segment of the plurality of tasks processes the voice information, a processing result of the common segment is transmitted to a dedicated segment of each task to be recognized, and the call response state of the user is determined according to a recognition result of the plurality of tasks. By constructing the call state identification model with a plurality of tasks, because the plurality of tasks have a common segment, repeated calculation is avoided for the same part of calculation processes in the calculation processes of different states, and the speed of identifying the response state of the user during call is improved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
fig. 1 is a schematic diagram illustrating a method for identifying a call response status of a user according to an embodiment of the present disclosure;
fig. 2 is a schematic structural diagram of an apparatus for identifying a user call response state according to an embodiment of the present disclosure;
fig. 3 is a schematic structural diagram of an electronic device provided in an embodiment of the present disclosure;
fig. 4 is a schematic diagram of a computer-readable medium provided in an embodiment of the present specification.
Detailed Description
Exemplary embodiments of the present invention will now be described more fully with reference to the accompanying drawings. The exemplary embodiments, however, may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these exemplary embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of the invention to those skilled in the art. The same reference numerals denote the same or similar elements, components, or parts in the drawings, and thus their repetitive description will be omitted.
Features, structures, characteristics or other details described in a particular embodiment do not preclude the fact that the features, structures, characteristics or other details may be combined in a suitable manner in one or more other embodiments in accordance with the technical idea of the invention.
In describing particular embodiments, the present invention has been described with reference to features, structures, characteristics or other details that are within the purview of one skilled in the art to provide a thorough understanding of the embodiments. One skilled in the relevant art will recognize, however, that the invention may be practiced without one or more of the specific features, structures, characteristics, or other details.
The flow charts shown in the drawings are merely illustrative and do not necessarily include all of the contents and operations/steps, nor do they necessarily have to be performed in the order described. For example, some operations/steps may be decomposed, and some operations/steps may be combined or partially combined, so that the actual execution sequence may be changed according to the actual situation.
The block diagrams shown in the figures are functional entities only and do not necessarily correspond to physically separate entities. I.e. these functional entities may be implemented in the form of software, or in one or more hardware modules or integrated circuits, or in different networks and/or processor means and/or microcontroller means.
The term "and/or" and/or "includes all combinations of any one or more of the associated listed items.
Fig. 1 is a schematic diagram of a method for identifying a call response status of a user according to an embodiment of the present disclosure, where the method may include:
s101: and constructing a call state identification model with a plurality of tasks, wherein the tasks have a common segment, and different tasks are targeted to different call response states.
In order to enable the model to finally obtain a response state from a plurality of possible states, a plurality of tasks can be built in the model, and corresponding response state feature matching rules are respectively built for each task, so that the probabilities of the response states to be identified by the tasks can be calculated by different tasks, and finally, one response state can be finally judged and obtained by comparing the probabilities calculated by the tasks.
In order to reduce the calculation amount, a plurality of tasks share part of task segments, and the three tasks are calculated in parallel to improve the calculation efficiency.
In this embodiment of the present specification, when a specific training model is involved, the constructing a call state recognition model with multiple tasks may include:
acquiring call response information samples, and performing label setting on the call response information samples according to the call response states of the samples;
and constructing a call state recognition model with a multitask structure, and training the call state recognition model by using the call response information sample provided with the label.
Wherein the plurality of tasks are each targeted to one of an unplanned, a noise call, and a voice call.
The task for identifying the unconnected state can be configured with a rule matched with a fixed tone corresponding to the state feature matching rule of each of the plurality of tasks, the task for identifying the noise conversation state can be matched with the noise sent by the opposite terminal, and the task for identifying the voice conversation state can be matched with the human voice sent by the opposite terminal.
Of course, when the task of recognizing the voice call state is matched, the human voice accompanied with noise can be matched because the noise can indicate that the other party gets through the phone.
The fixed tone may be a sound indicating that the call is not connected, such as beep sound, the other party is off, music, etc., which are not specifically illustrated and limited herein.
In order to achieve model lightweight and increase computation speed, in an embodiment of the present specification, the constructing a call state identification model having multiple tasks may further include:
and compressing and deploying the constructed call state identification model.
S102: initiating a call to a user and acquiring voice information, and calling the call state identification model to identify the call response state of the user according to the voice information, which may include:
the common segment of the tasks processes the voice information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification results of the tasks.
The method comprises the steps of establishing a call state identification model with a plurality of tasks, wherein a common segment is arranged among the tasks, the targets of different tasks are different call response states, initiating a call to a user and acquiring voice information, calling the call state identification model to identify the call response state of the user according to the voice information, wherein the common segment of the tasks processes the voice information, transmits the processing result of the common segment to a special segment of each task for identification, and judges the call response state of the user according to the identification result of the tasks. By constructing the call state identification model with a plurality of tasks, because the plurality of tasks have a common segment, repeated calculation is avoided for the same part of calculation processes in the calculation processes of different states, and the speed of identifying the response state of the user during call is improved.
Wherein, the sound information is the sound played by the collected robot terminal.
In this embodiment of the present specification, the passing the processing result of the common segment to the dedicated segment of each task for identification may include:
calculating the probability of the corresponding target of each task by each special segment according to the processing result of the common segment;
the determining the call response state of the user according to the recognition results of the plurality of tasks may include:
and judging the call response state of the user according to the probability calculated by the tasks and the target of each task.
The special segment of the task can comprise a calculation function for matching the processing result of the common segment, the matching degree between the special segment and the target is obtained by calculating the probability, and the target corresponding to the task with the probability at the head position is judged as the finally identified call response state.
In an embodiment of the present specification, the method may further include:
and selecting the push opportunity of the conversation content according to the real-time recognized conversation response state.
Specifically, different users may be identified as not connected, a noise call and a voice call, the former may not generate voice for the noise call and the voice call, and therefore the robot may not cause discomfort even though initiating a topic, and the latter may need to wait for a moment to avoid interruption of the topic of the user.
In the embodiment of the specification, different response states generated by a user for dialing are considered, the strong and weak demand of the user for robot service can be reflected to a certain extent actually, if the user is not connected during dialing for the first time, at least the user is not disliked, if the user is connected with a telephone but does not speak, the user is only about to connect the telephone externally, but actually is disliked for the robot, the possibility that the person is taken as a target user is low, and therefore, the user can redial under the condition that the dialing effect reflected by the former two states is poor.
Therefore, in the embodiment of the present specification, the method may further include:
and screening the target user according to the call response state to redial the call.
In this way, since the response status of the user is considered when re-dialing, the probability of providing service hits to these target users is high, which is helpful for generating the expected dialing effect.
In an embodiment of the present specification, the method may further include:
and predicting the probability of refusing the number initiating the call by the target user, and removing the predicted refusing user from the target user.
This can be shown in the practical application scenario, by identifying and eliminating the users shielding the robot numbers, we can know the future rejection event in advance, and no longer dial the users, thus improving the resource utilization rate and avoiding the resource waste.
In an embodiment of the present specification, the method may further include:
and carrying out guest group division and priority sequencing on the users according to the identified call response state, and releasing the service resources according to the priority sequence.
Fig. 2 is a schematic structural diagram of an apparatus for identifying a user call response state according to an embodiment of the present disclosure, where the apparatus may include:
the multi-task model module 201 is used for constructing a call state identification model with a plurality of tasks, wherein the tasks have a common segment, and the targets of different tasks are different call response states;
the identification module 202 initiates a call to a user, acquires voice information, and calls the call state identification module to identify a call response state of the user according to the voice information, including:
the common segment of the tasks processes the voice information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification results of the tasks.
In order to enable the model to finally obtain a response state from a plurality of possible states, a plurality of tasks can be built in the model, and corresponding response state feature matching rules are respectively built for each task, so that the probabilities of the response states to be identified by the tasks can be calculated by different tasks, and finally, one response state can be finally judged and obtained by comparing the probabilities calculated by the tasks.
In order to reduce the calculation amount, a plurality of tasks share part of task segments, and the three tasks are calculated in parallel to improve the calculation efficiency.
In this embodiment of the present specification, the passing the processing result of the common segment to the proprietary segment of each task for identification includes:
calculating the probability of the corresponding target of each task by each special segment according to the processing result of the common segment;
the judging the call response state of the user according to the recognition results of the tasks comprises the following steps:
and judging the call response state of the user according to the probability calculated by the tasks and the target of each task.
In an embodiment of the present specification, the constructing a call state recognition model with multiple tasks further includes:
and compressing and deploying the constructed call state identification model.
In the embodiment of the present specification, the plurality of tasks are each targeted to one of an off call, a noise call, and a voice call.
In an embodiment of the present specification, the building a call state recognition model with multiple tasks includes:
acquiring call response information samples, and performing label setting on the call response information samples according to the call response states of the samples;
and constructing a call state recognition model with a multitask structure, and training the call state recognition model by using the call response information sample provided with the label.
In the embodiment of this specification, still include:
and selecting the push opportunity of the conversation content according to the real-time recognized conversation response state.
In the embodiment of this specification, still include:
and screening the target user according to the call response state to redial the call.
In the embodiment of this specification, still include:
and predicting the probability of refusing the number initiating the call by the target user, and removing the predicted refusing user from the target user.
In the embodiment of this specification, still include:
and carrying out guest group division and priority sequencing on the users according to the identified call response state, and releasing the service resources according to the priority sequence.
The device constructs a call state identification model with a plurality of tasks, wherein a common segment is arranged among the tasks, the targets of different tasks are different call response states, a call is initiated to a user and voice information is acquired, the call state identification model is called to identify the call response state of the user according to the voice information, the common segment of the tasks processes the voice information, the processing result of the common segment is transmitted to a special segment of each task to be identified, and the call response state of the user is judged according to the identification result of the tasks. By constructing the call state identification model with a plurality of tasks, because the plurality of tasks have a common segment, repeated calculation is avoided for the same part of calculation processes in the calculation processes of different states, and the speed of identifying the response state of the user during call is improved.
Based on the same inventive concept, the embodiment of the specification further provides the electronic equipment.
In the following, embodiments of the electronic device of the present invention are described, which may be regarded as specific physical implementations for the above-described embodiments of the method and apparatus of the present invention. Details described in the embodiments of the electronic device of the invention should be considered supplementary to the embodiments of the method or apparatus described above; for details which are not disclosed in embodiments of the electronic device of the invention, reference may be made to the above-described embodiments of the method or the apparatus.
Fig. 3 is a schematic structural diagram of an electronic device provided in an embodiment of the present disclosure. An electronic device 300 according to this embodiment of the invention is described below with reference to fig. 3. The electronic device 300 shown in fig. 3 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present invention.
As shown in fig. 3, electronic device 300 is embodied in the form of a general purpose computing device. The components of electronic device 300 may include, but are not limited to: at least one processing unit 310, at least one memory unit 320, a bus 330 connecting the various system components (including the memory unit 320 and the processing unit 310), a display unit 340, and the like.
Wherein the storage unit stores program code executable by the processing unit 310 to cause the processing unit 310 to perform the steps according to various exemplary embodiments of the present invention described in the above-mentioned processing method section of the present specification. For example, the processing unit 310 may perform the steps as shown in fig. 1.
The storage unit 320 may include readable media in the form of volatile storage units, such as a random access memory unit (RAM)3201 and/or a cache storage unit 3202, and may further include a read only memory unit (ROM) 3203.
The storage unit 320 may also include a program/utility 3204 having a set (at least one) of program modules 3205, such program modules 3205 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
Bus 330 may be one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.
The electronic device 300 may also communicate with one or more external devices 400 (e.g., keyboard, pointing device, bluetooth device, etc.), with one or more devices that enable a user to interact with the electronic device 300, and/or with any devices (e.g., router, modem, etc.) that enable the electronic device 300 to communicate with one or more other computing devices. Such communication may occur via an input/output (I/O) interface 350. Also, the electronic device 300 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the internet) via the network adapter 360. Network adapter 360 may communicate with other modules of electronic device 300 via bus 330. It should be appreciated that although not shown in FIG. 3, other hardware and/or software modules may be used in conjunction with electronic device 300, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAI D systems, tape drives, and data backup storage systems, etc.
Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments of the present invention described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiment of the present invention can be embodied in the form of a software product, which can be stored in a computer-readable storage medium (which can be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to make a computing device (which can be a personal computer, a server, or a network device, etc.) execute the above-mentioned method according to the present invention. The computer program, when executed by a data processing apparatus, enables the computer readable medium to implement the above-described method of the invention, namely: such as the method shown in fig. 1.
Fig. 4 is a schematic diagram of a computer-readable medium provided in an embodiment of the present specification.
A computer program implementing the method shown in fig. 1 may be stored on one or more computer readable media. The computer readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
The computer readable storage medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable storage medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
In summary, the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that some or all of the functionality of some or all of the components in embodiments in accordance with the invention may be implemented in practice using a general purpose data processing device such as a microprocessor or a Digital Signal Processor (DSP). The present invention may also be embodied as apparatus or device programs (e.g., computer programs and computer program products) for performing a portion or all of the methods described herein. Such programs implementing the present invention may be stored on computer-readable media or may be in the form of one or more signals. Such a signal may be downloaded from an internet website or provided on a carrier signal or in any other form.
While the foregoing embodiments have described the objects, aspects and advantages of the present invention in further detail, it should be understood that the present invention is not inherently related to any particular computer, virtual machine or electronic device, and various general-purpose machines may be used to implement the present invention. The invention is not to be considered as limited to the specific embodiments thereof, but is to be understood as being modified in all respects, all changes and equivalents that come within the spirit and scope of the invention.
The embodiments in the present specification are described in a progressive manner, and the same and similar parts among the embodiments are referred to each other, and each embodiment focuses on the differences from the other embodiments. The above description is only an example of the present application and is not intended to limit the present application. Various modifications and changes may occur to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.

Claims (12)

1. A method for identifying a call response status of a user, comprising:
establishing a call state identification model with a plurality of tasks, wherein the tasks have a common segment, and the targets of different tasks are different call response states;
initiating a call to a user, acquiring voice information, calling the call state identification model to identify the call response state of the user according to the voice information, and comprising the following steps:
the common segment of the tasks processes the voice information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification results of the tasks.
2. The method of claim 1, wherein the passing the processing results of the common segment to the proprietary segments of each task for identification comprises:
calculating the probability of the corresponding target of each task by each special segment according to the processing result of the common segment;
the judging the call response state of the user according to the recognition results of the tasks comprises the following steps:
and judging the call response state of the user according to the probability calculated by the tasks and the target of each task.
3. The method of any one of claims 1-2, wherein the constructing a call state recognition model having a plurality of tasks further comprises:
and compressing and deploying the constructed call state identification model.
4. The method of any one of claims 1-3, wherein each of the plurality of tasks is targeted to one of an off, a murky call, and a voice call.
5. The method according to any one of claims 1-4, wherein the constructing a call state recognition model having a plurality of tasks comprises:
acquiring call response information samples, and performing label setting on the call response information samples according to the call response states of the samples;
and constructing a call state recognition model with a multitask structure, and training the call state recognition model by using the call response information sample provided with the label.
6. The method according to any one of claims 1-5, further comprising:
and selecting the push opportunity of the conversation content according to the real-time recognized conversation response state.
7. The method according to any one of claims 1-6, further comprising:
and screening the target user according to the call response state to redial the call.
8. The method according to any one of claims 1-7, further comprising:
and predicting the probability of refusing the number initiating the call by the target user, and removing the predicted refusing user from the target user.
9. The method according to any one of claims 1-8, further comprising:
and carrying out guest group division and priority sequencing on the users according to the identified call response state, and releasing the service resources according to the priority sequence.
10. An apparatus for identifying a user call response status, comprising:
the multi-task model module is used for constructing a call state identification model with a plurality of tasks, wherein the tasks have a common segment, and the targets of different tasks are different call response states;
the identification module initiates a call to a user, acquires voice information, and calls the call state identification module to identify the call response state of the user according to the voice information, and comprises:
the common segment of the tasks processes the voice information, transmits the processing result of the common segment to the special segment of each task for identification, and judges the call response state of the user according to the identification results of the tasks.
11. An electronic device, wherein the electronic device comprises:
a processor; and a memory storing computer-executable instructions that, when executed, cause the processor to perform the method of any of claims 1-9.
12. A computer readable storage medium, wherein the computer readable storage medium stores one or more programs which, when executed by a processor, implement the method of any of claims 1-9.
CN202110447521.1A 2021-04-25 2021-04-25 Method and device for identifying call response state of user and electronic equipment Active CN113518153B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110447521.1A CN113518153B (en) 2021-04-25 2021-04-25 Method and device for identifying call response state of user and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110447521.1A CN113518153B (en) 2021-04-25 2021-04-25 Method and device for identifying call response state of user and electronic equipment

Publications (2)

Publication Number Publication Date
CN113518153A true CN113518153A (en) 2021-10-19
CN113518153B CN113518153B (en) 2023-07-04

Family

ID=78061438

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110447521.1A Active CN113518153B (en) 2021-04-25 2021-04-25 Method and device for identifying call response state of user and electronic equipment

Country Status (1)

Country Link
CN (1) CN113518153B (en)

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07162553A (en) * 1993-12-07 1995-06-23 Brother Ind Ltd Call signal identification device, facsimile equipment and telephone set
JP2004198831A (en) * 2002-12-19 2004-07-15 Sony Corp Method, program, and recording medium for speech recognition
JP2006171185A (en) * 2004-12-14 2006-06-29 Asahi Kasei Corp Speech recognition device and method
US20060224392A1 (en) * 2005-03-31 2006-10-05 Microsoft Corporation Test harness for a speech server
US20110066433A1 (en) * 2009-09-16 2011-03-17 At&T Intellectual Property I, L.P. System and method for personalization of acoustic models for automatic speech recognition
JP2015055653A (en) * 2013-09-10 2015-03-23 セイコーエプソン株式会社 Speech recognition device and method and electronic apparatus
CN109327621A (en) * 2018-11-21 2019-02-12 Oppo广东移动通信有限公司 Method, apparatus, system, equipment and the storage medium of call reminding
CN109961776A (en) * 2017-12-18 2019-07-02 上海智臻智能网络科技股份有限公司 Speech information processing apparatus
CN111212190A (en) * 2019-12-20 2020-05-29 北京淇瑀信息科技有限公司 Conversation management method, device and system based on conversation strategy management
CN111491058A (en) * 2020-03-31 2020-08-04 维沃移动通信有限公司 Method for controlling operation mode, electronic device, and storage medium
CN111916071A (en) * 2020-06-16 2020-11-10 深圳追一科技有限公司 Call calling method, device, computer equipment and storage medium
CN112017645A (en) * 2020-08-31 2020-12-01 广州市百果园信息技术有限公司 Voice recognition method and device
CN112202974A (en) * 2020-12-03 2021-01-08 零犀(北京)科技有限公司 Method, device and system for automatically judging telephone answering state

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07162553A (en) * 1993-12-07 1995-06-23 Brother Ind Ltd Call signal identification device, facsimile equipment and telephone set
JP2004198831A (en) * 2002-12-19 2004-07-15 Sony Corp Method, program, and recording medium for speech recognition
JP2006171185A (en) * 2004-12-14 2006-06-29 Asahi Kasei Corp Speech recognition device and method
US20060224392A1 (en) * 2005-03-31 2006-10-05 Microsoft Corporation Test harness for a speech server
US20110066433A1 (en) * 2009-09-16 2011-03-17 At&T Intellectual Property I, L.P. System and method for personalization of acoustic models for automatic speech recognition
JP2015055653A (en) * 2013-09-10 2015-03-23 セイコーエプソン株式会社 Speech recognition device and method and electronic apparatus
CN109961776A (en) * 2017-12-18 2019-07-02 上海智臻智能网络科技股份有限公司 Speech information processing apparatus
CN109327621A (en) * 2018-11-21 2019-02-12 Oppo广东移动通信有限公司 Method, apparatus, system, equipment and the storage medium of call reminding
CN111212190A (en) * 2019-12-20 2020-05-29 北京淇瑀信息科技有限公司 Conversation management method, device and system based on conversation strategy management
CN111491058A (en) * 2020-03-31 2020-08-04 维沃移动通信有限公司 Method for controlling operation mode, electronic device, and storage medium
CN111916071A (en) * 2020-06-16 2020-11-10 深圳追一科技有限公司 Call calling method, device, computer equipment and storage medium
CN112017645A (en) * 2020-08-31 2020-12-01 广州市百果园信息技术有限公司 Voice recognition method and device
CN112202974A (en) * 2020-12-03 2021-01-08 零犀(北京)科技有限公司 Method, device and system for automatically judging telephone answering state

Also Published As

Publication number Publication date
CN113518153B (en) 2023-07-04

Similar Documents

Publication Publication Date Title
CN109514586B (en) Method and system for realizing intelligent customer service robot
CN108764487A (en) For generating the method and apparatus of model, the method and apparatus of information for identification
CN101341532A (en) Sharing voice application processing via markup
CN112100339A (en) User intention recognition method and device for intelligent voice robot and electronic equipment
WO2021047209A1 (en) Optimization for a call that waits in queue
CN110782128B (en) User occupation label generation method and device and electronic equipment
CN110223694B (en) Voice processing method, system and device
CN112087726A (en) Method and system for identifying polyphonic ringtone, electronic equipment and storage medium
CN113518153A (en) Method and device for identifying user call response state and electronic equipment
CN110740212A (en) Call answering method and device based on intelligent voice technology and electronic equipment
US11985097B2 (en) Multi-agent chatbot with multi-intent recognition
CN114067842B (en) Customer satisfaction degree identification method and device, storage medium and electronic equipment
CN111949776B (en) User tag evaluation method and device and electronic equipment
CN112765022B (en) Webshell static detection method based on data stream and electronic equipment
CN111194026B (en) Information sending method and device and electronic equipment
US11738268B2 (en) Game development method and apparatus, game running method and apparatus, and electronic device
CN111131354B (en) Method and apparatus for generating information
CN110489124B (en) Source code execution method, source code execution device, storage medium and computer equipment
CN111292766B (en) Method, apparatus, electronic device and medium for generating voice samples
CN110489738B (en) Information processing method, electronic equipment and computer storage medium
CN109788128A (en) A kind of income prompting method, incoming call prompting device and terminal device
CN112711654B (en) Chinese character interpretation technique generation method, system, equipment and medium for voice robot
US20240127790A1 (en) Systems and methods for reconstructing voice packets using natural language generation during signal loss
CN111680032B (en) Method and device for processing information sending task and electronic equipment
CN112309399A (en) Method and device for executing task based on voice and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant