CN109743436B - Communication compensation method, device, equipment and storage medium for voice conversation - Google Patents

Communication compensation method, device, equipment and storage medium for voice conversation Download PDF

Info

Publication number
CN109743436B
CN109743436B CN201811637588.6A CN201811637588A CN109743436B CN 109743436 B CN109743436 B CN 109743436B CN 201811637588 A CN201811637588 A CN 201811637588A CN 109743436 B CN109743436 B CN 109743436B
Authority
CN
China
Prior art keywords
recording
real
analysis result
communication
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811637588.6A
Other languages
Chinese (zh)
Other versions
CN109743436A (en
Inventor
许璐强
杨云飞
金丽丽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sipic Technology Co Ltd
Original Assignee
AI Speech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AI Speech Ltd filed Critical AI Speech Ltd
Priority to CN201811637588.6A priority Critical patent/CN109743436B/en
Publication of CN109743436A publication Critical patent/CN109743436A/en
Application granted granted Critical
Publication of CN109743436B publication Critical patent/CN109743436B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a communication compensation method and a communication compensation device for voice conversation, wherein the communication compensation method for the voice conversation comprises the following steps: responding to a voice conversation analysis task started in a communication process by a user, connecting an analysis server to analyze the voice conversation of the user in real time and acquiring a first real-time analysis result; acquiring a communication connection state in the current communication process; if the connection is disconnected in the communication process, switching from the real-time analysis task to a recording task to record the recording audio of the user during the disconnection of the communication connection; if communication connection recovery occurs during recording, switching from the recording task to a real-time analysis task and acquiring a second real-time analysis result; uploading and analyzing the recording audio to generate a first recording analysis result; and combining the first real-time analysis result, the first recording analysis result and the second real-time analysis result to obtain a complete analysis result. Therefore, the user can not worry about the incompleteness of the voice dialogue analysis result under any communication condition.

Description

Communication compensation method, device, equipment and storage medium for voice conversation
Technical Field
The invention belongs to the technical field of voice data processing, and particularly relates to a communication compensation method and device for voice conversation.
Background
In the related art, the real-time analysis of the voice conversation on the market must maintain the communication connection state, and the current voice conversation can be continuously analyzed after the communication interruption must wait for reconnection.
Currently, the speech dialogue parsing in the market is roughly divided into off-line parsing and real-time parsing.
The inventors have found that the prior art solution suffers from at least the following problems: offline analysis can only be carried out completely by recording audio, which is time-consuming and can not confirm the correct analysis condition in real time; real-time analysis is more dependent on the stability of communication, and under the condition of communication interruption, voice conversation analysis is immediately interrupted and cannot be used continuously.
Disclosure of Invention
Embodiments of the present invention provide a communication compensation method and apparatus for voice dialog, so as to solve at least one of the above technical problems.
In a first aspect, an embodiment of the present invention provides a communication compensation method for a voice conversation, including: responding to a voice conversation analysis task started in a communication process by a user, connecting an analysis server to analyze the voice conversation of the user in real time and acquiring a first real-time analysis result; acquiring a communication connection state in the current communication process; if the connection is disconnected in the communication process, switching from the real-time analysis task to a recording task to record the recording audio of the user during the disconnection of the communication connection; if communication connection recovery occurs during recording, switching from the recording task to a real-time analysis task and acquiring a second real-time analysis result; uploading and analyzing the recording audio to generate a first recording analysis result; and combining the first real-time analysis result, the first recording analysis result and the second real-time analysis result to obtain a complete analysis result.
In a second aspect, an embodiment of the present invention provides a communication compensation apparatus for voice conversation, including: the first analysis module is configured to respond to the voice conversation analysis task started in the communication process by the user, and is connected with the analysis server to analyze the voice conversation of the user in real time and obtain a first real-time analysis result; the connection state acquisition module is configured to acquire a communication connection state in the current communication process; the recording module is configured to switch from the real-time analysis task to the recording task to record the recording audio of the user during the communication disconnection if the communication disconnection occurs; the second analysis module is configured to switch from the recording task to the real-time analysis task and acquire a second real-time analysis result if communication connection is recovered during recording; the recording analysis module is configured to upload and analyze the recording audio to generate a first recording analysis result; and the merging module is configured to merge the first real-time analysis result, the first sound record analysis result and the second real-time analysis result to obtain a complete analysis result.
In a third aspect, an electronic device is provided, comprising: the communication compensation system comprises at least one processor and a memory which is in communication connection with the at least one processor, wherein the memory stores instructions which can be executed by the at least one processor, and the instructions are executed by the at least one processor so as to enable the at least one processor to execute the steps of the communication compensation method for voice conversation according to any embodiment of the invention.
In a fourth aspect, the present invention further provides a computer program product, where the computer program product includes a computer program stored on a non-volatile computer-readable storage medium, where the computer program includes program instructions, and when the program instructions are executed by a computer, the computer executes the steps of the communication compensation method for voice conversation according to any embodiment of the present invention.
The method and the device can ensure that the user does not worry about the incompleteness of the voice conversation analysis result under any communication condition, and can greatly reduce the time of uploading analysis after direct recording even if the communication connection condition is poor.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on the drawings without creative efforts.
Fig. 1 is a flowchart illustrating a communication compensation method for voice conversation according to an embodiment of the present invention;
fig. 2 is a flowchart of another communication compensation method for voice conversation according to an embodiment of the present invention;
fig. 3 is a flowchart illustrating a communication compensation method for voice conversation according to another embodiment of the present invention;
fig. 4 is a flowchart illustrating a communication compensation method for voice conversation according to another embodiment of the present invention;
fig. 5 is a flowchart illustrating a communication compensation method for a voice conversation according to an embodiment of the present invention;
fig. 6 is a block diagram of another communication compensation apparatus for voice conversation according to an embodiment of the present invention;
fig. 7 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, which shows a flowchart of an embodiment of a communication compensation method for voice dialog according to the present application, the communication compensation method for voice dialog according to the present embodiment may be applied to a terminal having a communication or real-time voice dialog function, such as a smart phone, a tablet, a computer, and the like.
As shown in fig. 1, in step 101, in response to a user starting a voice conversation parsing task in a communication process, a connection parsing server parses a voice conversation of the user in real time and obtains a first real-time parsing result;
in step 102, a communication connection state in the current communication process is acquired;
in step 103, if the connection is disconnected in the communication process, switching from the real-time analysis task to the recording task to record the recording audio of the user during the disconnection of the communication;
in step 104, if communication connection is recovered during recording, switching from the recording task to a real-time analysis task and acquiring a second real-time analysis result;
in step 105, uploading and parsing the recording audio to generate a first recording parsing result;
in step 106, the first real-time parsing result, the first recording parsing result, and the second real-time parsing result are combined to obtain a complete parsing result.
In this embodiment, for step 101, after the user starts the voice dialog parsing task in the communication process, the communication compensation device is connected to the parsing server to parse the voice dialog of the user in real time and obtain a first real-time parsing result. The voice conversation analysis task comprises a real-time analysis task and a recording task. Then, for step 102, the communication compensation apparatus needs to always obtain the communication connection status in the current communication process, and determine whether the current connection is disconnected or normal. Then, for step 103, if the connection is disconnected during the communication process, the real-time parsing task needs to be switched to the recording task to record the recording audio of the user during the disconnection of the communication, so that the part which cannot be networked for real-time parsing can be recorded and then uploaded for parsing. Then, in step 104, if the communication connection is recovered during recording, the recording task needs to be switched to the real-time parsing task and a second real-time parsing result needs to be obtained, that is, the communication connection is recovered and then real-time parsing is performed. Then, in step 105, the communication compensation device uploads the recording audio that was interrupted before the real-time analysis and analyzes the recording audio to generate a first recording analysis result. Finally, in step 106, the communication compensation device combines the first real-time analysis result, the first recording analysis result and the second real-time analysis result to obtain a complete analysis result. Of course, in practical applications, multiple interruptions may occur, so that there may be multiple first sound record parsing results and multiple second real-time parsing results, and the application is not limited herein. And then all the results are spliced together according to the time sequence to form a complete analysis result.
According to the method, the recording is uploaded and analyzed after the recording is carried out when the communication is interrupted, and then the analysis result of the recording and the real-time analysis result are spliced to form a complete analysis result, so that even if the communication connection of a user is unstable, a complete voice analysis result can be obtained, and the user experience is excellent.
Referring further to fig. 2, a flow chart of another communication compensation method for voice conversation according to an embodiment of the present application is shown. The flow chart is primarily a flow chart of steps further defined for the additional flow of the flow chart 1. In this embodiment, the first real-time parsing result and the second real-time parsing result are stored in a parsing result set.
As shown in fig. 2, in step 201, every time communication connection recovery occurs, a corresponding communication recovery flag is added to the parsing result set;
in step 202, the first sound recording parsing result is used to replace the corresponding communication recovery flag to form a complete parsing result set.
In this embodiment, for step 201, the communication compensation apparatus needs to add a corresponding communication recovery flag to the parsing result set every time the communication connection is recovered, where each communication recovery has a corresponding communication recovery flag, and the communication recovery flag is named by using, for example, an identifier of the entire voice parsing result plus time. Then, in step 202, the corresponding communication recovery flag can be replaced by the recording analysis result corresponding to the recording before communication recovery, so as to form a complete analysis result set.
The method of the embodiment marks the time point of communication connection recovery by adopting the communication recovery mark, and then can form a complete analysis result set by replacing the mark with the recording analysis result before the mark, so that the method is simple and easy to operate and does not occupy too much space.
With further reference to fig. 3, a flowchart of another communication compensation method for voice conversation according to an embodiment of the present application is shown. The flow chart is primarily a flow chart of further defined steps for the case "connection has not been restored after disconnection of communication".
As shown in fig. 3, in step 301, if the communication connection is not recovered during recording, the recorded audio is saved after the voice dialog parsing task is finished;
in step 302, a second recording analysis result of the recording audio uploaded after the communication connection is restored is obtained, and the second recording analysis result is inserted into the analysis result set.
In this embodiment, for step 301, if the communication compensation device does not recover after detecting the communication disconnection, the recorded audio is saved after the voice dialog parsing task is finished. Then, in step 302, since the communication is not recovered all the time, only one recording does not have a communication recovery flag, and the analysis result of the recording audio is directly appended to the result of the previous real-time analysis. Therefore, the method of the embodiment is simple to operate and does not occupy too many resources.
Please refer to fig. 4, which is a flowchart illustrating another communication compensation method for a voice conversation according to an embodiment of the present application. The flow chart is mainly a flow chart of steps defined further for the case of "communication connection is relatively stable".
As shown in fig. 4, in step 401, if the current communication connection is available, a long connection is established with the resolution server;
in step 402, obtaining a real-time parsing result returned by the parsing server;
in step 403, the real-time parsing result is stored into the parsing result set.
In this embodiment, for step 401, if it is determined that the current communication connection is available, a long connection is established with the resolution server, so that it is not necessary to continuously perform the connection, which wastes network bandwidth and system resources. Then, in step 402, the real-time parsing result returned by the parsing server is obtained, and in step 403, the real-time parsing result is stored in the corresponding parsing result set.
The method of the embodiment can reduce the time of 3 times of TCP handshake by multiplexing the TCP link by establishing long connection with the resolution server when the connection is available, and can always maintain a relatively stable and fast network connection state and data transmission environment under the condition of uninterrupted network connection, thereby being beneficial to the transmission of real-time resolution results.
In some optional embodiments, the method further comprises: and when the communication connection is recovered every time, re-establishing long connection with the analysis server. Thereby reducing unnecessary handshaking time and maintaining a faster transmission speed through a long connection after each communication resumption.
It should be noted that the above method steps are not intended to limit the execution order of the steps, and in fact, some steps may be executed simultaneously or in the reverse order of the steps, which is not limited herein.
At present, no technical scheme for keeping continuous recording of voice conversations under the condition of communication interruption, keeping current voice conversation analysis after communication is recovered and synchronously analyzing the voice conversations in the communication interruption process is available in the market.
The following description is provided to enable those skilled in the art to better understand the present disclosure by describing some of the problems encountered by the inventors in implementing the present disclosure and by describing one particular embodiment of the finally identified solution.
The inventor finds that the defects in the prior art are mainly caused by the following reasons in the process of implementing the application: the voice conversation is too dependent on real-time analysis, and has no support for offline analysis under the condition of communication interruption.
The inventor also finds that the current products on the market have single functions and high requirements on communication connection, and the reasons are as follows:
first, products on the market are oriented to specific user groups, and audiences are relatively single.
Secondly, the real-time analysis logic of the voice conversation with the fusion function is complex, and many abnormal situations are difficult to solve.
Finally, the pressure ratio of the voice conversation real-time analysis with the fusion function to the server side is larger, the long connection is required to be established to realize the real-time analysis of the voice conversation, the uploading analysis of the large recording file is required to be provided, and the cost of software and hardware is higher.
The scheme of the application is mainly designed and optimized from the following aspects:
1. and in the process of real-time analysis of the voice conversation, if the communication connection is interrupted, the voice conversation is automatically switched to a recording state, and a recording file is recorded.
2. When the communication connection is recovered, the real-time analysis state is switched back. And simultaneously uploading the recording file to a server side for voice conversation analysis, and inserting an analysis result into a result before communication connection recovery.
3. When the voice conversation is finished, the communication connection is still not recovered, the recording file is stored, the user can manually select and upload the recording file during the communication connection, and the analysis result is stored in the corresponding voice conversation analysis file.
Referring to fig. 5, a detailed flow chart of an aspect of the present application is shown.
As shown in fig. 5, the scheme of the present application mainly includes the following steps:
the user first needs to create a voice dialog parsing task and trigger a processing flow.
The processing flow is shown in FIG. 5:
step 1: and the client opens a voice conversation analysis task.
Step 2: and judging the current communication connection state.
And step 3: a communication connection is available.
a. The client is connected to the voice conversation analysis service, and the result is obtained in real time.
b. And (5) performing real-time voice conversation analysis, and if the communication is abnormal, returning to the step 1.
And 4, step 4: the communication connection is disconnected.
a. And starting the recording and saving the recording file.
b. The communication connection is restored.
a) And adding marks in the analysis result set.
b) Uploading the record for voice dialogue analysis, and replacing the mark bit in the result set with the analysis result.
c) And establishing connection with the real-time analysis service, and acquiring an analysis result and adding the analysis result to a result set.
c. The communication connection is not recovered until the voice conversation parsing task is finished.
a) And storing the recording file after the communication connection is interrupted.
b) And under the condition that the communication connection is determined to be available, selecting a recording file at the client to upload for voice conversation analysis.
c) The parsing result will be appended to the selected set of speech dialog parsing results.
And 5: and obtaining a complete voice conversation analysis result.
In the process of implementing the present application, the inventors have also tried the following schemes: under the condition of poor communication condition, the mobile phone is switched to carry out voice conversation acquisition, the stability of the mobile phone network is high, the resource consumption during analysis of the recording file of the server side can be reduced, and the coding amount of complex logic is effectively reduced. But the defect is also obvious, the requirement of mobile phone pickup on distance is higher, and when the voice conversation is far away from the mobile phone, the accuracy rate of the analysis result is low, and the effect is poor.
After the use condition of the actual application scene is measured, the current scheme is carried out according to the scheme.
By the aid of the method and the device, a user does not need to worry about incompleteness of the voice conversation analysis result under any communication condition, and uploading analysis time after direct recording can be greatly shortened even if the communication connection condition is poor.
Referring to fig. 6, a block diagram of a communication compensation apparatus for voice conversation according to an embodiment of the present invention is shown.
As shown in fig. 6, the communication compensation apparatus 600 for voice dialog includes a first parsing module 610, a connection status obtaining module 620, a recording module 630, a second parsing module 640, a recording parsing module 650, and a merging module 660.
The first analysis module 610 is configured to respond to a user starting a voice conversation analysis task in a communication process, connect an analysis server to perform real-time analysis on a voice conversation of the user and acquire a first real-time analysis result; a connection state obtaining module 620 configured to obtain a communication connection state in a current communication process; the recording module 630 is configured to switch from the real-time parsing task to the recording task to record the recording audio of the user during the communication disconnection if the communication disconnection occurs; the second analysis module 640 is configured to switch from the recording task to the real-time analysis task and obtain a second real-time analysis result if the communication connection is recovered during recording; a recording parsing module 650 configured to upload and parse a recording audio to generate a first recording parsing result; and a merging module 660 configured to merge the first real-time parsing result, the first recording parsing result, and the second real-time parsing result to obtain a complete parsing result.
In some optional embodiments, the first real-time parsing result and the second real-time parsing result are stored in a parsing result set, and the communication compensation apparatus 600 for voice conversation further includes: a recovery mark adding module (not shown in the figure) configured to add a corresponding communication recovery mark in the analysis result set each time communication connection recovery occurs; and a replacing module (not shown) configured to replace the corresponding communication recovery mark with the first sound recording parsing result to form a complete parsing result set.
In other optional embodiments, the communication compensation apparatus 600 for voice conversation described above further includes: a saving module (not shown in the figure) configured to save the recorded audio after the voice conversation parsing task is finished if the communication connection is not recovered during recording; and a recording insertion module (not shown in the figure) configured to obtain a second recording analysis result of the recording audio uploaded after the communication connection is restored, and insert the second recording analysis result into the analysis result set.
It should be understood that the modules recited in fig. 6 correspond to various steps in the methods described with reference to fig. 1, 2, 3, and 4. Thus, the operations and features described above for the method and the corresponding technical effects are also applicable to the modules in fig. 6, and are not described again here.
It should be noted that the modules in the embodiments of the present disclosure are not limited to the scheme of the present disclosure, for example, the determining module may be described as a module that determines whether the interaction state is a playing scene when the device is in the interaction state. In addition, the related function module may also be implemented by a hardware processor, for example, the determining module may also be implemented by a processor, which is not described herein again.
In other embodiments, the present invention further provides a non-volatile computer storage medium, where the computer storage medium stores computer-executable instructions, where the computer-executable instructions may perform the communication compensation method for voice conversation in any of the above method embodiments;
as one embodiment, a non-volatile computer storage medium of the present invention stores computer-executable instructions configured to:
responding to a voice conversation analysis task started in a communication process by a user, connecting an analysis server to analyze the voice conversation of the user in real time and acquiring a first real-time analysis result;
acquiring a communication connection state in the current communication process;
if the connection is disconnected in the communication process, switching from the real-time analysis task to a recording task to record the recording audio of the user during the disconnection of the communication connection;
if communication connection recovery occurs during recording, switching from the recording task to a real-time analysis task and acquiring a second real-time analysis result;
uploading and analyzing the recording audio to generate a first recording analysis result;
and combining the first real-time analysis result, the first recording analysis result and the second real-time analysis result to obtain a complete analysis result.
The non-volatile computer-readable storage medium may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to use of the communication compensation apparatus for voice conversation, and the like. Further, the non-volatile computer-readable storage medium may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, the non-transitory computer readable storage medium optionally includes memory located remotely from the processor, which may be connected to a communication compensation device for voice conversations over a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
Embodiments of the present invention also provide a computer program product comprising a computer program stored on a non-transitory computer-readable storage medium, the computer program comprising program instructions that, when executed by a computer, cause the computer to perform any of the above communication compensation methods for voice conversations.
Fig. 7 is a schematic structural diagram of an electronic device according to an embodiment of the present invention, and as shown in fig. 7, the electronic device includes: one or more processors 710 and a memory 720, one processor 710 being illustrated in fig. 7. The apparatus for the communication compensation method of the voice conversation may further include: an input device 730 and an output device 740. The processor 710, the memory 720, the input device 730, and the output device 740 may be connected by a bus or other means, such as the bus connection in fig. 7. The memory 720 is a non-volatile computer-readable storage medium as described above. The processor 710 executes various functional applications of the server and data processing by executing nonvolatile software programs, instructions and modules stored in the memory 720, namely, implements the communication compensation method for voice conversation of the above-described method embodiment. The input device 730 may receive input numeric or character information and generate key signal inputs related to user settings and function control of the communication compensation device. The output device 740 may include a display device such as a display screen.
The product can execute the method provided by the embodiment of the invention, and has corresponding functional modules and beneficial effects of the execution method. For technical details that are not described in detail in this embodiment, reference may be made to the method provided by the embodiment of the present invention.
As an embodiment, the electronic device is applied to a communication compensation device for voice conversation, and is used for a client, and the electronic device includes: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to cause the at least one processor to:
responding to a voice conversation analysis task started in a communication process by a user, connecting an analysis server to analyze the voice conversation of the user in real time and acquiring a first real-time analysis result;
acquiring a communication connection state in the current communication process;
if the connection is disconnected in the communication process, switching from the real-time analysis task to a recording task to record the recording audio of the user during the disconnection of the communication connection;
if communication connection recovery occurs during recording, switching from the recording task to a real-time analysis task and acquiring a second real-time analysis result;
uploading and analyzing the recording audio to generate a first recording analysis result;
and combining the first real-time analysis result, the first recording analysis result and the second real-time analysis result to obtain a complete analysis result.
The electronic device of the embodiments of the present application exists in various forms, including but not limited to:
(1) a mobile communication device: such devices are characterized by mobile communications capabilities and are primarily targeted at providing voice, data communications. Such terminals include smart phones (e.g., iphones), multimedia phones, functional phones, and low-end phones, among others.
(2) Ultra mobile personal computer device: the equipment belongs to the category of personal computers, has calculation and processing functions and generally has the characteristic of mobile internet access. Such terminals include: PDA, MID, and UMPC devices, etc., such as ipads.
(3) A portable entertainment device: such devices can display and play multimedia content. Such devices include audio and video players (e.g., ipods), handheld game consoles, electronic books, as well as smart toys and portable car navigation devices.
(4) The server is similar to a general computer architecture, but has higher requirements on processing capability, stability, reliability, safety, expandability, manageability and the like because of the need of providing highly reliable services.
(5) And other electronic devices with data interaction functions.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and the parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium, such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods of the various embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A communication compensation method for voice conversations, comprising:
responding to a voice conversation analysis task started in a communication process by a user, connecting an analysis server to analyze the voice conversation of the user in real time and acquiring a first real-time analysis result;
acquiring a communication connection state in the current communication process;
if the connection is disconnected in the communication process, switching from the real-time analysis task to a recording task to record the recording audio of the user during the disconnection of the communication connection;
if communication connection recovery occurs during recording, switching from the recording task to a real-time analysis task and acquiring a second real-time analysis result;
uploading and analyzing the recording audio to generate a first recording analysis result;
and combining the first real-time analysis result, the first recording analysis result and the second real-time analysis result to obtain a complete analysis result.
2. The method of claim 1, wherein the first real-time parsing result and the second real-time parsing result are stored in a set of parsing results, the method further comprising:
adding corresponding communication recovery marks in the analysis result set when communication connection recovery occurs each time;
and replacing the corresponding communication recovery mark with the first recording analysis result to form a complete analysis result set.
3. The method of claim 2, wherein after switching from the real-time parsing task to the recording task to record the offline recorded audio of the user during the communication disconnection if the communication disconnection occurs, the method further comprises:
if the communication connection is not recovered during recording, storing the recording audio after the voice conversation analysis task is finished;
and acquiring a second recording analysis result of the recording audio uploaded after the communication connection is recovered, and inserting the second recording analysis result into the analysis result set.
4. The method of claim 1, wherein after the obtaining the communication connection status in the current communication process, the method further comprises:
if the current communication connection is available, establishing long connection with the resolution server;
acquiring a real-time analysis result returned by the analysis server;
and storing the real-time analysis result into an analysis result set.
5. The method of claim 4, wherein the method further comprises:
and when the communication connection is recovered every time, re-establishing long connection with the analysis server.
6. A communication compensation apparatus for voice conversation, comprising:
the first analysis module is configured to respond to the voice conversation analysis task started in the communication process by the user, and is connected with the analysis server to analyze the voice conversation of the user in real time and obtain a first real-time analysis result;
the connection state acquisition module is configured to acquire a communication connection state in the current communication process;
the recording module is configured to switch from the real-time analysis task to the recording task to record the recording audio of the user during the communication disconnection if the communication disconnection occurs;
the second analysis module is configured to switch from the recording task to the real-time analysis task and acquire a second real-time analysis result if communication connection is recovered during recording;
the recording analysis module is configured to upload and analyze the recording audio to generate a first recording analysis result;
and the merging module is configured to merge the first real-time analysis result, the first recording analysis result and the second real-time analysis result to obtain a complete analysis result.
7. The apparatus of claim 6, wherein the first real-time parsing result and the second real-time parsing result are stored in a set of parsing results, further comprising:
a recovery mark adding module configured to add a corresponding communication recovery mark in the analysis result set every time communication connection recovery occurs;
and the replacing module is configured to replace the corresponding communication recovery mark with the first sound recording analysis result to form a complete analysis result set.
8. The apparatus of claim 7, further comprising:
the storage module is configured to store the recording audio after the voice conversation analysis task is finished if communication connection recovery does not occur during recording;
and the recording insertion module is configured to acquire a second recording analysis result of the recording audio uploaded after the communication connection is recovered, and insert the second recording analysis result into the analysis result set.
9. An electronic device, comprising: at least one processor, and a memory communicatively coupled to the at least one processor, wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the steps of the method of any one of claims 1 to 5.
10. A storage medium having stored thereon a computer program, characterized in that the program, when being executed by a processor, is adapted to carry out the steps of the method of any one of claims 1 to 5.
CN201811637588.6A 2018-12-29 2018-12-29 Communication compensation method, device, equipment and storage medium for voice conversation Active CN109743436B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811637588.6A CN109743436B (en) 2018-12-29 2018-12-29 Communication compensation method, device, equipment and storage medium for voice conversation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811637588.6A CN109743436B (en) 2018-12-29 2018-12-29 Communication compensation method, device, equipment and storage medium for voice conversation

Publications (2)

Publication Number Publication Date
CN109743436A CN109743436A (en) 2019-05-10
CN109743436B true CN109743436B (en) 2020-08-28

Family

ID=66362336

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811637588.6A Active CN109743436B (en) 2018-12-29 2018-12-29 Communication compensation method, device, equipment and storage medium for voice conversation

Country Status (1)

Country Link
CN (1) CN109743436B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002215584A (en) * 2001-01-22 2002-08-02 Omron Corp Device, method, and program for voice response, and computer-readable recording medium where the same is recorded
CN1677997A (en) * 2004-03-31 2005-10-05 日本电气株式会社 Call interruption compensation system
US7881234B2 (en) * 2006-10-19 2011-02-01 International Business Machines Corporation Detecting interruptions in audio conversations and conferences, and using a conversation marker indicative of the interrupted conversation
CN106469558A (en) * 2015-08-21 2017-03-01 中兴通讯股份有限公司 Audio recognition method and equipment
US9659564B2 (en) * 2014-10-24 2017-05-23 Sestek Ses Ve Iletisim Bilgisayar Teknolojileri Sanayi Ticaret Anonim Sirketi Speaker verification based on acoustic behavioral characteristics of the speaker

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8396192B2 (en) * 2010-03-03 2013-03-12 Calabrio, Inc. Desktop recording architecture for recording call sessions over a telephony network
CN103096186B (en) * 2011-10-28 2016-06-29 上海博泰悦臻网络技术服务有限公司 The continuous even method of mobile unit, the talkback unit of vehicle and off-line thereof
CN103369094B (en) * 2013-07-15 2015-09-02 北京邮电大学 The successional mobile terminal of communication process is not affected when conversing and being interrupted
US9911415B2 (en) * 2014-12-19 2018-03-06 Lenovo (Singapore) Pte. Ltd. Executing a voice command during voice input
CN104702791A (en) * 2015-03-13 2015-06-10 安徽声讯信息技术有限公司 Smart phone recording sound for a long time and synchronously transliterating text, information processing method thereof
CN205943456U (en) * 2016-08-24 2017-02-08 安徽咪鼠科技有限公司 Pronunciation are gathered and preprocessing device based on intelligence pronunciation mouse
US20180166073A1 (en) * 2016-12-13 2018-06-14 Ford Global Technologies, Llc Speech Recognition Without Interrupting The Playback Audio

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002215584A (en) * 2001-01-22 2002-08-02 Omron Corp Device, method, and program for voice response, and computer-readable recording medium where the same is recorded
CN1677997A (en) * 2004-03-31 2005-10-05 日本电气株式会社 Call interruption compensation system
US7881234B2 (en) * 2006-10-19 2011-02-01 International Business Machines Corporation Detecting interruptions in audio conversations and conferences, and using a conversation marker indicative of the interrupted conversation
US9659564B2 (en) * 2014-10-24 2017-05-23 Sestek Ses Ve Iletisim Bilgisayar Teknolojileri Sanayi Ticaret Anonim Sirketi Speaker verification based on acoustic behavioral characteristics of the speaker
CN106469558A (en) * 2015-08-21 2017-03-01 中兴通讯股份有限公司 Audio recognition method and equipment

Also Published As

Publication number Publication date
CN109743436A (en) 2019-05-10

Similar Documents

Publication Publication Date Title
CN106936671B (en) Multi-screen interaction method and device and terminal equipment
US11845451B2 (en) Automobile diagnostic method, apparatus, device and system, and diagnostic connection device
CN111031058A (en) Websocket-based distributed server cluster interaction method and device
US8189754B2 (en) Image sharing system
CN110765744A (en) Multi-person collaborative document editing method and system
CN110166791B (en) Connection establishing method, device, equipment and storage medium
CN107342083B (en) Method and apparatus for providing voice service
US20170171585A1 (en) Method and Electronic Device for Recording Live Streaming Media
US20170163787A1 (en) Method and electronic device for upgrading or downgrading system
CA2951525A1 (en) Communication apparatus, communication system, communication management system, and communication control method
CN105550934A (en) System and method for pushing WeChat soft advertisement in virtual reality
CN105516638A (en) Video call method, device and system
US20170171339A1 (en) Advertisement data transmission method, electrnoic device and system
CN111767558B (en) Data access monitoring method, device and system
CN109743436B (en) Communication compensation method, device, equipment and storage medium for voice conversation
CN113301135A (en) Data sharing method
CN104853234A (en) Data processing method, system thereof and mobile terminal
US9485458B2 (en) Data processing method and device
CN114466224B (en) Video data encoding and decoding method and device, storage medium and electronic equipment
CN113676761B (en) Multimedia resource playing method and device and main control equipment
CN108076381B (en) Video display method, video forwarding equipment and system
CN108377243B (en) Live streaming transmission method and device
US20170171510A1 (en) Method and device for leaving video message
CN112423098A (en) Video processing method, electronic device and storage medium
CN113031895A (en) Screen projection control method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu.

Patentee after: Sipic Technology Co.,Ltd.

Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu.

Patentee before: AI SPEECH Ltd.