CN110265061B

CN110265061B - Method and equipment for translating call voice in real time

Info

Publication number: CN110265061B
Application number: CN201910559564.1A
Authority: CN
Inventors: 陈景郁; 成荣飞
Original assignee: Samsung Guangzhou Mobile R&D Center; Samsung Electronics Co Ltd
Current assignee: Samsung Guangzhou Mobile R&D Center; Samsung Electronics Co Ltd
Priority date: 2019-06-26
Filing date: 2019-06-26
Publication date: 2021-08-20
Anticipated expiration: 2039-06-26
Also published as: CN110265061A

Abstract

A method and device for real-time translation of call voice are provided. The method comprises the following steps: when the electronic terminal needs to translate call voice in real time, detecting whether a preset condition is met; when the preset condition is detected to be met, the collected call voice is sent to a translation server for translating the call voice; when detecting that the preset condition is not met, carrying out tone quality preprocessing on the collected call voice, and sending the processed call voice to a translation server; and receiving a translation result corresponding to the transmitted call voice from the translation server. According to the method and the device, the real-time performance of the call voice translation function can be improved.

Description

Method and equipment for translating call voice in real time

Technical Field

The present invention relates generally to the field of electronic technology, and more particularly, to a method and apparatus for real-time translation of call speech.

Background

With the advent of the global era, cross-regional communication has become more frequent. In the cross-region communication process, people can smoothly communicate by using translation software so as to solve the trouble caused by language obstruction. In the voice communication process, the two parties can realize barrier-free voice communication through the function of real-time translation of communication voice even if the two parties use different languages. However, the translation delay of the current call voice translation function is large, so that the translation real-time performance is poor, and the user experience is reduced.

Disclosure of Invention

An exemplary embodiment of the present invention is to provide a method and an apparatus for translating call voice in real time, which can solve the problem of poor real-time translation of call voice.

According to an exemplary embodiment of the present invention, a method for real-time translation of call voice is provided, wherein the method comprises: when the electronic terminal needs to translate call voice in real time, detecting whether a preset condition is met; when the preset condition is detected to be met, the collected call voice is sent to a translation server for translating the call voice; when detecting that the preset condition is not met, carrying out tone quality preprocessing on the collected call voice, and sending the processed call voice to a translation server; and receiving a translation result corresponding to the transmitted call voice from the translation server.

Optionally, the method further comprises: and transmitting the translation result received from the translation server to the base station to be forwarded by the base station to another electronic terminal in voice communication with the electronic terminal.

Optionally, the step of detecting whether the preset condition is met includes: periodically detecting whether a preset condition is met; or detecting whether preset conditions are met or not in real time.

Optionally, the preset condition includes: the collected voice quality of the call voice meets a specific condition and/or the translation server can carry out voice quality preprocessing on the received call voice to be translated.

Optionally, the sound quality preprocessing comprises: noise reduction processing and/or echo cancellation processing.

Optionally, the step of sending the translation result received from the translation server to the base station includes: and performing sound quality post-processing on the translation result received from the translation server, and sending the processed translation result to the base station, wherein the translation result is a translation result in a voice form.

Optionally, the specific condition is: the signal to noise ratio is higher than a preset threshold.

Optionally, the sound quality post-processing includes: filtering processes and/or gain settings.

According to another exemplary embodiment of the present invention, a method for real-time translation of call voice is provided, wherein the method comprises: when the electronic terminal needs to translate call voice in real time, detecting whether a preset condition is met; when detecting that a preset condition is met, sending the call voice received from the base station to a translation server for translating the call voice; when detecting that the preset condition is not met, carrying out tone quality preprocessing on the call voice received from the base station, and sending the processed call voice to a translation server; and receiving a translation result corresponding to the transmitted call voice from the translation server.

Optionally, the method further comprises: and outputting the translation result received from the translation server.

Optionally, the preset condition includes: the voice quality of the call voice received from the base station meets a specific condition and/or the translation server performs voice quality preprocessing on the received call voice to be translated.

Optionally, the step of outputting the translation result received from the translation server includes: and performing sound quality post-processing on the translation result received from the translation server, and outputting the processed translation result, wherein the translation result is a translation result in a voice form.

According to another exemplary embodiment of the present invention, there is provided an apparatus for translating call voice in real time, wherein the apparatus includes: the voice quality detection unit is used for detecting whether a preset condition is met or not when the electronic terminal needs to translate the call voice in real time; the voice quality processing unit is used for preprocessing the voice quality of the collected call voice when the voice quality processing unit detects that the preset condition is not met; the transmitting unit is used for transmitting the collected call voice to a translation server for translating the call voice when the preset condition is detected to be met; when detecting that the preset condition is not met, sending the call voice processed by the tone quality processing unit to a translation server; and a translation result receiving unit that receives a translation result corresponding to the transmitted call voice from the translation server.

Optionally, the transmitting unit further transmits the translation result received from the translation server to the base station to be forwarded by the base station to another electronic terminal having a voice call with the electronic terminal.

Optionally, the sound quality detection unit periodically detects whether a preset condition is met; or, the tone quality detection unit detects whether preset conditions are met in real time.

Optionally, the voice quality processing unit performs voice quality post-processing on the translation result received from the translation server, wherein the transmitting unit transmits the translation result processed by the voice quality processing unit to the base station, wherein the translation result is a translation result in a speech form.

According to another exemplary embodiment of the present invention, there is provided an apparatus for translating call voice in real time, wherein the apparatus includes: the voice quality detection unit is used for detecting whether a preset condition is met or not when the electronic terminal needs to translate the call voice in real time; the tone quality processing unit is used for preprocessing the tone quality of the call voice received from the base station when detecting that the preset condition is not met; a transmitting unit which transmits the call voice received from the base station to a translation server for translating the call voice when it is detected that a preset condition is satisfied; when detecting that the preset condition is not met, sending the call voice processed by the tone quality processing unit to a translation server; and a translation result receiving unit that receives a translation result corresponding to the transmitted call voice from the translation server.

Optionally, the apparatus further comprises: and an output unit that outputs the translation result received from the translation server.

Optionally, the voice quality processing unit performs voice quality post-processing on the translation result received from the translation server, wherein the output unit outputs the processed translation result, and the translation result is a translation result in a speech form.

According to another exemplary embodiment of the present invention, a computer-readable storage medium is provided, in which a computer program is stored, which, when being executed by a processor, implements the method for real-time translation of call speech as described above.

According to another exemplary embodiment of the present invention, there is provided an electronic terminal, wherein the electronic terminal includes: a processor; a memory storing a computer program which, when executed by the processor, implements the method of real-time translation of call speech as described above.

According to the method and the device for translating the call voice in real time, the time consumption of the call voice translation process can be effectively reduced, and the time for acquiring the translation result of the call voice is shortened, so that the real-time performance of the call voice translation function is improved, and the user experience is improved.

Additional aspects and/or advantages of the present general inventive concept will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the general inventive concept.

Drawings

The above and other objects and features of exemplary embodiments of the present invention will become more apparent from the following description taken in conjunction with the accompanying drawings which illustrate exemplary embodiments, wherein:

fig. 1 shows a flowchart of a method of real-time translation of call speech according to a first exemplary embodiment of the present invention;

fig. 2 shows a flowchart of a method of real-time translation of call speech according to a second exemplary embodiment of the present invention;

fig. 3 is a block diagram illustrating an apparatus for real-time translation of call voice according to a first exemplary embodiment of the present invention;

fig. 4 is a block diagram illustrating an apparatus for real-time translation of call voice according to a second exemplary embodiment of the present invention.

Detailed Description

Reference will now be made in detail to the embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. The embodiments are described below in order to explain the present invention by referring to the figures.

Fig. 1 shows a flowchart of a method for real-time translation of call speech according to a first exemplary embodiment of the present invention. The method may be implemented by a computer program. For example, the method may be performed by a call voice translation application installed in the electronic terminal or by a function program implemented in an operating system of the electronic terminal. As an example, the electronic terminal may be a mobile communication terminal (e.g., a smartphone), a smart wearable device (e.g., a smart watch), or the like capable of voice call.

Referring to fig. 1, in step S10, when the electronic terminal needs to translate the call voice in real time, it is detected whether a first preset condition is satisfied.

As an example, when the electronic terminal is in a voice call state and a call voice real-time translation function is turned on, it may be determined that the electronic terminal needs to translate the call voice in real time.

As an example, it may be periodically detected whether a first preset condition is satisfied.

As another example, whether the first preset condition is satisfied may be detected in real time.

As an example, the first preset condition may include: the collected voice quality of the call voice meets a first specific condition and/or the translation server for translating the call voice performs voice quality preprocessing on the received call voice to be translated.

As an example, the collected call voice may be a call voice collected through a microphone of the electronic terminal.

As an example, the first specific condition may be: the signal to noise ratio is higher than a preset threshold. For example, the smaller the ambient noise, the higher the signal-to-noise ratio; the smaller the echo signal, the higher the signal-to-noise ratio.

It should be understood that the first specific condition may also be other conditions for determining whether the speech quality of the call speech is good enough without performing the tone quality preprocessing.

It should be appreciated that the determination of whether the translation server will perform timbre preprocessing on the received speech to be translated and then translate the speech may be made in any suitable manner. For example, the translation server for translating the call voice at this time may be asked whether or not to perform tone quality preprocessing on the received call voice to be translated; or, whether a translation server for translating the call voice at this time performs voice quality preprocessing on the received call voice to be translated can be confirmed through a corresponding database, wherein the database can record whether different translation servers perform voice quality preprocessing on the call voice to be translated.

In step S20, when it is detected that the first preset condition is satisfied, the collected call voice is directly sent to a translation server for translating the call voice without performing tone quality preprocessing on the collected call voice.

In step S30, when it is detected that the first preset condition is not satisfied, tone quality preprocessing is performed on the collected call voice, and the processed call voice is sent to the translation server.

It should be understood that the psychoacoustic preprocessing may include various suitable psychoacoustic processing approaches. As an example, the timbre preprocessing may include: noise reduction processing and/or echo cancellation processing.

In step S40, a translation result corresponding to the transmitted call voice is received from the translation server.

In the prior art, the conversation voice after tone quality preprocessing is uniformly sent to a translation server for processing, and particularly, when the network transmission quality is poor or the translation task of the translation server is heavy, the situation of large translation delay can occur; when the translation server also performs tone quality preprocessing on the received call voice to be translated, the problem that time and computing resources are wasted by repeatedly performing tone quality preprocessing on the call voice also occurs. According to the exemplary embodiment of the invention, the voice quality of the call voice can be detected firstly, only when the voice quality of the call voice is poor, the voice quality of the call voice is preprocessed, and the processed call voice is sent to the translation server; whether the translation server carries out tone quality preprocessing on the received call voice to be translated firstly and then translates the received call voice is determined, tone quality preprocessing is carried out on the call voice only when the translation server does not carry out tone quality preprocessing, the processed call voice is sent to the translation server, and when the translation server carries out tone quality preprocessing, the call voice without tone quality preprocessing is directly sent to the translation server so as to avoid tone quality preprocessing of both sides; in addition, when the translation server does not perform tone quality preprocessing on the received to-be-translated call voice and the voice quality of the call voice is poor, tone quality preprocessing is performed on the call voice and the processed call voice is sent to the translation server, otherwise, the call voice without tone quality preprocessing is directly sent to the translation server. According to the embodiment of the invention, the translation server can acquire and process the call voice to be translated as early as possible, so that the processing speed of the call voice translation process is increased, the real-time performance of the call voice translation is improved, and the operation load of the electronic terminal caused by voice quality preprocessing on the call voice can be reduced.

As an example, the method for translating call voice in real time according to the first exemplary embodiment of the present invention may further include: and transmitting the translation result received from the translation server to the base station to be forwarded by the base station to another electronic terminal in voice communication with the electronic terminal.

As an example, the translation result may be a translation result in a speech form and/or a text form.

As an example, the sound quality post-processing may be performed on the translation result received from the translation server, and the processed translation result may be transmitted to the base station, wherein the translation result is a translation result in a speech form.

It should be understood that the psychoacoustic post-processing may include various suitable psychoacoustic processing approaches. As an example, the sound quality post-processing may include: filtering processes and/or gain settings.

Fig. 2 shows a flowchart of a method for real-time translation of call speech according to a second exemplary embodiment of the present invention.

Referring to fig. 2, in step S50, when the electronic terminal needs to translate the call voice in real time, it is detected whether a second preset condition is satisfied.

As an example, it may be periodically detected whether the second preset condition is satisfied.

As another example, whether the second preset condition is satisfied may be detected in real time.

As an example, the second preset condition may include: the voice quality of the call voice received from the base station satisfies a second specific condition and/or the translation server for translating the call voice performs voice quality preprocessing on the received call voice to be translated.

As an example, the second specific condition may be: the signal to noise ratio is higher than a preset threshold.

It should be understood that the second specific condition may also be other conditions for determining whether the speech quality of the call speech is good enough without performing the tone quality preprocessing.

In step S60, when it is detected that the second preset condition is satisfied, the call voice received from the base station is directly transmitted to the translation server for translating the call voice without performing the voice quality preprocessing on the call voice received from the base station.

In step S70, when it is detected that the second preset condition is not satisfied, voice quality preprocessing is performed on the call voice received from the base station, and the processed call voice is transmitted to the translation server.

In step S80, a translation result corresponding to the transmitted call voice is received from the translation server.

As an example, the method for real-time translation of call voice according to the second exemplary embodiment of the present invention may further include: and outputting the translation result received from the translation server.

By way of example, the translation results received from the translation server may be output in a variety of suitable ways. For example, the translation result may be output in the form of voice and/or text.

As an example, the sound quality post-processing may be performed on the translation result received from the translation server, and the processed translation result may be output, where the translation result is a translation result in a speech form.

Fig. 3 illustrates a block diagram of an apparatus for real-time translation of call voice according to a first exemplary embodiment of the present invention.

As shown in fig. 3, the apparatus for translating call voice in real time according to the first exemplary embodiment of the present invention includes: voice quality detection section 10, voice quality processing section 20, transmission section 30, and translation result reception section 40.

Specifically, the sound quality detection unit 10 is configured to detect whether a first preset condition is satisfied when the electronic terminal needs to translate the call voice in real time.

As an example, the sound quality detection unit 10 may determine that the electronic terminal needs to translate the call voice in real time when the electronic terminal is in a voice call state and the call voice real-time translation function is turned on.

As an example, the sound quality detection unit 10 may periodically detect whether the first preset condition is satisfied.

As another example, the sound quality detection unit 10 may detect whether the first preset condition is satisfied in real time.

The voice quality processing unit 20 is configured to perform voice quality preprocessing on the collected call voice when it is detected that the preset condition is not satisfied.

The sending unit 30 is configured to send the collected call voice to a translation server for translating the call voice when it is detected that a preset condition is met; when detecting that the preset condition is not satisfied, the call voice processed by the voice quality processing unit 20 is sent to the translation server.

Specifically, when it is detected that the first preset condition is satisfied, the voice quality processing unit 20 does not perform voice quality preprocessing on the collected call voice, and the transmitting unit 30 directly transmits the collected call voice to a translation server for translating the call voice; when detecting that the first preset condition is not satisfied, the voice quality processing unit 20 performs voice quality preprocessing on the collected call voice, and the sending unit 30 sends the processed call voice to the translation server.

The translation result receiving unit 40 is configured to receive a translation result corresponding to the transmitted call voice from the translation server.

As an example, the transmitting unit 30 may also transmit the translation result received from the translation server to the base station to be forwarded by the base station to another electronic terminal that performs a voice call with the electronic terminal.

As an example, the voice quality processing unit 20 may perform voice quality post-processing on the translation result received from the translation server, and the transmitting unit 30 may transmit the processed translation result to the base station, wherein the translation result is a translation result in a speech form.

As shown in fig. 4, an apparatus for translating call voice in real time according to a second exemplary embodiment of the present invention includes: voice quality detection section 50, voice quality processing section 60, transmission section 70, and translation result reception section 80.

Specifically, the sound quality detection unit 50 is configured to detect whether the second preset condition is satisfied when the electronic terminal needs to translate the call voice in real time.

As an example, the sound quality detection unit 50 may determine that the electronic terminal needs to translate the call voice in real time when the electronic terminal is in a voice call state and the call voice real-time translation function is turned on.

As an example, the sound quality detection unit 50 may periodically detect whether the second preset condition is satisfied.

As another example, the sound quality detection unit 50 may detect whether the second preset condition is satisfied in real time.

The voice quality processing unit 60 is configured to perform voice quality preprocessing on the call voice received from the base station when it is detected that the preset condition is not satisfied.

The transmitting unit 70 is configured to transmit the call voice received from the base station to a translation server for translating the call voice when it is detected that a preset condition is satisfied; when detecting that the preset condition is not satisfied, the call voice processed by the voice quality processing unit 60 is sent to the translation server.

Specifically, when it is detected that the second preset condition is satisfied, the voice quality processing unit 60 does not perform voice quality preprocessing on the call voice received from the base station, and the transmitting unit 70 directly transmits the call voice received from the base station to a translation server for translating the call voice; when it is detected that the second preset condition is not satisfied, the voice quality processing unit 60 performs voice quality preprocessing on the call voice received from the base station, and the transmitting unit 70 transmits the processed call voice to the translation server.

The translation result receiving unit 80 is configured to receive a translation result corresponding to the transmitted call voice from the translation server.

As an example, the apparatus for translating call voice in real time according to the second exemplary embodiment of the present invention may further include: an output unit (not shown) for outputting the translation result received from the translation server.

As an example, the output unit may output the translation result received from the translation server in various appropriate manners. For example, the output unit may output the translation result in the form of voice and/or text.

As an example, the voice quality processing unit 60 may perform voice quality post-processing on the translation result received from the translation server, and the output unit outputs the processed translation result, wherein the translation result is a translation result in a speech form.

It should be understood that the device for real-time translation of call voice according to the first exemplary embodiment of the present invention may perform the method described with reference to fig. 1, and thus, in order to avoid repetition, detailed description thereof is omitted. The device for translating call voice in real time according to the second exemplary embodiment of the present invention may perform the method described with reference to fig. 2, and thus, in order to avoid repetition, details are not repeated herein.

Further, it should be understood that each unit in the apparatus for real-time translation of call voice according to the first exemplary embodiment of the present invention may be implemented as a hardware component and/or a software component. The individual units may be implemented, for example, using Field Programmable Gate Arrays (FPGAs) or Application Specific Integrated Circuits (ASICs), depending on the processing performed by the individual units as defined by the skilled person.

A computer-readable storage medium according to an exemplary embodiment of the present invention stores a computer program that, when executed by a processor, causes the processor to perform the method of real-time translation of call voice of the first exemplary embodiment. The computer readable storage medium is any data storage device that can store data which can be read by a computer system. Examples of computer-readable storage media include: read-only memory, random access memory, read-only optical disks, magnetic tapes, floppy disks, optical data storage devices, and carrier waves (such as data transmission through the internet via wired or wireless transmission paths).

An electronic terminal according to an exemplary embodiment of the present invention includes: a processor (not shown) and a memory (not shown), wherein the memory stores a computer program which, when executed by the processor, implements the method of real-time translation of call speech as in the first exemplary embodiment.

Further, it should be understood that each unit in the apparatus for real-time translation of call voice according to the second exemplary embodiment of the present invention may be implemented as a hardware component and/or a software component. The individual units may be implemented, for example, using Field Programmable Gate Arrays (FPGAs) or Application Specific Integrated Circuits (ASICs), depending on the processing performed by the individual units as defined by the skilled person.

A computer-readable storage medium according to an exemplary embodiment of the present invention stores a computer program that, when executed by a processor, causes the processor to perform the method of real-time translation of call voice of the second exemplary embodiment. The computer readable storage medium is any data storage device that can store data which can be read by a computer system. Examples of computer-readable storage media include: read-only memory, random access memory, read-only optical disks, magnetic tapes, floppy disks, optical data storage devices, and carrier waves (such as data transmission through the internet via wired or wireless transmission paths).

An electronic terminal according to an exemplary embodiment of the present invention includes: a processor (not shown) and a memory (not shown), wherein the memory stores a computer program which, when executed by the processor, implements the method of real-time translation of call speech as in the second exemplary embodiment.

Although a few exemplary embodiments of the present invention have been shown and described, it would be appreciated by those skilled in the art that changes may be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the claims and their equivalents.

Claims

1. A method for real-time translation of call speech, wherein the method comprises:

when the electronic terminal needs to translate call voice in real time, detecting whether a preset condition is met;

when the preset condition is detected to be met, the collected call voice is sent to a translation server for translating the call voice;

when detecting that the preset condition is not met, carrying out tone quality preprocessing on the collected call voice, and sending the processed call voice to a translation server;

receiving a translation result corresponding to the transmitted call voice from the translation server,

wherein the preset conditions include: and the translation server can carry out tone quality preprocessing on the received call voice to be translated.

2. The method of claim 1, wherein the method further comprises: transmitting the translation result received from the translation server to the base station to be forwarded by the base station to another electronic terminal having a voice call with the electronic terminal;

and/or the step of detecting whether the preset condition is met comprises the following steps: periodically detecting whether a preset condition is met; or detecting whether preset conditions are met or not in real time;

and/or, the tone quality preprocessing comprises: noise reduction processing or echo cancellation processing.

3. The method of claim 2, wherein the transmitting the translation result received from the translation server to the base station comprises: and performing sound quality post-processing on the translation result received from the translation server, and sending the processed translation result to the base station, wherein the translation result is a translation result in a voice form.

4. The method of claim 3, wherein the psychoacoustic post-processing comprises: filtering processes and/or gain settings.

5. A method for real-time translation of call speech, wherein the method comprises:

when detecting that a preset condition is met, sending the call voice received from the base station to a translation server for translating the call voice;

when detecting that the preset condition is not met, carrying out tone quality preprocessing on the call voice received from the base station, and sending the processed call voice to a translation server;

6. The method of claim 5, wherein the method further comprises: outputting the translation result received from the translation server;

7. The method of claim 6, wherein the outputting of the translation result received from the translation server comprises: and performing sound quality post-processing on the translation result received from the translation server, and outputting the processed translation result, wherein the translation result is a translation result in a voice form.

8. The method of claim 7, wherein the psychoacoustic post-processing comprises: filtering processes and/or gain settings.

9. An apparatus for real-time translation of call speech, wherein the apparatus comprises:

the voice quality detection unit is used for detecting whether a preset condition is met or not when the electronic terminal needs to translate the call voice in real time;

the voice quality processing unit is used for preprocessing the voice quality of the collected call voice when the voice quality processing unit detects that the preset condition is not met;

the transmitting unit is used for transmitting the collected call voice to a translation server for translating the call voice when the preset condition is detected to be met; when detecting that the preset condition is not met, sending the call voice processed by the tone quality processing unit to a translation server;

a translation result receiving unit that receives a translation result corresponding to the transmitted call voice from the translation server,

10. The apparatus of claim 9, wherein the transmitting unit further transmits the translation result received from the translation server to the base station to be forwarded by the base station to another electronic terminal performing a voice call with the electronic terminal;

and/or the tone quality detection unit periodically detects whether a preset condition is met; or, the tone quality detection unit detects whether preset conditions are met in real time;

11. The apparatus of claim 10, wherein the voice quality processing unit performs voice quality post-processing on the translation result received from the translation server, wherein the transmission unit transmits the translation result processed by the voice quality processing unit to the base station, wherein the translation result is a translation result in a form of speech.

12. The apparatus of claim 11, wherein the timbre post-processing comprises: filtering processes and/or gain settings.

13. An apparatus for real-time translation of call speech, wherein the apparatus comprises:

the tone quality processing unit is used for preprocessing the tone quality of the call voice received from the base station when detecting that the preset condition is not met;

a transmitting unit which transmits the call voice received from the base station to a translation server for translating the call voice when it is detected that a preset condition is satisfied; when detecting that the preset condition is not met, sending the call voice processed by the tone quality processing unit to a translation server;

14. The apparatus of claim 13, wherein the apparatus further comprises: an output unit that outputs the translation result received from the translation server;

15. The apparatus of claim 14, wherein the voice quality processing unit performs voice quality post-processing on the translation result received from the translation server, wherein the output unit outputs the processed translation result, wherein the translation result is a translation result in a form of speech.

16. The apparatus of claim 15, wherein the timbre post-processing comprises: filtering processes and/or gain settings.

17. A computer-readable storage medium, in which a computer program is stored, which, when being executed by a processor, implements a method for real-time translation of call speech according to any one of claims 1 to 4 and/or a method for real-time translation of call speech according to any one of claims 5 to 8.

18. An electronic terminal, wherein the electronic terminal comprises:

a processor;

a memory storing a computer program which, when executed by the processor, implements the method of real-time translation of call speech according to any one of claims 1 to 4 and/or the method of real-time translation of call speech according to any one of claims 5 to 8.