WO2014177209A1 - An apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, and related method - Google Patents

An apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, and related method Download PDF

Info

Publication number
WO2014177209A1
WO2014177209A1 PCT/EP2013/059083 EP2013059083W WO2014177209A1 WO 2014177209 A1 WO2014177209 A1 WO 2014177209A1 EP 2013059083 W EP2013059083 W EP 2013059083W WO 2014177209 A1 WO2014177209 A1 WO 2014177209A1
Authority
WO
WIPO (PCT)
Prior art keywords
speech
message
text
recipient
voice
Prior art date
Application number
PCT/EP2013/059083
Other languages
French (fr)
Inventor
Carolina T. DE CARNEY
Original Assignee
Saronikos Trading And Services, Unipessoal Lda
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Saronikos Trading And Services, Unipessoal Lda filed Critical Saronikos Trading And Services, Unipessoal Lda
Priority to JP2016510947A priority Critical patent/JP6165321B2/en
Priority to EP13721550.5A priority patent/EP2992666B1/en
Priority to ES13721550T priority patent/ES2786079T3/en
Priority to KR1020157034137A priority patent/KR102038827B1/en
Priority to PCT/EP2013/059083 priority patent/WO2014177209A1/en
Priority to US14/787,539 priority patent/US9924012B2/en
Priority to CN201380076175.4A priority patent/CN105210355B/en
Publication of WO2014177209A1 publication Critical patent/WO2014177209A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/64Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations
    • H04M1/642Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations storing speech in digital form
    • H04M1/645Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations storing speech in digital form with speech synthesis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/274Converting codes to words; Guess-ahead of partial word inputs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • G06F40/35Discourse or dialogue representation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/64Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations
    • H04M1/642Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations storing speech in digital form
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/12Messaging; Mailboxes; Announcements
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/66Substation equipment, e.g. for use by subscribers with means for preventing unauthorised or fraudulent calling
    • H04M1/663Preventing unauthorised calls to a telephone set
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W88/00Devices specially adapted for wireless communication networks, e.g. terminals, base stations or access point devices
    • H04W88/02Terminal devices
    • H04W88/06Terminal devices adapted for operation in multiple networks or having at least two operational modes, e.g. multi-mode terminals

Definitions

  • the present invention relates to an apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, as well as the related method.
  • a recipient of the phone call may decide that it is inappropriate to talk in order to answer an incoming call.
  • Said situation may arise from the actual location of the recipient, for example when he/she is using a means of public transport where he/she wishes to maintain confidential the content of the call, or in a business meeting or conference where it is inappropriate to start a telephone conversation, even if the subject matter of the call is important or urgent.
  • an apparatus and a method for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, the apparatus comprising control means, in particular a key, for sending a dedicated command, opening a voice conversation with a caller, putting the apparatus in a message mode and thus answering the phone call; a microphone that is muted after sending the dedicated command for the entire period during which the apparatus is kept in the message mode; the control means, in particular a keyboard, are adapted to type a text message by the recipient of the apparatus; an earphone device for listening to the caller; a text-to-speech injection apparatus comprising a text-to-speech translation apparatus and a database of words for synthesizing the text message into a speech message and transmitting the speech message to the caller during the voice conversation; and a voice message injection module for sending an alert voice message to the caller, the alert voice message saying that the recipient is not talking, but is substituted by the text-to-speech injection apparatus.
  • control means in particular a key
  • FIG. 1 shows an apparatus according to the present invention
  • FIG. 2 shows a scenario according to the present invention.
  • a first apparatus la can be a smartphone, a traditional mobile phone, a tablet and the like.
  • the first apparatus 1 a comprises first control means, in particular a first dedicated key 3a and a first keyboard 4a.
  • the first apparatus la also comprises a first microphone 5a.
  • said dedicated key 3a and said first keyboard 4a can be a part of a modern touch screen device, generally provided in a smartphone.
  • the first dedicated key 3a allows a user to send a dedicated command for putting the first apparatus la in a "message mode" and, substantially at the same time, answering an incoming phone call opening a voice conversation with a caller.
  • the first dedicated key 3 a also produces the muting of signals coming from the microphone 5a of the first apparatus 1 a for the whole period during which the first apparatus 1 a is kept in such "message mode".
  • the function described above namely the "message mode” can also be enabled by an opening key of a phone call (for example the conventional green key to answer an incoming call), however in this case the first apparatus la must be previously set in a silent/vibration mode. Then, to disable the "message mode", the recipient can press the first dedicated key 3 a or exit from the silent/vibration mode.
  • a phone call for example the conventional green key to answer an incoming call
  • the first apparatus la If the first apparatus la is not in the silent/vibration mode and the recipient presses the opening key to open the incoming call, then the first apparatus la establishes a normal voice communication between caller and recipient. Therefore the first apparatus la is configured in such a way that the first dedicated key 3 a and the opening key of an incoming call operate different functions depending on the current mode thereof (silent/vibration mode or not).
  • this "message mode" function and carrying out said function help the recipient of a phone call to immediately answer without talking, only sending a command to the first apparatus la, in particular pressing the first dedicated key 3a or the opening key (for example the conventional green key) when the apparatus la is in the silent/vibration mode.
  • the first apparatus la allows the recipient to send text messages to the caller via a telecommunications network. The result is the recipient can send and input data in the first apparatus la in a very rapid manner.
  • the first keyboard 4a is useful for typing a text message by the recipient.
  • the first keyboard 4a can be a soft or hard keyboard.
  • Soft keyboard means that the first apparatus la comprises a screen (not shown in Fig. 1) allowing the display of keys of the keyboard.
  • Hard keyboard means that the keyboard comprises physical keys.
  • the first apparatus la also comprises a first earphone device 7a that can be internal or external to the first apparatus la.
  • a first earphone device 7a that can be internal or external to the first apparatus la.
  • Fig. 1 it is shown the first earphone device 7a external to the first apparatus la, but it represents only one non-limiting embodiment thereof.
  • the first earphone device 7a is connected to the first apparatus 1 a through a first wireless or cable connection 6a.
  • the first apparatus la comprises a text-to-speech injection apparatus 20 and a voice message injection module 9. This one is connected (not being shown in figure 1 for the sake of simplicity) to the rest of the apparatus (a smartphone, a traditional mobile phone, a tablet and the like) for sending voice answers, as the voice would arrive from the microphone 5 a.
  • the text-to-speech injection apparatus 20 comprises a text-to-speech translation apparatus 11, which comprises a database 13. Moreover, the text-to-speech injection apparatus 20 comprises a speech recognition and understanding apparatus 15.
  • the scenario 10 comprises the first apparatus la as described above and a second apparatus lb. It is assumed that a first user of the first apparatus la is a recipient of a phone call and a second user of the second apparatus lb is a caller of the phone call.
  • the first apparatus la and the second apparatus lb are in communication via a telecommunication network 23.
  • the second apparatus lb is similar to the first apparatus la, indeed the second apparatus lb may also comprise second control means, in particular a second dedicated key 3b, a second keyboard 4b. It also comprises a second microphone 5b and a second earphone device 7b. Similarly, the second earphone device 7b can be connected to the second apparatus lb through a second wireless or cable connection 6b.
  • the text-to-speech translation apparatus 11 is responsible for capturing a text message and for transforming or translating this text message, sent from a user, for example from the recipient through the first apparatus la, into a speech message.
  • the database 13 contains a vocabulary useful for the translation, in particular it allows an association between text strings, i.e. words or phrases, and speech signals. Each text string is associated with a speech signal.
  • the speech signal represents the message or a part thereof.
  • the text-to-speech injection apparatus 20 is associated to a voice/text conversation, established on a channel through the telecommunication network 23, between the recipient of the first apparatus la and the caller of the second apparatus lb.
  • the recipient when the recipient activates the "message mode", he/she automatically sends, through the voice message injection module 9, an alert voice message (memorized into the apparatus la) to the caller, in particular a message saying that the recipient is not talking, but it is substituted by a text-to-speech injection apparatus 20.
  • the recipient can write a text message using a keyboard 4a of the first apparatus la in reply to the questions posed by the caller.
  • Said text message is synthesized into a speech message and transmitted to the caller via the telecommunication network 23, through the remaining part of the apparatus la.
  • the voice message injection module 9 also analyzes the ongoing voice conversation and detects periods of silence of the caller, during which such speech message is injected into the voice conversation.
  • the voice message injection module 9 injects the speech message into the same audio channel as the voice conversation so as that the recipient of the second apparatus lb can hear the former text message created by the caller of the first apparatus la.
  • the text message is synthesized into the speech message using the database 13 contained by a text-to-speech translation apparatus 11 and speech synthesis software nowadays available on the market.
  • the recipient of the first apparatus la can send the text message to the text-to-speech translation apparatus 11 using a format similar to SMS ("Short Message Service"), or similar to an IM Service ("Instant Messaging Service”) format, in particular "WhatsApp", “Google Talk”, “Skype”, “Viber” and so on.
  • SMS Short Message Service
  • IM Service Intelligent Messaging Service
  • What is important is that, when a certain part of the phrase introduced by the recipient satisfies the need to answer to a question, even if the entire phrase is not yet finished, the recipient can send this text to be converted into a speech with a simple command, such as return or OK.
  • a simple command such as return or OK.
  • the text-to-speech translation apparatus 11 is ready again to prepare other phrases or part of them to be translated into speech.
  • the speech recognition and understanding apparatus 15 receives the voice statements from the caller and analyzes phrases formulated by the caller, extracts some words from these phrases and assigns them a meaning related to the usual phrases utilized in a phone conversation.
  • the speech recognition and understanding apparatus 15 also stores in the database 13 completed phrases useful for example to answer a question posed by the caller, and it extracts and proposes them to the recipient in accordance to the actual question posed by the caller. For doing this, the speech recognition and understanding apparatus 15 analyzes the meaning of the phrase, in particular a question posed by the caller, and looks for a series of answers in the database 13 that may be appropriate to that question. The answers are stored in the database 13 before or during use.
  • the speech recognition and understanding apparatus 15 stores in the database 13 both questions and answers. In this way, each question is associated with at least one answer that can be proposed to the recipient.
  • the caller can talk with the recipient, even if he/she doesn't say word.
  • the latter is analyzed through the speech recognition and understanding apparatus 15 that converts a voice signal of the question into a text format (it performs a speech-to-text translation); then it compares said text format with phrases stored in the database 13. It is important to specify that such comparison is performed as syntactic or semantic way.
  • the speech recognition and understanding apparatus 15 recovers at least one predefined answer stored associated with such question and proposes it to the recipient in text format, i.e. it facilitates the recipient in typing the answer.
  • the recipient will see the answer in text format on a screen of the first apparatus la and he/she can select immediately the answer that he/she prefers, if there are a plurality of answers proposed.
  • a typical phrase/question stored in the database 13 can be "How are you ?”; then associated answers can be “I'm fine.” or “Very fine, thank you, and you ?” or “Not too bad, thanks!” and so on.
  • Another typical question could be "Are you busy?”; then associated answers can be "Yes sorry, call me later” or “No, don't worry, what's up?” and so on. It is clear that a huge amount of questions and answers can be stored in the database 13.
  • the feature just described allows to help a user, in particular the recipient, to respond quickly to a question through a text message without having to entirely type the answer on the first keyboard 4a.
  • the text-to-speech translation apparatus 11 also works in a "learning mode".
  • learning mode can be activated by a user, for example by the recipient of the phone call, it provides to recognize the voice of the user when the first apparatus la is not in the "message mode", i.e. during a conversation, and it distinguishes, word by word, words that the user is pronouncing.
  • the text-to-speech translation apparatus 11 also stores the recognized words in an automatic or manual manner.
  • the manual manner provides that the user can validate the storing of the recognized words through his/her apparatus la sending an information to the text-to-speech injection apparatus 20 that performs the validation and confirms the storing of the single word, i.e. the vocal acoustic signal of the single word.
  • the text-to-speech injection apparatus 20 substitutes the validated and recognized words, i.e. the acoustic speech signal of the word, in the place of the synthesized voice words made on the basis of a predefined male or female voice.
  • the male or female voice is nevertheless used by default.
  • the male or female voice is selectable by the user, for example the recipient, through a menu system of the first apparatus la.
  • the caller of the conversation then can hear the real voice of the recipient and not the predefined male or female voice.
  • the text-to-speech translation apparatus 11 recognizes each single word of that phrase and stores in an automatic o manual manner the acoustic speech signal of each word in the database 13. So, in the database 13, for example, to the word "later” corresponding to the predefined male or female voice is substituted by the acoustic speech signals of the recipient's voice relating to the first apparatus la. Each word of the phrase is then recognized and the respective acoustic speech signal is stored in the database 13.
  • the text-to-speech translation apparatus 11 substitutes, i.e. chooses, the validated and recognized words, i.e. the acoustic speech signal of the word, in the place of the synthesized voice words made on the basis of the predefined male or female voice.
  • the result of the text-to-speech translation is a speech message that comprises acoustic speech signals of the real voice of the recipient.
  • the voice messages sent to the caller will be made entirely by the acoustic speech of the real voice of the user.
  • the first apparatus la identifies and extracts text words, used for creating the speech message, from the database 13 by considering single letters of a word that a user, for example the recipient, is inputting on the keyboard 4a, and it suggests to the user the complete word before it is completely inputted.
  • the first apparatus la identifies text words when a user types a word on the keyboard 4a, then the first apparatus la sends at least one letter of said word to the text-to-speech translation apparatus 11 that queries the database 13 for extracting at least a word that contains those letters. For example, if the user begins to type "He", these letters are sent to the text-to-speech translation apparatus 11 that queries the database 13 for extracting at least a word that contains those letters. So, the database 13 returns at least a word from its vocabulary, i.e. "Hello", "Head", "Hence", "Help” and so on. The text-to-speech translation apparatus 11 then returns these words to the first apparatus la that will display them on the screen thereof.
  • the user can select immediately by the keyboard 4a the word he/she prefers without spending time to type the whole word.
  • This feature is particularly efficient is apparatuses like smart phones having a screen, where it is possible to show on multiple lines, placed one above the others, all the possible words that can satisfy the appropriate message to be sent, so that the user can simply choose the most appropriate one, scrolling them vertically.
  • the text-to-speech translation apparatus 11 provides for storing all past text messages into the database 13, converted into the speech messages, and suggesting entire phrases to the user, for example the recipient, when at least two consecutive words have been already used in said past text messages.
  • the caller can be the first user of the first apparatus 1 a and the recipient can be the second user of the second apparatus lb and vice versa.
  • the second apparatus lb can comprise all the elements described for the first apparatus la, namely also a text-to-speech injection apparatus and a voice message injection module as described above.
  • the apparatus according to the present invention can be implemented through a computer product which can be loaded into a memory of the first and/or second apparatus la, lb and which comprises portions of software code adapted to implement the method by using existing hardware.
  • a first advantage of the apparatus and method according to the present invention is that it overcomes the drawbacks of the prior art.
  • a second advantage of the apparatus and method according to the present invention is substituting the voice of the recipient of a phone call, when he/she decides that is inappropriate to talk, with something that is as similar as far as possible to a normal conversation when the recipient of a phone call answers talking with the caller.
  • a third advantage of the apparatus and method according to the present invention is helping the recipient of a phone call to answer such call without talking, but by inputting data in the apparatus in a very rapid manner.
  • the apparatus and method for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk may be subject to many possible variations without departing from the novelty spirit of the inventive idea; it is also clear that in the practical implementation of the invention the illustrated details may have different shapes or be replaced with other technically equivalent elements.
  • the present invention is not limited to a apparatus and method for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, but may be subject to many modifications, improvements or replacements of equivalent parts and elements without departing from the novelty spirit of the inventive idea, as clearly specified in the following claims.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Human Computer Interaction (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)

Abstract

An apparatus (1a; 1b) is described for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, the apparatus (1a; 1b) comprising control means, in particular a key (3a; 3b), for sending a dedicated command, opening a voice conversation with a caller, putting the apparatus (1a; 1b) in a message mode and so answering the phone call; a microphone (5a; 5b) that is muted after sending the dedicated command for the whole period during which the apparatus (1a; 1b) is kept in the message mode; the control means, in particular a keyboard (4a; 4b), are adapted to type a text message by the recipient of the apparatus (1a; 1b); an earphone device (7a; 7b) for listening to the caller; a text-to-speech injection apparatus (20) comprising a text-to-speech translation apparatus (11) and a database (13) of words for synthesizing the text message into a speech message and transmitting the speech message to the caller during the voice conversation; and a voice message injection module (9) for sending an alert voice message to the caller, the alert voice message saying that the recipient is not talking, but is substituted by the text-to-speech injection apparatus (20).

Description

A APPARATUS FOR ANSWERING A PHONE CALL WHEN A RECIPIENT OF THE PHONE CALL DECIDES THAT IT IS INAPPROPRIATE TO TALK, AND RELATED METHOD.
DESCRIPTION
The present invention relates to an apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, as well as the related method.
It is known that a recipient of the phone call may decide that it is inappropriate to talk in order to answer an incoming call.
Said situation may arise from the actual location of the recipient, for example when he/she is using a means of public transport where he/she wishes to maintain confidential the content of the call, or in a business meeting or conference where it is inappropriate to start a telephone conversation, even if the subject matter of the call is important or urgent.
Another situation, where it is practically impossible to talk and therefore answering a phone call, takes place when the recipient is located in a place with a lot of noise (such as a sports stadium) and even shouting into the microphone of the apparatus it is not enough for the caller to distinguish what the recipient is saying in respect to noise that is surrounding him or her.
Normally when the recipient of a phone call is in one of the above said situations, he/she prefers not to answer the incoming phone call and starts messaging the caller using one of the facilities of the phone set, for instance the SMS facility.
However answering a phone call in this way means losing the benefits associated with a voice call, due to the fact that first of all the user has to exit from the phone call and also compel the caller to use the same messaging facility. Moreover all known systems of text messaging require a given amount of time to input the words and the immediacy of a telephone conversation with questions and immediate response is lost. This becomes more evident in connection to the fact that in order to send an SMS it is necessary to finish the entire phrase or message before sending it.
It is therefore one object of the present invention to provide an apparatus and a method for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, that overcomes the drawbacks of the prior art.
It is another object of the present invention to provide an apparatus and a method for helping the recipient of a phone call answer a call without talking, but by inputting data in the apparatus in a very rapid manner.
It is a further object of the present invention to provide an apparatus and a method for substituting the voice of the recipient of a phone call, when he/she decides that is inappropriate to talk, with something that is as similar as possible to a normal conversation when the recipient of a phone call answers talking with the caller.
These and other objects of the invention are achieved through an apparatus and a method for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, as claimed in the appended claims, which are an integral part of the present description.
In short, an apparatus and a method are described for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, the apparatus comprising control means, in particular a key, for sending a dedicated command, opening a voice conversation with a caller, putting the apparatus in a message mode and thus answering the phone call; a microphone that is muted after sending the dedicated command for the entire period during which the apparatus is kept in the message mode; the control means, in particular a keyboard, are adapted to type a text message by the recipient of the apparatus; an earphone device for listening to the caller; a text-to-speech injection apparatus comprising a text-to-speech translation apparatus and a database of words for synthesizing the text message into a speech message and transmitting the speech message to the caller during the voice conversation; and a voice message injection module for sending an alert voice message to the caller, the alert voice message saying that the recipient is not talking, but is substituted by the text-to-speech injection apparatus.
Further features of the invention are set out in the appended claims, which are intended to be an integral part of the present description.
The above objects will become more apparent from the following detailed description of an apparatus and a method for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, according to the present invention, with particular reference to the annexed drawings, wherein:
- Figure 1 shows an apparatus according to the present invention;
- Figure 2 shows a scenario according to the present invention.
With reference to Fig. 1, it is shown a part of a first apparatus la according to the present invention, such first apparatus la can be a smartphone, a traditional mobile phone, a tablet and the like.
The first apparatus 1 a comprises first control means, in particular a first dedicated key 3a and a first keyboard 4a. The first apparatus la also comprises a first microphone 5a. It is clear that said dedicated key 3a and said first keyboard 4a can be a part of a modern touch screen device, generally provided in a smartphone. The first dedicated key 3a allows a user to send a dedicated command for putting the first apparatus la in a "message mode" and, substantially at the same time, answering an incoming phone call opening a voice conversation with a caller. The first dedicated key 3 a also produces the muting of signals coming from the microphone 5a of the first apparatus 1 a for the whole period during which the first apparatus 1 a is kept in such "message mode".
The function described above, namely the "message mode", can also be enabled by an opening key of a phone call (for example the conventional green key to answer an incoming call), however in this case the first apparatus la must be previously set in a silent/vibration mode. Then, to disable the "message mode", the recipient can press the first dedicated key 3 a or exit from the silent/vibration mode.
If the first apparatus la is not in the silent/vibration mode and the recipient presses the opening key to open the incoming call, then the first apparatus la establishes a normal voice communication between caller and recipient. Therefore the first apparatus la is configured in such a way that the first dedicated key 3 a and the opening key of an incoming call operate different functions depending on the current mode thereof (silent/vibration mode or not).
It is clear that this "message mode" function and carrying out said function help the recipient of a phone call to immediately answer without talking, only sending a command to the first apparatus la, in particular pressing the first dedicated key 3a or the opening key (for example the conventional green key) when the apparatus la is in the silent/vibration mode.
Once the recipient has summoned the "message mode", the first apparatus la allows the recipient to send text messages to the caller via a telecommunications network. The result is the recipient can send and input data in the first apparatus la in a very rapid manner.
The first keyboard 4a is useful for typing a text message by the recipient. The first keyboard 4a can be a soft or hard keyboard. Soft keyboard means that the first apparatus la comprises a screen (not shown in Fig. 1) allowing the display of keys of the keyboard. Hard keyboard means that the keyboard comprises physical keys.
The first apparatus la also comprises a first earphone device 7a that can be internal or external to the first apparatus la. In Fig. 1 it is shown the first earphone device 7a external to the first apparatus la, but it represents only one non-limiting embodiment thereof. In this case the first earphone device 7a is connected to the first apparatus 1 a through a first wireless or cable connection 6a.
Referring again to Fig. 1, the first apparatus la comprises a text-to-speech injection apparatus 20 and a voice message injection module 9. This one is connected (not being shown in figure 1 for the sake of simplicity) to the rest of the apparatus (a smartphone, a traditional mobile phone, a tablet and the like) for sending voice answers, as the voice would arrive from the microphone 5 a.
The text-to-speech injection apparatus 20 comprises a text-to-speech translation apparatus 11, which comprises a database 13. Moreover, the text-to-speech injection apparatus 20 comprises a speech recognition and understanding apparatus 15.
With reference to Fig. 2, a scenario is shown 10 useful for explaining the present invention. The scenario 10 comprises the first apparatus la as described above and a second apparatus lb. It is assumed that a first user of the first apparatus la is a recipient of a phone call and a second user of the second apparatus lb is a caller of the phone call. The first apparatus la and the second apparatus lb are in communication via a telecommunication network 23.
The second apparatus lb is similar to the first apparatus la, indeed the second apparatus lb may also comprise second control means, in particular a second dedicated key 3b, a second keyboard 4b. It also comprises a second microphone 5b and a second earphone device 7b. Similarly, the second earphone device 7b can be connected to the second apparatus lb through a second wireless or cable connection 6b.
The text-to-speech translation apparatus 11 is responsible for capturing a text message and for transforming or translating this text message, sent from a user, for example from the recipient through the first apparatus la, into a speech message.
The database 13 contains a vocabulary useful for the translation, in particular it allows an association between text strings, i.e. words or phrases, and speech signals. Each text string is associated with a speech signal. The speech signal represents the message or a part thereof.
The text-to-speech injection apparatus 20 is associated to a voice/text conversation, established on a channel through the telecommunication network 23, between the recipient of the first apparatus la and the caller of the second apparatus lb.
In addition to that described herein above, when the recipient activates the "message mode", he/she automatically sends, through the voice message injection module 9, an alert voice message (memorized into the apparatus la) to the caller, in particular a message saying that the recipient is not talking, but it is substituted by a text-to-speech injection apparatus 20.
Then, the recipient can write a text message using a keyboard 4a of the first apparatus la in reply to the questions posed by the caller.
Said text message is synthesized into a speech message and transmitted to the caller via the telecommunication network 23, through the remaining part of the apparatus la. The voice message injection module 9 also analyzes the ongoing voice conversation and detects periods of silence of the caller, during which such speech message is injected into the voice conversation.
When a period of silence is detected, the voice message injection module 9 injects the speech message into the same audio channel as the voice conversation so as that the recipient of the second apparatus lb can hear the former text message created by the caller of the first apparatus la.
More in detail the text message is synthesized into the speech message using the database 13 contained by a text-to-speech translation apparatus 11 and speech synthesis software nowadays available on the market.
The recipient of the first apparatus la can send the text message to the text-to-speech translation apparatus 11 using a format similar to SMS ("Short Message Service"), or similar to an IM Service ("Instant Messaging Service") format, in particular "WhatsApp", "Google Talk", "Skype", "Viber" and so on. What is important is that, when a certain part of the phrase introduced by the recipient satisfies the need to answer to a question, even if the entire phrase is not yet finished, the recipient can send this text to be converted into a speech with a simple command, such as return or OK. In this case, when the recipient starts again to introduce words, the text-to-speech translation apparatus 11 is ready again to prepare other phrases or part of them to be translated into speech.
The speech recognition and understanding apparatus 15 receives the voice statements from the caller and analyzes phrases formulated by the caller, extracts some words from these phrases and assigns them a meaning related to the usual phrases utilized in a phone conversation.
The speech recognition and understanding apparatus 15 also stores in the database 13 completed phrases useful for example to answer a question posed by the caller, and it extracts and proposes them to the recipient in accordance to the actual question posed by the caller. For doing this, the speech recognition and understanding apparatus 15 analyzes the meaning of the phrase, in particular a question posed by the caller, and looks for a series of answers in the database 13 that may be appropriate to that question. The answers are stored in the database 13 before or during use.
Alternatively, the speech recognition and understanding apparatus 15 stores in the database 13 both questions and answers. In this way, each question is associated with at least one answer that can be proposed to the recipient.
When a recipient sets its first apparatus la in the "message mode" as described above, the caller can talk with the recipient, even if he/she doesn't say word. Assuming that the caller asks a question, the latter is analyzed through the speech recognition and understanding apparatus 15 that converts a voice signal of the question into a text format (it performs a speech-to-text translation); then it compares said text format with phrases stored in the database 13. It is important to specify that such comparison is performed as syntactic or semantic way.
If the question in a text format is contained within the database 13, the speech recognition and understanding apparatus 15 recovers at least one predefined answer stored associated with such question and proposes it to the recipient in text format, i.e. it facilitates the recipient in typing the answer.
Thus, the recipient will see the answer in text format on a screen of the first apparatus la and he/she can select immediately the answer that he/she prefers, if there are a plurality of answers proposed.
More in detail, for example, a typical phrase/question stored in the database 13 can be "How are you ?"; then associated answers can be "I'm fine." or "Very fine, thank you, and you ?" or "Not too bad, thanks!" and so on. Another typical question could be "Are you busy?"; then associated answers can be "Yes sorry, call me later" or "No, don't worry, what's up?" and so on. It is clear that a huge amount of questions and answers can be stored in the database 13.
Therefore, the feature just described allows to help a user, in particular the recipient, to respond quickly to a question through a text message without having to entirely type the answer on the first keyboard 4a.
Furthermore, the text-to-speech translation apparatus 11 also works in a "learning mode". Such "learning mode" can be activated by a user, for example by the recipient of the phone call, it provides to recognize the voice of the user when the first apparatus la is not in the "message mode", i.e. during a conversation, and it distinguishes, word by word, words that the user is pronouncing.
The text-to-speech translation apparatus 11 also stores the recognized words in an automatic or manual manner. The manual manner provides that the user can validate the storing of the recognized words through his/her apparatus la sending an information to the text-to-speech injection apparatus 20 that performs the validation and confirms the storing of the single word, i.e. the vocal acoustic signal of the single word.
Then, when the first apparatus la operates in the "message mode", the text-to-speech injection apparatus 20 substitutes the validated and recognized words, i.e. the acoustic speech signal of the word, in the place of the synthesized voice words made on the basis of a predefined male or female voice. The male or female voice is nevertheless used by default. Furthermore, the male or female voice is selectable by the user, for example the recipient, through a menu system of the first apparatus la.
The caller of the conversation then can hear the real voice of the recipient and not the predefined male or female voice.
For example, if the recipient of the first apparatus la, during a conversation not in "message mode" with the caller of the second apparatus lb, says the following phrase "Hi Mark, See you later, Bye", the text-to-speech translation apparatus 11 recognizes each single word of that phrase and stores in an automatic o manual manner the acoustic speech signal of each word in the database 13. So, in the database 13, for example, to the word "later" corresponding to the predefined male or female voice is substituted by the acoustic speech signals of the recipient's voice relating to the first apparatus la. Each word of the phrase is then recognized and the respective acoustic speech signal is stored in the database 13.
When the recipient sets his/her first apparatus la in "message mode" and sends a text message to the caller of the second apparatus lb, the text-to-speech translation apparatus 11 substitutes, i.e. chooses, the validated and recognized words, i.e. the acoustic speech signal of the word, in the place of the synthesized voice words made on the basis of the predefined male or female voice. The result of the text-to-speech translation is a speech message that comprises acoustic speech signals of the real voice of the recipient. In addition, after a while the voice messages sent to the caller will be made entirely by the acoustic speech of the real voice of the user.
To speed up the sending of a text message, the first apparatus la identifies and extracts text words, used for creating the speech message, from the database 13 by considering single letters of a word that a user, for example the recipient, is inputting on the keyboard 4a, and it suggests to the user the complete word before it is completely inputted.
More in detail, the first apparatus la identifies text words when a user types a word on the keyboard 4a, then the first apparatus la sends at least one letter of said word to the text-to-speech translation apparatus 11 that queries the database 13 for extracting at least a word that contains those letters. For example, if the user begins to type "He", these letters are sent to the text-to-speech translation apparatus 11 that queries the database 13 for extracting at least a word that contains those letters. So, the database 13 returns at least a word from its vocabulary, i.e. "Hello", "Head", "Hence", "Help" and so on. The text-to-speech translation apparatus 11 then returns these words to the first apparatus la that will display them on the screen thereof. So, the user can select immediately by the keyboard 4a the word he/she prefers without spending time to type the whole word. This feature is particularly efficient is apparatuses like smart phones having a screen, where it is possible to show on multiple lines, placed one above the others, all the possible words that can satisfy the appropriate message to be sent, so that the user can simply choose the most appropriate one, scrolling them vertically. Moreover, the text-to-speech translation apparatus 11 provides for storing all past text messages into the database 13, converted into the speech messages, and suggesting entire phrases to the user, for example the recipient, when at least two consecutive words have been already used in said past text messages.
In other words, when the recipient is typing a message using a keyboard 4a, not only words are suggested to the recipient, but also phrases already used by him/her. The suggestion is shown on the screen of the first apparatus la.
It is clear that everything has been shown with reference to the first apparatus la, can be made also for the second apparatus lb, in the case both the users of the first and second apparatus la, lb desire to start a phone conversation, but both of them don't desire to talk.
Given the duality of the scenario 10, the caller can be the first user of the first apparatus 1 a and the recipient can be the second user of the second apparatus lb and vice versa.
Therefore, the second apparatus lb can comprise all the elements described for the first apparatus la, namely also a text-to-speech injection apparatus and a voice message injection module as described above.
It must be pointed out that the apparatus according to the present invention can be implemented through a computer product which can be loaded into a memory of the first and/or second apparatus la, lb and which comprises portions of software code adapted to implement the method by using existing hardware.
It should also be noted that, due to the increasing computing power of microprocessors used nowadays in mobile phones and the progress made by the software related to both text-to-speech translation and speech recognition, all the features of the apparatus above described can be contained in mobile phones as those in common use. A first advantage of the apparatus and method according to the present invention is that it overcomes the drawbacks of the prior art.
A second advantage of the apparatus and method according to the present invention is substituting the voice of the recipient of a phone call, when he/she decides that is inappropriate to talk, with something that is as similar as far as possible to a normal conversation when the recipient of a phone call answers talking with the caller.
A third advantage of the apparatus and method according to the present invention is helping the recipient of a phone call to answer such call without talking, but by inputting data in the apparatus in a very rapid manner.
The apparatus and method for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, may be subject to many possible variations without departing from the novelty spirit of the inventive idea; it is also clear that in the practical implementation of the invention the illustrated details may have different shapes or be replaced with other technically equivalent elements.
It can therefore be easily understood that the present invention is not limited to a apparatus and method for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, but may be subject to many modifications, improvements or replacements of equivalent parts and elements without departing from the novelty spirit of the inventive idea, as clearly specified in the following claims.

Claims

1. An apparatus (la; lb) for answering a phone call when a recipient of said phone call decides that it is inappropriate to talk, said apparatus (l ; lb) comprising:
- control means, in particular a key (3a;3b), for sending a dedicated command that opens a voice conversation with a caller, for putting said apparatus (la; lb) in a message mode and thus answer said phone call;
- a microphone (5a;5b) that is muted after sending said dedicated command for the whole period during which said apparatus (la; lb) is kept in said message mode;
- said control means, in particular a keyboard (4a;4b), being adapted to type a text message by said recipient of said apparatus ( 1 a; 1 b);
- an earphone device (7a; 7b) for listening to said caller;
- a text-to-speech injection apparatus (20) comprising a text-to-speech translation apparatus (11) and a database (13) for synthesizing said text message into a speech message and transmitting said speech message to said caller during said voice conversation; and
- a voice message injection module (9) for sending an alert voice message to said caller, said alert voice message saying that said recipient is not talking, but is substituted by said text-to-speech injection apparatus (20).
2. An apparatus (la; lb) according to claim 1, wherein said text-to-speech translation apparatus (11) being adapted to recognize a voice of said recipient during a conversation and to distinguish, word by word, words that said recipient is pronouncing; said text-to-speech injection apparatus (20) being adapted to store said recognized words that have been used by said recipient during a conversation, and to substitute said recognized and stored words in the place of the synthesized voice words made on the basis of a predefined male or female voice.
3. An apparatus (la; lb) according to claim 1, wherein said voice message injector module (9) analyzes an ongoing voice conversation and detects periods of silence of said caller, during which said speech message is injected into said voice conversation.
4. An apparatus (la; lb) according to claim 1 or 2, wherein said apparatus (la; lb) identifies and extracts text words from said database (13) by considering single letters of a word that said recipient is inputting on said control means, in particular said keyboard (4a;4b), and suggests to said recipient the complete word before it is completely inputted.
5. An apparatus (la; lb) according to one or more of the preceding claims, said text-to- speech injection apparatus (20) further comprising a speech recognition and understanding apparatus (15) that analyzes questions posed by said caller, extracts some words from said questions and assigns them a meaning related to the usual phrases utilized in a phone conversation.
6. An apparatus (la; lb) according to claim 5, wherein said speech recognition and understanding apparatus (15) stores completed phrases to answer said questions posed by said caller in said database (13), and extracts and proposes them to said recipient in accordance to the actual question posed by said caller and analyzed by said speech recognition and understanding apparatus (15).
7. An apparatus (la;lb) according to claim 5 or 6, wherein said speech recognition and understanding apparatus (15) stores in said database (13) both questions and answers and wherein each question is associated with at least one answer.
8. An apparatus (la; lb) according to one or more of the preceding claims, wherein said text-to-speech translation apparatus (11) stores all past text messages into said database (13), converted into said speech messages, and suggests entire phrases to said recipient when at least two consecutive words have been already used in said past text messages.
9. An apparatus (la; lb) according to one or more of the preceding claims, wherein said text-to-speech translation apparatus (11) synthesizes said speech message on the basis of said predefined male or female voice, selectable from a user through a menu system of said apparatus (la; lb).
10. A method for answering a phone call through an apparatus (la;lb) when a recipient of said phone call decides that it is inappropriate to talk, said method comprising the following steps:
- sending a dedicated command though control means, in particular through the selection of a dedicated key (3a;3b) on said apparatus (la; lb), opening a voice conversation with a caller, putting said apparatus (la; lb) in a message mode and thus answering said phone call; - said dedicated command producing the muting of signals coming from a microphone (5a;5b) of said apparatus (la; lb) for the whole period during which said apparatus (la; lb) is kept in said message mode;
- listening to said caller through an earphone device (7a; 7b);
- sending, through a voice message injection module (9), an alert voice message to said caller, said alert voice message saying that said recipient is not talking, but is substituted by a text-to-speech injection apparatus (20);
- writing by said recipient a text message in reply to questions posed by said caller; and
- synthesizing said text message into a speech message through a text-to-speech translation apparatus (11) using a database (13), and transmitting said speech message to said caller during said voice conversation through said text-to-speech injection apparatus (20).
11. A method according to claim 10, said method further comprising the steps of: - recognizing, through said text-to-speech translation apparatus (11), a voice of said recipient during a conversation and distinguishing, word by word, words that said recipient is pronouncing;
- storing, through said text-to-speech injection apparatus (20), said recognized words that have been used by said recipient during a conversation which did not occur in said message mode; and
- substituting said recognized and stored words in said text-to-speech injection apparatus (20) in the place of synthesized voice words made on the basis of a predefined male or female voice.
12. A method according to claim 11, wherein the step of storing words is validated by a user of said apparatus ( 1 a; 1 b) .
13. A method according to claim 11, wherein said step of recognizing a voice of said recipient when said apparatus (la; lb) is not in said message mode is activated by said user.
14. A method according to claim 10, wherein said method further comprising the step of analyzing an ongoing voice conversation and detecting periods of silence of said caller, during which said speech message is injected into said voice conversation through said voice message injector module (9).
15. A method according to one or more of the preceding claims, wherein said method further comprising the step of analyzing questions posed by said caller through a speech recognition and understanding apparatus (15) that extracts some words from said questions and assigns them a meaning related to the usual phrases utilized in a phone conversation.
16. A method according to claim 15, said method further comprising the step of storing in said database (13), through said speech recognition and understanding apparatus (15), both questions and answers, wherein each question is associated with at least one answer.
17. A method according to claims 15 or 16, said method further comprising the step of storing completed phrases to answer said questions posed by said caller in said database (13), and extracting and proposing them to said recipient in accordance to the actual question posed by said caller and analyzed by said speech recognition and understanding apparatus (15).
18. A method according to one or more of the preceding claims, said method further comprising the step of identifying and extracting text words, used for creating said speech message, from said database (13) by considering single letters of a word that said recipient is inputting, and suggesting to said recipient the complete word before it is completely inputted.
19. A method according to one or more of the preceding claims, said method further comprising the step of storing all past text messages, converted into said speech messages, into said database (13) through said text-to-speech translation apparatus (11), and suggesting entire phrases to said recipient when at least two consecutive words have been already used in said past text messages.
20. A method according to one or more of the preceding claims, said method further comprising the step of synthesizing said speech message on the basis of said predefined male or female voice, selectable from said user through a menu system of said apparatus (la;lb).
21. A method according to one or more of the preceding claims, wherein said text message uses a format similar to an SMS ("Short Messaging Service") or an IM Service message ("Instant Messaging Service").
22. A computer product which can be loaded into a memory of said apparatus (la; lb), comprising portions of software code adapted to implement the method according to one or more of claims 10 to 21.
PCT/EP2013/059083 2013-05-02 2013-05-02 An apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, and related method WO2014177209A1 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
JP2016510947A JP6165321B2 (en) 2013-05-02 2013-05-02 Apparatus and method
EP13721550.5A EP2992666B1 (en) 2013-05-02 2013-05-02 An apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, and related method
ES13721550T ES2786079T3 (en) 2013-05-02 2013-05-02 Apparatus for answering a telephone call when a recipient of the telephone call decides that it is inappropriate to speak and related method
KR1020157034137A KR102038827B1 (en) 2013-05-02 2013-05-02 An apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, and related method
PCT/EP2013/059083 WO2014177209A1 (en) 2013-05-02 2013-05-02 An apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, and related method
US14/787,539 US9924012B2 (en) 2013-05-02 2013-05-02 Apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, and related method
CN201380076175.4A CN105210355B (en) 2013-05-02 2013-05-02 Equipment and correlation technique for the answer calls when recipient's judgement of call is not suitable for speaking

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2013/059083 WO2014177209A1 (en) 2013-05-02 2013-05-02 An apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, and related method

Publications (1)

Publication Number Publication Date
WO2014177209A1 true WO2014177209A1 (en) 2014-11-06

Family

ID=48366310

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2013/059083 WO2014177209A1 (en) 2013-05-02 2013-05-02 An apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, and related method

Country Status (7)

Country Link
US (1) US9924012B2 (en)
EP (1) EP2992666B1 (en)
JP (1) JP6165321B2 (en)
KR (1) KR102038827B1 (en)
CN (1) CN105210355B (en)
ES (1) ES2786079T3 (en)
WO (1) WO2014177209A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106060272A (en) * 2016-07-12 2016-10-26 魏喜国 Bilingual instant translation device taking smart phone as carrier
CN107404432A (en) * 2017-09-02 2017-11-28 刘兴丹 A kind of method, apparatus of a variety of networking sending bulk messages of combination

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3395054B1 (en) * 2015-12-21 2023-11-22 Saronikos Trading and Services, Unipessoal Lda Apparatus and method for managing communications
US11170757B2 (en) * 2016-09-30 2021-11-09 T-Mobile Usa, Inc. Systems and methods for improved call handling
CN107071328B (en) * 2016-12-16 2019-12-03 维沃移动通信有限公司 A kind of video calling processing method and mobile terminal
EP4092998A1 (en) 2017-06-30 2022-11-23 Google LLC Methods, systems, and media for connecting an iot device to a call
EP3646161A1 (en) 2017-06-30 2020-05-06 Google LLC Methods, systems, and media for voice-based call operations
CN109698877A (en) * 2017-10-24 2019-04-30 华为终端(东莞)有限公司 Voice communication method and voice communication assembly
CN108965600B (en) * 2018-07-24 2021-05-04 Oppo(重庆)智能科技有限公司 Voice pickup method and related product
US11381675B2 (en) 2018-12-12 2022-07-05 Samsung Electronics Co., Ltd. Command based interactive system and a method thereof
RU2719659C1 (en) * 2019-01-10 2020-04-21 Общество с ограниченной ответственностью "Центр речевых технологий" (ООО "ЦРТ") Device for recording and controlling input of voice information
CN110602328B (en) * 2019-09-30 2021-10-22 联想(北京)有限公司 Processing method and processing device
US11019207B1 (en) * 2019-11-07 2021-05-25 Hithink Royalflush Information Network Co., Ltd. Systems and methods for smart dialogue communication

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0249575A2 (en) * 1986-04-16 1987-12-16 Call It Co Computerized communications system
EP1465393A1 (en) * 2003-04-01 2004-10-06 Silent Communication Ltd. Apparatus and method for silent communication using pre-recorded audible messages
GB2402650A (en) * 2003-12-31 2004-12-15 Research In Motion Ltd Keyboard arrangement
GB2408170A (en) * 2002-06-07 2005-05-18 Hewlett Packard Development Co Telephone communication with silent response feature
US7106852B1 (en) * 2000-09-08 2006-09-12 Fuji Xerox Co., Ltd. Telephone accessory for generating conversation utterances to a remote listener in response to a quiet selection

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6354855A (en) * 1986-08-26 1988-03-09 Oki Electric Ind Co Ltd Automatic answering telephone set
JPH0774843A (en) * 1993-09-01 1995-03-17 Omron Corp Communication terminal device
JP3165585B2 (en) * 1994-05-13 2001-05-14 シャープ株式会社 Information processing device
WO2000022591A1 (en) * 1998-10-14 2000-04-20 Morris Gary J Communicative environmental alarm system with voice indication
JP2000134298A (en) * 1998-10-21 2000-05-12 Kazuo Ishikawa Character input device
US6438524B1 (en) * 1999-11-23 2002-08-20 Qualcomm, Incorporated Method and apparatus for a voice controlled foreign language translation device
US7083342B2 (en) * 2001-12-21 2006-08-01 Griffin Jason T Keyboard arrangement
GB0213021D0 (en) * 2002-06-07 2002-07-17 Hewlett Packard Co Telephone communication with silent response feature
JP2004129174A (en) * 2002-08-06 2004-04-22 Ricoh Co Ltd Information communication instrument, information communication program, and recording medium
FI20055717A0 (en) * 2005-12-30 2005-12-30 Nokia Corp Code conversion method in a mobile communication system
JP2008072425A (en) * 2006-09-14 2008-03-27 Interfuncs Co Ltd Telephone call controller for portable telephone
JP3165585U (en) 2010-11-11 2011-01-27 有限会社オフィス結アジア Speech synthesizer
JP2012205223A (en) * 2011-03-28 2012-10-22 Sanyo Electric Co Ltd Communication apparatus
EP2536176B1 (en) * 2011-06-16 2016-09-21 Alcatel Lucent Text-to-speech injection apparatus for telecommunication system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0249575A2 (en) * 1986-04-16 1987-12-16 Call It Co Computerized communications system
US7106852B1 (en) * 2000-09-08 2006-09-12 Fuji Xerox Co., Ltd. Telephone accessory for generating conversation utterances to a remote listener in response to a quiet selection
GB2408170A (en) * 2002-06-07 2005-05-18 Hewlett Packard Development Co Telephone communication with silent response feature
EP1465393A1 (en) * 2003-04-01 2004-10-06 Silent Communication Ltd. Apparatus and method for silent communication using pre-recorded audible messages
GB2402650A (en) * 2003-12-31 2004-12-15 Research In Motion Ltd Keyboard arrangement

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106060272A (en) * 2016-07-12 2016-10-26 魏喜国 Bilingual instant translation device taking smart phone as carrier
CN107404432A (en) * 2017-09-02 2017-11-28 刘兴丹 A kind of method, apparatus of a variety of networking sending bulk messages of combination

Also Published As

Publication number Publication date
EP2992666A1 (en) 2016-03-09
CN105210355B (en) 2019-03-22
KR102038827B1 (en) 2019-10-31
US9924012B2 (en) 2018-03-20
JP6165321B2 (en) 2017-07-19
JP2016524365A (en) 2016-08-12
US20160065711A1 (en) 2016-03-03
ES2786079T3 (en) 2020-10-08
CN105210355A (en) 2015-12-30
KR20160005075A (en) 2016-01-13
EP2992666B1 (en) 2020-02-26

Similar Documents

Publication Publication Date Title
EP2992666B1 (en) An apparatus for answering a phone call when a recipient of the phone call decides that it is inappropriate to talk, and related method
US10255918B2 (en) Command and control of devices and applications by voice using a communication base system
CN102117614B (en) Personalized text-to-speech synthesis and personalized speech feature extraction
US7305068B2 (en) Telephone communication with silent response feature
KR20200016295A (en) Asynchronous multimode messaging system and method
KR20090085376A (en) Service method and apparatus for using speech synthesis of text message
US8229086B2 (en) Apparatus, system and method for providing silently selectable audible communication
US20080096587A1 (en) Telephone for Sending Voice and Text Messages
US20240305707A1 (en) Systems and methods for cellular and landline text-to-audio and audio-to-text conversion
KR20080054591A (en) Method for communicating voice in wireless terminal
US9237224B2 (en) Text interface device and method in voice communication
US20050122959A1 (en) Enhanced telecommunication system
WO2015011217A1 (en) User-interface using rfid-tags or voice as input
JP2005123869A (en) System and method for dictating call content
KR20060130897A (en) Silent call communication appartus and method for mobile terminal
JP2016144024A (en) Telephone apparatus with voice memo storage function

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13721550

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 14787539

Country of ref document: US

ENP Entry into the national phase

Ref document number: 2016510947

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20157034137

Country of ref document: KR

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2013721550

Country of ref document: EP