WO2015043442A1 - Method, device and mobile terminal for text-to-speech processing - Google Patents

Method, device and mobile terminal for text-to-speech processing Download PDF

Info

Publication number
WO2015043442A1
WO2015043442A1 PCT/CN2014/087137 CN2014087137W WO2015043442A1 WO 2015043442 A1 WO2015043442 A1 WO 2015043442A1 CN 2014087137 W CN2014087137 W CN 2014087137W WO 2015043442 A1 WO2015043442 A1 WO 2015043442A1
Authority
WO
WIPO (PCT)
Prior art keywords
text
speech
voice
interactive interface
application
Prior art date
Application number
PCT/CN2014/087137
Other languages
French (fr)
Inventor
Hui Tang
Original Assignee
Tencent Technology (Shenzhen) Company Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology (Shenzhen) Company Limited filed Critical Tencent Technology (Shenzhen) Company Limited
Publication of WO2015043442A1 publication Critical patent/WO2015043442A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback

Definitions

  • the present disclosure relates to the field of computer and mobile communication technologies, and more particularly to a method, device and mobile terminal for text-to-speech processing.
  • a user can easily read texts, such as a novel, a prose or the like, on a browser of mobile terminals.
  • texts such as a novel, a prose or the like
  • text browsing by sight has many limitations in its applications.
  • a user is limited to browse by sight, which adversely affects user experience. Accordingly, it would be advantageous to provide a method to browse texts by sound.
  • the present disclosure discloses a method, a device and a mobile terminal for text-to-speech processing, which provide more approaches to browse text and reduce the effects on the performance of text browsing.
  • a method for text-to-speech processing comprises: receiving, by a device having a processor and a speaker, a voice play request on an interactive interface of a social application, the voice play request comprises selected text; converting, by the device, the selected text into speech according to the voice play request; and outputting, by the speaker of the device, the speech.
  • a device for text-to-speech processing comprises a processor and a non-transitory storage medium.
  • the non-transitory storage medium comprises: a request acquisition module, configured to obtain a voice play request on an interactive interface of a social application, wherein the voice play request comprises selected text; a transforming module, configured to transform the selected text to speech according to the voice play request; and a play control module, configured to play the speech on the interactive interface.
  • a mobile terminal comprises a device for text-to-speech processing.
  • the device comprises a processor and a non-transitory storage medium.
  • the non-transitory storage medium comprises: a request acquisition module, configured to obtain a voice play request on an interactive interface of a social application, wherein the voice play request comprises selected text; a transforming module, configured to transform the selected text to speech according to the voice play request; and a play control module, configured to play the speech on the interactive interface.
  • Fig. 1 is the flow diagram of a text-to-speech processing method in one embodiment of this disclosure.
  • Fig. 2 is the flow diagram of a text-to-speech processing method in one embodiment of this disclosure.
  • Fig. 3 is the flow diagram of a text-to-speech processing method in one embodiment of this disclosure.
  • Fig. 4 is the structural diagram of a device for text-to-speech processing in one embodiment of this disclosure.
  • Fig. 5 is the structural diagram of a transforming module in one embodiment of this disclosure.
  • Fig. 6 is the structural diagram of a device for text-to-speech processing in one embodiment of this disclosure.
  • Fig. 7 is the structural diagram of a device for text-to-speech processing in one embodiment of this disclosure.
  • Fig. 8 is the diagram of application scenarios provided in one embodiment of this disclosure.
  • Interactive interface is the channel of message exchange between human and mobile terminals.
  • the interactive interface is used by users to input messages into mobile terminals and execute operations, while the mobile terminals are used to provide messages for reading, analyzing and judging.
  • Interactive interfaces in the embodiments of this disclosure contain home pages of software or applications, as well as various function buttons, interactive interfaces.
  • the interactive interfaces, of different levels or appearing after different triggering events, are different interactive interfaces, and those of a same level or appearing after one triggering event are the same interactive interface, which is not determined by the content on the interactive interface.
  • the difference between various interactive interfaces is only related to the inherent difference of interactive interface in a certain level but not to the content in the interactive interface.
  • Social applications include instant messaging, text chat, internet forums, blogs, social network services, and any other social software.
  • the mobile terminal may be a mobile phone, a smart phone, a tablet PC or other terminal devices.
  • the device of text-to-speech processing may be a hardware device contained in the mobile terminal, or a text application in the mobile terminal, e. g. a browser application or the like.
  • the text-to-speech processing device can show webpage on basis of Webview, which is a kind of space used to show webpage on basis of Webkit Kernel.
  • the text-to-speech processing method provided in the embodiments of this disclosure may be applied when it is inconvenient to read text.
  • the text-to-speech processing device can transform the text to speech; or when words in a webpage are read by a user on a trip, the text-to-speech processing device can transform the text to speech, or the like.
  • the text-to-speech processing device obtains the text selected for voice play on an interactive interface of a social application, transforms the selected text, generates speech corresponding to the selected text and plays the speech on the interactive interface.
  • the flow diagram of a text-to-speech processing method shows the embodiments of this disclosure.
  • the method in the embodiments of this disclosure comprises steps of S101-S103 as below.
  • S101 obtaining a voice play request on the interactive interface of a social application, in which the voice play request comprises a selected text.
  • the text-to-speech processing device finds a voice play request on the interactive interface of the social application, the text-to-speech processing device will obtain the selected text on the interactive interface.
  • the voice play request comprises the selected text, which may contains Chinese characters, English characters, and so on.
  • the selected text may be the text selected on the interactive interface by a user.
  • the text-to-speech processing device can analyze the voice play request and obtain the text after analysis.
  • the text-to-speech processing device may apply a voice application to obtain speech corresponding to the selected text according to the voice play request.
  • the voice application obtains the speech corresponding to the text, according to the correspondence between the text and the speech.
  • the correspondence between the text and the speech is based on the following: one text is corresponding to only one code information, and one code information is corresponding to only one speech.
  • the voice application may be a built-in voice application or a voice module of instant messaging application, and the system may be an operating system inset in the mobile terminal, e. g. Android system, IOS system, and so on.
  • the text-to-speech processing device can control the voice application to play the speech on the interactive interface.
  • the device can transform the selected text on an interactive interface of social application to speech and play the speech.
  • the device provides more approaches to browse text.
  • the method reduces effects on text browsing caused by character sizes, environmental factors and so on, and improves the intellectuality of the mobile terminal.
  • the flow diagram of another method for text-to-speech processing shows the embodiments of this disclosure.
  • the method in the embodiments comprises steps of S201-S205 as below.
  • S201 obtaining the text selected by a user on an interactive interface, when the request of selecting text on an interactive interface of social application is received.
  • the text-to-speech processing device when the text-to-speech processing device detects the user’s voice play request on the interactive interface of the social application, the text-to-speech processing device will obtain the text of user’s selection on the interactive interface.
  • the user’s voice play request on the interactive interface may be issued by the user through long-pressing on the interactive interface, and then the text-to-speech processing device will display a pull-selection cursor on the interactive interface.
  • the user can select the text for voice play by sliding the pull-selection cursor.
  • the text-to-speech processing device packages the text of the user’s selection on the interactive interface, and generates a voice play request.
  • S203 obtaining the voice play request on the interactive interface of a social application, in which the voice play request comprises the selected text.
  • the text-to-speech processing device finds a voice play request on the interactive interface of a social application, the text-to-speech processing device will obtain the selected text on the interactive interface.
  • the voice play request comprises the selected text, which may contain Chinese characters, English characters, and so on.
  • the selected text may be the text selected on the interactive interface by a user.
  • the text-to-speech processing device can analyze the voice play request and obtain the text after analysis.
  • S204 applying a voice application to obtain speech corresponding to the selected text, according to correspondence between the text and the speech.
  • correspondence between the text and the speech is based on the following: one text is corresponding to only one coding information, and one coding information is corresponding to only one speech.
  • the text-to-speech processing device applies a voice application to obtain the coding information corresponding to the selected text, and then control the voice application to search the speech corresponding to the coding information.
  • the text-to-speech processing device applies the voice application to obtain the coding information 66 of A, and controls the voice application to search pronunciation of A corresponding to the coding information 66.
  • the voice application may be a built-in voice application or a voice module of instant messaging application, and the system may be an operating system inset in the mobile terminal, e. g. Android system, IOS system, and so on.
  • the text-to-speech processing device can control the voice application to play the speech on the interactive interface.
  • the device transforms the text of selection on an interactive interface of a social application to speech and plays the message, which provides more approaches to browse text.
  • the method can reduce effects on text browsing caused by character sizes, environmental factors and so on.
  • One coding information is corresponding to only one text and only one speech, which increases the accuracy of transformation from text to speech.
  • the intellectuality of the mobile terminal is improved.
  • the flow diagram of another method for text-to-speech processing shows the embodiments of this disclosure.
  • the method in the embodiment of this disclosure comprises steps of S301-S307 as below.
  • S301 obtaining the text selected by a user on an interactive interface, when the request of selecting text on the interactive interface of a social application is received.
  • the text-to-speech processing device when the text-to-speech processing device detects a user’s voice play request on the interactive interface of a social application, the text-to-speech processing device will obtain the text of the user’s selection on the interactive interface.
  • the user’s voice play request on the interactive interface may be issued by the user through long-pressing on the interactive interface, and then the text-to-speech processing device will display a pull-selection cursor on the interactive interface and the user can select the text for voice play through sliding the pull-selection cursor.
  • the text-to-speech processing device displays a prompt button of voice play on the interactive interface and the user clicks the prompt button of voice play.
  • the prompt button of voice play is clicked, it means the prompt button of voice play is triggered.
  • the prompt button of voice play may be displayed when the interactive interface appears.
  • the prompt button of voice play may be a prompt message of voice play, used to prompt a user whether to play the selected text.
  • step of S304 is executed when the text-to-speech processing device detects that the prompt button of voice play is triggered.
  • the text-to-speech processing device packages the text of the user’s selection on the interactive interface and generates a voice play request.
  • S305 obtaining a voice play request on the interactive interface of a social application, and the voice play request comprises the selected text.
  • the text-to-speech processing device finds a voice play request on the interactive interface of a social application, the text-to-speech processing device obtains selected text on the interactive interface.
  • the voice play request comprises the selected text, which may contain Chinese characters, English characters, and so on.
  • the selected text may be the text selected on the interactive interface by a user.
  • the text-to-speech processing device can analyze the voice play request and obtain the text after analysis.
  • S306 applying a voice application to obtain the speech corresponding to the selected text, according to correspondence between the text and the speech.
  • correspondence between the text and the speech is based on the following; one text is corresponding to only one coding information, and one coding information is corresponding to only one speech.
  • the text-to-speech processing device may apply a voice application to obtain coding information corresponding to the text, and then control the voice application to search the speech corresponding to the coding information.
  • the text-to-speech processing device can apply the voice application to obtain the coding information 66 of A, and control the voice application to search pronunciation of A corresponding to the coding information 66.
  • the voice application saves the speech, correspondence between the coding information and the text, and correspondence between the coding information and the speech beforehand.
  • the voice application may be a built-in voice application or a voice module of instant messaging application, and the system may be an operating system inset in the mobile terminal, e. g. Android system, IOS system, and so on.
  • the text-to-speech processing device can control the voice application to play the speech on the interactive interface.
  • the device can transform the text of selection on the interactive interface of a social application to speech and play the speech.
  • This device provides more approaches to browse text.
  • the process can reduce effects on text browsing caused by character sizes, environmental factors and so on.
  • One coding information is corresponding to only one text and only one speech, which increases the accuracy of transformation from text to speech.
  • the prompt button of voice play or the prompt message of voice play the automatic voice play of text caused by misoperation can be avoided when user is browsing the interactive interface. Therefore, the intellectuality of mobile terminal is improved.
  • Fig. 4 is the structural diagram of a device for text-to-speech processing provided by the embodiments of this disclosure.
  • the device for text-to-speech processing 1 comprises request acquisition module 11, transforming module 12 and play control module 13.
  • the request acquisition module 11 is used to obtain a voice play request on the interactive interface of a social application, in which the voice play request comprises the selected text.
  • the request acquisition module 11 obtains the selected text on the interactive interface.
  • the voice play request comprises the selected text, which may contain Chinese characters, English characters, and so on.
  • the text of selection may be the text selected on the interactive interface by a user.
  • the request acquisition module 11 can analyze the voice play request and obtain the text after analysis.
  • the transforming module 12 is used to transform the text into speech according to the voice play request.
  • the transforming module 12 applies a voice application to obtain speech corresponding to the selected text, according to the voice play request.
  • the voice application obtains speech corresponding to the text, according to correspondence between the text and the speech.
  • the correspondence between the text and the speech is based on the following; one text is corresponding to only one coding information, and one coding information is corresponding to only one voice.
  • the voice application saves the correspondence between the coding information and the text, and correspondence between the coding information and the speech beforehand.
  • the voice application may be a built-in voice application or a voice module of instant messaging application and the system may be an operating system inset in the mobile terminal, e. g. Android system, IOS system, and so on.
  • Fig. 5 is the structural diagram of the transforming module provided by the embodiments of this disclosure.
  • the transforming module 12 contains code acquisition unit 121 and speech search unit 122 .
  • Code acquisition unit 121 is used to apply a voice application to obtain coding information corresponding to the text.
  • the code acquisition unit 121 applies the voice application to obtain the coding information corresponding to the text. For example, supposing that the text of A is corresponding to coding information 66 and the coding information is corresponding to the pronunciation of A, the code acquisition unit 121 applies the voice application to obtain the coding information 66 of A.
  • Speech search unit 122 is used to control the voice application to search speech corresponding to the coding information.
  • the speech search unit 122 controls the voice application to search the speech corresponding to the coding information. For example, supposing that the text of A is corresponding to coding information 66 and the coding information is corresponding to the pronunciation of A, when the code acquisition unit 121 calls the voice application to get the coding information 66 of A, the speech search unit 122 controls the voice application to search pronunciation of A corresponding to the coding information 66.
  • the play control module 13 is used to play the speech on the interactive interface.
  • the play control module 13 controls the voice application to play the speech on the interactive interface.
  • the device transforms the text of selection on an interactive interface of a social application into speech and plays the information.
  • the device provides more approaches to browse text.
  • the method reduces effects on text browsing caused by character sizes, environmental factors and so on.
  • One coding information is corresponding to only one text and only one speech, which increases the accuracy of transformation from text into speech. The intellectuality of mobile terminal is improved.
  • Fig. 6 is the structural diagram of another device for text-to-speech processing provided by the embodiments of this disclosure.
  • the device for text-to-speech processing 1 may comprise request acquisition module 11, transforming module 12, play control module 13, acquisition module 14 and generating module 15, among which the structure of request acquisition module 11, transforming module 12 and play control module 13 have been described in the introduction of embodiment corresponding to Fig. 4, which will not be described in detail again.
  • Acquisition module 14 is used to obtain the text selected by a user on the interactive interface, when the request of selecting text on the interactive interface of a social application is received.
  • acquisition module 14 obtains the selected text on the interactive interface.
  • the user’s voice play request on the interactive interface may be issued by the user through long-pressing on the interactive interface, and then the text-to-speech processing device 1 displays a pull-selection cursor on the interactive interface and the user can select the text for voice play by sliding the pull-selection cursor.
  • Generating module 15 is used to generate a voice play request according to the text.
  • the text-to-speech processing device packages the text of a user’s selection on the interactive interface and generates voice play request.
  • the device can transform the text of selection for voice play on the interactive interface of a social application to speech and play the information, which avoids the fuzziness in text caused by small-size characters and brings convenience and better feeling in reading to user.
  • the device because one coding information is corresponding to only one text and only one speech, the device increases the accuracy of transformation from text to speech, and the intellectuality of mobile terminal is improved.
  • Fig. 7 is the structural diagram of another device for text-to-speech processing provided by the embodiments of this disclosure.
  • the device for text-to-speech processing 1 may comprise request acquisition module 11, transforming module 12, play control module 13, acquisition module 14, generating module 15, display module 16 and notify module 17, among which the structure of the request acquisition module 11, transforming module 12 and play control module 13 have been described in the introduction of embodiment corresponding to Fig. 4, and the acquisition module 14 and generating module 15 have been described in the introduction of embodiment corresponding to Fig. 6, and they will not be described in detail again.
  • Display module 16 is used to display a prompt button of voice play.
  • the display module 16 displays a prompt button of voice play on the interactive interface.
  • the user can click the prompt button of voice play.
  • the prompt button of voice play is clicked, it means the prompt button of voice play is triggered.
  • the prompt button of voice play might be displayed when the interactive interface appears.
  • the prompt button of voice play may be a prompt message of voice play, used to prompt user whether to play the selected text.
  • Notify module 17 is used to notify the generating module to execute the step of generating a voice play request according to the text, in case that it is detected that the prompt button of voice play is triggered.
  • the notify module 17 informs the generating module 15 to execute the step of generating a voice play request according to the text when the device detects that the prompt button of voice play is triggered.
  • the device transforms the text of selection on the interactive interface of a social application to speech. It provides more approaches to browse text.
  • the method reduces effects on text browsing caused by character sizes, environmental factors and so on.
  • One coding information is corresponding to only one text and only one speech. It increases the accuracy of transformation from text into speech.
  • the prompt button of voice play or the prompt message of voice play the automatic voice play of text caused by misoperation can be avoided, when a user is browsing the interactive interface. Therefore, the intellectuality of mobile terminal is improved.
  • Fig. 8 is the diagram of application scenarios provided by the embodiments of this disclosure.
  • the text-to-speech processing device displays a pull-selection cursor on the interactive interface and the user can select the text for voice play through sliding the pull-selection cursor.
  • the shadowed part in Fig. 8 is the text of user’s selection on the interactive interface of a social application, and the text-to-speech processing device displays a prompt button of voice play (the button “PLAY” in Fig. 8) . If the prompt button of voice play is triggered, the text-to-speech processing device applies the voice application to transform the text into speech, and controls the voice application to play the speech on the interactive interface.
  • the voice application may be a built-in voice application or an voice module of instant messaging application and the system may be an operating system inset in the mobile terminal, e. g. Android system, IOS system, and so on.
  • the device transforms the text of selection on the interactive interface of a social application into speech and plays the speech. It provides more approaches to browse text.
  • the method reduces effects on performance of text browse caused by character sizes, environmental factors and so on.
  • One coding information is corresponding to only one text and only one speech, which increases the accuracy of transformation from text to speech.
  • the prompt button of voice play or the prompt message of voice play the automatic voice play of text caused by misoperation can be avoided, when user is browsing the interactive interface. Therefore, the intellectuality of mobile terminal is improved.
  • the memory medium above may be diskettes, optical disks, Read-Only Memory (ROM) or Random Access Memory (RAM) , or the like.
  • the smart terminal of the present disclosure is not limited to smart phones
  • the server device is not limited to personal computer
  • the disclosed method is also suitable for operating systems other than Android systems.
  • the server device may be a computer, a tablet, a smart phone, or any computing devices.
  • the disclosed methods in the above embodiments may be combined with each other.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The present disclosure discloses a method, a device and a mobile terminal for text-to-speech processing. The method comprises: receiving, by a device having a processor and a speaker, a voice play request on an interactive interface of a social application, the voice play request comprises selected text; converting, by the device, the selected text into speech according to the voice play request; and outputting, by the speaker of the device, the speech.

Description

METHOD, DEVICE AND MOBILE TERMINAL FOR TEXT-TO-SPEECH PROCESSING
CROSS-REFERENCE TO RELATED APPLICATIONS
This application claims priority to a Chinese Patent Application No. 201310442687. X, filed on September 25, 2013, which is incorporated by reference in its entirety.
FIELD OF THE TECHNOLOGY
The present disclosure relates to the field of computer and mobile communication technologies, and more particularly to a method, device and mobile terminal for text-to-speech processing.
BACKGROUND
In the information age, mobile phones and tablet PCs (Personal Computers) , as well as other mobile terminals, have become indispensable part of our lives. These terminals are not only used to communicate with others but also to text messages, take photos, play games and so on.
A user can easily read texts, such as a novel, a prose or the like, on a browser of mobile terminals. However, due to the small size of characters displayed on a mobile terminal screen and other adverse effects on the display screen, such as strong ambient sunlight, rain and so on, text browsing by sight has many limitations in its applications. In addition, a user is limited to browse by sight, which adversely affects user experience. Accordingly, it would be advantageous to provide a method to browse texts by sound.
SUMMARY OF THE DISCLOSURE
The present disclosure discloses a method, a device and a mobile terminal for text-to-speech processing, which provide more approaches to browse text and reduce the effects on the performance of text browsing.
In the first aspect, a method for text-to-speech processing, comprises: receiving, by a device having a processor and a speaker, a voice play request on an interactive interface of a social application, the voice play request comprises selected text; converting, by the device, the selected text into speech according to the voice play request; and outputting, by the speaker of the device, the speech.
In the second aspect, a device for text-to-speech processing, comprises a processor and a non-transitory storage medium. The non-transitory storage medium comprises: a request acquisition module, configured to obtain a voice play request on an interactive interface of a social application, wherein the voice play request comprises selected text; a transforming module, configured to transform the selected text to speech according to the voice play request; and a play control module, configured to play the speech on the interactive interface.
In the third aspect, a mobile terminal comprises a device for text-to-speech processing. The device comprises a processor and a non-transitory storage medium. The non-transitory storage medium comprises: a request acquisition module, configured to obtain a voice play request on an interactive interface of a social application, wherein the voice play request comprises selected text; a transforming module, configured to transform the selected text to speech according to the voice play request; and a play control module, configured to play the speech on the interactive interface.
BRIEF DESCRIPTION OF THE DRAWINGS
In order to make further introduction to the embodiments of this disclosure and technical scheme of the existing technology, the accompanying drawings used in embodiments and description of the existing technology will be introduced briefly. Obviously, the drawings in the description are just some embodiments of this disclosure, according to which other drawings may be obtained without creative labor by a person of skill in the art.
Fig. 1 is the flow diagram of a text-to-speech processing method in one embodiment of this disclosure.
Fig. 2 is the flow diagram of a text-to-speech processing method in one embodiment of this  disclosure.
Fig. 3 is the flow diagram of a text-to-speech processing method in one embodiment of this disclosure.
Fig. 4 is the structural diagram of a device for text-to-speech processing in one embodiment of this disclosure.
Fig. 5 is the structural diagram of a transforming module in one embodiment of this disclosure. 
Fig. 6 is the structural diagram of a device for text-to-speech processing in one embodiment of this disclosure.
Fig. 7 is the structural diagram of a device for text-to-speech processing in one embodiment of this disclosure.
Fig. 8 is the diagram of application scenarios provided in one embodiment of this disclosure.
DETAILED DESCRIPTION OF THE EMBODIMENTS
For a better understanding of the technical scheme thereof, the present disclosure is described in further detail in connection with the accompanying drawings as follows.
The technological scheme in the embodiments of this disclosure will be described clearly and completely, using the accompanying drawings. Obviously, only some embodiments, rather than all of them, will be described below. Any other embodiment obtained by a person of skill in the art on basis of embodiments of this disclosure without any creative labor is in the scope of this invention.
Interactive interface is the channel of message exchange between human and mobile terminals. The interactive interface is used by users to input messages into mobile terminals and execute operations, while the mobile terminals are used to provide messages for reading, analyzing and judging. Interactive interfaces in the embodiments of this disclosure contain home pages of software or applications, as well as various function buttons, interactive interfaces. The interactive interfaces, of different levels or appearing after different triggering events, are different interactive interfaces, and those of a same level or appearing after one triggering event are the same interactive interface, which is not determined by the content on the interactive interface. The difference between  various interactive interfaces is only related to the inherent difference of interactive interface in a certain level but not to the content in the interactive interface. Social applications include instant messaging, text chat, internet forums, blogs, social network services, and any other social software.
In embodiments of this disclosure, the mobile terminal may be a mobile phone, a smart phone, a tablet PC or other terminal devices. The device of text-to-speech processing may be a hardware device contained in the mobile terminal, or a text application in the mobile terminal, e. g. a browser application or the like. The text-to-speech processing device can show webpage on basis of Webview, which is a kind of space used to show webpage on basis of Webkit Kernel.
The text-to-speech processing method provided in the embodiments of this disclosure may be applied when it is inconvenient to read text. For example, when words in an article are read by a user on a bus, the text-to-speech processing device can transform the text to speech; or when words in a webpage are read by a user on a trip, the text-to-speech processing device can transform the text to speech, or the like. The text-to-speech processing device obtains the text selected for voice play on an interactive interface of a social application, transforms the selected text, generates speech corresponding to the selected text and plays the speech on the interactive interface.
The method for text-to-speech processing provided in the embodiments of this disclosure will be described in detail with drawings 1-3 as below.
Referring to Fig. 1, the flow diagram of a text-to-speech processing method shows the embodiments of this disclosure. As shown in Fig. 1, the method in the embodiments of this disclosure comprises steps of S101-S103 as below.
S101: obtaining a voice play request on the interactive interface of a social application, in which the voice play request comprises a selected text.
Specifically, when the text-to-speech processing device finds a voice play request on the interactive interface of the social application, the text-to-speech processing device will obtain the selected text on the interactive interface.
It should be aware that, the voice play request comprises the selected text, which may  contains Chinese characters, English characters, and so on. The selected text may be the text selected on the interactive interface by a user. Preferably, the text-to-speech processing device can analyze the voice play request and obtain the text after analysis.
S102: transforming the selected text into speech according to the voice play request.
Specifically, the text-to-speech processing device may apply a voice application to obtain speech corresponding to the selected text according to the voice play request. Preferably, the voice application obtains the speech corresponding to the text, according to the correspondence between the text and the speech. The correspondence between the text and the speech is based on the following: one text is corresponding to only one code information, and one code information is corresponding to only one speech.
It should be aware that the voice application may be a built-in voice application or a voice module of instant messaging application, and the system may be an operating system inset in the mobile terminal, e. g. Android system, IOS system, and so on.
S103: playing the speech on the interactive interface.
Specifically, the text-to-speech processing device can control the voice application to play the speech on the interactive interface.
In the embodiment, it can transform the selected text on an interactive interface of social application to speech and play the speech. The device provides more approaches to browse text. The method reduces effects on text browsing caused by character sizes, environmental factors and so on, and improves the intellectuality of the mobile terminal.
Referring to Fig. 2, the flow diagram of another method for text-to-speech processing shows the embodiments of this disclosure. As shown in Fig. 2, the method in the embodiments comprises steps of S201-S205 as below.
S201: obtaining the text selected by a user on an interactive interface, when the request of selecting text on an interactive interface of social application is received.
Specifically, when the text-to-speech processing device detects the user’s voice play request  on the interactive interface of the social application, the text-to-speech processing device will obtain the text of user’s selection on the interactive interface.
Preferably, the user’s voice play request on the interactive interface may be issued by the user through long-pressing on the interactive interface, and then the text-to-speech processing device will display a pull-selection cursor on the interactive interface. The user can select the text for voice play by sliding the pull-selection cursor.
S202: generating a voice play request according to the text.
Specifically, the text-to-speech processing device packages the text of the user’s selection on the interactive interface, and generates a voice play request.
S203: obtaining the voice play request on the interactive interface of a social application, in which the voice play request comprises the selected text.
Specifically, when the text-to-speech processing device finds a voice play request on the interactive interface of a social application, the text-to-speech processing device will obtain the selected text on the interactive interface.
The voice play request comprises the selected text, which may contain Chinese characters, English characters, and so on. The selected text may be the text selected on the interactive interface by a user. Preferably, the text-to-speech processing device can analyze the voice play request and obtain the text after analysis.
S204: applying a voice application to obtain speech corresponding to the selected text, according to correspondence between the text and the speech.
Specifically, correspondence between the text and the speech is based on the following: one text is corresponding to only one coding information, and one coding information is corresponding to only one speech. The text-to-speech processing device applies a voice application to obtain the coding information corresponding to the selected text, and then control the voice application to search the speech corresponding to the coding information.
For example, supposing that the text of A is corresponding to coding information 66 and the  coding information 66 is corresponding to the pronunciation of A, the text-to-speech processing device applies the voice application to obtain the coding information 66 of A, and controls the voice application to search pronunciation of A corresponding to the coding information 66.
The voice application may be a built-in voice application or a voice module of instant messaging application, and the system may be an operating system inset in the mobile terminal, e. g. Android system, IOS system, and so on.
S205: playing the speech on the interactive interface.
Specifically, the text-to-speech processing device can control the voice application to play the speech on the interactive interface.
In the embodiment, the device transforms the text of selection on an interactive interface of a social application to speech and plays the message, which provides more approaches to browse text. The method can reduce effects on text browsing caused by character sizes, environmental factors and so on. One coding information is corresponding to only one text and only one speech, which increases the accuracy of transformation from text to speech. The intellectuality of the mobile terminal is improved.
Referring to Fig. 3, the flow diagram of another method for text-to-speech processing shows the embodiments of this disclosure. As shown in Fig. 3, the method in the embodiment of this disclosure comprises steps of S301-S307 as below.
S301: obtaining the text selected by a user on an interactive interface, when the request of selecting text on the interactive interface of a social application is received.
Specifically, when the text-to-speech processing device detects a user’s voice play request on the interactive interface of a social application, the text-to-speech processing device will obtain the text of the user’s selection on the interactive interface.
Preferably, the user’s voice play request on the interactive interface may be issued by the user through long-pressing on the interactive interface, and then the text-to-speech processing device will display a pull-selection cursor on the interactive interface and the user can select the text for voice  play through sliding the pull-selection cursor.
S302: displaying a prompt button of voice play.
Specifically, after the text-to-speech processing device obtains the text of the user’s selection on the interactive interface, the text-to-speech processing device displays a prompt button of voice play on the interactive interface and the user clicks the prompt button of voice play. When the prompt button of voice play is clicked, it means the prompt button of voice play is triggered.
The prompt button of voice play may be displayed when the interactive interface appears. The prompt button of voice play may be a prompt message of voice play, used to prompt a user whether to play the selected text. By displaying the prompt button of voice play or the prompt message of voice play, the automatic voice play of text caused by misoperation can be avoided, when a user is browsing the interactive interface.
S303: executing the step of generating a voice play request according to the selected text when the device detects that the prompt button of voice play is triggered.
Specifically, the step of S304 is executed when the text-to-speech processing device detects that the prompt button of voice play is triggered.
S304: generating a voice play request according to the text.
Specifically, the text-to-speech processing device packages the text of the user’s selection on the interactive interface and generates a voice play request.
S305: obtaining a voice play request on the interactive interface of a social application, and the voice play request comprises the selected text.
Specifically, when the text-to-speech processing device finds a voice play request on the interactive interface of a social application, the text-to-speech processing device obtains selected text on the interactive interface.
The voice play request comprises the selected text, which may contain Chinese characters, English characters, and so on. The selected text may be the text selected on the interactive interface by a user. Preferably, the text-to-speech processing device can analyze the voice play request and  obtain the text after analysis.
S306: applying a voice application to obtain the speech corresponding to the selected text, according to correspondence between the text and the speech.
Specifically, correspondence between the text and the speech is based on the following; one text is corresponding to only one coding information, and one coding information is corresponding to only one speech. The text-to-speech processing device may apply a voice application to obtain coding information corresponding to the text, and then control the voice application to search the speech corresponding to the coding information.
For example, suppose the text of A is corresponding to coding information 66 and the coding information is corresponding to the pronunciation of A, the text-to-speech processing device can apply the voice application to obtain the coding information 66 of A, and control the voice application to search pronunciation of A corresponding to the coding information 66.
Preferably, the voice application saves the speech, correspondence between the coding information and the text, and correspondence between the coding information and the speech beforehand.
It should be aware that the voice application may be a built-in voice application or a voice module of instant messaging application, and the system may be an operating system inset in the mobile terminal, e. g. Android system, IOS system, and so on.
S307: playing the speech on the interactive interface.
Specifically, the text-to-speech processing device can control the voice application to play the speech on the interactive interface.
In the embodiment, the device can transform the text of selection on the interactive interface of a social application to speech and play the speech. This device provides more approaches to browse text. In addition, the process can reduce effects on text browsing caused by character sizes, environmental factors and so on. One coding information is corresponding to only one text and only one speech, which increases the accuracy of transformation from text to speech. In addition, by  displaying the prompt button of voice play or the prompt message of voice play, the automatic voice play of text caused by misoperation can be avoided when user is browsing the interactive interface. Therefore, the intellectuality of mobile terminal is improved.
The text-to-speech processing device provided in the embodiments of this disclosure will be introduced in detail below, using the accompanying drawings 4-7. It should be noticed that the text-to-speech processing device shown in the accompanying drawings 4-7 is used to execute the method in the accompanying drawings 1-3. In order to make better introduction, it only shows some parts related to the embodiments of this disclosure. Reference may be made to the embodiments in accompanying drawings 1-3 to obtain more information about the undisclosed specific technique detail.
Reference is made to Fig. 4, which is the structural diagram of a device for text-to-speech processing provided by the embodiments of this disclosure. As shown in Fig. 4, the device for text-to-speech processing 1 comprises request acquisition module 11, transforming module 12 and play control module 13.
The request acquisition module 11 is used to obtain a voice play request on the interactive interface of a social application, in which the voice play request comprises the selected text.
Specifically, when the device for text-to-speech processing 1 finds a voice play request on the interactive interface of a social application, the request acquisition module 11 obtains the selected text on the interactive interface.
The voice play request comprises the selected text, which may contain Chinese characters, English characters, and so on. The text of selection may be the text selected on the interactive interface by a user. Preferably, the request acquisition module 11 can analyze the voice play request and obtain the text after analysis.
The transforming module 12 is used to transform the text into speech according to the voice play request.
Specifically, the transforming module 12 applies a voice application to obtain speech  corresponding to the selected text, according to the voice play request. Preferably, the voice application obtains speech corresponding to the text, according to correspondence between the text and the speech. The correspondence between the text and the speech is based on the following; one text is corresponding to only one coding information, and one coding information is corresponding to only one voice.
Preferably, the voice application saves the correspondence between the coding information and the text, and correspondence between the coding information and the speech beforehand.
The voice application may be a built-in voice application or a voice module of instant messaging application and the system may be an operating system inset in the mobile terminal, e. g. Android system, IOS system, and so on.
Specifically, further reference is made to Fig. 5, which is the structural diagram of the transforming module provided by the embodiments of this disclosure. As shown in Fig. 5, the transforming module 12 contains code acquisition unit 121 and speech search unit 122 .
Code acquisition unit 121 is used to apply a voice application to obtain coding information corresponding to the text.
Specifically, the code acquisition unit 121 applies the voice application to obtain the coding information corresponding to the text. For example, supposing that the text of A is corresponding to coding information 66 and the coding information is corresponding to the pronunciation of A, the code acquisition unit 121 applies the voice application to obtain the coding information 66 of A.
Speech search unit 122 is used to control the voice application to search speech corresponding to the coding information.
Specifically, the speech search unit 122 controls the voice application to search the speech corresponding to the coding information. For example, supposing that the text of A is corresponding to coding information 66 and the coding information is corresponding to the pronunciation of A, when the code acquisition unit 121 calls the voice application to get the coding information 66 of A, the speech search unit 122 controls the voice application to search pronunciation of A corresponding  to the coding information 66.
The play control module 13 is used to play the speech on the interactive interface.
Specifically, the play control module 13 controls the voice application to play the speech on the interactive interface.
In the embodiment, the device transforms the text of selection on an interactive interface of a social application into speech and plays the information. The device provides more approaches to browse text. In addition, the method reduces effects on text browsing caused by character sizes, environmental factors and so on. One coding information is corresponding to only one text and only one speech, which increases the accuracy of transformation from text into speech. The intellectuality of mobile terminal is improved.
Reference is made to Fig. 6, which is the structural diagram of another device for text-to-speech processing provided by the embodiments of this disclosure. As shown in Fig. 6, the device for text-to-speech processing 1 may comprise request acquisition module 11, transforming module 12, play control module 13, acquisition module 14 and generating module 15, among which the structure of request acquisition module 11, transforming module 12 and play control module 13 have been described in the introduction of embodiment corresponding to Fig. 4, which will not be described in detail again.
Acquisition module 14 is used to obtain the text selected by a user on the interactive interface, when the request of selecting text on the interactive interface of a social application is received.
Specifically, when the text-to-speech processing device 1 finds a user’s voice play request on the interactive interface of a social application, acquisition module 14 obtains the selected text on the interactive interface.
Preferably, the user’s voice play request on the interactive interface may be issued by the user through long-pressing on the interactive interface, and then the text-to-speech processing device 1 displays a pull-selection cursor on the interactive interface and the user can select the text for voice play by sliding the pull-selection cursor.
Generating module 15 is used to generate a voice play request according to the text.
Specifically, the text-to-speech processing device packages the text of a user’s selection on the interactive interface and generates voice play request.
In the embodiment, the device can transform the text of selection for voice play on the interactive interface of a social application to speech and play the information, which avoids the fuzziness in text caused by small-size characters and brings convenience and better feeling in reading to user. In addition, because one coding information is corresponding to only one text and only one speech, the device increases the accuracy of transformation from text to speech, and the intellectuality of mobile terminal is improved.
Reference is made to Fig. 7, which is the structural diagram of another device for text-to-speech processing provided by the embodiments of this disclosure. As shown in Fig. 7, the device for text-to-speech processing 1 may comprise request acquisition module 11, transforming module 12, play control module 13, acquisition module 14, generating module 15, display module 16 and notify module 17, among which the structure of the request acquisition module 11, transforming module 12 and play control module 13 have been described in the introduction of embodiment corresponding to Fig. 4, and the acquisition module 14 and generating module 15 have been described in the introduction of embodiment corresponding to Fig. 6, and they will not be described in detail again.
Display module 16 is used to display a prompt button of voice play.
Specifically, after the acquisition module 14 obtains the text of a user’s selection on the interactive interface, the display module 16 displays a prompt button of voice play on the interactive interface. The user can click the prompt button of voice play. When the prompt button of voice play is clicked, it means the prompt button of voice play is triggered.
The prompt button of voice play might be displayed when the interactive interface appears. The prompt button of voice play may be a prompt message of voice play, used to prompt user whether to play the selected text. By displaying the prompt button of voice play or the prompt message of voice play, the automatic voice play of text caused by misoperation can be avoided, when  the user is browsing the interactive interface.
Notify module 17 is used to notify the generating module to execute the step of generating a voice play request according to the text, in case that it is detected that the prompt button of voice play is triggered.
Specifically, the notify module 17 informs the generating module 15 to execute the step of generating a voice play request according to the text when the device detects that the prompt button of voice play is triggered.
In the embodiment, the device transforms the text of selection on the interactive interface of a social application to speech. It provides more approaches to browse text. In addition, the method reduces effects on text browsing caused by character sizes, environmental factors and so on.One coding information is corresponding to only one text and only one speech. It increases the accuracy of transformation from text into speech. In addition, by displaying the prompt button of voice play or the prompt message of voice play, the automatic voice play of text caused by misoperation can be avoided, when a user is browsing the interactive interface. Therefore, the intellectuality of mobile terminal is improved.
There is also a provided mobile terminal in the embodiment of this disclosure, comprising the text-to-speech processing device shown in embodiments corresponding to Fig. 4 -Fig. 7, as well as applications in embodiments above. The mobile terminal in the embodiment of this disclosure can be used in the above methods.
Reference is made to Fig. 8, which is the diagram of application scenarios provided by the embodiments of this disclosure. As shown in Fig. 8, after receiving the user’s request of text selection on the interactive interface of a social application, the text-to-speech processing device displays a pull-selection cursor on the interactive interface and the user can select the text for voice play through sliding the pull-selection cursor.
The shadowed part in Fig. 8 is the text of user’s selection on the interactive interface of a social application, and the text-to-speech processing device displays a prompt button of voice  play (the button “PLAY” in Fig. 8) . If the prompt button of voice play is triggered, the text-to-speech processing device applies the voice application to transform the text into speech, and controls the voice application to play the speech on the interactive interface.
The voice application may be a built-in voice application or an voice module of instant messaging application and the system may be an operating system inset in the mobile terminal, e. g. Android system, IOS system, and so on.
In the embodiment, the device transforms the text of selection on the interactive interface of a social application into speech and plays the speech. It provides more approaches to browse text. In addition, the method reduces effects on performance of text browse caused by character sizes, environmental factors and so on. One coding information is corresponding to only one text and only one speech, which increases the accuracy of transformation from text to speech. Moreover, by displaying the prompt button of voice play or the prompt message of voice play, the automatic voice play of text caused by misoperation can be avoided, when user is browsing the interactive interface. Therefore, the intellectuality of mobile terminal is improved.
Person of skill in the art can be aware that the whole or part of process in the embodiments may be realized by involved hardware under control of computer program, which may be stored in a memory medium. When the program is executed, flow processes in the embodiments above may be contained. The memory medium above may be diskettes, optical disks, Read-Only Memory (ROM) or Random Access Memory (RAM) , or the like.
All disclosures above are the preferred the embodiments and it does not intend to limit the range of the invention. Therefore, any equivalent change according to the claims of the invention is in range of this invention.
It must be noted that the smart terminal of the present disclosure is not limited to smart phones, the server device is not limited to personal computer, and the disclosed method is also suitable for operating systems other than Android systems. The server device may be a computer, a tablet, a smart phone, or any computing devices. The disclosed methods in the above embodiments  may be combined with each other.
Disclosed above are only embodiments of the present disclosure and these embodiments are not intended to be limiting the scope of the present disclosure, hence any equivalent variations made based on the prospectus and accompanying drawings of the present disclosure, or any direct or indirect use based thereon in other related fields shall fall within the scope of the present disclosure.

Claims (15)

  1.  A method for text-to-speech processing, comprising:
    receiving, by a device having a processor and a speaker, a voice play request on an interactive interface of a social application, the voice play request comprising selected text;
    converting, by the device, the selected text into speech according to the voice play request; and
    outputting, by the speaker of the device, the speech.
  2. The method of claim 1, wherein converting the selected text into the speech according to the voice play request comprises:
    applying a voice application to obtain the speech corresponding to the selected text, according to the correspondence between text and speech.
  3.  The method of claim 2, wherein the correspondence between the text and the speech comprises:
    the text that corresponds to only one coding information; and
    the coding information that corresponds to only one speech.
  4.  The method of claim 3, wherein applying a voice application to obtain the speech comprises:
    applying the voice application to obtain the coding information corresponding to the selected text; and
    controlling the voice application to search speech corresponding to the coding information.
  5.  The method of claim 1, further comprising:
    obtaining the text selected by a user on an interactive interface; and
    generating the voice play request according to the selected text.
  6.  The method of claim 5, further comprising:
    displaying a prompt button of voice play; and
    generating a voice play request according to the selected text, when the prompt button of voice play is triggered.
  7.  The method of claim 2 or claim 4, wherein the voice application comprises at least one of a built-in voice application and a voice module of instant messaging application.
  8.  A device for text-to-speech processing, comprising a processor and a non-transitory storage medium, the non-transitory storage medium comprising:
    a request acquisition module, configured to obtain a voice play request on an interactive interface of a social application, wherein the voice play request comprises selected text;
    a transforming module, configured to transform the selected text to speech according to the voice play request; and
    a play control module, configured to play the speech on the interactive interface.
  9.  The device of claim 8, wherein the transforming module is configured to apply a voice application to obtain speech corresponding to the selected text according to the correspondence between the text and the speech.
  10.  The device of claim 9, wherein the correspondence between the text and the speech comprises:
    the text that corresponds to only one coding information; and
    the coding information that corresponds to only one speech.
  11.  The device of claim 10, wherein the transforming module comprises:
    a code acquisition unit, configured to apply a voice application to obtain the coding information corresponding to the text; and
    a speech search unit, configured to control the voice application to search speech corresponding to the coding information.
  12.  The device of claim 8, further comprising:
    an acquisition module, configured to obtain the text selected by a user on the interactive interface, when a request of selecting text on the interactive interface of the social application is received, and
    a generating module, configured to generate a voice play request according to the text.
  13.  The device of claim 12, further comprising:
    a display module, configured to display a prompt button of voice play; and
    a notify module, configured to notify the generating module to execute the step of generating a voice play request according to the text, when the prompt button of voice play is triggered.
  14.  The device of claim 9 or claim 11, wherein the voice application comprises at least one of a built-in voice application and a voice module of instant messaging application.
  15.  A mobile terminal, comprising a device for text-to-speech processing, wherein the device comprises a processor and a non-transitory storage medium, the non-transitory storage medium comprising:
    a request acquisition module, configured to obtain a voice play request on an interactive interface of a social application, wherein the voice play request comprises selected text;
    a transforming module, configured to transform the selected text to speech according to the voice play request; and
    a play control module, configured to play the speech on the interactive interface.
PCT/CN2014/087137 2013-09-25 2014-09-23 Method, device and mobile terminal for text-to-speech processing WO2015043442A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201310442687.X 2013-09-25
CN201310442687.XA CN104142778B (en) 2013-09-25 2013-09-25 A kind of method of text-processing, device and mobile terminal

Publications (1)

Publication Number Publication Date
WO2015043442A1 true WO2015043442A1 (en) 2015-04-02

Family

ID=51851970

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/087137 WO2015043442A1 (en) 2013-09-25 2014-09-23 Method, device and mobile terminal for text-to-speech processing

Country Status (2)

Country Link
CN (1) CN104142778B (en)
WO (1) WO2015043442A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020023068A1 (en) * 2018-07-24 2020-01-30 Google Llc Systems and methods for a text-to-speech interface
US11145288B2 (en) 2018-07-24 2021-10-12 Google Llc Systems and methods for a text-to-speech interface

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105898603A (en) * 2015-12-15 2016-08-24 乐视网信息技术(北京)股份有限公司 Voice danmaku generation method and device
CN107015975A (en) * 2016-01-27 2017-08-04 阿里巴巴集团控股有限公司 A kind of information output method and device
CN105955609A (en) * 2016-04-25 2016-09-21 乐视控股(北京)有限公司 Voice reading method and apparatus
CN106791078A (en) * 2016-12-18 2017-05-31 程在舒 The speech playing method and application of mobile terminal new information and Domestic News
CN108605074B (en) * 2017-01-26 2021-01-05 华为技术有限公司 Method and equipment for triggering voice function
CN108874266A (en) * 2018-06-27 2018-11-23 北京微播视界科技有限公司 Text playback method, client, terminal and storage medium
CN109241541A (en) * 2018-08-14 2019-01-18 平安普惠企业管理有限公司 Exchange method and terminal device based on voice conversion

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1941747A (en) * 2005-09-27 2007-04-04 腾讯科技(深圳)有限公司 Demand telecommunicating method and system
CN101075983A (en) * 2006-12-15 2007-11-21 腾讯科技(深圳)有限公司 Instant speech telecommunication terminal, server, system and instant speech telecommunication method
US8027837B2 (en) * 2006-09-15 2011-09-27 Apple Inc. Using non-speech sounds during text-to-speech synthesis
CN202907188U (en) * 2011-09-02 2013-04-24 辜进荣 Information transmitter for mobile phone communication

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1187683C (en) * 2001-06-20 2005-02-02 英华达(南京)科技有限公司 Portable voice broadcast E-mail device and method
CN101187855A (en) * 2006-11-16 2008-05-28 王铁兵 Mobile phone for voice reading
CN101616303A (en) * 2008-06-27 2009-12-30 深圳市同洲电子股份有限公司 A kind of method, device and receiving terminal for digital television of realizing automatic reading function

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1941747A (en) * 2005-09-27 2007-04-04 腾讯科技(深圳)有限公司 Demand telecommunicating method and system
US8027837B2 (en) * 2006-09-15 2011-09-27 Apple Inc. Using non-speech sounds during text-to-speech synthesis
CN101075983A (en) * 2006-12-15 2007-11-21 腾讯科技(深圳)有限公司 Instant speech telecommunication terminal, server, system and instant speech telecommunication method
CN202907188U (en) * 2011-09-02 2013-04-24 辜进荣 Information transmitter for mobile phone communication

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020023068A1 (en) * 2018-07-24 2020-01-30 Google Llc Systems and methods for a text-to-speech interface
US11145288B2 (en) 2018-07-24 2021-10-12 Google Llc Systems and methods for a text-to-speech interface

Also Published As

Publication number Publication date
CN104142778B (en) 2017-06-13
CN104142778A (en) 2014-11-12

Similar Documents

Publication Publication Date Title
WO2015043442A1 (en) Method, device and mobile terminal for text-to-speech processing
JP6169590B2 (en) Adaptive input language switching
US11119627B2 (en) Information display method, device, apparatus and storage medium
US20140325323A1 (en) Online video playing method and apparatus and computer readable medium
CN106251869B (en) Voice processing method and device
US20140380375A1 (en) Page turning method, page turning apparatus and terminal as well as computer readable medium
CN105630787B (en) Animation realization method and device based on dynamic portable network graphics
WO2014176906A1 (en) Online video playing method and apparatus and computer readable medium
CN108449255B (en) Comment interaction method and equipment, client device and electronic equipment
CN107169147B (en) Data processing method and device and electronic equipment
CN109817204A (en) Voice interactive method and device, electronic equipment, readable storage medium storing program for executing
CN109683760B (en) Recent content display method, device, terminal and storage medium
CN111897607A (en) Application interface loading and interaction method, device and storage medium
CN107562324B (en) Data display control method and terminal
CN113641921A (en) Page display method and device and page display device
CN110618811B (en) Information presentation method and device
CN110088750B (en) Method and system for providing context function in static webpage
CN111143090A (en) Application interaction method, device and computer-readable storage medium
CN111506848A (en) Webpage processing method, device, equipment and readable storage medium
KR102255369B1 (en) Method for providing alternative service and electronic device thereof
CN113450762A (en) Character reading method, device, terminal and storage medium
CN111158678B (en) Video playing method and device, client device and electronic device
CN111399722A (en) Mail signature generation method, device, terminal and storage medium
CN111859999A (en) Message translation method, device, storage medium and electronic equipment
US11122324B2 (en) Method for displaying video related service, storage medium, and electronic device therefor

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14849672

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase
32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 17.08.2016)

122 Ep: pct application non-entry in european phase

Ref document number: 14849672

Country of ref document: EP

Kind code of ref document: A1