WO2019080833A1 - Voice communication method and voice communication apparatus - Google Patents

Voice communication method and voice communication apparatus

Info

Publication number
WO2019080833A1
WO2019080833A1 PCT/CN2018/111424 CN2018111424W WO2019080833A1 WO 2019080833 A1 WO2019080833 A1 WO 2019080833A1 CN 2018111424 W CN2018111424 W CN 2018111424W WO 2019080833 A1 WO2019080833 A1 WO 2019080833A1
Authority
WO
WIPO (PCT)
Prior art keywords
application
voice
text information
terminal device
voice data
Prior art date
Application number
PCT/CN2018/111424
Other languages
French (fr)
Chinese (zh)
Inventor
李凤彬
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2019080833A1 publication Critical patent/WO2019080833A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/72406User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by software upgrading or downloading
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72436User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for text messaging, e.g. SMS or e-mail
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/725Cordless telephones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W88/00Devices specially adapted for wireless communication networks, e.g. terminals, base stations or access point devices
    • H04W88/02Terminal devices

Definitions

  • the present application relates to the field of information technology, and in particular, to a voice communication method and a voice communication device.
  • TTY Tele TYpe
  • TTY device Text Telephone Devices for the Deaf
  • TTY device Designed for people who are deaf and hearing impaired. After the terminal device is inserted into the TTY device, it will support sending and receiving text.
  • TTY devices support both listening and speaking modes, and deaf people can easily use the phone for real-time communication and communication.
  • the TTY device is a dedicated hardware device, and the communication between it and the terminal device is transmitted through an analog signal. In the process of data transmission, conversion and processing, signal loss and distortion will inevitably occur, and the bit error rate is relatively high. high. In addition, the weight and volume of the TTY device are large, and it is very inconvenient to carry.
  • the present invention provides a voice communication method and a voice communication device, which can avoid the problems of high bit error rate and inconvenient carrying when using traditional TTY devices for voice communication.
  • the present application provides a voice communication method, the method comprising: acquiring, by a first application on a terminal device, text information input by a user; the first application converting the text information into voice data; and transmitting, by the first application, the voice data a voice transmission module of the terminal device; the voice transmission module transmits the voice data to other terminal devices that perform voice communication with the terminal device.
  • the first application mentioned in the embodiment of the present application has the function of realizing mutual conversion between text and voice.
  • the first application may encode the text information based on the TTY protocol to obtain voice data.
  • the first application may decode the voice data based on the TTY protocol to obtain text information.
  • the technical solution provided by the present application is compared with the voice communication using the traditional TTY device, and the process of converting the text information and the voice data input by the user is realized by the first application, that is, the digital signal is Processing, no longer involves the conversion between the analog signal and the digital signal, thus reducing the bit error rate.
  • the user no longer needs to carry the TTY device with him, and the first application on the terminal device can implement voice communication, thereby avoiding the disadvantage that the TTY device is inconvenient to carry.
  • the first application converts the text information into the voice data, including: the first application encodes the text information based on the text phone TTY protocol to obtain the voice data.
  • the voice transmission module sends the voice data to other terminal devices that perform voice communication with the terminal device, including: the voice transmission module sends the voice data to the Other terminal devices that the terminal device performs voice communication.
  • the terminal device further includes a display module
  • the method further includes: the first application transmitting the acquired text information input by the user to the display module; the display module is at the first The text message is displayed on the interface of the application.
  • the present application provides a voice communication method, the method includes: a voice transmission module in a terminal device receives voice data sent by another terminal device; and a voice transmission module sends voice data to a first application on the terminal device; An application converts voice data into text information.
  • the first application converts the voice data into text information, including: the first application decodes the voice data based on the text phone TTY protocol to obtain text information.
  • the terminal device further includes a display module
  • the method further includes: the first application sends the text information to the display module; the display module displays the information on the interface of the first application text information.
  • the terminal device further includes a display module
  • the method further includes: the first application corrects the text information to obtain corrected text information; the first application will correct The subsequent text information is sent to the display module; the display module displays the corrected text.
  • the display module that directly displays the decoded text information and displays the corrected text information may be the same display module.
  • the present application provides a voice communication device having the function of implementing the voice communication method of the first aspect described above.
  • These functions can be implemented in hardware or in software by executing the corresponding software.
  • the hardware or software includes one or more units corresponding to the functions described above.
  • the present application provides a voice communication device having a function of implementing the voice communication method of the second aspect described above.
  • These functions can be implemented in hardware or in software by executing the corresponding software.
  • the hardware or software includes one or more units corresponding to the functions described above.
  • the present application provides a voice communication device including a transceiver, a processor, and a memory. a processor for controlling transceiver transceiver signals, a memory for storing a computer program, and a processor for calling and running the computer program from the memory, such that the voice communication device performs voice communication in the first aspect and any possible implementation thereof described above method.
  • the present application provides a voice communication device including a transceiver, a processor, and a memory. a processor for controlling transceiver transceiver signals, a memory for storing a computer program, and a processor for calling and running the computer program from the memory, such that the voice communication device performs voice communication in the second aspect and any possible implementation thereof method.
  • the present application provides a communication device, which may be a voice communication device in the above method design, or a chip disposed in a voice communication device.
  • the communication device includes a memory, a communication interface, and a processor, wherein the memory is configured to store computer executable program code, the program code including instructions that, when executed by the processor, cause the communication device to perform the first aspect described above Or the method of the second aspect and any one of its possible implementations.
  • the application provides a computer program product, comprising: computer program code, when the computer program code is run on a computer, causing the computer to perform the first aspect or the second aspect and any one of the above The method in the possible implementation.
  • the present application provides a chip system, including a processor, for implementing the functions involved in the voice communication device in the first aspect or the second aspect and any one of the possible implementation manners, for example, For example, the speech data and/or text information involved in the above method is received or processed.
  • the chip system further includes a memory for holding program instructions and data necessary for the voice communication device to perform the voice communication method.
  • the chip system can be composed of chips, and can also include chips and other discrete devices.
  • FIG. 1 is a schematic diagram of the working principle of the TTY device.
  • FIG. 2 is a structural block diagram of a TTY device.
  • FIG. 3 is a schematic flowchart of a voice communication method 300 provided by the present application.
  • FIG. 4 is an application scenario applicable to the voice communication method provided by the present application.
  • FIG. 5 is another application scenario of a method applicable to voice communication provided by the present application.
  • FIG. 6 is a schematic block diagram of a voice communication device 500 provided by the present application.
  • FIG. 7 is a schematic diagram of a voice communication device 600 provided by the present application.
  • FIG. 8 is a schematic structural diagram of a terminal device 800 provided by the present application.
  • TTY The TeleTYpe (TTY) service or the Text Telephone Devices for the Deaf (TDD) service is a mobile "voice" service for deaf people.
  • TTY services or TDD services can generally support three TTY function modes: Full mode (also called TTY mode), Voice Carry Over (VCO) mode, and Hearing Carry Over (HCO) mode.
  • the TTY mode supports sending and receiving TTY text at the same time.
  • TTY mode when the terminal device is connected to the TTY device, only the character transmission can be performed and the call cannot be made.
  • the terminal device can normally talk with other terminal devices.
  • VCO the terminal device can switch between receiving TTY text and transmitting voice, mainly for ⁇ .
  • HCO mode the terminal device can switch between sending TTY text and answering a call, mainly for dumb.
  • TTY devices work similarly to TDD devices. The following only describes TTY devices as an example.
  • FIG 1 is a schematic diagram of the working principle of the TTY device.
  • the existing TTY solution requires a dedicated TTY device to connect to a terminal device.
  • text input and display can be performed on the TTY device to implement communication with the peer.
  • the use of the TTY device can be roughly expressed as follows:
  • the TTY device provides a 3.5 mm audio connector for connection with the terminal device. If the audio connector of the terminal device does not match the audio connector provided by the TTY device, it can be converted by a patch cord.
  • TTY device The working principle of the TTY device and the TDD device are similar. To avoid the cumbersome description, the following describes the TTY device as an example.
  • FIG. 2 is a structural block diagram of a TTY device.
  • the implementation of the TTY service mainly includes two parts: an encoder and a decoder.
  • the encoder is responsible for detecting a Baudot tone transmitted on a Pulse Code Modulation (PCM) link and parsing it into corresponding TTY character information for transmission to the decoder.
  • PCM Pulse Code Modulation
  • the decoder After receiving the TTY character information, the decoder is responsible for restoring the TTY character information to the corresponding Bode tone.
  • the FER in Fig. 2 represents a frame error rate (FER).
  • the existing TTY device has a high purchase cost, and since the communication between the TTY device and the terminal device is transmitted through an analog signal, signal loss and distortion are inevitable in the process of signal transmission, processing, and conversion, and thus error The code rate is also higher. In addition, the weight and volume of the TTY device are generally large and inconvenient to carry.
  • the present application provides a method for implementing voice communication, which can avoid a high bit error rate when using a conventional TTY hardware device. Inconvenience such as carrying.
  • the user can implement voice communication by using the application software.
  • the application software running on the terminal device that implements the function of converting between text and voice needs to acquire the same authority as the call module on the terminal device.
  • the software running on the terminal device with the function of realizing the conversion between text and voice as the first application.
  • the first application mentioned in the embodiment of the present application is different from the data service-based application (for example, WeChat), and the first application mentioned herein is based on voice service. Or, the first application needs to be based on a real-time communication network. Moreover, the voice service is a function supported by all carrier networks, so any area with signal coverage can be used.
  • the first application in this article needs to have the same privileges as a normal call. This requires a high level of authorization due to user privacy.
  • the first application requires an Android installation package (Android Package, APK) to interact with the underlying driver software, and the underlying driver software can only be modified by the original equipment manufacturer (OEM). Therefore, this application
  • the first application in the embodiment is not suitable for a third party to provide separately.
  • the voice communication method provided by the present application can be applied to various scenarios. For example, a terminal device running a first application communicates with a legacy TTY device. Alternatively, communication between the terminal device running the first application and the terminal device is performed.
  • FIG. 3 is a schematic flowchart of a method 300 for voice communication provided by the present application.
  • the terminal device that performs the method 300 needs to run the first application, and at least should also have a voice transmission module.
  • the first application is an application software, which needs to have the function of converting text and voice to each other.
  • the voice transmission module can communicate with the first application to send the application output by the first application to other terminal devices that perform voice communication with the terminal device.
  • the voice transmission module may send the voice data sent by the other terminal device to the first application, so that the voice data is converted into text information by the first application.
  • the mutual conversion between the text information and the voice can be realized through the first application on the terminal device.
  • the terminal can support the sending and receiving of text, thereby helping the deaf or hearing impaired person to perform normal voice communication.
  • the first application on the terminal device obtains text information for input.
  • the user can use various input methods for text input based on the operating system running on the terminal device.
  • the terminal device can provide modes such as “only hear HCO”, “just say no listening to VCO” and "FULL” for the user to select. A description of these modes can be found above.
  • the user can enter the text to be sent on the interface of the first application.
  • the text entered by the user can be displayed in real time on the interface of the first application.
  • users can also input by voice or text.
  • the FULL mode the user can input the text to be sent on the interface of the first application, and the information sent by the other party is received by the user by displaying the text on the first application interface.
  • the first application converts the text information into voice data.
  • the first application can convert the text information into voice data.
  • the first application can encode the text input by the user number based on the TTY protocol to obtain voice data.
  • the first application sends the voice data to the voice transmission module.
  • the voice transmission module sends the voice data to other terminal devices that perform voice communication with the terminal device.
  • the first application can convert the text input by the user into voice data in real time, and send the voice data to the voice transmission module.
  • the voice transmission module on the terminal device is responsible for transmitting voice data over the cellular network to other terminal devices that perform voice communication with the terminal device.
  • first terminal device For convenience of description, we refer to the terminal device described in steps 310-340 as the first terminal device. Other terminal devices that perform voice communication with the first terminal device are referred to as second terminal devices.
  • the voice transmission module on the second terminal device After the voice transmission module on the second terminal device receives the voice data from the first terminal device, the voice transmission module sends the voice data to the first application on the second terminal device.
  • the first application on the second terminal device converts the voice data into text information. In this way, the user using the second terminal device can receive the text information sent by the first terminal device. Since the text information input by the user can be converted into voice data in real time, real-time communication and communication between the two parties of the voice communication is realized.
  • the process of converting the text information and the voice data input by the user is realized by the first application, that is, the processing of the digital signal is no longer involved, and the analog signal and the digital signal are no longer involved.
  • the conversion between them can thus reduce the bit error rate.
  • the user no longer needs to carry the TTY device with him, and the voice communication and communication can be realized through the first application on the terminal device, thereby avoiding the disadvantage that the TTY device is inconvenient to carry.
  • FIG. 4 is an application scenario of a voice communication method applicable to an embodiment of the present application.
  • the first application on the terminal device A obtains text information input by the user.
  • the first application gets the text entered by the user as "Long time no see, is it good recently?"
  • the first application converts text input by the user into voice data.
  • the first application running on the terminal device A directly encodes the text input by the user into the voice data conforming to the TTY protocol according to the TTY protocol.
  • step 402 the encoded speech data is a digital signal.
  • the first application sends the voice data to the voice transmission module.
  • the voice transmission module receives the voice data sent by the first application, and modulates the voice data and sends the voice data.
  • the voice transmission module can be a modulation module (Modem).
  • the first application sends the encoded voice data to the modulation module on the terminal device A.
  • the modulation module modulates the voice data into an analog signal and sends the voice data to the operator network, and the operator network sends the analog signal to the terminal device B.
  • the modulation module will send feedback information to terminal device A in real time.
  • the receiver on the terminal device A forwards the received information to the first application, and the first application displays the prompt information on the interface to prompt the user.
  • the first application performs text prompts on the software interface, for example, displays prompts such as "will resend the input text", "network exception, please wait a little".
  • the first application can perform voice prompts, or both voice prompts and text prompts.
  • the terminal device B receives the analog signal sent by the terminal device A, and obtains voice data after demodulation.
  • the voice transmission module on the terminal device B receives the analog signal from the terminal device A, and demodulates the analog signal to obtain voice data.
  • the voice data at this time is a digital signal.
  • the first application on the terminal device B converts the voice data into text information.
  • the terminal device B first demodulates the analog signal into a digital signal.
  • the digital signal is then forwarded to the first application running on terminal device B.
  • the first application decodes the digital signal (ie, voice data) based on the TTY protocol to obtain the decoded text.
  • the decoded text of the first application on the terminal device is "Long time no see, is it good recently?"
  • the terminal device may further include a display module.
  • the display module on the terminal device B displays the decoded text information in real time.
  • the display module displays the text "Long time no see, is it good recently?" in real time on the interface of the first application.
  • the terminal device B may have a preset number of data packets (eg, 10) buffers and display text at a set speed to avoid network fluctuations that may result in a poor user experience.
  • a preset number of data packets eg, 10
  • FIG. 4 uses the voice communication method provided by the present application as the application scenario, that is, both parties performing voice communication run the first application on the terminal device, and the first application performs the mutual conversion between the text and the voice.
  • the party in the call uses the voice communication method provided by the present application, and the other party to the call can continue to use the traditional TTY device.
  • ADC analog-to-digital converter
  • DAC digital-to-analog converter
  • FIG. 5 is another application scenario applicable to the voice communication method provided by the present application.
  • the terminal device C adopts the voice communication method in the embodiment of the present application, and the terminal device C runs the first application.
  • the terminal device D is connected to a conventional TTY device. Since the TTY device involves the mutual conversion between the analog signal and the digital signal in the processing and transmission of the data packet, the bit error rate is high. Therefore, the terminal device C may receive an error in the data packet transmitted by the terminal device D.
  • the present application is specifically directed to such an application scenario, and it is proposed that the first application on the terminal device C can perform error correction on data packets from other terminal devices.
  • the voice transmission module on the terminal device C sends the voice data to the first application on the terminal device C.
  • the first application decodes the voice data based on the TTY protocol, and converts the voice data into text information. Thereafter, the first application can correct the decoded text information to obtain the corrected text. Finally, the corrected text is displayed by the display module on the terminal device C.
  • error correction of the data packet can also be performed by other applications on the terminal device than the first application.
  • an expert system capable of self-learning is run on the terminal device C, and the first application sends the decoded text information to the expert system, and the expert system performs error correction (or error correction) on the text information. .
  • the function of the expert system is to perform secondary correction on the decoded text (for example, text correction in word text).
  • This is an alignment correction method based on an existing font library.
  • the system can compare the decoded text with the existing text in the font library by means of fuzzy matching. If the decoded text is not found in the text library, mark it out and give the suggested text the closest.
  • the font library can be upgraded with the upgrade of the operating system running on the terminal device to enhance the correction capability.
  • the text library can be continuously upgraded following the upgrade of the APK.
  • the central server can perform push upgrade at any time to enhance the error resistance capability of the expert system.
  • the expert system in the embodiment of the present application can also be understood as an application software having a text correction function.
  • the function of the expert system can be integrated on the first application, or can also be interacted with the first application.
  • the embodiments of the present application are not limited in any way.
  • the voice communication method provided by the present application is described in detail above with reference to FIG. 3 to FIG. 5. It can be seen that, in the embodiment of the present application, by softwareizing the function of the TTY device, the problem that the bit error rate is high and the portability is inconvenient when using the traditional TTY device can be avoided.
  • the deaf-mute person will no longer be limited to the hardware TTY device with high cost of purchase, and the terminal device designed according to the voice communication method according to the embodiment of the present application can reduce the purchase cost.
  • the voice communication device provided by the present application will be described below.
  • FIG. 6 is a schematic diagram of a voice communication device 600 provided by the present application. As shown in FIG. 6, the device 600 includes a first application 610 and a voice transmission module 620. among them,
  • the first application 610 is configured to obtain text information input by the user, and convert the text information into text voice data;
  • the first application 610 is further configured to send the voice data to the voice transmission module 620;
  • the voice transmission module 520 is configured to send the voice data to other terminal devices that perform voice communication with the device 600.
  • the units in the voice communication device 600 of the embodiment of the present application and the other operations or functions described above are respectively implemented to implement corresponding processes or operations in the voice communication method provided by the present application. For the sake of brevity, it will not be repeated here.
  • the voice communication device provided by the present application can avoid the problems of high bit error rate and inconvenient carrying when using a traditional TTY device by softwareizing the function of the TTY device.
  • FIG. 7 is a schematic diagram of a voice communication device 700 provided by the present application. As shown in FIG. 7, the apparatus 700 includes a voice transmission module 710 and a first application 720. among them,
  • the voice transmission module 710 is configured to receive voice data sent by another terminal device, and send the voice data to the first application 720;
  • the first application 720 is configured to convert the voice data into text information.
  • the units in the voice communication device 700 of the embodiment of the present application and the other operations or functions described above are respectively for implementing the corresponding processes or operations in the voice communication method provided by the present application. For the sake of brevity, it will not be repeated here.
  • the voice communication device provided by the present application can avoid the problems of high bit error rate and inconvenient carrying when using the traditional TTY device by softwareizing the function of the TTY device.
  • first application 610 in FIG. 6 and the first application 720 in FIG. 7 adopt a dashed box to indicate that the first application is an application software.
  • FIG. 8 is a schematic structural diagram of a terminal device 800 provided by the present application.
  • the terminal device 800 includes an input unit 814 and a processor 804.
  • Terminal device 800 can also include a memory 819 for storing computer instructions.
  • the input unit 814 is configured to receive text information input by the user
  • the processor 804 is configured to run the computer instructions stored in the memory 819, convert the text information into voice data, and send the voice data to the transceiver 808;
  • the transceiver 808 is configured to send the voice data to other terminal devices that perform voice communication with the terminal device.
  • terminal device 800 herein is the party that performs voice communication, and the other terminal devices are the other party that performs voice communication.
  • the processor 804 may be configured to perform the actions implemented by the terminal device described in the foregoing method embodiments
  • the transceiver 808 may be configured to perform the receiving or sending action of the terminal device described in the foregoing method embodiments.
  • the processor 804 and the memory 819 described above may be integrated into one processing device, and the processor 804 is configured to execute program code stored in the memory 819 to implement the above functions.
  • the memory 819 can also be integrated in the processor 804 when implemented.
  • the terminal device 800 described above may also include a power source 812 for providing power to various devices or circuits in the terminal device 800.
  • the terminal device 800 described above may include an antenna 810 for transmitting data or information output by the transceiver 808 through a wireless signal.
  • the terminal device 800 may further include one or more of a display unit 816, an audio circuit 818, a camera 820, a sensor 822, and the like.
  • the audio circuit may also include a speaker 8182, a microphone 8184, and the like.
  • the terminal device involved in the embodiment of the present application is not limited to a mobile phone, a tablet, a smart watch, etc., and should also include all terminal devices having a TTY service function.
  • the processor may be a central processing unit (CPU), a microprocessor, an application-specific integrated circuit (ASIC), or one or more programs for controlling the program of the present application.
  • the processor can include a digital signal processor device, a microprocessor device, an analog to digital converter, a digital to analog converter, and the like.
  • the processor can distribute the control and signal processing functions of the mobile device among the devices according to their respective functions.
  • the processor can include functionality to operate one or more software programs, which can be stored in memory.
  • the functions of the processor may be implemented by hardware or by software executing corresponding software.
  • the hardware or software includes one or more modules corresponding to the functions described above.
  • the memory may be a Read-Only Memory (ROM) or other type of static storage device that can store static information and instructions, a Random Access Memory (RAM) or other type that can store information and instructions. Dynamic storage device. It can also be an Electrically Erasable Programmable Read-Only Memory (EEPROM), a Compact Disc Read-Only Memory (CD-ROM) or other optical disc storage, and a disc storage (including a compact disc, a laser disc, a compact disc, a digital versatile disc, a Blu-ray disc, etc.), a disk storage medium or other magnetic storage device, or any other device that can be used to carry or store desired program code in the form of an instruction or data structure and accessible by a computer. Medium, but not limited to this.
  • EEPROM Electrically Erasable Programmable Read-Only Memory
  • CD-ROM Compact Disc Read-Only Memory
  • disc storage including a compact disc, a laser disc, a compact disc, a digital versatile disc, a Blu-ray disc, etc.
  • the above functions are implemented in the form of software and sold or used as stand-alone products, they can be stored in a computer readable storage medium.
  • the part of the technical solution of the present application which contributes in essence or to the prior art, or part of the technical solution, may be embodied in the form of a software product stored in a storage medium.
  • a number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .
  • the units described above as separate components may or may not be physically separated.
  • the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units.
  • the purpose of the technical solution of the embodiment of the present application may be achieved by selecting some or all of the units according to actual needs.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.

Abstract

Provided in the present application is a voice communication method, capable of avoiding the problems of the high bit error rate and inconvenient carrying when using traditional TTY devices for voice communication. The method comprises: a first application in a terminal device receives text information inputted by a user; the first application converts the text information into voice data; the first application sends the voice data to a voice transmission module in the terminal device; and the voice transmission module sends the voice data to another terminal device performing voice communication with said terminal device.

Description

语音通信方法和语音通信装置Voice communication method and voice communication device
本申请要求于2017年10月24日提交中国专利局、申请号为201711001961.4、发明名称为“语音通信方法和语音通信装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。The present application claims priority to Chinese Patent Application No. PCT Application No. No. No. No. No. No. No. No. No. No. No. No. No. No. No. No. .
技术领域Technical field
本申请涉及信息技术领域,尤其涉及一种语音通信方法和语音通信装置。The present application relates to the field of information technology, and in particular, to a voice communication method and a voice communication device.
背景技术Background technique
电传打字(Tele TYpe,TTY)或者文本电话设备(Text Telephone Devices for the Deaf,TDD)业务是专门为聋哑人提供的移动“语音”业务,具有这种功能的设备称作TTY设备,是专门为聋哑和听力有障碍的人士设计。终端设备在插入TTY设备后,将会支持收发文本。除了聋哑模式外,TTY设备还支持只听不说和只说不听两种模式,聋哑人士也可以方便地使用电话进行实时的沟通与交流。The Tele TYpe (TTY) or Text Telephone Devices for the Deaf (TDD) service is a mobile "voice" service for deaf people. A device with this function is called a TTY device. Designed for people who are deaf and hearing impaired. After the terminal device is inserted into the TTY device, it will support sending and receiving text. In addition to the deaf mode, TTY devices support both listening and speaking modes, and deaf people can easily use the phone for real-time communication and communication.
但是,TTY设备为专用硬件设备,它与终端设备之间的通信是通过模拟信号传送的,在数据传输、转换及处理的过程中,不可避免地将会出现信号损失和失真,误码率较高。另外,TTY设备的重量和体积都较大,携带十分不便。However, the TTY device is a dedicated hardware device, and the communication between it and the terminal device is transmitted through an analog signal. In the process of data transmission, conversion and processing, signal loss and distortion will inevitably occur, and the bit error rate is relatively high. high. In addition, the weight and volume of the TTY device are large, and it is very inconvenient to carry.
发明内容Summary of the invention
本申请提供一种语音通信方法和语音通信装置,可以避免使用传统的TTY设备进行语音通信时误码率较高,不方便携带等问题。The present invention provides a voice communication method and a voice communication device, which can avoid the problems of high bit error rate and inconvenient carrying when using traditional TTY devices for voice communication.
第一方面,本申请提供一种语音通信方法,该方法包括:终端设备上的第一应用获取用户输入的文字信息;第一应用将文字信息转换为语音数据;第一应用将语音数据发送给终端设备的语音传输模块;语音传输模块将语音数据发送给与该终端设备进行语音通信的其它终端设备。In a first aspect, the present application provides a voice communication method, the method comprising: acquiring, by a first application on a terminal device, text information input by a user; the first application converting the text information into voice data; and transmitting, by the first application, the voice data a voice transmission module of the terminal device; the voice transmission module transmits the voice data to other terminal devices that perform voice communication with the terminal device.
本申请实施例中所说的第一应用,具有实现文字和语音之间相互转换的功能。例如,该第一应用可以基于TTY协议对文字信息进行编码,得到语音数据。或者,第一应用可以基于TTY协议对语音数据进行解码,得到文字信息。The first application mentioned in the embodiment of the present application has the function of realizing mutual conversion between text and voice. For example, the first application may encode the text information based on the TTY protocol to obtain voice data. Alternatively, the first application may decode the voice data based on the TTY protocol to obtain text information.
其中,关于TTY协议的具体内容可以参考现有技术,本文不作详述。For details of the TTY protocol, reference may be made to the prior art, which is not described in detail herein.
本申请提供的技术方案,与使用传统的TTY设备进行语音通信相比,用户输入的文字信息与语音数据之间相互转换的过程全部通过第一应用实现,也就说,都是对数字信号的处理,不再涉及模拟信号和数字信号之间的转换,因而可以降低误码率。同时,用户不再需要随身携带TTY设备,通过终端设备上的第一应用就可以实现语音通信,避免了TTY设备携带不便的缺点。The technical solution provided by the present application is compared with the voice communication using the traditional TTY device, and the process of converting the text information and the voice data input by the user is realized by the first application, that is, the digital signal is Processing, no longer involves the conversion between the analog signal and the digital signal, thus reducing the bit error rate. At the same time, the user no longer needs to carry the TTY device with him, and the first application on the terminal device can implement voice communication, thereby avoiding the disadvantage that the TTY device is inconvenient to carry.
结合第一方面,在第一方面的某些实现方式中,第一应用将文字信息转换为语音数据,包括:第一应用基于文本电话TTY协议对文字信息进行编码,得到语音数据。In conjunction with the first aspect, in some implementations of the first aspect, the first application converts the text information into the voice data, including: the first application encodes the text information based on the text phone TTY protocol to obtain the voice data.
结合第一方面,在第一方面的某些实现方式中语音传输模块将语音数据发送给与该终端设备进行语音通信的其它终端设备,包括:语音传输模块通过蜂窝网络将语音数据发送给与 该终端设备进行语音通信的其它终端设备。With reference to the first aspect, in some implementations of the first aspect, the voice transmission module sends the voice data to other terminal devices that perform voice communication with the terminal device, including: the voice transmission module sends the voice data to the Other terminal devices that the terminal device performs voice communication.
结合第一方面,在第一方面的某些实现方式中,终端设备还包括显示模块,该方法还包括:第一应用将获取到的用户输入的文字信息发送给显示模块;显示模块在第一应用的界面上显示该文字信息。In conjunction with the first aspect, in some implementations of the first aspect, the terminal device further includes a display module, the method further includes: the first application transmitting the acquired text information input by the user to the display module; the display module is at the first The text message is displayed on the interface of the application.
第二方面,本申请提供一种语音通信方法,该方法包括:终端设备中的语音传输模块接收其它终端设备发送的语音数据;语音传输模块将语音数据发送给终端设备上的第一应用;第一应用将语音数据转换为文字信息。In a second aspect, the present application provides a voice communication method, the method includes: a voice transmission module in a terminal device receives voice data sent by another terminal device; and a voice transmission module sends voice data to a first application on the terminal device; An application converts voice data into text information.
结合第二方面,在第二方面的某些实现方式中,第一应用将语音数据转换为文字信息,包括:第一应用基于文本电话TTY协议对语音数据进行解码,得到文字信息。In conjunction with the second aspect, in some implementations of the second aspect, the first application converts the voice data into text information, including: the first application decodes the voice data based on the text phone TTY protocol to obtain text information.
结合第二方面,在第二方面的某些实现方式中,终端设备还包括显示模块,该方法还包括:第一应用将文字信息发送给显示模块;显示模块在第一应用的界面上显示该文字信息。With reference to the second aspect, in some implementations of the second aspect, the terminal device further includes a display module, the method further includes: the first application sends the text information to the display module; the display module displays the information on the interface of the first application text information.
结合第二方面,在第二方面的某些实现方式中,终端设备还包括显示模块,该方法还包括:第一应用对该文字信息进行校正,得到校正后的文字信息;第一应用将校正后的文字信息发送给显示模块;显示模块显示校正后的文字。With reference to the second aspect, in some implementations of the second aspect, the terminal device further includes a display module, the method further includes: the first application corrects the text information to obtain corrected text information; the first application will correct The subsequent text information is sent to the display module; the display module displays the corrected text.
应理解,直接显示解码后的文字信息和显示校正后的文字信息的显示模块可以为同一个显示模块。It should be understood that the display module that directly displays the decoded text information and displays the corrected text information may be the same display module.
第三方面,本申请提供一种语音通信装置,该装置具有实现上述第一方面的语音通信方法的功能。这些功能可以通过硬件实现,也可以通过硬件执行相应的软件实现。所述硬件或软件包括一个或多个与上述功能相对应的单元。In a third aspect, the present application provides a voice communication device having the function of implementing the voice communication method of the first aspect described above. These functions can be implemented in hardware or in software by executing the corresponding software. The hardware or software includes one or more units corresponding to the functions described above.
第四方面,本申请提供一种语音通信装置,所述装置具有实现上述第二方面的语音通信方法的功能。这些功能可以通过硬件实现,也可以通过硬件执行相应的软件实现。所述硬件或软件包括一个或多个与上述功能相对应的单元。In a fourth aspect, the present application provides a voice communication device having a function of implementing the voice communication method of the second aspect described above. These functions can be implemented in hardware or in software by executing the corresponding software. The hardware or software includes one or more units corresponding to the functions described above.
第五方面,本申请提供一种语音通信设备,该设备包括收发器、处理器和存储器。处理器用于控制收发器收发信号,存储器用于存储计算机程序,处理器用于从存储器中调用并运行该计算机程序,使得该语音通信设备执行上述第一方面及其任意可能的实现方式中的语音通信方法。In a fifth aspect, the present application provides a voice communication device including a transceiver, a processor, and a memory. a processor for controlling transceiver transceiver signals, a memory for storing a computer program, and a processor for calling and running the computer program from the memory, such that the voice communication device performs voice communication in the first aspect and any possible implementation thereof described above method.
第六方面,本申请提供一种语音通信设备,该设备包括收发器、处理器和存储器。处理器用于控制收发器收发信号,存储器用于存储计算机程序,处理器用于从存储器中调用并运行该计算机程序,使得该语音通信设备执行上述第二方面及其任意可能的实现方式中的语音通信方法。In a sixth aspect, the present application provides a voice communication device including a transceiver, a processor, and a memory. a processor for controlling transceiver transceiver signals, a memory for storing a computer program, and a processor for calling and running the computer program from the memory, such that the voice communication device performs voice communication in the second aspect and any possible implementation thereof method.
第七方面,本申请提供一种通信装置,该通信装置可以为上述方法设计中的语音通信装置,或者为设置在语音通信装置中的芯片。该通信装置包括:存储器、通信接口和处理器,其中,存储器用于存储计算机可执行的程序代码,该程序代码包括指令,当处理器执行所述指令时,使得该通信装置执行上述第一方面或第二方面及其任意一种可能的实现方式中的方法。In a seventh aspect, the present application provides a communication device, which may be a voice communication device in the above method design, or a chip disposed in a voice communication device. The communication device includes a memory, a communication interface, and a processor, wherein the memory is configured to store computer executable program code, the program code including instructions that, when executed by the processor, cause the communication device to perform the first aspect described above Or the method of the second aspect and any one of its possible implementations.
第八方面,本申请提供计算机程序产品,所述计算机程序产品包括:计算机程序代码,当所述计算机程序代码在计算机上运行时,使得计算机执行上述第一方面或第二方面及其任意一种可能的实现方式中的方法。In an eighth aspect, the application provides a computer program product, comprising: computer program code, when the computer program code is run on a computer, causing the computer to perform the first aspect or the second aspect and any one of the above The method in the possible implementation.
第九方面,本申请提供了一种芯片系统,该芯片系统包括处理器,用于实现上述第一方面或第二方面及其任意一种可能的实现方式中语音通信装置所涉及的功能,例如,例如接收或处理上述方法中所涉及的语音数据和/或文字信息。在一种可能的设计中,该芯片系统还包括存储器,存储器用于保存语音通信装置执行语音通信方法所必要的程序指令和数据。该芯片系统,可以由芯片构成,也可以包括芯片和其它分立器件。In a ninth aspect, the present application provides a chip system, including a processor, for implementing the functions involved in the voice communication device in the first aspect or the second aspect and any one of the possible implementation manners, for example, For example, the speech data and/or text information involved in the above method is received or processed. In one possible design, the chip system further includes a memory for holding program instructions and data necessary for the voice communication device to perform the voice communication method. The chip system can be composed of chips, and can also include chips and other discrete devices.
在本申请实施例中,通过将TTY设备的功能软件化,可以避免使用传统的TTY硬件设备进行语音通信时误码率较高,不方便携带等问题。In the embodiment of the present application, by softwareizing the function of the TTY device, the problem that the bit error rate is high and the portability is inconvenient when using the traditional TTY hardware device for voice communication can be avoided.
附图说明DRAWINGS
图1是TTY设备的工作原理示意图。Figure 1 is a schematic diagram of the working principle of the TTY device.
图2是TTY设备的结构框图。2 is a structural block diagram of a TTY device.
图3是本申请提供的语音通信方法300的示意性流程图。FIG. 3 is a schematic flowchart of a voice communication method 300 provided by the present application.
图4是适用于本申请提供的语音通信方法的一种应用场景。FIG. 4 is an application scenario applicable to the voice communication method provided by the present application.
图5是适用于本申请提供的语音通信的方法的另一种应用场景。FIG. 5 is another application scenario of a method applicable to voice communication provided by the present application.
图6为本申请提供的语音通信装置500的示意性框图。FIG. 6 is a schematic block diagram of a voice communication device 500 provided by the present application.
图7为本申请提供的语音通信装置600的示意图。FIG. 7 is a schematic diagram of a voice communication device 600 provided by the present application.
图8为本申请提供的终端设备800的示意性结构图。FIG. 8 is a schematic structural diagram of a terminal device 800 provided by the present application.
具体实施方式Detailed ways
下面将结合附图,对本申请的技术方案进行描述。The technical solutions of the present application will be described below with reference to the accompanying drawings.
为了便于理解,首先对本申请涉及的相关概念作简单介绍。For ease of understanding, the related concepts involved in this application are briefly introduced.
电传打字(TeleTYpe,TTY)业务或者文本电话设备(Text Telephone Devices for the Deaf,TDD)业务是为聋哑人士提供的移动“语音”业务。TTY业务或TDD业务一般可以支持三种TTY功能模式:Full模式(也称为TTY模式)、只说不听(Voice Carry Over,VCO)模式和只听不说(Hearing Carry Over,HCO)模式。TTY模式同时支持发送和接收TTY文本,在TTY模式下,当终端设备与TTY设备连接时,只能进行字符传输,不能通话。断开终端设备与TTY设备之间的连接时,终端设备可与其它终端设备正常通话。在VCO模式下,终端设备可以在接收TTY文本和发送语音之间切换,主要针对聋。在HCO模式下,终端设备可以在发送TTY文本和接听通话之间进行切换,主要针对哑。The TeleTYpe (TTY) service or the Text Telephone Devices for the Deaf (TDD) service is a mobile "voice" service for deaf people. TTY services or TDD services can generally support three TTY function modes: Full mode (also called TTY mode), Voice Carry Over (VCO) mode, and Hearing Carry Over (HCO) mode. The TTY mode supports sending and receiving TTY text at the same time. In the TTY mode, when the terminal device is connected to the TTY device, only the character transmission can be performed and the call cannot be made. When the connection between the terminal device and the TTY device is disconnected, the terminal device can normally talk with other terminal devices. In the VCO mode, the terminal device can switch between receiving TTY text and transmitting voice, mainly for 聋. In the HCO mode, the terminal device can switch between sending TTY text and answering a call, mainly for dumb.
TTY设备与TDD设备的工作原理类似,以下仅以TTY设备作为示例进行介绍。TTY devices work similarly to TDD devices. The following only describes TTY devices as an example.
下面对TTY设备的工作原理作简单说明。The following is a brief description of the working principle of the TTY device.
图1是TTY设备的工作原理示意图。参见图1,现有的TTY方案需要一个专用的TTY设备连接一个终端设备实现。当用户在终端设备上将话路接通时,可以在TTY设备上进行文本输入和显示,实现与对端的交流。Figure 1 is a schematic diagram of the working principle of the TTY device. Referring to Figure 1, the existing TTY solution requires a dedicated TTY device to connect to a terminal device. When the user connects the session on the terminal device, text input and display can be performed on the TTY device to implement communication with the peer.
TTY设备的使用大致可以表述如下:TTY设备提供3.5mm的音频接头,用于和终端设备连接。如果终端设备的音频接头与TTY设备提供的音频接头不符,则可以通过转接线进行转换。The use of the TTY device can be roughly expressed as follows: The TTY device provides a 3.5 mm audio connector for connection with the terminal device. If the audio connector of the terminal device does not match the audio connector provided by the TTY device, it can be converted by a patch cord.
TTY设备和TDD设备的工作原理是类似的,为了避免描述上的累赘,下文都以TTY设 备为例进行描述。The working principle of the TTY device and the TDD device are similar. To avoid the cumbersome description, the following describes the TTY device as an example.
图2是TTY设备的结构框图。参见图2所示,TTY业务的实现主要包括编码器和解码器两大部分。其中,编码器负责检测脉冲编码调制(Pulse Code Modulation,PCM)链路上所传送的波德(Baudot)单音,并将其解析为相应的TTY字符信息发送给解码器。解码器接收该TTY字符信息后,负责将该TTY字符信息还原为相应的波德单音。2 is a structural block diagram of a TTY device. As shown in Figure 2, the implementation of the TTY service mainly includes two parts: an encoder and a decoder. The encoder is responsible for detecting a Baudot tone transmitted on a Pulse Code Modulation (PCM) link and parsing it into corresponding TTY character information for transmission to the decoder. After receiving the TTY character information, the decoder is responsible for restoring the TTY character information to the corresponding Bode tone.
图2中的FER表示误帧率(Frame Error Rate,FER)。The FER in Fig. 2 represents a frame error rate (FER).
现有TTY设备的购买成本较高,并且由于TTY设备与终端设备之间的通信是通过模拟信号传输的,在信号的传输、处理及转换过程中,不可避免地出现信号损失和失真,因而误码率也较高。另外,TTY设备的重量和体积一般都较大,携带不便。The existing TTY device has a high purchase cost, and since the communication between the TTY device and the terminal device is transmitted through an analog signal, signal loss and distortion are inevitable in the process of signal transmission, processing, and conversion, and thus error The code rate is also higher. In addition, the weight and volume of the TTY device are generally large and inconvenient to carry.
为此,本申请提供一种实现语音通信的方法,可以避免使用传统的TTY硬件设备时误码率较高。携带不便等问题。To this end, the present application provides a method for implementing voice communication, which can avoid a high bit error rate when using a conventional TTY hardware device. Inconvenience such as carrying.
在本申请实施例中,通过在终端设备上运行具有实现文字与语音之间相互转换功能的应用软件,用户可以通过使用该应用软件即可实现语音通信。In the embodiment of the present application, by running an application software having a function of converting between text and voice on a terminal device, the user can implement voice communication by using the application software.
在本申请实施例中,终端设备上运行的实现文字与语音之间相互转换功能的应用软件,需要获取与终端设备上的通话模块同等的权限。为了便于描述上的简洁,以下我们将运行在终端设备上的具有实现文字与语音之间相互转换功能的软件记作第一应用。In the embodiment of the present application, the application software running on the terminal device that implements the function of converting between text and voice needs to acquire the same authority as the call module on the terminal device. In order to facilitate the succinctness of the description, in the following, we will write the software running on the terminal device with the function of realizing the conversion between text and voice as the first application.
需要说明的是,本申请实施例中所说的第一应用与基于数据业务的应用(例如,微信)不同,本文中所说的第一应用是基于语音业务的。或者说,第一应用需要基于实时通信网络。并且,语音业务是所有运营商网络都支持的功能,所以只要有信号覆盖的任何地区都可以使用。It should be noted that, the first application mentioned in the embodiment of the present application is different from the data service-based application (for example, WeChat), and the first application mentioned herein is based on voice service. Or, the first application needs to be based on a real-time communication network. Moreover, the voice service is a function supported by all carrier networks, so any area with signal coverage can be used.
此外,本文中的第一应用需要具有与普通通话同等的权限。由于涉及用户隐私,因此,这需要得到较高的授权。以安卓操作系统为例,第一应用需要安卓安装包(Android Package,APK)与底层驱动软件相互配合,而底层驱动软件只有原始设备制造商(Origin Entrusted Manufacture,OEM)可以修改,因此,本申请实施例中的第一应用不适合第三方单独提供。In addition, the first application in this article needs to have the same privileges as a normal call. This requires a high level of authorization due to user privacy. Taking the Android operating system as an example, the first application requires an Android installation package (Android Package, APK) to interact with the underlying driver software, and the underlying driver software can only be modified by the original equipment manufacturer (OEM). Therefore, this application The first application in the embodiment is not suitable for a third party to provide separately.
本申请提供的语音通信方法,可以适用于多种场景。例如,运行有第一应用的终端设备与传统TTY设备之间进行通信。或者,运行有第一应用的终端设备与终端设备之间进行通信等。The voice communication method provided by the present application can be applied to various scenarios. For example, a terminal device running a first application communicates with a legacy TTY device. Alternatively, communication between the terminal device running the first application and the terminal device is performed.
参见图3,图3是本申请提供的语音通信方法300的示意性流程图。Referring to FIG. 3, FIG. 3 is a schematic flowchart of a method 300 for voice communication provided by the present application.
需要说明的是,执行方法300的终端设备上需要运行有第一应用,至少还应具有语音传输模块。其中,第一应用是一个应用软件,它需要具有将文字和语音进行互相转换的功能。语音传输模块通过和第一应用进行交互,可以将第一应用输出的应用发送给其它与该终端设备进行语音通信的其它终端设备。或者,语音传输模块可以将其它终端设备发送的语音数据发送给第一应用,从而由第一应用将该语音数据转换为文字信息。It should be noted that the terminal device that performs the method 300 needs to run the first application, and at least should also have a voice transmission module. Among them, the first application is an application software, which needs to have the function of converting text and voice to each other. The voice transmission module can communicate with the first application to send the application output by the first application to other terminal devices that perform voice communication with the terminal device. Alternatively, the voice transmission module may send the voice data sent by the other terminal device to the first application, so that the voice data is converted into text information by the first application.
可以理解的是,通过终端设备上的第一应用,文字信息和语音之间可以实现相互转换。这样,如果使用终端设备的用户为聋哑人士或听力有障碍的人士,则该终端就可以支持文本的收发,从而帮助聋哑人士或听力有障碍的人士进行正常的语音交流。It can be understood that the mutual conversion between the text information and the voice can be realized through the first application on the terminal device. In this way, if the user using the terminal device is a deaf or hearing impaired person, the terminal can support the sending and receiving of text, thereby helping the deaf or hearing impaired person to perform normal voice communication.
310、终端设备上第一应用获取用于输入的文字信息。310. The first application on the terminal device obtains text information for input.
用户基于终端设备上运行的操作系统,可以使用各种输入法进行文字输入。The user can use various input methods for text input based on the operating system running on the terminal device.
进一步地,终端设备可以提供“只听不说HCO”、“只说不听VCO”和“FULL”等模式以供用户选择。关于这些模式的说明可以参见上文。Further, the terminal device can provide modes such as "only hear HCO", "just say no listening to VCO" and "FULL" for the user to select. A description of these modes can be found above.
在“只听不说”模式下,用户可以在第一应用的界面上输入需要发送的文字。用户输入的文字可以在第一应用的界面上实时显示。在“只说不听”模式下,用户也可以通过语音或文字进行输入。在FULL模式下,用户可以在第一应用的界面上输入需要发送的文字,同时对方发送的信息通过在第一应用界面上进行文字显示,从而被用户接收到。In the "speak only" mode, the user can enter the text to be sent on the interface of the first application. The text entered by the user can be displayed in real time on the interface of the first application. In the "just say no listening" mode, users can also input by voice or text. In the FULL mode, the user can input the text to be sent on the interface of the first application, and the information sent by the other party is received by the user by displaying the text on the first application interface.
320、第一应用将文字信息转换为语音数据。320. The first application converts the text information into voice data.
第一应用可以将文字信息转换为语音数据,例如第一应用可以基于TTY协议,对用户数输入的文字进行编码,得到语音数据。The first application can convert the text information into voice data. For example, the first application can encode the text input by the user number based on the TTY protocol to obtain voice data.
330、第一应用将语音数据发送给语音传输模块。330. The first application sends the voice data to the voice transmission module.
340、语音传输模块将语音数据发送给与终端设备进行语音通信的其它终端设备。340. The voice transmission module sends the voice data to other terminal devices that perform voice communication with the terminal device.
在用户实时输入文字的过程中,第一应用可以实时将用户输入的文字转换为语音数据,并将该语音数据发送给语音传输模块。终端设备上的语音传输模块负责通过蜂窝网络将语音数据发送给与该终端设备进行语音通信的其它终端设备。In the process of the user inputting text in real time, the first application can convert the text input by the user into voice data in real time, and send the voice data to the voice transmission module. The voice transmission module on the terminal device is responsible for transmitting voice data over the cellular network to other terminal devices that perform voice communication with the terminal device.
为了描述上的方便,我们将步骤310-340中描述的终端设备记作第一终端设备。而将与第一终端设备进行语音通信的其它终端设备称作第二终端设备。For convenience of description, we refer to the terminal device described in steps 310-340 as the first terminal device. Other terminal devices that perform voice communication with the first terminal device are referred to as second terminal devices.
第二终端设备上的语音传输模块接收到来自第一终端设备的语音数据之后,语音传输模块将该语音数据发送给第二终端设备上的第一应用。第二终端设备上的第一应用将该语音数据转换为文字信息。这样,使用第二终端设备的用户就可以接收到第一终端设备发送的文字信息。由于用户输入的文字信息可以被实时地转换为语音数据,这样,就实现了进行语音通信双方的实时沟通和交流。After the voice transmission module on the second terminal device receives the voice data from the first terminal device, the voice transmission module sends the voice data to the first application on the second terminal device. The first application on the second terminal device converts the voice data into text information. In this way, the user using the second terminal device can receive the text information sent by the first terminal device. Since the text information input by the user can be converted into voice data in real time, real-time communication and communication between the two parties of the voice communication is realized.
与使用传统的TTY设备相比,用户输入的文字信息与语音数据之间相互转换的过程全部通过第一应用实现,也就说,都是对数字信号的处理,不再涉及模拟信号和数字信号之间的转换,因而可以降低误码率。同时,用户不再需要随身携带TTY设备,通过终端设备上的第一应用就可以实现语音交流和沟通,避免了TTY设备携带不便的缺点。Compared with the traditional TTY device, the process of converting the text information and the voice data input by the user is realized by the first application, that is, the processing of the digital signal is no longer involved, and the analog signal and the digital signal are no longer involved. The conversion between them can thus reduce the bit error rate. At the same time, the user no longer needs to carry the TTY device with him, and the voice communication and communication can be realized through the first application on the terminal device, thereby avoiding the disadvantage that the TTY device is inconvenient to carry.
参见图4,图4是适用于本申请实施例的语音通信方法的一种应用场景。Referring to FIG. 4, FIG. 4 is an application scenario of a voice communication method applicable to an embodiment of the present application.
401、终端设备A上的第一应用获取用户输入的文字信息。401. The first application on the terminal device A obtains text information input by the user.
例如,第一应用获取到用户输入的文字为“好久不见,近来可好?”For example, the first application gets the text entered by the user as "Long time no see, is it good recently?"
402、第一应用将用户输入的文字转换为语音数据。402. The first application converts text input by the user into voice data.
在用户实时输入文字的过程中,运行在终端设备A上的第一应用根据TTY协议直接将用户输入的文字编码为符合TTY协议的语音数据。In the process of the user inputting the text in real time, the first application running on the terminal device A directly encodes the text input by the user into the voice data conforming to the TTY protocol according to the TTY protocol.
在步骤402中,编码得到的语音数据为数字信号。In step 402, the encoded speech data is a digital signal.
403、第一应用将语音数据发送给语音传输模块。语音传输模块接收第一应用发送的语音数据,对语音数据进行调制后发送。403. The first application sends the voice data to the voice transmission module. The voice transmission module receives the voice data sent by the first application, and modulates the voice data and sends the voice data.
例如,语音传输模块可以为一个调制模块(Modem)。第一应用将编码得到的语音数据发送给终端设备A上的调制模块,调制模块将语音数据调制成模拟信号后发送给运营商网络,由运营商网络将该模拟信号送给终端设备B。For example, the voice transmission module can be a modulation module (Modem). The first application sends the encoded voice data to the modulation module on the terminal device A. The modulation module modulates the voice data into an analog signal and sends the voice data to the operator network, and the operator network sends the analog signal to the terminal device B.
在发送模拟信号的过程中,如果出现异常情况(例如,网络故障、掉网),调制模块会实 时向终端设备A发送反馈信息。终端设备A上的接收机将接收到的信息转发给第一应用,第一应用在界面上显示提示信息,以提示用户。In the process of sending an analog signal, if an abnormal situation occurs (for example, network failure, dropped network), the modulation module will send feedback information to terminal device A in real time. The receiver on the terminal device A forwards the received information to the first application, and the first application displays the prompt information on the interface to prompt the user.
在VCO模式下,第一应用在软件界面上进行文字提示,例如,显示“即将重新发送输入的文字”、“网络异常,请稍作等待”等提示信息。在HCO模式下,第一应用可以进行语音提示,或者也可以同时语音提示和文字提示。In the VCO mode, the first application performs text prompts on the software interface, for example, displays prompts such as "will resend the input text", "network exception, please wait a little". In the HCO mode, the first application can perform voice prompts, or both voice prompts and text prompts.
404、终端设备B接收终端设备A发送的模拟信号,解调后得到语音数据。404. The terminal device B receives the analog signal sent by the terminal device A, and obtains voice data after demodulation.
终端设备B上的语音传输模块接收来自终端设备A的模拟信号,并对该模拟信号进行解调,得到语音数据。此时的语音数据为数字信号。The voice transmission module on the terminal device B receives the analog signal from the terminal device A, and demodulates the analog signal to obtain voice data. The voice data at this time is a digital signal.
405、终端设备B上的第一应用将语音数据转换为文字信息。405. The first application on the terminal device B converts the voice data into text information.
在步骤404-405,终端设备B接收到模拟信号后,首先将模拟信号解调为数字信号。再将该数字信号转发给运行在终端设备B上的第一应用。第一应用基于TTY协议,对数字信号(也即,语音数据)进行解码,得到解码后的文字。In steps 404-405, after receiving the analog signal, the terminal device B first demodulates the analog signal into a digital signal. The digital signal is then forwarded to the first application running on terminal device B. The first application decodes the digital signal (ie, voice data) based on the TTY protocol to obtain the decoded text.
例如,终端设备上的第一应用解码后的文字为“好久不见,近来可好?”For example, the decoded text of the first application on the terminal device is "Long time no see, is it good recently?"
进一步地,终端设备还可以包括显示模块。Further, the terminal device may further include a display module.
406、终端设备B上的显示模块实时显示解码后的文字信息。406. The display module on the terminal device B displays the decoded text information in real time.
具体地,显示模块在第一应用的界面上实时显示文字“好久不见,近来可好?”。Specifically, the display module displays the text "Long time no see, is it good recently?" in real time on the interface of the first application.
在步骤406中,终端设备B可以具有预设数量的数据包(例如,10个)缓冲区,并按照设定速度进行文字显示,以避免网络波动可能带来不良的用户体验。In step 406, the terminal device B may have a preset number of data packets (eg, 10) buffers and display text at a set speed to avoid network fluctuations that may result in a poor user experience.
以上图4以通话双方都使用本申请提供的语音通信方法作为应用场景,即,进行语音通信的双方都在终端设备上运行第一应用,通过第一应用进行文字与语音之间的相互转换。在另一种应用场景下,通话的一方使用本申请提供的语音通信方法,而通话的另一方可以继续使用传统的TTY设备。The above-mentioned FIG. 4 uses the voice communication method provided by the present application as the application scenario, that is, both parties performing voice communication run the first application on the terminal device, and the first application performs the mutual conversion between the text and the voice. In another application scenario, the party in the call uses the voice communication method provided by the present application, and the other party to the call can continue to use the traditional TTY device.
如上文所述,传统的TTY设备与终端设备之间的通信是需要经过诸多模数转换(Analog-to-Digital Converter,ADC)和/或数模转换(Digital-to-Analog Converter,DAC)的过程,在ADC/DAC的过程中将不可避免地引起信号的失真和损失,导致误码率较高。As mentioned above, the communication between the traditional TTY device and the terminal device requires a lot of analog-to-digital converter (ADC) and/or digital-to-analog converter (DAC). The process, in the course of the ADC/DAC, will inevitably cause distortion and loss of the signal, resulting in a higher bit error rate.
参见图5,图5是适用于本申请提供的语音通信方法的另一种应用场景。如图5所示,终端设备C采用本申请实施例的语音通信方法,终端设备C上运行有第一应用。终端设备D与传统的TTY设备连接。由于TTY设备在数据包的处理、传输过程中涉及模拟信号与数字信号之间的相互转换,误码率较高。因此,终端设备C接收到终端设备D发送的数据包可能存在误码。本申请专门针对这种应用场景,提出终端设备C上的第一应用可以对来自其它终端设备的数据包进行误码校正。Referring to FIG. 5, FIG. 5 is another application scenario applicable to the voice communication method provided by the present application. As shown in FIG. 5, the terminal device C adopts the voice communication method in the embodiment of the present application, and the terminal device C runs the first application. The terminal device D is connected to a conventional TTY device. Since the TTY device involves the mutual conversion between the analog signal and the digital signal in the processing and transmission of the data packet, the bit error rate is high. Therefore, the terminal device C may receive an error in the data packet transmitted by the terminal device D. The present application is specifically directed to such an application scenario, and it is proposed that the first application on the terminal device C can perform error correction on data packets from other terminal devices.
具体地,终端设备C上的语音传输模块接收到终端设备D发送的语音数据后,将该语音数据发送给终端设备C上的第一应用。第一应用基于TTY协议对该语音数据进行解码,将该语音数据转换为文字信息。之后,第一应用可以对解码得到的文字信息进行校正,得到校正后的文字。最终,由终端设备C上的显示模块显示校正后的文字。Specifically, after receiving the voice data sent by the terminal device D, the voice transmission module on the terminal device C sends the voice data to the first application on the terminal device C. The first application decodes the voice data based on the TTY protocol, and converts the voice data into text information. Thereafter, the first application can correct the decoded text information to obtain the corrected text. Finally, the corrected text is displayed by the display module on the terminal device C.
此外,对数据包进行误码校正也可以由终端设备上除了第一应用之外的其它应用执行。例如,在终端设备C上运行可以自学习的专家系统,第一应用将解码后的文字信息发送给该专家系统,由该专家系统对该文字信息进行误码纠正(或者说,误码纠偏)。Furthermore, error correction of the data packet can also be performed by other applications on the terminal device than the first application. For example, an expert system capable of self-learning is run on the terminal device C, and the first application sends the decoded text information to the expert system, and the expert system performs error correction (or error correction) on the text information. .
在本申请实施例中,专家系统的作用就是对解码后的文字进行二次校正(例如,word文本中的文字校正)。这是基于已有的文字库的一种比对校正方法。系统可以通过模糊匹配等方式,将解码后的文字与文字库中已有的文字进行比对。如果文字库中找不到解码后的文字,则将其标注出来并给出建议最接近的文字。In the embodiment of the present application, the function of the expert system is to perform secondary correction on the decoded text (for example, text correction in word text). This is an alignment correction method based on an existing font library. The system can compare the decoded text with the existing text in the font library by means of fuzzy matching. If the decoded text is not found in the text library, mark it out and give the suggested text the closest.
例如,“hello”由于网络或者其它原因畸变为“helle”后,专家系统可在“helle”下加波浪线,并给出最接近的文字“hello”。For example, after "hello" is deformed to "helle" due to network or other reasons, the expert system can add a wavy line under "helle" and give the closest word "hello".
由于是基于文字库的对比校正,而文字库事实上是在不断更新的,因此,文字库可以跟随终端设备上运行的操作系统的升级而不断升级,增强校正能力。Because it is based on the contrast correction of the font library, and the font library is in fact constantly updated, the font library can be upgraded with the upgrade of the operating system running on the terminal device to enhance the correction capability.
以安卓操作系统为例,文字库可以跟随APK的升级而不断升级。Take the Android operating system as an example, the text library can be continuously upgraded following the upgrade of the APK.
可以理解的是,由于专家系统是运行在终端设备上的应用软件,因此,可以由中心服务器随时进行推送升级,增强专家系统的抗误码能力。It can be understood that since the expert system is an application software running on the terminal device, the central server can perform push upgrade at any time to enhance the error resistance capability of the expert system.
或者,本申请实施例中的专家系统也可以理解为具有文字校正功能的一个应用软件在具体实现时,专家系统的功能可以集成在第一应用上实现,或者也可以通过和第一应用的交互单独实现,本申请实施例不作任何限定。Alternatively, the expert system in the embodiment of the present application can also be understood as an application software having a text correction function. In a specific implementation, the function of the expert system can be integrated on the first application, or can also be interacted with the first application. The embodiments of the present application are not limited in any way.
以上结合图3至图5,对本申请提供的语音通信方法进行了详细说明。可见,在本申请实施例中,通过将TTY设备的功能软件化,可以避免使用传统的TTY设备时误码率较高、不方便携带等问题。The voice communication method provided by the present application is described in detail above with reference to FIG. 3 to FIG. 5. It can be seen that, in the embodiment of the present application, by softwareizing the function of the TTY device, the problem that the bit error rate is high and the portability is inconvenient when using the traditional TTY device can be avoided.
此外,聋哑人士也将不再局限于购买成本较高的硬件TTY设备,使用根据本申请实施例的语音通信方法设计的终端设备即可,可以降低购买成本。In addition, the deaf-mute person will no longer be limited to the hardware TTY device with high cost of purchase, and the terminal device designed according to the voice communication method according to the embodiment of the present application can reduce the purchase cost.
下面对本申请提供的语音通信装置进行说明。The voice communication device provided by the present application will be described below.
图6为本申请提供的语音通信装置600的示意图。如图6所示,装置600包括第一应用610和语音传输模块620。其中,FIG. 6 is a schematic diagram of a voice communication device 600 provided by the present application. As shown in FIG. 6, the device 600 includes a first application 610 and a voice transmission module 620. among them,
第一应用610,用于获取用户输入的文字信息,并将该文字信息转换文语音数据;The first application 610 is configured to obtain text information input by the user, and convert the text information into text voice data;
第一应用610还用于将该语音数据发送给语音传输模块620;The first application 610 is further configured to send the voice data to the voice transmission module 620;
语音传输模块520,用于将该语音数据发送给与该装置600进行语音通信的其它终端设备。The voice transmission module 520 is configured to send the voice data to other terminal devices that perform voice communication with the device 600.
本申请实施例的语音通信装置600中的各单元和上述其它操作或功能分别为了实现本申请提供的语音通信方法中的相应流程或操作。为了简洁,此处不再赘述。The units in the voice communication device 600 of the embodiment of the present application and the other operations or functions described above are respectively implemented to implement corresponding processes or operations in the voice communication method provided by the present application. For the sake of brevity, it will not be repeated here.
本申请提供的语音通信装置,通过将TTY设备的功能软件化,可以避免使用传统的TTY设备时误码率较高、不方便携带等问题。The voice communication device provided by the present application can avoid the problems of high bit error rate and inconvenient carrying when using a traditional TTY device by softwareizing the function of the TTY device.
图7为本申请提供的语音通信装置700的示意图。如图7所示,装置700包括语音传输模块710和第一应用720。其中,FIG. 7 is a schematic diagram of a voice communication device 700 provided by the present application. As shown in FIG. 7, the apparatus 700 includes a voice transmission module 710 and a first application 720. among them,
语音传输模块710,用于接收其它终端设备发送的语音数据,并将该语音数据发送给第一应用720;The voice transmission module 710 is configured to receive voice data sent by another terminal device, and send the voice data to the first application 720;
第一应用720,用于将该语音数据转换为文字信息。The first application 720 is configured to convert the voice data into text information.
本申请实施例的语音通信装置700中的各单元和上述其它操作或功能分别为了实现本申请提供的语音通信方法中的相应流程或操作。为了简洁,此处不再赘述。The units in the voice communication device 700 of the embodiment of the present application and the other operations or functions described above are respectively for implementing the corresponding processes or operations in the voice communication method provided by the present application. For the sake of brevity, it will not be repeated here.
本申请提供的语音通信装置,通过将TTY设备的功能软件化,可以避免使用传统TTY 设备时误码率较高、不方便携带等问题。The voice communication device provided by the present application can avoid the problems of high bit error rate and inconvenient carrying when using the traditional TTY device by softwareizing the function of the TTY device.
需要说明的是,图6中的第一应用610和图7中的第一应用720采用虚线框,表示第一应用为一个应用软件。It should be noted that the first application 610 in FIG. 6 and the first application 720 in FIG. 7 adopt a dashed box to indicate that the first application is an application software.
当本申请实施例中的语音通信装置具体为终端设备时,终端设备的结构可以如图7所示。图8为本申请提供的终端设备800的示意性结构图。When the voice communication device in the embodiment of the present application is specifically a terminal device, the structure of the terminal device may be as shown in FIG. 7. FIG. 8 is a schematic structural diagram of a terminal device 800 provided by the present application.
如图8所示,终端设备800包括:输入单元814和处理器804。终端设备800还可以包括存储器819,存储器819用于其存储计算机指令。As shown in FIG. 8, the terminal device 800 includes an input unit 814 and a processor 804. Terminal device 800 can also include a memory 819 for storing computer instructions.
输入单元814,用于接收用户输入文字信息;The input unit 814 is configured to receive text information input by the user;
处理器804,用于运行存储器819中存储的计算机指令,将该文字信息转换为语音数据,并将该语音数据发送给收发器808;The processor 804 is configured to run the computer instructions stored in the memory 819, convert the text information into voice data, and send the voice data to the transceiver 808;
收发器808,用于将该语音数据发送给与该终端设备进行语音通信的其它终端设备。The transceiver 808 is configured to send the voice data to other terminal devices that perform voice communication with the terminal device.
应理解,这里的终端设备800是进行语音通信的一方,其它终端设备是进行语音通信的另一方。It should be understood that the terminal device 800 herein is the party that performs voice communication, and the other terminal devices are the other party that performs voice communication.
进一步地,上述处理器804可以用于执行前面方法实施例中描述的由终端设备内部实现的动作,而收发器808可以用于执行前面方法实施例中描述的终端设备的接收或发送动作。具体请见前面方法实施例中的描述,此处不再赘述。Further, the processor 804 may be configured to perform the actions implemented by the terminal device described in the foregoing method embodiments, and the transceiver 808 may be configured to perform the receiving or sending action of the terminal device described in the foregoing method embodiments. For details, please refer to the description in the previous method embodiments, and details are not described herein again.
上述处理器804和存储器819可以集成为一个处理装置,处理器804用于执行存储器819中存储的程序代码来实现上述功能。具体实现时,该存储器819也可以集成在处理器804中。The processor 804 and the memory 819 described above may be integrated into one processing device, and the processor 804 is configured to execute program code stored in the memory 819 to implement the above functions. The memory 819 can also be integrated in the processor 804 when implemented.
上述终端设备800还可以包括电源812,用于给终端设备800中的各种器件或电路提供电源。上述终端设备800可以包括天线810,用于将收发器808输出的数据或信息通过无线信号发送出去。The terminal device 800 described above may also include a power source 812 for providing power to various devices or circuits in the terminal device 800. The terminal device 800 described above may include an antenna 810 for transmitting data or information output by the transceiver 808 through a wireless signal.
除此之外,为了使终端设备800的功能更加完善,终端设备800还可以包括显示单元816,音频电路818,摄像头820和传感器822等中的一个或多个。音频电路还可以包括扬声器8182,麦克风8184等。In addition, in order to make the function of the terminal device 800 more perfect, the terminal device 800 may further include one or more of a display unit 816, an audio circuit 818, a camera 820, a sensor 822, and the like. The audio circuit may also include a speaker 8182, a microphone 8184, and the like.
本申请实施例中涉及的终端设备不限于手机、平板、智能手表等,还应包括所有具有建立TTY业务功能的终端设备。The terminal device involved in the embodiment of the present application is not limited to a mobile phone, a tablet, a smart watch, etc., and should also include all terminal devices having a TTY service function.
以上实施例中,处理器可以为中央处理器(Central Processing Unit,CPU)、微处理器、特定应用集成电路(Application-Specific Integrated Circuit,ASIC),或一个或多个用于控制本申请方案程序执行的集成电路等。例如,处理器可以包括数字信号处理器设备、微处理器设备、模数转换器、数模转换器等。处理器可以根据这些设备各自的功能而在这些设备之间分配移动设备的控制和信号处理的功能。此外,处理器可以包括操作一个或多个软件程序的功能,软件程序可以存储在存储器中。In the above embodiment, the processor may be a central processing unit (CPU), a microprocessor, an application-specific integrated circuit (ASIC), or one or more programs for controlling the program of the present application. Execution of integrated circuits, etc. For example, the processor can include a digital signal processor device, a microprocessor device, an analog to digital converter, a digital to analog converter, and the like. The processor can distribute the control and signal processing functions of the mobile device among the devices according to their respective functions. Additionally, the processor can include functionality to operate one or more software programs, which can be stored in memory.
处理器的所述功能可以通过硬件实现,也可以通过硬件执行相应的软件实现。所述硬件或软件包括一个或多个与上述功能相对应的模块。The functions of the processor may be implemented by hardware or by software executing corresponding software. The hardware or software includes one or more modules corresponding to the functions described above.
存储器可以是只读存储器(Read-Only Memory,ROM)或可存储静态信息和指令的其他类型的静态存储设备,随机存取存储器(Random Access Memory,RAM)或者可存储信息和指令的其他类型的动态存储设备。也可以是电可擦可编程只读存储器(Electrically Erasable Programmable Read-Only Memory,EEPROM)、只读光盘(Compact Disc Read-Only Memory, CD-ROM)或其他光盘存储、光碟存储(包括压缩光碟、激光碟、光碟、数字通用光碟、蓝光光碟等)、磁盘存储介质或者其他磁存储设备、或者能够用于携带或存储具有指令或数据结构形式的期望的程序代码并能够由计算机存取的任何其他介质,但不限于此。The memory may be a Read-Only Memory (ROM) or other type of static storage device that can store static information and instructions, a Random Access Memory (RAM) or other type that can store information and instructions. Dynamic storage device. It can also be an Electrically Erasable Programmable Read-Only Memory (EEPROM), a Compact Disc Read-Only Memory (CD-ROM) or other optical disc storage, and a disc storage (including a compact disc, a laser disc, a compact disc, a digital versatile disc, a Blu-ray disc, etc.), a disk storage medium or other magnetic storage device, or any other device that can be used to carry or store desired program code in the form of an instruction or data structure and accessible by a computer. Medium, but not limited to this.
结合前面的描述,本领域的技术人员可以意识到,本文实施例的方法,可以通过硬件(例如,逻辑电路),或者软件,或者硬件与软件的结合来实现。这些方法究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。In view of the foregoing description, those skilled in the art will appreciate that the methods of the embodiments herein may be implemented by hardware (eg, logic circuitry), or software, or a combination of hardware and software. Whether these methods are implemented in hardware or software depends on the specific application and design constraints of the solution. A person skilled in the art can use different methods to implement the described functions for each particular application, but such implementation should not be considered to be beyond the scope of the present application.
当上述功能通过软件的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。在这种情况下,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。When the above functions are implemented in the form of software and sold or used as stand-alone products, they can be stored in a computer readable storage medium. In this case, the part of the technical solution of the present application, which contributes in essence or to the prior art, or part of the technical solution, may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .
上述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本申请实施例的技术方案的目的。The units described above as separate components may or may not be physically separated. The components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. The purpose of the technical solution of the embodiment of the present application may be achieved by selecting some or all of the units according to actual needs.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。The foregoing is only a specific embodiment of the present application, but the scope of protection of the present application is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope disclosed in the present application. It should be covered by the scope of protection of this application. Therefore, the scope of protection of the present application should be determined by the scope of the claims.

Claims (18)

  1. 一种语音通信方法,其特征在于,所述方法包括:A voice communication method, characterized in that the method comprises:
    终端设备中的第一应用获取用户输入的文字信息;The first application in the terminal device acquires text information input by the user;
    所述第一应用将所述文字信息转换为语音数据;The first application converts the text information into voice data;
    所述第一应用将所述语音数据发送给所述终端设备的语音传输模块;Transmitting, by the first application, the voice data to a voice transmission module of the terminal device;
    所述语音传输模块将所述语音数据发送给与所述终端设备进行语音通信的其它终端设备。The voice transmission module transmits the voice data to other terminal devices that perform voice communication with the terminal device.
  2. 根据权利要求1所述的方法,其特征在于,所述第一应用将所述文字信息转换为语音数据,包括:The method according to claim 1, wherein the converting, by the first application, the text information into voice data comprises:
    所述第一应用基于文本电话TTY协议对所述文字信息进行编码,得到所述语音数据。The first application encodes the text information based on a text phone TTY protocol to obtain the voice data.
  3. 根据权利要求1或2所述的方法,其特征在于,所述语音传输模块将所述语音数据发送给与所述终端设备进行语音通信的其它终端设备,包括:The method according to claim 1 or 2, wherein the voice transmission module transmits the voice data to other terminal devices that perform voice communication with the terminal device, including:
    所述语音传输模块通过蜂窝网络将所述语音数据发送给与所述终端设备进行语音通信的其它终端设备。The voice transmission module transmits the voice data to other terminal devices that perform voice communication with the terminal device through a cellular network.
  4. 根据权利要求1至3中任一项所述的方法,其特征在于,所述终端设备还包括显示模块,所述方法还包括:The method according to any one of claims 1 to 3, wherein the terminal device further comprises a display module, the method further comprising:
    所述第一应用将获取到的所述用户输入的文字信息发送给所述显示模块;The first application sends the acquired text information input by the user to the display module;
    所述显示模块在所述第一应用的界面上显示所述文字信息。The display module displays the text information on an interface of the first application.
  5. 一种语音通信方法,其特征在于,所述方法包括:A voice communication method, characterized in that the method comprises:
    终端设备中的语音传输模块接收其它终端设备发送的语音数据;The voice transmission module in the terminal device receives the voice data sent by the other terminal device;
    所述语音传输模块将所述语音数据发送给所述终端设备上的第一应用;Transmitting, by the voice transmission module, the voice data to a first application on the terminal device;
    所述第一应用将所述语音数据转换为文字信息。The first application converts the voice data into text information.
  6. 根据权利要求5所述的方法,其特征在于,所述第一应用将所述语音数据转换为文字信息,包括:The method according to claim 5, wherein the converting, by the first application, the voice data into text information comprises:
    所述第一应用基于文本电话TTY协议对所述语音数据进行解码,得到所述文字信息。The first application decodes the voice data based on a text phone TTY protocol to obtain the text information.
  7. 根据权利要求5或6所述的方法,其特征在于,所述语音传输模块接收其它终端设备发送的语音数据,包括:The method according to claim 5 or 6, wherein the voice transmission module receives the voice data sent by the other terminal device, including:
    所述语音传输模块通过蜂窝网络接收其它终端设备发送的语音数据。The voice transmission module receives voice data sent by other terminal devices through a cellular network.
  8. 根据权利要求5至7中任一项所述的方法,其特征在于,所述终端设备还包括显示模块,所述方法还包括:The method according to any one of claims 5 to 7, wherein the terminal device further comprises a display module, the method further comprising:
    所述第一应用将所述文字信息发送给所述显示模块;The first application sends the text information to the display module;
    所述显示模块显示所述文字信息。The display module displays the text information.
  9. 根据权利要求5至7中任一项所述的方法,其特征在于,所述终端设备还包括显示模块,所述方法还包括:The method according to any one of claims 5 to 7, wherein the terminal device further comprises a display module, the method further comprising:
    所述第一应用对所述文字信息进行校正,得到校正后的文字信息;The first application corrects the text information to obtain corrected text information;
    所述第一应用将所述校正后的文字信息发送给所述显示模块;The first application sends the corrected text information to the display module;
    所述显示模块显示所述校正后的文字。The display module displays the corrected text.
  10. 一种语音通信装置,其特征在于,包括第一应用和语音传输模块,其中,A voice communication device, comprising: a first application and a voice transmission module, wherein
    所述第一应用,用于获取用户输入的文字信息,并将所述文字信息转换为语音数据;The first application is configured to acquire text information input by a user, and convert the text information into voice data;
    所述第一应用将所述语音数据发送给所述语音传输模块;Transmitting, by the first application, the voice data to the voice transmission module;
    所述语音传输模块,用于将所述语音数据发送给与所述装置进行语音通信的其它终端设备。The voice transmission module is configured to send the voice data to other terminal devices that perform voice communication with the device.
  11. 根据权利要求10所述的装置,其特征在于,所述第一用于基于文本电话TTY协议,将所述文字信息转换为所述语音数据。The apparatus according to claim 10, wherein said first is for converting said text information into said voice data based on a text telephone TTY protocol.
  12. 根据权利要求10或11所述的装置,其特征在于,所述语音传输模块通过蜂窝网络将所述语音数据发送给与所述装置进行语音通信的其它终端设备。The apparatus according to claim 10 or 11, wherein said voice transmission module transmits said voice data to another terminal device for voice communication with said device via a cellular network.
  13. 根据权利要求10至12中任一项所述的装置,其特征在于,所述装置还包括:The device according to any one of claims 10 to 12, wherein the device further comprises:
    显示模块,用于接收所述第一应用发送的所述文字信息,并在所述第一应用的界面上显示所述文字信息。a display module, configured to receive the text information sent by the first application, and display the text information on an interface of the first application.
  14. 一种语音通信装置,其特征在于,包括语音传输模块和第一应用,其中,A voice communication device, comprising: a voice transmission module and a first application, wherein
    所述语音传输模块,用于接收其它终端设备发送的语音数据,并将所述语音数据发送给所述第一应用;The voice transmission module is configured to receive voice data sent by another terminal device, and send the voice data to the first application;
    所述第一应用,用于将所述语音数据转换为文字信息。The first application is configured to convert the voice data into text information.
  15. 根据权利要求14所述的装置,其特征在于,所述第一应用基于文本电话TTY协议,对所述语音数据进行解码,得到所述文字信息。The apparatus according to claim 14, wherein said first application decodes said voice data based on a text telephone TTY protocol to obtain said text information.
  16. 根据权利要求14或15所述的装置,其特征在于,所述语音传输模块通过蜂窝网络接收其它终端设备发送的所述语音数据。The apparatus according to claim 14 or 15, wherein the voice transmission module receives the voice data transmitted by other terminal devices through a cellular network.
  17. 根据权利要求14至16中任一项所述的装置,其特征在于,所述装置还包括:The device according to any one of claims 14 to 16, wherein the device further comprises:
    显示模块,用于接收所述第一应用发送的所述文字信息,并在所述第一应用的界面上显示所述文字信息。a display module, configured to receive the text information sent by the first application, and display the text information on an interface of the first application.
  18. 根据权利要求14至16中任一项所述的装置,其特征在于,所述第一应用还用于对所述文字信息进行校正,得到校正后的文字信息,并将所述校正后的信息发送给所述装置上的显示模块;The apparatus according to any one of claims 14 to 16, wherein the first application is further configured to correct the text information, obtain corrected text information, and the corrected information Sending to a display module on the device;
    所述显示模块,用于接收所述第一应用发送的所述文字信息,并在所述第一应用的界面上显示所述校正后的文字信息。The display module is configured to receive the text information sent by the first application, and display the corrected text information on an interface of the first application.
PCT/CN2018/111424 2017-10-24 2018-10-23 Voice communication method and voice communication apparatus WO2019080833A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711001961.4 2017-10-24
CN201711001961.4A CN109698877A (en) 2017-10-24 2017-10-24 Voice communication method and voice communication assembly

Publications (1)

Publication Number Publication Date
WO2019080833A1 true WO2019080833A1 (en) 2019-05-02

Family

ID=66227781

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/111424 WO2019080833A1 (en) 2017-10-24 2018-10-23 Voice communication method and voice communication apparatus

Country Status (2)

Country Link
CN (1) CN109698877A (en)
WO (1) WO2019080833A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102821196A (en) * 2012-07-25 2012-12-12 江西好帮手电子科技有限公司 Text-speech matching conversation method of mobile terminal as well as mobile terminal thereof
CN105210355A (en) * 2013-05-02 2015-12-30 萨罗尼科斯贸易与服务一人有限公司 Ultrasonically cleaning vessels and pipes
EP2536176B1 (en) * 2011-06-16 2016-09-21 Alcatel Lucent Text-to-speech injection apparatus for telecommunication system

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101123630A (en) * 2006-08-07 2008-02-13 英华达(南京)科技有限公司 Communication method and system for voice and text conversion
CN101500028A (en) * 2008-01-28 2009-08-05 英华达(上海)电子有限公司 Communication terminal using read-write mode and method for implementing read-write mode communication
CN201860365U (en) * 2010-05-26 2011-06-08 康佳集团股份有限公司 Mobile phone device for deaf-mute
CN103428663A (en) * 2012-05-25 2013-12-04 深圳信息职业技术学院 Communication method and system realized based on TTS control center
CN103973877A (en) * 2013-02-06 2014-08-06 北京壹人壹本信息科技有限公司 Method and device for using characters to realize real-time communication in mobile terminal

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2536176B1 (en) * 2011-06-16 2016-09-21 Alcatel Lucent Text-to-speech injection apparatus for telecommunication system
CN102821196A (en) * 2012-07-25 2012-12-12 江西好帮手电子科技有限公司 Text-speech matching conversation method of mobile terminal as well as mobile terminal thereof
CN105210355A (en) * 2013-05-02 2015-12-30 萨罗尼科斯贸易与服务一人有限公司 Ultrasonically cleaning vessels and pipes

Also Published As

Publication number Publication date
CN109698877A (en) 2019-04-30

Similar Documents

Publication Publication Date Title
US8571545B2 (en) Short-range wireless relay method and system
US20100263015A1 (en) Wireless Interface for Set Top Box
US8504015B2 (en) Short-range wireless relay method and system
US8774706B2 (en) Short-range wireless mobile terminal method and system
US10827455B1 (en) Method and apparatus for sending a notification to a short-range wireless communication audio output device
JP2009518920A (en) VoIP accessories
TW201724879A (en) Sending a transcript of a voice conversation during telecommunication
US20050049879A1 (en) Communication device capable of interworking between voice communications and text communications
US7162012B2 (en) Apparatus and method for transitioning between TTY and voice transmission modes
JP5966917B2 (en) Relay device
WO2019080833A1 (en) Voice communication method and voice communication apparatus
US10863024B2 (en) System, user equipment, server, computer program product and method for providing access to mobile communication services
KR100544036B1 (en) SMS system of internet visual phone
JP2006510318A (en) Changing the operating mode of the wireless communication device using the voice service option
KR100724928B1 (en) Device and method of informing communication using push to talk scheme in mobile communication terminal
JP6015349B2 (en) Relay device and communication system
KR20200026166A (en) Method and system for providing calling supplementary service
JP6464971B2 (en) Wireless terminal device
CN112751975A (en) Playing method, device, storage medium and system of call hold tone
US20240121580A1 (en) Voice and text communications management related to accessibility enhancement of a calling experience
JP6590276B2 (en) Communication device
US20230247136A1 (en) Automated attendant that specifies audio transmission characteristics for calls
JP6274263B2 (en) Relay device
US11445064B2 (en) Method for establishing a communication with an interactive server
JP2017022680A (en) Communication device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18870764

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18870764

Country of ref document: EP

Kind code of ref document: A1