WO2017067319A1 - 信息传输方法和装置、及终端 - Google Patents

信息传输方法和装置、及终端 Download PDF

Info

Publication number
WO2017067319A1
WO2017067319A1 PCT/CN2016/096644 CN2016096644W WO2017067319A1 WO 2017067319 A1 WO2017067319 A1 WO 2017067319A1 CN 2016096644 W CN2016096644 W CN 2016096644W WO 2017067319 A1 WO2017067319 A1 WO 2017067319A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
voice
vibration
conversion
utterance
Prior art date
Application number
PCT/CN2016/096644
Other languages
English (en)
French (fr)
Inventor
么文琦
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2017067319A1 publication Critical patent/WO2017067319A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72475User interfaces specially adapted for cordless or mobile telephones specially adapted for disabled users
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72475User interfaces specially adapted for cordless or mobile telephones specially adapted for disabled users
    • H04M1/72478User interfaces specially adapted for cordless or mobile telephones specially adapted for disabled users for hearing-impaired users

Definitions

  • the present disclosure relates to the field of communication technologies, for example, to an information transmission method and apparatus, and a terminal.
  • Mobile phones have been widely used as a communication tool, and users can conveniently communicate with others in real-time voice or text using a mobile phone.
  • the mobile phone can collect the sound emitted by the user through the microphone, thereby implementing voice communication.
  • the user cannot or is inconvenient to issue a voice scene, the user cannot use the mobile phone for voice communication.
  • voice calls cannot be made using the mobile phone in the related art. Therefore, people hope that there is a mobile phone that can meet the needs of voice transmission without the need for users to make voices.
  • current terminals are difficult to meet such needs.
  • the present disclosure proposes an information transmission method and apparatus, and a terminal, which enable a user to use a terminal for voice communication without issuing a sound.
  • the embodiment of the present disclosure provides an information transmission method, including:
  • the conversion information is text information or voice information
  • the conversion information is transmitted over a communication network.
  • the method further includes:
  • the voice information is transmitted to a microphone output port.
  • it also includes:
  • the acquiring the plurality of pieces of conversion information includes: acquiring a plurality of pieces of text information corresponding to the vibration utterance according to the matching relationship between the stored vibration utterance and the voice signal;
  • the presenting the plurality of conversion information includes: displaying the plurality of text information
  • the acquiring the plurality of pieces of conversion information includes: acquiring a plurality of pieces of voice information corresponding to the vibration utterance according to the matching relationship between the stored vibration utterance and the voice signal;
  • the presenting the plurality of conversion information includes: playing the plurality of voice information.
  • the method further includes:
  • it also includes:
  • the embodiment of the present disclosure further provides an information transmission device, which is disposed on the terminal, and includes:
  • a sound pickup unit configured to obtain a vibration sound of a human throat
  • the conversion unit is configured to acquire, according to the matching relationship between the stored vibration utterance and the voice signal, the conversion information corresponding to the vibration utterance; the conversion information is text information or voice information;
  • a transmission unit configured to transmit the conversion information over a communication network.
  • the conversion unit transmits the voice information to a microphone output port.
  • it also includes:
  • the learning unit is configured to acquire a plurality of conversion information, and present the plurality of conversion information, and adjust a matching relationship between the vibration sounding and the voice signal according to the selection result of the plurality of conversion information by the user.
  • the learning unit includes at least one of the following subunits:
  • the first learning subunit is configured to acquire a plurality of text information corresponding to the vibration utterance according to the matching relationship between the stored vibration utterance and the voice signal; and display the plurality of text information;
  • the second learning subunit is configured to acquire a plurality of voice information corresponding to the vibration utterance according to the matching relationship between the stored vibration utterance and the voice signal; and play the plurality of voice information.
  • it also includes:
  • the noise filtering unit is configured to perform noise filtering on the acquired vibration sound.
  • the embodiment of the present disclosure further proposes a terminal, where the terminal includes any one of the above information transmission devices.
  • Embodiments of the present disclosure also provide a non-transitory computer readable storage medium storing computer executable instructions for performing any of the above information transmission methods.
  • An embodiment of the present disclosure further provides an electronic device, including:
  • At least one processor and,
  • the memory stores instructions executable by the one processor, the instructions being executed by the at least one processor to enable the at least one processor to implement any of the information transfer methods described above.
  • the technical solution provided by the present disclosure directly converts the vibration sound emitted by the human throat into conversion information, does not require the actual sound to be emitted in the middle, converts the information into voice information, and transmits the conversion information to the communication network through the communication network.
  • the other party after the voice is restored, can hear the voice of the call originator. In this way, voice calls can be made "quiet". The whole process does not need to be pronounced, and it is not easy to be discovered by others.
  • special people such as aphasia patients can use the vibration of the throat to make voice calls, thus providing a voice call for aphasia patients.
  • the communication terminal provides a solution for a call in a scene that is inconvenient to make a sound but wants to make a voice call.
  • Figure 1 is a schematic view of the human body sounding
  • FIG. 2 is a flowchart of an information transmission method according to an embodiment of the present disclosure
  • FIG. 3 is a schematic structural diagram of an information transmission apparatus according to an embodiment of the present disclosure.
  • FIG. 4 is a schematic structural diagram of hardware of an electronic device according to an embodiment of the present disclosure.
  • the present disclosure provides an information transmission method and apparatus, and a terminal
  • the information transmission method and apparatus provided by the embodiments of the present disclosure and the principles on which the terminal is based are first described.
  • Human pronunciation can be divided into four steps: tone production, vibration, resonance and reintegration.
  • the sound is produced by the movement of the lung exhalation airflow;
  • the vibration is the basic sound of the throat vocal cord vibration;
  • the resonance is the enlarged voice of the pharynx, the mouth and the nasal cavity above the throat;
  • the expanded pronunciation is the basic of the expansion of the tongue, teeth, lips and sputum. Sound, and become a recognizable sound.
  • FIG. 1 is a schematic diagram of the human body sounding.
  • the throat sound band 1 generates a basic sound by vibration, and the basic sound is enlarged by the transformation of the tongue, the teeth, the lips and the ankle in the oral cavity 2, and is an identifiable sound.
  • Deaf-mute people can't make ordinary people's recognizable sounds, but usually the deaf-mute throat sounds can still vibrate, so you can use the vibration of the throat sound to make voice communication.
  • a communication tool such as a mobile phone
  • the mobile phone picks up the user's voice through the microphone
  • the input signal of the microphone is the user's voice signal
  • the output of the microphone is the corresponding voice sampling signal, wherein the voice sampling signal is an analog signal.
  • the voice sampled signal is converted into a digital signal by analog-to-digital conversion, and then transmitted from the communication network by means of modulation and carrier.
  • the voice sampling signal is restored to a sound signal in the mobile phone at the other end of the communication network, thereby realizing a long-distance voice call.
  • the embodiment of the present disclosure proposes an information transmission method, which is applicable to a terminal.
  • the method includes steps 100, 300 and 500.
  • step 100 obtaining a vibration sound of the human throat
  • step 300 according to the matching relationship between the stored vibration utterance and the voice signal, the conversion information corresponding to the vibration utterance is acquired; the conversion information is text information or voice information;
  • the conversion information refers to information represented by a human language, for example, the conversion information is voice information or text information.
  • the conversion information is voice information or text information.
  • voice information or text information By transforming information, a person with ordinary communication skills can understand the information or opinions that the presenter wants to express. For example, the sound of a vibrating sound is “ ⁇ ”, it is difficult for ordinary people to understand the information to be expressed by the vibrating sound, and the corresponding voice information or text information obtained by the conversion is “you Ok, ordinary people can understand the information or opinions that the converted voice information or text information should express.
  • step 500 the conversion information is transmitted over a communication network.
  • the method may further include step 400.
  • step 400 if the conversion information is voice information, the voice information is transmitted to a microphone output port.
  • the format of the conversion information corresponds to the voice sampling signal of the microphone output port, so that the function modules existing in the terminal can be fully utilized to avoid too much modification of the hardware part.
  • the information transmission method further includes:
  • the obtaining the multiple conversion information includes:
  • the presenting the plurality of conversion information includes: displaying the plurality of text information.
  • the acquiring the plurality of conversion information includes: acquiring a plurality of voice information corresponding to the vibration utterance according to the matching relationship between the stored vibration utterance and the voice signal; and the presenting the plurality of conversion information includes: playing the plurality of voices information.
  • a plurality of conversion results can also be selected by voice during machine learning.
  • the above adjustment process can be a machine learning process, and machine learning is performed for each vibration recognition sound of the mobile phone, and the correctness of the vibration sound signal conversion can be continuously improved through the user's later adjustment.
  • the user can establish a friendly interaction with the machine and continuously train the voice transmission device to correctly identify the vibration occurrence, which can provide a reliable guarantee for sound recognition in the future.
  • the method may further include include:
  • step 200 noise filtering processing is performed on the acquired vibration utterance.
  • the signal outside this range and amplitude can be used as noise to filter it. After the filtered signal is identified, it is converted into a corresponding converted signal.
  • the information transmission method may further include:
  • Receiving voice information transmitted through the communication network Receiving voice information transmitted through the communication network, converting the voice information into text information, and presenting the text information.
  • the user opens the terminal, places the terminal in the throat, sounds through the breathing airflow, and discriminates the vibration sound recognized by the terminal during the learning process to train the correct rate of terminal recognition.
  • the user uses the terminal to make a call.
  • the device is placed in the throat, the sound is emitted through the breathing airflow, the terminal collects the vibration, and the necessary conversion is performed, and is transmitted through the voice channel of the terminal.
  • the above method can be applied only to the terminal that initiates the voice call, and there is no special requirement for the terminal that receives the voice call, the transmitting end of the voice transmission, and the use environment of the operator.
  • the vibration of the human body is collected by the terminal, and the vibration generated by the human throat is directly converted into conversion information, and the actual sound is not required to be sent in the middle, and the converted information is voice information through the communication network.
  • the conversion information is sent to the other party, and the voice of the call originator can be heard by the voice recovery. In this way, a "quiet" voice call can be made. The whole process does not need to be pronounced, and it is not easy to be discovered by others.
  • it provides a communication terminal for voice a voice call to aphasia patients, and on the other hand, a scene that is inconvenient to make a voice but wants to make a voice call. The next call provides a solution.
  • an information transmission apparatus which is disposed on a terminal.
  • an information transmission apparatus includes a sound pickup unit 10, and converts Unit 30 and transmission unit 40, wherein:
  • the sound pickup unit 10 is configured to acquire the vibration sound of the human throat
  • the converting unit 30 is configured to obtain a matching relationship between the stored sound and the voice signal according to the stored vibration Taking the conversion information corresponding to the vibration utterance; the conversion information is text information or voice information;
  • the transmission unit 40 is arranged to transmit the conversion information over a communication network.
  • the conversion unit 30 transmits the voice information to a microphone output port.
  • the information transmission device may further include:
  • the learning unit 50 is configured to acquire a plurality of conversion information, and present the plurality of conversion information, and adjust a matching relationship between the vibration utterance and the voice signal according to the selection result of the plurality of conversion information by the user.
  • the learning unit 50 adjusts the matching relationship, the conversion process of the conversion unit 30 will be changed.
  • the learning unit 50 includes at least one of the following subunits:
  • the first learning subunit is configured to acquire a plurality of text information corresponding to the vibration utterance according to the matching relationship between the stored vibration utterance and the voice signal; and display the plurality of text information;
  • the second learning subunit is configured to acquire a plurality of voice information corresponding to the vibration utterance according to the matching relationship between the stored vibration utterance and the voice signal; and play the plurality of voice information.
  • the information transmission device may further include:
  • the noise filtering unit 20 is configured to perform noise filtering processing on the acquired vibration sound.
  • the information transmission apparatus may further include a receiving unit 60, configured to:
  • the embodiments of the present disclosure further provide a terminal, where the terminal includes any information transmission apparatus provided by an embodiment of the present disclosure.
  • the embodiment of the present disclosure further provides an electronic device.
  • the electronic device includes:
  • One or more processors 1000, one processor 1000 is taken as an example in FIG. 4;
  • the electronic device may further include: an input device 3000 and an output device 4000.
  • the processor 1000, the memory 2000, the input device 3000, and the output device in the electronic device 4000 can be connected by bus or other means, and the connection by bus is taken as an example in FIG.
  • the memory 2000 is a non-transitory computer readable storage medium that can be used to store software programs, computer executable programs, and modules.
  • the processor 1000 executes various functional applications and data processing by executing software programs, instructions, and units stored in the memory 2000, that is, implementing the information transmission method of the above method embodiments.
  • the memory 2000 may include a storage program area and an storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to usage of the terminal, and the like. Further, the memory 2000 may include a high speed random access memory, and may also include a nonvolatile memory such as at least one magnetic disk storage device, flash memory device, or other nonvolatile memory device. In some embodiments, memory 2000 can optionally include memory remotely located relative to processor 1000, which can be connected to the electronic device over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the input device 3000 of the present embodiment may include a microphone for acquiring a vibration sound of the human throat, and may further include receiving input digital or character information, and other input devices for generating a key signal input related to user setting and function control of the terminal, Such as buttons or touch screens.
  • the output device 4000 may include a display device such as a display screen, and an audio playback device such as a speaker.
  • the electronic device of the present embodiment may further include a communication device 5000 that transmits and/or receives information over a communication network.
  • the embodiment further provides a non-transitory computer readable storage medium storing computer executable instructions for executing any of the above information transmission methods .
  • the information transmission method and device and the terminal of the embodiment of the present disclosure directly convert the vibration sound emitted by the human throat into the conversion information, and do not need to send the actual sound in the middle, and transmit the conversion information to the other party through the communication network, and the other party undergoes voice restoration.
  • the voice of the call originator can be heard, and the aphasia patient can provide a communication terminal that can make a voice call, and also provides a solution for the call in a scene that is inconvenient to make a voice but wants to make a voice call. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)

Abstract

一种信息传输方法和装置、及终端;该方法包括:获取人体喉部的震动发声;根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的转换信息;所述转换信息为文字信息或语音信息;通过通信网络传输所述转换信息。

Description

信息传输方法和装置、及终端 技术领域
本公开涉及通信技术领域,例如涉及信息传输方法和装置、及终端。
背景技术
手机作为一种通信工具已经得到了广泛的使用,用户使用手机可以方便地和其他人进行实时语音或文字通信。相关技术中,手机可通过麦克风收集用户发出的声音,从而实现语音通信。然而对于用户无法或者不方便发出语音场景,则用户无法用手机进行语音通信,例如,聋哑人无法发出可辨识的语音,则不能使用相关技术中的手机进行语音通话,普通用户在不方便发出语音的情况下,也不能使用相关技术中的手机进行语音通话。因此人们希望有一款手机能够满足无需用户发出语音,也可以进行语音通信的需求,然而,目前的终端难以满足这种需求。
发明内容
为了解决上述问题,本公开提出了一种信息传输方法和装置、及终端,能够实现用户无需发出声音即可使用终端进行语音通信的需求。
本公开实施例提出了一种信息传输方法,包括:
获取人体喉部的震动发声;
根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的转换信息;所述转换信息为文字信息或语音信息;以及
通过通信网络传输所述转换信息。
可选地,在所述获取震动发声对应的转换信息之后,还包括:
在转换信息为语音信息的情况下,将所述语音信息发送至麦克风输出端口。
可选地,还包括:
获取多个转换信息,并呈现所述多个转换信息,根据对多个转换信息的选择结果,对震动发声与语音信号之间的匹配关系进行调整。
可选地,所述获取多个转换信息包括:根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个文字信息;
所述呈现多个转换信息包括:显示所述多个文字信息;
可选地,所述获取多个转换信息包括:根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个语音信息;
所述呈现多个转换信息包括:播放所述多个语音信息。
可选地,在所述获取人体喉部的震动发声之后,在所述根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的转换信息之前,还包括:
对获取的震动发声进行噪声滤除处理。
可选地,还包括:
接收通过通信网络传输的语音信息;
将所述语音信息转换为文字信息;以及
呈现所述文字信息。
本公开实施例还提出了一种信息传输装置,设置在终端上,包括:
拾音单元,设置为获取人体喉部的震动发声;
转换单元,设置为根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的转换信息;所述转换信息为文字信息或语音信息;以及
传输单元,设置为通过通信网络传输所述转换信息。
可选地,在转换信息为语音信息的情况下,所述转换单元将所述语音信息发送至麦克风输出端口。
可选地,还包括:
学习单元,设置为获取多个转换信息,并呈现所述多个转换信息,根据用户对多个转换信息的选择结果,对震动发声与语音信号之间的匹配关系进行调整。
可选地,所述学习单元包括如下子单元的至少一个:
第一学习子单元,设置为根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个文字信息;并显示所述多个文字信息;
第二学习子单元,设置为根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个语音信息;并播放所述多个语音信息。
可选地,还包括:
滤噪单元,设置为对获取的震动发声进行噪声滤除处理。
可选地,还包括接收单元,设置为
接收通过通信网络传输的语音信息;
将所述语音信息转换为文字信息;以及
呈现所述文字信息。
本公开实施例还提出了一种终端,所述终端包括上述任一种信息传输装置。
本公开实施例还提供了一种非瞬时性计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行上述任一种信息传输方法。
本公开实施例还提供了一种电子设备,包括:
至少一个处理器;以及,
与所述至少一个处理器通信连接的存储器;其中,
所述存储器存储有可被所述一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够实现上述任一种信息传输方法。
与相关技术相比,本公开提供的技术方案,通过将人喉部发出的震动发声直接转换为转换信息,中间不需要实际声音的发出,转换信息为语音信息,通过通信网络将转换信息发送给对方,对方经过语音还原便可以听到通话发起方的声音,通过这种方式,可以做到“安静的”进行语音电话。整个过程不需发音,也不容易被旁人发觉,一方面,失语症患者之类的特殊人群,可以利用喉部的震动发声进行语音通话,从而给失语症患者提供了一种可以进行语音通话的通信终端,另一方面,为不方便发出声音的但又希望进行语音通话的场景下的通话提供了一种解决方法。
附图说明
下面对本公开实施例中的附图进行说明,实施例中的附图是用于对本公开实施例的理解,与说明书一起用于解释本公开实施例,并不构成对本公开实施例保护范围的限制。
图1为人体发声的示意图;
图2为本公开实施例提供的信息传输方法的流程图;
图3为本公开实施例提供的信息传输装置的结构组成示意图;
图4为本公开实施例提供的电子设备的硬件结构示意图。
实施方式
为了便于本领域技术人员的理解,下面结合附图对本公开作相关的描述, 并不能用来限制本公开实施例的保护范围。需要说明的是,在不冲突的情况下,本公开实施例及实施例中的各种方式可以相互组合。
在介绍本公开实施例提出了一种信息传输方法和装置、及终端之前,首先对本公开实施例提供的信息传输方法和装置、及终端所基于的原理进行相关说明。
人类发音可分为四个步骤:产音,振动,共鸣和改扩发音。产音是由于肺呼气气流移动而产生;振动是喉声带振动而产生基本音;共鸣是喉以上的咽、口腔、鼻腔扩大声音;改扩发音是舌、齿、唇和腭改造扩大的基本音,而成为可辨识的声音。请参阅图1,为人体发声的示意图,如图1所示,喉声带1通过振动产生基本音,基本音经过口腔2中的舌、齿、唇和腭的改造扩大,为可辨识的声音。
聋哑人不能发出普通人可辨识的声音,但是通常聋哑人的喉声带依然可以震动发声,因此,可以利用喉声带的震动发声来进行语音通信。
相关技术中,以手机之类的通信工具为例,手机通过麦克风拾取用户的声音,麦克风的输入信号为用户的声音信号,麦克风的输出为对应的语音采样信号,其中,语音采样信号为模拟信号,语音采样信号经过模数转换转换为数字信号,再通过调制和载波的方式从通信网络进行发送。
其中,语音采样信号在通信网络中的另一端的手机中将还原为声音信号,从而实现远距离的语音通话。
本公开实施例提出了一种信息传输方法,该方法可应用于终端。参见图2,所述方法包括步骤100,300和500。
在步骤100中,获取人体喉部的震动发声;
在步骤300中,根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的转换信息;所述转换信息为文字信息或语音信息;
其中,所述转换信息指人类的语言所代表信息,例如,转换信息是语音信息或者文字信息。通过转换信息,具有普通交流能力的人能够理解表达者所要表达的信息或观点。例如,一条震动发声的发音为“诶哦啊”,普通人难以理解该震动发声所要表达的信息,经过转换得到的对应的语音信息或文字信息为“你 好啊”,普通人能够理解转换后的语音信息或文字信息所要表达的信息或观点。
在步骤500中,通过通信网络传输所述转换信息。
本公开实施例中,在步骤300之后,该方法还可以包括步骤400。
在步骤400中,在转换信息为语音信息的情况下,将所述语音信息传输至麦克风输出端口。转换信息的格式对应麦克风输出端口的语音采样信号,这样,可以充分利用终端中已有的功能模块,避免对硬件部分的改动太大。
本公开实施例中,所述信息传输方法还包括:
获取多个转换信息,并呈现所述多个转换信息,根据用户对多个转换信息的选择结果,对震动发声与语音信号之间的匹配关系进行调整。
其中,所述获取多个转换信息包括:
根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个文字信息;
所述呈现多个转换信息包括:显示所述多个文字信息。
或者,所述获取多个转换信息包括:根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个语音信息;所述呈现多个转换信息包括:播放所述多个语音信息。
由于聋哑人只能通过文字查看多个转换信息,即多个识别结果,因此需要将震动发声转换为对应的文字信息。在转换过程中,可以根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个语音信息,然后,根据语音信息与文字信息之间的对应关系,分别获取多个语音信息对应的文字信息,从而获取震动发声对应的多个文字信息。
对于普通人,如果希望使用本公开提供的信息传输装置,在机器学习的过程中,也可以通过语音来对多个转换结果进行选择。
上述调整过程可以上是机器学习的过程,针对每次手机识别的震动发声做机器学习,并通过用户后期的调整,能过不断提高震动发声信号转换的正确性。在使用初期,用户可以与机器建立一个友好的互动,不断训练语音传输设备对震动发生的识别的正确率,可以为日后更可好高效的声音识别提供可靠保证。
本公开实施例中,在步骤100之后,和步骤300之前,所述方法还可以包 括:
在步骤200中,对获取的震动发声进行噪声滤除处理。
因为人体喉部发音时震动频率及幅度是有一定范围的,故可以将在这个范围及幅度外的信号作为噪音,将其过滤。这样过滤后的信号通过识别之后,转换为对应的转换信号。
所述信息传输方法还可以包括:
接收通过通信网络传输的语音信息,将所述语音信息转换为文字信息,呈现所述文字信息。
下面结合实施场景进行示例性说明。
用户打开终端,将终端置于喉部,通过呼吸气流发声,并在学习过程对终端识别的震动声音做甄别,以训练终端识别的正确率。
用户使用该终端拨打电话,在通话时,将设备置于喉部,通过呼吸气流发声,终端收集震动,并做必要转换,通过终端的语音信道发送出去。对接受方而言,使用普通的电话,手机,就可以听见用户的语音。
需要说明的是,上述方法可只应用于发起语音通话的终端中,对接收语音通话的终端,传送语音的传送端,以及运营商的使用环境等均无特别需求。
本公开实施例中,通过终端对人体喉部的震动发声进行收集,通过将人喉部发出的震动发声直接转换为转换信息,中间不需要实际声音的发出,转换信息为语音信息,通过通信网络将转换信息发送给对方,对方经过语音还原便可以听到通话发起方的声音,通过这种方式,可以做到“安静的”进行语音电话。整个过程不需发音,也不容易被旁人发觉,一方面,给失语症患者提供了一种可以进行语音通话的通信终端,另一方面,为不方便发出声音的但又希望进行语音通话的场景下的通话提供了一种解决方法。
基于与上述实施例相同或相似的构思,本公开实施例还提供一种信息传输装置,设置在终端上,参见图3,本公开实施例提出的一种信息传输装置包括拾音单元10,转换单元30和传输单元40,其中:
拾音单元10,设置为获取人体喉部的震动发声;
转换单元30,设置为根据存储的震动发声与语音信号之间的匹配关系,获 取震动发声对应的转换信息;所述转换信息为文字信息或语音信息;以及
传输单元40,设置为通过通信网络传输所述转换信息。
本公开实施例中,在转换信息为语音信息的情况下,所述转换单元30将所述语音信息发送至麦克风输出端口。
本公开实施例中,所述信息传输装置还可以包括:
学习单元50,设置为获取多个转换信息,并呈现所述多个转换信息,根据用户对多个转换信息的选择结果,对震动发声与语音信号之间的匹配关系进行调整。
学习单元50对匹配关系进行调整之后,将改变转换单元30的转换过程。
本公开实施例中,所述学习单元50包括如下子单元的至少一个:
第一学习子单元,设置为根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个文字信息;并显示所述多个文字信息;
第二学习子单元,设置为根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个语音信息;并播放所述多个语音信息。
本公开实施例中,所述信息传输装置还可以包括:
滤噪单元20,设置为对获取的震动发声进行噪声滤除处理。
本公开实施例中,所述信息传输装置还可以包括接收单元60,设置为:
接收通过通信网络传输的语音信息;
将所述语音信息转换为文字信息;以及
呈现所述文字信息。
基于与上述实施例相同或相似的构思,本公开实施例还提供一种终端,所述终端包括本公开实施例提供的任一信息传输装置。
基于与上述实施例相同或相似的构思,本公开实施例还提供了一种电子设备,参见图4,该电子设备包括:
一个或多个处理器1000,图4中以一个处理器1000为例;
存储器2000。
所述电子设备还可以包括:输入装置3000和输出装置4000。
所述电子设备中的处理器1000、存储器2000、输入装置3000和输出装置 4000可以通过总线或者其他方式连接,图4中以通过总线连接为例。
存储器2000作为一种非瞬时性计算机可读存储介质,可用于存储软件程序、计算机可执行程序以及模块。处理器1000通过运行存储在存储器2000中的软件程序、指令以及单元,从而执行各种功能应用以及数据处理,即实现上述方法实施例的信息传输方法。
存储器2000可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储根据终端的使用所创建的数据等。此外,存储器2000可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他非易失性存储器件。在一些实施例中,存储器2000可选包括相对于处理器1000远程设置的存储器,这些远程存储器可以通过网络连接至电子设备。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。
本实施例的输入装置3000可包括麦克风,获取人体喉部的震动发声,还可以包括接收输入的数字或字符信息,以及产生与终端的用户设置以及功能控制有关的键信号输入的其他输入装置,比如按键或触摸屏。输出装置4000可包括显示屏等显示设备,以及扬声器等音频播放设备。
本实施例的电子设备还可以包括通信装置5000,通过通信网络传输和/或接收信息。
基于与上述实施例相同或相似的构思,本实施例还提供了一种非瞬时性计算机可读存储介质,存储有计算机可执行指令,该计算机可执行指令用于执行上述任意一种信息传输方法。
需要说明的是,本领域普通技术人员可理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来执行相关的硬件来完成的,该程序可存储于一个非瞬时性计算机可读存储介质中,该程序在执行时,可包括如上述方法的实施例的流程,其中,该计算机可读存储介质可以为磁碟、光盘、只读存储记忆体(ROM)或随机存储记忆体(RAM)等。
工业实用性
本公开实施例的信息传输方法和装置以及终端,通过将人喉部发出的震动发声直接转换为转换信息,中间不需要实际声音的发出,通过通信网络将转换信息发送给对方,对方经过语音还原便可以听到通话发起方的声音,给失语症患者提供了一种可以进行语音通话的通信终端,也为不方便发出声音的但又希望进行语音通话的场景下的通话提供了一种解决方法。

Claims (15)

  1. 一种信息传输方法,包括:
    获取人体喉部的震动发声;
    根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的转换信息;所述转换信息为文字信息或语音信息;以及
    通过通信网络传输所述转换信息。
  2. 根据权利要求1所述的方法,在所述获取震动发声对应的转换信息之后,还包括:
    在转换信息为语音信息的情况下,将所述语音信息发送至麦克风输出端口。
  3. 根据权利要求1所述的方法,还包括:
    获取多个转换信息,并呈现所述多个转换信息,根据对多个转换信息的选择结果,对震动发声与语音信号之间的匹配关系进行调整。
  4. 根据权利要求3所述的方法,其中,
    所述获取多个转换信息包括:根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个文字信息;
    所述呈现多个转换信息包括:显示所述多个文字信息。;
  5. 根据权利要求3所述的方法,其中,
    所述获取多个转换信息包括:根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个语音信息;
    所述呈现多个转换信息包括:播放所述多个语音信息。
  6. 根据权利要求3所述的方法,在所述获取人体喉部的震动发声之后,在所述根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的转换信息之前,还包括:
    对获取的震动发声进行噪声滤除处理。
  7. 根据权利要求1-6任一项所述的方法,还包括:
    接收通过通信网络传输的语音信息;
    将所述语音信息转换为文字信息;以及
    呈现所述文字信息。
  8. 一种信息传输装置,设置在终端上,包括:
    拾音单元,设置为获取人体喉部的震动发声;
    转换单元,设置为根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的转换信息;所述转换信息为文字信息或语音信息;以及
    传输单元,设置为通过通信网络传输所述转换信息。
  9. 根据权利要求8所述的信息传输装置,其中,在转换信息为语音信息的情况下,所述转换单元将所述语音信息发送至麦克风输出端口。
  10. 根据权利要求8所述的信息传输装置,还包括:
    学习单元,设置为获取多个转换信息,并呈现所述多个转换信息,根据对多个转换信息的选择结果,对震动发声与语音信号之间的匹配关系进行调整。
  11. 根据权利要求10所述的信息传输装置,其中,所述学习单元包括如下子单元的至少一个:
    第一学习子单元,设置为根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个文字信息;并显示所述多个文字信息;以及
    第二学习子单元,设置为根据存储的震动发声与语音信号之间的匹配关系,获取震动发声对应的多个语音信息;并播放所述多个语音信息。
  12. 根据权利要求10所述的信息传输装置,还包括:
    滤噪单元,设置为对获取的震动发声进行噪声滤除处理。
  13. 根据权利要求8所述的信息传输装置,还包括接收单元,设置为:
    接收通过通信网络传输的语音信息;
    将所述语音信息转换为文字信息;以及
    呈现所述文字信息。
  14. 一种终端,包括权利要求8-12中任一项所述的信息传输装置。
  15. 一种非瞬时性计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行权利要求1-7任一项所述的信息传输方法。
PCT/CN2016/096644 2015-10-21 2016-08-25 信息传输方法和装置、及终端 WO2017067319A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510695959.6 2015-10-21
CN201510695959.6A CN106612364A (zh) 2015-10-21 2015-10-21 一种信息传输方法和装置、及终端

Publications (1)

Publication Number Publication Date
WO2017067319A1 true WO2017067319A1 (zh) 2017-04-27

Family

ID=58556653

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/096644 WO2017067319A1 (zh) 2015-10-21 2016-08-25 信息传输方法和装置、及终端

Country Status (2)

Country Link
CN (1) CN106612364A (zh)
WO (1) WO2017067319A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109598991B (zh) * 2019-01-11 2021-06-15 张翩 一种英语发音教学系统、装置及方法
CN110956949B (zh) * 2019-10-24 2022-10-04 中国人民解放军军事科学院国防科技创新研究院 一种口含式缄默通信方法与系统
CN115910027B (zh) * 2023-03-08 2023-05-09 深圳市九天睿芯科技有限公司 一种辅助发声方法及装置

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101950249A (zh) * 2010-07-14 2011-01-19 北京理工大学 默声音符编码字符输入方法和装置
CN202796043U (zh) * 2012-09-07 2013-03-13 四川长虹电器股份有限公司 一种语音识别系统
CN104965597A (zh) * 2015-07-28 2015-10-07 芜湖科创生产力促进中心有限责任公司 感应式喉头送话器装置及其使用方法
CN204798078U (zh) * 2015-07-28 2015-11-25 安徽机电职业技术学院 基于三维压力检测的喉头送话器装置
CN105105898A (zh) * 2015-07-28 2015-12-02 安徽机电职业技术学院 基于三维压力检测的喉头送话器装置及其使用方法
CN105147429A (zh) * 2015-07-28 2015-12-16 安徽工程大学 喉头送话器装置及其使用方法
CN204951248U (zh) * 2015-07-28 2016-01-13 安徽工程大学 喉头送话器装置

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101950249A (zh) * 2010-07-14 2011-01-19 北京理工大学 默声音符编码字符输入方法和装置
CN202796043U (zh) * 2012-09-07 2013-03-13 四川长虹电器股份有限公司 一种语音识别系统
CN104965597A (zh) * 2015-07-28 2015-10-07 芜湖科创生产力促进中心有限责任公司 感应式喉头送话器装置及其使用方法
CN204798078U (zh) * 2015-07-28 2015-11-25 安徽机电职业技术学院 基于三维压力检测的喉头送话器装置
CN105105898A (zh) * 2015-07-28 2015-12-02 安徽机电职业技术学院 基于三维压力检测的喉头送话器装置及其使用方法
CN105147429A (zh) * 2015-07-28 2015-12-16 安徽工程大学 喉头送话器装置及其使用方法
CN204951248U (zh) * 2015-07-28 2016-01-13 安徽工程大学 喉头送话器装置

Also Published As

Publication number Publication date
CN106612364A (zh) 2017-05-03

Similar Documents

Publication Publication Date Title
Chern et al. A smartphone-based multi-functional hearing assistive system to facilitate speech recognition in the classroom
WO2017067319A1 (zh) 信息传输方法和装置、及终端
EP3982358A2 (en) Whisper conversion for private conversations
JP2009178783A (ja) コミュニケーションロボット及びその制御方法
US11589173B2 (en) Hearing aid comprising a record and replay function
WO2020142679A1 (en) Audio signal processing for automatic transcription using ear-wearable device
US9295423B2 (en) System and method for audio kymographic diagnostics
US10848855B2 (en) Method, electronic device and recording medium for compensating in-ear audio signal
WO2017140153A1 (zh) 语音控制方法及装置
JP7284570B2 (ja) 音声再生システムおよびプログラム
WO2020079918A1 (ja) 情報処理装置及び情報処理方法
TWI548278B (zh) 音視訊同步控制設備及方法
JP2019110447A (ja) 電子機器、電子機器の制御方法、及び、電子機器の制御プログラム
JP6150707B2 (ja) 音声データ合成端末、音声データ記録端末、音声データ合成方法、音声出力方法、及びプログラム
WO2018088210A1 (ja) 情報処理装置および方法、並びにプログラム
US10299050B2 (en) Mobile audio receiver
JP2020124444A (ja) 発声補助装置および発声補助システム
JP3227725U (ja) 文字表示機能付き補聴システム
US20230047187A1 (en) Extraneous voice removal from audio in a communication session
JP2023023032A (ja) 手話情報伝送装置、手話情報出力装置、手話情報伝送システム及びプログラム
US10580431B2 (en) Auditory interpretation device with display
JP2014216787A (ja) 会議端末装置及び増幅率登録方法
WO2023171124A1 (ja) 情報処理装置、情報処理方法、情報処理プログラム及び情報処理システム
TWI719699B (zh) 人工智慧輔助說好話的方法
TWM560746U (zh) 可優化外部的語音信號裝置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16856753

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16856753

Country of ref document: EP

Kind code of ref document: A1