CN109587042B - Voice conversion communication terminal - Google Patents

Voice conversion communication terminal Download PDF

Info

Publication number
CN109587042B
CN109587042B CN201811620039.8A CN201811620039A CN109587042B CN 109587042 B CN109587042 B CN 109587042B CN 201811620039 A CN201811620039 A CN 201811620039A CN 109587042 B CN109587042 B CN 109587042B
Authority
CN
China
Prior art keywords
voice
data
message
combined
communication terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811620039.8A
Other languages
Chinese (zh)
Other versions
CN109587042A (en
Inventor
王碧芳
李雪
张帆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Zhize Communication Services Co.,Ltd.
Original Assignee
Wuhan Polytechnic University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Polytechnic University filed Critical Wuhan Polytechnic University
Priority to CN201811620039.8A priority Critical patent/CN109587042B/en
Publication of CN109587042A publication Critical patent/CN109587042A/en
Application granted granted Critical
Publication of CN109587042B publication Critical patent/CN109587042B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/06Message adaptation to terminal or network requirements
    • H04L51/066Format adaptation, e.g. format conversion or compression
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/07User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail characterised by the inclusion of specific contents
    • H04L51/10Multimedia information

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a voice conversion communication terminal, which comprises: a voice encoder for generating voice data and message data of data from an original voice, combining the voice data and the message to generate combined data, and transmitting the combined data to a data transceiving channel; a voice decoder for separating voice data and message data from the combined data after receiving the combined message data from the data transceiving channel, and reconstructing the separated voice data and message data to obtain redesigned combined data; a data transceiving channel receiving the combined message from the speech encoder and passing the combined message to the speech decoder. The invention realizes the voice type conversion when the user sends voice through the voice conversion communication terminal, and meets the user-defined requirement on the type of the voice information to be sent.

Description

Voice conversion communication terminal
Technical Field
The invention relates to the field of communication, in particular to a voice conversion communication terminal.
Background
At present, the speech recognition technology is developed very rapidly, and has been applied to various technical fields, such as personal computers or mobile phone terminals for identification, the speech recognition technology is mostly applied to single electronic terminals, and with the continuous maturity of internet technology, the speech recognition technology has a very broad prospect when being applied to Web pages in order to further facilitate users to access the internet. While voice recognition is adopted as a protection means, the user's customization demand for sending voice information is continuously increased, and a voice file that the user wants to send can be reflected by a plurality of different voice types at a receiving end, so that a new scheme needs to be provided to realize the conversion of the voice types when the user sends voice.
Disclosure of Invention
The invention aims to provide a voice conversion communication terminal to solve the customization requirement of a user on the type of voice information to be sent.
To solve the above problems, the present invention provides a voice conversion communication terminal comprising: the voice coder generates voice data and message data of data from original voice, combines the voice data and the message to generate combined data, and transmits the combined data to the data transceiving channel; the voice decoder is used for separating voice data and message data from the combined data after receiving the combined message data from the data transceiving channel, and reconstructing the separated voice data and the separated message data to obtain redesigned combined data; and the data transceiving channel receives the combined message from the voice coder and transmits the combined message to the voice decoder.
Wherein the speech encoder comprises: a message data generating unit for extracting and generating message data from the original voice; a voice data generation unit which extracts and generates voice data from the original voice; and a combined data generation unit for integrating the message data and the voice data to generate combined data.
Wherein the voice decoder includes: a voice data separating unit separating voice data and message data from the combined data; and a message reconstruction unit for reconstructing the separated voice data and message data to generate re-designed combined data.
The voice data is data corresponding to original voice, the message data includes voice type, and the voice data and the message data have a one-to-one mapping relation.
Wherein the voice font of the original voice is different from the voice font of the redesigned combined data.
Wherein the voice conversion communication terminal selects a voice type through the voice server.
The separated voice data and message data are respectively sent, the message data are sent to a multimedia message service center, the voice data are sent to a voice server, and the voice server records the voice type.
The invention has the beneficial effects that: different from the situation of the prior art, the invention provides the voice conversion communication terminal, which realizes the conversion of the voice type when a user sends voice and meets the customized requirement of the user on the type of the voice information to be sent.
Drawings
Fig. 1 is a schematic structural diagram of an embodiment of a voice conversion communication terminal according to the present invention;
FIG. 2 is a system diagram of an embodiment of a voice converting communication terminal of the present invention;
fig. 3 is a system flow diagram of an embodiment of a voice converting communication terminal according to the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.
Referring to fig. 1, fig. 1 is a schematic structural diagram of an embodiment of a voice conversion communication terminal in the present invention, in which 110 is a data transceiving channel, 120 is a voice decoder, 121 is a data reconstructing unit, 122 is a combined data separating unit, 130 is a voice encoder, 131 is a message data generator, 132 is a voice data generator, and 133 is a combined data generating unit. The voice converting communication terminal of the present invention includes a data transceiving channel 110, a voice decoder 120 and a voice encoder 130, wherein the voice decoder 120 includes a data reconstruction unit 121 and a combined data separation unit 122, and the voice encoder 130 includes a message data generation unit 131, a voice data generation unit 132 and a combined data generation unit 133. Specifically, in the data transceiving channel 110, a delivery service of data is implemented by inputting and outputting data, a combined message from the voice encoder is received and delivered to the voice decoder, and the data transceiving channel 110 can display the contents of the input and output messages; in the speech encoder 130, the original speech is converted into speech data by the speech data generating unit 132, message data is extracted and generated by the message data generating unit 131, and the speech data and the message data are combined in the combined data generating unit 133 to generate combined data; in the voice decoder 120, the combined data is separated into voice data and message data by the combined data separation unit 122, and the separated voice data and message data are reconstructed by the data reconstruction unit 121 to obtain redesigned combined data; the voice data is corresponding to the original voice file, the message data comprises the voice type, and one voice data has one message data corresponding to the voice data.
Further, the operation of the voice conversion communication terminal is described with reference to fig. 2 and fig. 3, fig. 2 is a system schematic diagram of an embodiment of the voice conversion communication terminal in the present invention, where 201 is a first communication terminal, 202 is a second communication terminal, 203 is a voice server, and 204 is a multimedia message service center; fig. 3 is a system flow diagram of an embodiment of a voice converting communication terminal according to the present invention. The first communication terminal 201 and the second communication terminal 202 are both the above-described voice conversion communication terminal, and the structure and function thereof are kept consistent with those of the above-described voice conversion communication terminal. In this embodiment, the work flow of the voice conversion communication terminal is as follows:
and S101, generating message data and voice data by the original voice through the first communication terminal, and combining the message data and the voice data to generate combined data. In this step, the first communication terminal 201 generates message data from the original voice by the message data generating unit 131 in the voice encoder 130, generates voice data by the voice data generating unit 132, and generates combined data from the message data and the voice data by the combined data generating unit 133; the voice data is data corresponding to an original voice file, the message data comprises voice types, and the voice data and the message data have a one-to-one mapping relation.
And S102, after the voice type is selected, the combined data is separated, the separated voice data is sent to a voice server, and the voice type is recorded by the voice server. In this step, after selecting the type of the voice to be transmitted from the voice server, the combined data of the voice to be transmitted is separated into voice data and message data by the combined data separation unit 122 in the first communication terminal 201, the voice data of the type is transmitted to the voice server 203, and the voice server 203 records the type of the voice to be transmitted; the voice server provides different types of voices, which can be selected or downloaded by the user of the first communication terminal 201, and the types of the voices can be distinguished by parameters such as audio frequency and the like. In one particular embodiment, the user can achieve the effect of changing voice by selecting or downloading different types of voice in the voice server 203.
S103, the first communication terminal sends the separated message data to the second communication terminal through the multimedia message service center. In this step, the first communication terminal 201 sends the message data of the voice type to be sent to the second communication terminal 202 through the multimedia message service center 204, and the message data received by the second communication terminal 202 is used for downloading the voice data corresponding to the message data to the voice server 203.
The second communication terminal downloads voice data corresponding to the received message data from the voice server S104. In this step, the second communication terminal 202 downloads voice data corresponding to the received message data from the voice server 203, the voice data corresponding to the message data.
And S105, the second communication terminal reconstructs the received message data and the voice data to obtain the redesigned combined data. In this step, the data reconstruction unit 121 of the voice decoder 120 in the second communication terminal 202 performs data reconstruction on the received message data and voice data in the voice type selected when the first communication terminal 201 transmits the received message data and voice data, and obtains redesigned combined data after the data reconstruction and provides the combined data to the user; the voice font of the original voice is different from the voice font of the redesigned combined data. The redesigned combined data is generated by original voice after being disassembled and converted in type, in a specific embodiment, the original voice of the user holding the first communication terminal 201 is type a, and voice type B is selected during transmission, and after the steps of S101 to S105, the user of the second communication terminal can receive the combined data of voice type B, thereby realizing the conversion of voice type when the user transmits voice.
Different from the situation of the prior art, the invention provides the voice conversion communication terminal, which realizes the conversion of the voice type when a user sends voice and meets the customized requirement of the user on the type of the voice information to be sent.
It should be noted that the above embodiments belong to the same inventive concept, and the description of each embodiment has a different emphasis, and reference may be made to the description in other embodiments where the description in individual embodiments is not detailed.
The above-mentioned embodiments only express the embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (3)

1. A voice conversion communication terminal, comprising:
the voice coder is used for generating voice data and message data of data by original voice, combining the voice data and the message to generate combined data and transmitting the combined data to a data receiving and transmitting channel, wherein the voice data is the data corresponding to the original voice, the message data comprises the type of the voice, and the voice data and the message data have a one-to-one mapping relation;
the voice decoder separates voice data and message data from the combined data after receiving the combined data from the data transceiving channel; carrying out data reconstruction on the received voice data and the message data in the voice type selected when other voice conversion terminals send the voice data and the message data;
a data transceiving channel receiving the combined data from the speech encoder and transferring the combined data to the speech decoder;
the speech encoder includes:
a message data generation unit which extracts and generates the message data from the original voice;
a voice data generation unit that extracts and generates the voice data from the original voice;
a combined data generating unit which integrates the message data and the voice data to generate the combined data;
the working process of the voice conversion communication terminal is as follows:
s101, generating message data and voice data by an original voice through a voice encoder of a voice conversion communication terminal, and combining to generate combined data;
s102, the voice conversion communication terminal selects a voice type through a voice server, performs combined data separation after the voice type is selected, sends the separated voice data to the voice server and records the voice type by the voice server;
s103, the voice conversion communication terminal sends the separated message data to a second communication terminal through a multimedia message service center;
s104, the second communication terminal downloads the voice data corresponding to the received message data from the voice server;
and S105, the second communication terminal reconstructs the received message data and the voice data according to the voice type selected when the voice conversion terminal sends the message data and the voice data, and the reconstructed data obtains the redesigned combined data and provides the combined data for the user.
2. The voice converting communication terminal according to claim 1, wherein the voice decoder comprises:
a combined data separating unit that separates voice data and message data from the combined data;
and the data reconstruction unit reconstructs the received voice data and the message data according to the voice type selected when the other voice conversion terminal sends the voice data and the message data.
3. The voice converting communication terminal according to claim 1, wherein a voice genre of the original voice is different from a voice genre of the redesigned combined data.
CN201811620039.8A 2018-12-28 2018-12-28 Voice conversion communication terminal Active CN109587042B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811620039.8A CN109587042B (en) 2018-12-28 2018-12-28 Voice conversion communication terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811620039.8A CN109587042B (en) 2018-12-28 2018-12-28 Voice conversion communication terminal

Publications (2)

Publication Number Publication Date
CN109587042A CN109587042A (en) 2019-04-05
CN109587042B true CN109587042B (en) 2022-01-21

Family

ID=65932206

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811620039.8A Active CN109587042B (en) 2018-12-28 2018-12-28 Voice conversion communication terminal

Country Status (1)

Country Link
CN (1) CN109587042B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104144097A (en) * 2013-05-07 2014-11-12 百度在线网络技术(北京)有限公司 Voice message transmission system, sending end, receiving end and voice message transmission method
CN104299619A (en) * 2014-09-29 2015-01-21 广东欧珀移动通信有限公司 Method and device for processing audio file

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1523913A (en) * 2003-09-05 2004-08-25 江剑锋 IP telephone set for mobile phone and mobile phone thereof, method for realizing mobile IP telephone
CN100558119C (en) * 2005-07-29 2009-11-04 华为技术有限公司 A kind of terminal, communication equipment, communication system and communication means
WO2010030285A1 (en) * 2008-09-12 2010-03-18 Research In Motion Corporation Obtaining information associated with established sessions
US20130160063A1 (en) * 2011-12-20 2013-06-20 Usman Rashid Network delivery of broadcast media content streams
CN105573988A (en) * 2015-04-28 2016-05-11 宇龙计算机通信科技(深圳)有限公司 Voice conversion method and terminal

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104144097A (en) * 2013-05-07 2014-11-12 百度在线网络技术(北京)有限公司 Voice message transmission system, sending end, receiving end and voice message transmission method
CN104299619A (en) * 2014-09-29 2015-01-21 广东欧珀移动通信有限公司 Method and device for processing audio file

Also Published As

Publication number Publication date
CN109587042A (en) 2019-04-05

Similar Documents

Publication Publication Date Title
CN100546322C (en) Chat and tele-conferencing system with the translation of Text To Speech and speech-to-text
CN101345776B (en) Content adapting implementing method and content adapting server
CN109547844A (en) Audio/video pushing method and plug-flow client based on WebRTC agreement
CN104144097A (en) Voice message transmission system, sending end, receiving end and voice message transmission method
TW201006190A (en) Open architecture based domain dependent real time multi-lingual communication service
TW543311B (en) Static information knowledge used with binary compression methods
EP2924985A1 (en) Low-bit-rate video conference system and method, sending end device, and receiving end device
CN110971685B (en) Content processing method, content processing device, computer equipment and storage medium
CN104735389A (en) Information processing method and equipment
KR20040048086A (en) Mobile communications system and method for transmitting multimedia message
CN109587042B (en) Voice conversion communication terminal
CN114356335A (en) Data processing method, device, equipment and medium
CN109587041B (en) Voice conversion communication control system
CN107835150B (en) Full-media customer service scheduling method and system
CN110149631B (en) Method and system suitable for cloud loudspeaker box connection establishment
CN100574339C (en) Converting text information into stream media or multimedia and then the method that is received by terminal
CN101471891B (en) Method, system for real time displaying input state, and sending party/receiving party client terminals
CN1316748C (en) Communication system and method utilizing request-reply communication patterns for data compression
CN100452778C (en) Multimedia content interaction system based on instantaneous communication and its realizing method
CN116781653A (en) Message processing method, device, electronic equipment, system and storage medium
CN105357171A (en) Communication method and terminal
MX2008008188A (en) Distribution of information in telecommunication systems.
EP2469851A1 (en) System and method for generating interactive voice and video response menu
CN106131030A (en) The distribution method of a kind of high-speed data and device
CN101312549B (en) Method for converting text information into stream media of multimedia and further receiving by terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20230512

Address after: 230000 B-2704, wo Yuan Garden, 81 Ganquan Road, Shushan District, Hefei, Anhui.

Patentee after: HEFEI LONGZHI ELECTROMECHANICAL TECHNOLOGY Co.,Ltd.

Address before: Wuhan vocational and technical college, no.463, Guanshan Avenue, Hongshan District, Wuhan City, Hubei Province, 430074

Patentee before: WUHAN POLYTECHNIC

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20231212

Address after: Room 1703, No. 310 Wuning South Road, Jing'an District, Shanghai, 200040

Patentee after: Tao Yewen

Address before: 230000 B-2704, wo Yuan Garden, 81 Ganquan Road, Shushan District, Hefei, Anhui.

Patentee before: HEFEI LONGZHI ELECTROMECHANICAL TECHNOLOGY Co.,Ltd.

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20240322

Address after: Room 554, 8th Floor, Building 2, No. 8589 Nanfeng Road, Fengxian District, Shanghai, 2014

Patentee after: Shanghai Zhize Communication Services Co.,Ltd.

Country or region after: China

Address before: Room 1703, No. 310 Wuning South Road, Jing'an District, Shanghai, 200040

Patentee before: Tao Yewen

Country or region before: China