CN202026434U - Voice conversion STB (set top box) - Google Patents

Voice conversion STB (set top box) Download PDF

Info

Publication number
CN202026434U
CN202026434U CN2011201311724U CN201120131172U CN202026434U CN 202026434 U CN202026434 U CN 202026434U CN 2011201311724 U CN2011201311724 U CN 2011201311724U CN 201120131172 U CN201120131172 U CN 201120131172U CN 202026434 U CN202026434 U CN 202026434U
Authority
CN
China
Prior art keywords
input
language
voice
output
characters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2011201311724U
Other languages
Chinese (zh)
Inventor
林辉荣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Unionman Technology Co Ltd
Original Assignee
Guangdong Unionman Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Unionman Technology Co Ltd filed Critical Guangdong Unionman Technology Co Ltd
Priority to CN2011201311724U priority Critical patent/CN202026434U/en
Application granted granted Critical
Publication of CN202026434U publication Critical patent/CN202026434U/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Machine Translation (AREA)

Abstract

The utility model relates to the technical field relative to an STB (set top box), in particular to a voice conversion STB, which comprises a main body, an AV (audio/video) input interface and an AV output interface, wherein the AV input interface and the AV output interface are connected with the main body. The main body comprises a voice-character conversion device, a character translation device and a voice synthesis device, the input end of the voice-character conversion device is connected with the AV input interface of the STB, the output end of the voice-character conversion device is connected with the input end of the character translation device, the output end of the character translation device is connected with the input end of the voice synthesis device, and the output end of the voice synthesis device is connected with the AV output interface. According to the voice conversion STB, voice can be converted according to the requirement of a user, not only is the requirement of the user met, but also a TV (television) station achieves better audience rating, and unnecessary voice transcribe conversion is saved.

Description

A kind of speech conversion set-top box
Technical field
The utility model relates to the set-top box correlative technology field, particularly a kind of speech conversion set-top box.
Background technology
Now some voice controls and voice are read and write appears on the digital equipment than higher-end, the voice of the non-master that present TV is seen generally all are the artificial treatment records, some still can not adopt the master voice through the TV programme of human translation like this, spectators can only understand programme information by captions, even some program does not have captions.Like this, spectators are very inconvenient when watching, and also can miss many highlights.
The utility model content
The utility model provides a kind of speech conversion set-top box, can not directly translate the technical problem of TV programme voice with the set-top box that solves prior art.
The technical solution adopted in the utility model is as follows:
A kind of speech conversion set-top box, comprise main body, and the audio frequency and video input interface and the audio and video output interface that are connected with main body, described main body comprises: language and characters conversion equipment, character translation device and speech synthetic device, the input of described language and characters conversion equipment is connected with the audio frequency and video input interface of set-top box, the output of language and characters conversion equipment is connected with the input of character translation device, the output of character translation device is connected with the input of speech synthetic device, and the output of speech synthetic device is connected with audio and video output interface.
As a kind of preferred version, described language and characters conversion equipment also comprises: preserve sound bank, speech analysis module and language and characters comparison module by voice and related text thereof;
The input of described speech analysis module is connected with the input of language and characters conversion equipment, obtains the voice of input;
The output of speech analysis module is connected with the first input end of language and characters comparison module, and second input of language and characters comparison module is connected with sound bank, and the output of language and characters comparison module is connected with the output of language and characters conversion equipment.
As a kind of preferred version, described speech synthetic device is the digital speech synthesizer.
As a kind of preferred version, described speech synthetic device is the mandarin pronunciation synthesizer.
The utility model makes set-top box to change voice according to user's requirement, has promptly satisfied user's requirement, also makes TV station obtain better audience ratings and unnecessary voice recording conversion.
Description of drawings
Fig. 1 is a system architecture diagram of the present utility model.
Fig. 2 is a workflow of the present utility model.
Embodiment
The utility model is described in more detail below in conjunction with the drawings and specific embodiments.
Present embodiment is a kind of speech conversion set-top box, comprise main body, and the audio frequency and video input interface and the audio and video output interface that are connected with main body, the audio frequency and video input interface is connected with program source, obtain programme information, audio and video output interface is connected with TV, the output program, described main body comprises: the language and characters conversion equipment, character translation device and digital speech synthesizer, the input of described language and characters conversion equipment is connected with the audio frequency and video input interface of set-top box, the output of language and characters conversion equipment is connected with the input of character translation device, the output of character translation device is connected with the input of digital speech synthesizer, and the output of digital speech synthesizer is connected with audio and video output interface.
Described language and characters conversion equipment comprises: preserve sound bank, speech analysis module and language and characters comparison module by voice and related text thereof;
The input of described speech analysis module is connected with the input of language and characters conversion equipment, obtains the voice of input;
The output of speech analysis module is connected with the first input end of language and characters comparison module, and second input of language and characters comparison module is connected with sound bank, and the output of language and characters comparison module is connected with the output of language and characters conversion equipment.
Its realization flow is:
1. set-top box is carried out preliminary treatment and is set up sound bank voice signal.
2. speech analysis module is extracted from tone color, speaker's individual information and the acoustic feature value parameter thereof that speech analysis goes out the speaker that grab of program and is carried out speech reconstructing.
3. the language and characters comparison module carries out the capable comparison of wave mode superposition algorithm rebuilding good voice with the voice in the sound bank, draws Word message.
Again Word message by change the language of customer requirements.
Again Word message is converted to the corresponding digital synthetic speech.

Claims (4)

1. speech conversion set-top box, comprise main body, and the audio frequency and video input interface and the audio and video output interface that are connected with main body, it is characterized in that, described main body comprises: the language and characters conversion equipment, character translation device and speech synthetic device, the input of described language and characters conversion equipment is connected with the audio frequency and video input interface of set-top box, the output of language and characters conversion equipment is connected with the input of character translation device, the output of character translation device is connected with the input of speech synthetic device, and the output of speech synthetic device is connected with audio and video output interface.
2. set-top box according to claim 1 is characterized in that, described language and characters conversion equipment also comprises: preserve sound bank, speech analysis module and language and characters comparison module by voice and related text thereof;
The input of described speech analysis module is connected with the input of language and characters conversion equipment, obtains the voice of input;
The output of speech analysis module is connected with the first input end of language and characters comparison module, and second input of language and characters comparison module is connected with sound bank, and the output of language and characters comparison module is connected with the output of language and characters conversion equipment.
3. set-top box according to claim 1 is characterized in that, described speech synthetic device is the digital speech synthesizer.
4. set-top box according to claim 1 is characterized in that, described speech synthetic device is the mandarin pronunciation synthesizer.
CN2011201311724U 2011-04-29 2011-04-29 Voice conversion STB (set top box) Expired - Fee Related CN202026434U (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011201311724U CN202026434U (en) 2011-04-29 2011-04-29 Voice conversion STB (set top box)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011201311724U CN202026434U (en) 2011-04-29 2011-04-29 Voice conversion STB (set top box)

Publications (1)

Publication Number Publication Date
CN202026434U true CN202026434U (en) 2011-11-02

Family

ID=44851367

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011201311724U Expired - Fee Related CN202026434U (en) 2011-04-29 2011-04-29 Voice conversion STB (set top box)

Country Status (1)

Country Link
CN (1) CN202026434U (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779508A (en) * 2012-03-31 2012-11-14 安徽科大讯飞信息科技股份有限公司 Speech corpus generating device and method, speech synthesizing system and method
CN104252861A (en) * 2014-09-11 2014-12-31 百度在线网络技术(北京)有限公司 Video voice conversion method, video voice conversion device and server
CN104936015A (en) * 2015-06-24 2015-09-23 冯旋宇 Set top box language control method and system
CN106384593A (en) * 2016-09-05 2017-02-08 北京金山软件有限公司 Voice information conversion and information generation method and device

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779508A (en) * 2012-03-31 2012-11-14 安徽科大讯飞信息科技股份有限公司 Speech corpus generating device and method, speech synthesizing system and method
CN102779508B (en) * 2012-03-31 2016-11-09 科大讯飞股份有限公司 Sound bank generates Apparatus for () and method therefor, speech synthesis system and method thereof
CN104252861A (en) * 2014-09-11 2014-12-31 百度在线网络技术(北京)有限公司 Video voice conversion method, video voice conversion device and server
WO2016037440A1 (en) * 2014-09-11 2016-03-17 百度在线网络技术(北京)有限公司 Video voice conversion method and device and server
CN104252861B (en) * 2014-09-11 2018-04-13 百度在线网络技术(北京)有限公司 Video speech conversion method, device and server
CN104936015A (en) * 2015-06-24 2015-09-23 冯旋宇 Set top box language control method and system
CN106384593A (en) * 2016-09-05 2017-02-08 北京金山软件有限公司 Voice information conversion and information generation method and device
CN110060687A (en) * 2016-09-05 2019-07-26 北京金山软件有限公司 A kind of conversion of voice messaging, information generating method and device
CN106384593B (en) * 2016-09-05 2019-11-01 北京金山软件有限公司 A kind of conversion of voice messaging, information generating method and device

Similar Documents

Publication Publication Date Title
CN104252861B (en) Video speech conversion method, device and server
CN105679348B (en) A kind of audio/video player and method
CN105845125B (en) Phoneme synthesizing method and speech synthetic device
US9547642B2 (en) Voice to text to voice processing
WO2020098115A1 (en) Subtitle adding method, apparatus, electronic device, and computer readable storage medium
CN201319640Y (en) Digital television receiving terminal capable of synchronously translating in real time
CN106653036B (en) Audio mixing code-transferring method based on OTT boxes
CN202026434U (en) Voice conversion STB (set top box)
CN102893313A (en) System for translating spoken language into sign language for the deaf
CN103491429A (en) Audio processing method and audio processing equipment
CN106098054A (en) The defecator of speaker noise and method in a kind of speech recognition
CN103226947A (en) Mobile terminal-based audio processing method and device
CN110349582B (en) Display device and far-field voice processing circuit
US20120130720A1 (en) Information providing device
CN109346057A (en) A kind of speech processing system of intelligence toy for children
CN111447519A (en) Smart speaker, interaction method based on smart speaker and program product
WO2023045954A1 (en) Speech synthesis method and apparatus, electronic device, and readable storage medium
CN202796043U (en) Voice recognition system
CN110767233A (en) Voice conversion system and method
CN202652435U (en) Digital television set top box capable of automatically generating subtitles
CN211860471U (en) Intelligent sound box
CN102110459B (en) Playing terminal and multimedia file playing method and device thereof
CN101188664A (en) STB with voice prompt
CN209030351U (en) A kind of television terminal with interpretative function
CN107393566A (en) The audio-frequency decoding method and device of a kind of Intelligent story device

Legal Events

Date Code Title Description
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20111102

Termination date: 20140429