CN202026434U

CN202026434U - Voice conversion STB (set top box)

Info

Publication number: CN202026434U
Application number: CN2011201311724U
Authority: CN
Inventors: 林辉荣
Original assignee: Guangdong Unionman Technology Co Ltd
Current assignee: Guangdong Unionman Technology Co Ltd
Priority date: 2011-04-29
Filing date: 2011-04-29
Publication date: 2011-11-02
Anticipated expiration: 2021-04-29

Abstract

The utility model relates to the technical field relative to an STB (set top box), in particular to a voice conversion STB, which comprises a main body, an AV (audio/video) input interface and an AV output interface, wherein the AV input interface and the AV output interface are connected with the main body. The main body comprises a voice-character conversion device, a character translation device and a voice synthesis device, the input end of the voice-character conversion device is connected with the AV input interface of the STB, the output end of the voice-character conversion device is connected with the input end of the character translation device, the output end of the character translation device is connected with the input end of the voice synthesis device, and the output end of the voice synthesis device is connected with the AV output interface. According to the voice conversion STB, voice can be converted according to the requirement of a user, not only is the requirement of the user met, but also a TV (television) station achieves better audience rating, and unnecessary voice transcribe conversion is saved.

Description

A kind of speech conversion set-top box

Technical field

The utility model relates to the set-top box correlative technology field, particularly a kind of speech conversion set-top box.

Background technology

Now some voice controls and voice are read and write appears on the digital equipment than higher-end, the voice of the non-master that present TV is seen generally all are the artificial treatment records, some still can not adopt the master voice through the TV programme of human translation like this, spectators can only understand programme information by captions, even some program does not have captions.Like this, spectators are very inconvenient when watching, and also can miss many highlights.

The utility model content

The utility model provides a kind of speech conversion set-top box, can not directly translate the technical problem of TV programme voice with the set-top box that solves prior art.

The technical solution adopted in the utility model is as follows:

A kind of speech conversion set-top box, comprise main body, and the audio frequency and video input interface and the audio and video output interface that are connected with main body, described main body comprises: language and characters conversion equipment, character translation device and speech synthetic device, the input of described language and characters conversion equipment is connected with the audio frequency and video input interface of set-top box, the output of language and characters conversion equipment is connected with the input of character translation device, the output of character translation device is connected with the input of speech synthetic device, and the output of speech synthetic device is connected with audio and video output interface.

As a kind of preferred version, described language and characters conversion equipment also comprises: preserve sound bank, speech analysis module and language and characters comparison module by voice and related text thereof;

The input of described speech analysis module is connected with the input of language and characters conversion equipment, obtains the voice of input;

The output of speech analysis module is connected with the first input end of language and characters comparison module, and second input of language and characters comparison module is connected with sound bank, and the output of language and characters comparison module is connected with the output of language and characters conversion equipment.

As a kind of preferred version, described speech synthetic device is the digital speech synthesizer.

As a kind of preferred version, described speech synthetic device is the mandarin pronunciation synthesizer.

The utility model makes set-top box to change voice according to user's requirement, has promptly satisfied user's requirement, also makes TV station obtain better audience ratings and unnecessary voice recording conversion.

Description of drawings

Fig. 1 is a system architecture diagram of the present utility model.

Fig. 2 is a workflow of the present utility model.

Embodiment

The utility model is described in more detail below in conjunction with the drawings and specific embodiments.

Present embodiment is a kind of speech conversion set-top box, comprise main body, and the audio frequency and video input interface and the audio and video output interface that are connected with main body, the audio frequency and video input interface is connected with program source, obtain programme information, audio and video output interface is connected with TV, the output program, described main body comprises: the language and characters conversion equipment, character translation device and digital speech synthesizer, the input of described language and characters conversion equipment is connected with the audio frequency and video input interface of set-top box, the output of language and characters conversion equipment is connected with the input of character translation device, the output of character translation device is connected with the input of digital speech synthesizer, and the output of digital speech synthesizer is connected with audio and video output interface.

Described language and characters conversion equipment comprises: preserve sound bank, speech analysis module and language and characters comparison module by voice and related text thereof;

Its realization flow is:

1. set-top box is carried out preliminary treatment and is set up sound bank voice signal.

2. speech analysis module is extracted from tone color, speaker's individual information and the acoustic feature value parameter thereof that speech analysis goes out the speaker that grab of program and is carried out speech reconstructing.

3. the language and characters comparison module carries out the capable comparison of wave mode superposition algorithm rebuilding good voice with the voice in the sound bank, draws Word message.

Again Word message by change the language of customer requirements.

Again Word message is converted to the corresponding digital synthetic speech.

Claims

1. speech conversion set-top box, comprise main body, and the audio frequency and video input interface and the audio and video output interface that are connected with main body, it is characterized in that, described main body comprises: the language and characters conversion equipment, character translation device and speech synthetic device, the input of described language and characters conversion equipment is connected with the audio frequency and video input interface of set-top box, the output of language and characters conversion equipment is connected with the input of character translation device, the output of character translation device is connected with the input of speech synthetic device, and the output of speech synthetic device is connected with audio and video output interface.

2. set-top box according to claim 1 is characterized in that, described language and characters conversion equipment also comprises: preserve sound bank, speech analysis module and language and characters comparison module by voice and related text thereof;

3. set-top box according to claim 1 is characterized in that, described speech synthetic device is the digital speech synthesizer.

4. set-top box according to claim 1 is characterized in that, described speech synthetic device is the mandarin pronunciation synthesizer.