CN104219459A - Video language translation method and system and intelligent display device - Google Patents

Video language translation method and system and intelligent display device Download PDF

Info

Publication number
CN104219459A
CN104219459A CN201410522251.6A CN201410522251A CN104219459A CN 104219459 A CN104219459 A CN 104219459A CN 201410522251 A CN201410522251 A CN 201410522251A CN 104219459 A CN104219459 A CN 104219459A
Authority
CN
China
Prior art keywords
video
current
broadcast
object language
image frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410522251.6A
Other languages
Chinese (zh)
Inventor
张卓立
张清华
丁伯炉
赵冀杨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Moruan Communication Technology Co Ltd
Original Assignee
Shanghai Moruan Communication Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Moruan Communication Technology Co Ltd filed Critical Shanghai Moruan Communication Technology Co Ltd
Priority to CN201410522251.6A priority Critical patent/CN104219459A/en
Publication of CN104219459A publication Critical patent/CN104219459A/en
Pending legal-status Critical Current

Links

Abstract

The invention discloses a video language translation method, a video language translation system and an intelligent display device. The video language translation method comprises the following steps of: obtaining current video which includes a current image frame and/or current language; automatically obtaining target language; recognizing characters and/or voice of the current video, playing the video when the recognizing result shows that the target language is met; translating the characters and/or voice of the current image frame, and adding the translating result into the current video; playing the video. According to the video language translation method and system and the intelligent display device provided by the invention, non-native language programs or video can be converted into the programs or video with the language similar to the native language for audiences, thus helping the audiences overcome the language barrier, and improving view experience when the audiences watch the programs or video through networks.

Description

Video language interpretation method, system and intelligent display device
Technical field
The present invention relates to a kind of video language interpretation method, system and intelligent display device.
Background technology
Along with popularizing of intelligent television and network, increasing televiewer can select to watch the video on network.Than traditional TV channel, the TV programme that can watch on network or the kind of video and quantity are all far away more than the former.Magnanimity program or video on network, not only comprised and the program of national this area also comprised a large amount of external programs.This has just brought a problem, and the video language play may not be mother tongue or spectators' mother tongue, and aphasis can affect spectators' viewing experience greatly.
Obviously, if can manage, in progress non-mother tongue program and video, necessarily process, to help spectators to overcome aphasis, can greatly promote spectators' viewing experience, more effectively utilize the resource on the Internet.
Summary of the invention
The technical problem to be solved in the present invention is to be difficult to overcome aphasis thereby the not good defect of perception in order to overcome in prior art when spectators or user watch program or video by network, proposes a kind of video language interpretation method, system and intelligent display device.
The present invention solves above-mentioned technical problem by following technical proposals:
The invention provides a kind of video language interpretation method, it is characterized in that, comprise the following steps:
S 1, a display device obtains current video, current video comprises current image frame and/or current speech;
S 2, this display device communicates by letter with a terminal equipment, and obtain and language be set as object language on this terminal equipment;
S 3, this display device carries out word identification and/or speech recognition to current video, at recognition result for meet this object language in the situation that, using current video as video to be broadcast and perform step S 5, in the situation that recognition result is not meet default object language to perform step S 4;
S 4, this display device is translated as this object language by the word in current image frame and/or current speech, and translation result is added in current video, to form video to be broadcast;
S 5, this display device plays video to be broadcast
It will be appreciated by those skilled in the art that and adopt any equipment to carry out the broadcasting of video, in fact all need first to obtain video, and then play.When both interval very in short-term, be just equivalent to instant broadcasting for the audience.Above-mentioned current video is the video that equipment obtains, and before video playback or in playing process, needs to carry out above-mentioned steps S 2~S 4.
Step S wherein 2to the automatic acquisition of object language, can be that display language on the terminal equipment arranging according to user obtains automatically, generally think that user arranges on terminal equipment language is spectators' mother tongue, be also object language.Here said terminal equipment can be user's the mobile terminal such as smart mobile phone, panel computer.Through above-mentioned steps S 4after processing, at least guaranteed in the situation that the video obtaining and mother tongue are not inconsistent, the content that translation is obtained is additional in video to be play, thereby helps spectators to be crossed at least to a certain extent aphasis more to understand the content of broadcasting, improves its viewing experience.
Preferably, current video comprises current image frame and current speech, this step S 4comprise the following steps:
S 41, current speech is translated as to this object language to form voice to be broadcast;
S 42, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S 43, according to the sequential of voice to be broadcast and picture frame to be broadcast, the two is synthesized to video to be broadcast.
Preferably, current video has a label;
Step S 41for: this display device is searched for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
Step S 42for: this display device is searched for the object language captions bag of current video on network according to this label, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast.
The label here can be the combination of the information such as the title, network resources address of current video or much information, and it can reflect the feature of current video, thereby is convenient to carry out according to it search of voice packet or captions bag.
The present invention also provides a kind of video language interpretation method, it is characterized in that, comprises the following steps:
S 1, a display device obtains current video, current video comprises current image frame and/or current speech;
S 2, this display device transfers to a terminal equipment by current video;
S 3, this terminal equipment read self language is set as object language, and current video is carried out to word identification and/or speech recognition, in the situation that recognition result is to meet default object language, using current video as video to be broadcast and perform step S 5, in the situation that recognition result is not meet default object language to perform step S 4;
S 4, this terminal equipment is translated as this object language by the word in current image frame and/or current speech, and translation result is added in current video, to form video to be broadcast, then video to be broadcast is sent to this display device;
S 5, this display device plays video to be broadcast.
Should be understood that, although adopt said method to increase the transfer of data between display device and terminal equipment, greatly reduce the performance requirement for display device itself simultaneously.Even if video playback apparatus non intelligentization of display device, comparatively old-fashioned herein, as long as it has ability to play and data transmission capabilities, and do not need data-handling capacity, can realize equally non-mother tongue video is converted to a certain extent to the function for the video of mother tongue.Need the terminal equipment of certain data-handling capacity can adopt smart mobile phone or wearable device etc.
Transfer of data can adopt wired form, such as the transmission such as USB, MHL interface (being Mobile High-Definition Link, is mobile terminal high-definition audio and video standard interface), also can utilize Radio Transmission Technology, such as WIFI.
Preferably, current video comprises current image frame and current speech, this step S 4comprise the following steps:
S 41, this terminal equipment is translated as this object language to form voice to be broadcast by current speech;
S 42, this terminal equipment is this object language by the character translation in current image frame, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S 43, this terminal equipment synthesizes video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two;
S 44, this terminal equipment is sent to this display device by video to be broadcast.
Preferably, current video has a label;
Step S 41for: this terminal equipment is searched for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
Step S 42for: this terminal equipment is searched for the object language captions bag of current video on network according to this label, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast.
The present invention also provides a kind of intelligent display device, comprises a video acquiring module, an object language module, an identification module and a playing module.
This video acquiring module is used for obtaining current video, and current video comprises current image frame and/or current speech.This object language module, for communicating by letter with terminal equipment, is obtained the language that arranges on terminal equipment, is defaulted as object language.
This identification module is for carrying out word identification and/or speech recognition to current video, at recognition result for meet this object language in the situation that, current video is sent to a playing module as video to be broadcast, in the situation that recognition result is not meet default object language, word in current image frame and/or current speech are translated as to this object language, and translation result is added in current video, to form video to be broadcast.This playing module is for playing video to be broadcast.
Preferably, current video comprises current image frame and current speech, and this identification module comprises an Audio Processing Unit, a graphics processing unit and a synthesis unit.
Wherein, this Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
Preferably, current video has a label, and this identification module also comprises a web search unit.This web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
The present invention also provides a kind of video translation system, comprises a display device and a terminal equipment.
This display device comprises a video acquiring module, a transport module and a playing module, this video acquiring module is used for obtaining current video, current video comprises current image frame and/or current speech, this transport module is for transferring to current video this terminal equipment and receive video to be broadcast from this terminal equipment, and this playing module is used for playing video to be broadcast.
This terminal equipment comprises a transport module and an identification module, this transport module be used for and this display device between video receiving, this identification module is for carrying out word identification and/or speech recognition to current video, at recognition result for meet this object language in the situation that, current video is sent to a playing module as video to be broadcast, in the situation that recognition result is not meet default object language, word in current image frame and/or current speech are translated as to this object language, and translation result is added in current video, to form video to be broadcast.
Preferably, this display device can be intelligent television, and this terminal equipment can be smart mobile phone or wearable device.
Preferably, current video comprises current image frame and current speech, and this identification module comprises an Audio Processing Unit, a graphics processing unit and a synthesis unit.
This Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
Preferably, current video has a label, and this identification module also comprises a web search unit.This web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
Meeting on the basis of this area general knowledge, above-mentioned each optimum condition, can combination in any, obtains the preferred embodiments of the invention.
Positive progressive effect of the present invention is:
Video language interpretation method of the present invention, system and intelligent display device, can be for spectators be by non-mother tongue program or intimate program or the video that is similar to mother tongue that be immediately converted to of video, help spectators to overcome aphasis, improve by the viewing experience of network or television-viewing program or video.
Accompanying drawing explanation
Fig. 1 is the flow chart of the video language interpretation method of the embodiment of the present invention 1.
Fig. 2 is the flow chart of the video language interpretation method of the embodiment of the present invention 2.
Fig. 3 is the schematic diagram of the intelligent display device of the embodiment of the present invention 3.
Fig. 4 is the schematic diagram of the video translation system of the embodiment of the present invention 3.
Embodiment
Below in conjunction with accompanying drawing, provide preferred embodiment of the present invention, to describe technical scheme of the present invention in detail, but therefore do not limit the present invention among described scope of embodiments.
Embodiment 1
As shown in Figure 1, the video language interpretation method of the present embodiment comprises the following steps:
S 1, obtain current video, current video comprises current image frame and current speech, current video also has a label;
S 2, automatic acquisition user terminal equipment language is set as object language;
S 3, current video is carried out to word identification and speech recognition, at recognition result for meet this object language in the situation that, using current video as video to be broadcast and perform step S 5, in the situation that recognition result is not meet default object language to perform step S 41;
S 41, according to this label, on network, search for the object language voice packet of current video, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
S 42, according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S 43, according to the sequential of voice to be broadcast and picture frame to be broadcast, the two is synthesized to video to be broadcast;
S 5, play video to be broadcast.
In the present embodiment, the network resources address that this label is current video, this object language is Chinese.And current video comprises some two field picture frames.
Embodiment 2
Shown in figure 2, the video language interpretation method of the present embodiment comprises the following steps:
S 1, a display device obtains current video, current video comprises current image frame and current speech, and current video also has a label;
S 2, this display device transfers to a terminal equipment by current video;
S 3, this terminal equipment carries out word identification and speech recognition to current video, in the situation that recognition result is to meet default object language, using current video as video to be broadcast and perform step S 5, in the situation that recognition result is not meet default object language to perform step S 41;
S 41, this terminal equipment searches for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
S 42, this terminal equipment searches for the object language captions bag of current video on network according to this label, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S 43, this terminal equipment synthesizes video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two;
S 44, this terminal equipment is sent to this display device by video to be broadcast;
S 5, this display device plays video to be broadcast.
Although the present embodiment adopts said method to increase the transfer of data between display device and terminal equipment, greatly reduce the performance requirement for display device itself simultaneously.And terminal equipment in the present embodiment is a wearing equipment.
Embodiment 3
Shown in figure 3, the intelligent display device of the present embodiment comprises a video acquiring module 1, an object language module 2, an identification module 3 and a playing module 4.
This video acquiring module is used for obtaining current video, and current video comprises current image frame and current speech, and has a label.This object language module is for communicating by letter with terminal equipment, automatic acquisition terminal equipment language is set, using it as object language.
This identification module comprises an Audio Processing Unit, a graphics processing unit, a web search unit and a synthesis unit.
Wherein, this web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
This Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.This playing module is for playing video to be broadcast.
The present embodiment adopts said method to take intelligent display device as main body, has reduced the transfer of data between display device and terminal equipment, but higher to the performance requirement of intelligent display device itself.Intelligent display device in the present embodiment is intelligent television.
Embodiment 4
Shown in figure 4, the video translation system of the present embodiment includes a display device and a terminal equipment.
This display device comprises a video acquiring module 11, a transport module 13 and a playing module 12, and this video acquiring module 11 is for obtaining current video, and current video comprises current image frame and current speech, also has a label.This transport module 13 is for transferring to current video on this terminal equipment and receiving video to be broadcast from this terminal equipment, and this playing module 12 is for playing video to be broadcast.
This terminal equipment comprises a transport module 22 and an identification module 21, this transport module 22 for complete and this display device between video receiving, this identification module 21 is for carrying out word identification and speech recognition to current video, at recognition result for meet this object language in the situation that, current video is sent to a playing module as video to be broadcast, in the situation that recognition result is not meet default object language, word in current image frame and current speech are translated as to this object language, and translation result is added in current video, to form video to be broadcast.
This identification module 21 comprises an Audio Processing Unit, a graphics processing unit, a web search unit and a synthesis unit.
This web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
This Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
Although more than described the specific embodiment of the present invention, it will be understood by those of skill in the art that these only illustrate, protection scope of the present invention is limited by appended claims.Those skilled in the art is not deviating under the prerequisite of principle of the present invention and essence, can make various changes or modifications to these execution modes, but these changes and modification all fall into protection scope of the present invention.

Claims (13)

1. a video language interpretation method, is characterized in that, comprises the following steps:
S 1, a display device obtains current video, current video comprises current image frame and/or current speech;
S 2, this display device communicates by letter with a terminal equipment, and obtain and language be set as object language on this terminal equipment;
S 3, this display device carries out word identification and/or speech recognition to current video, at recognition result for meet this object language in the situation that, using current video as video to be broadcast and perform step S 5, in the situation that recognition result is not meet default object language to perform step S 4;
S 4, this display device is translated as this object language by the word in current image frame and/or current speech, and translation result is added in current video, to form video to be broadcast;
S 5, this display device plays video to be broadcast.
2. video language interpretation method as claimed in claim 1, is characterized in that, current video comprises current image frame and current speech, this step S 4comprise the following steps:
S 41, this display device is translated as this object language to form voice to be broadcast by current speech;
S 42, this display device is this object language by the character translation in current image frame, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S 43, this display device synthesizes video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
3. video language interpretation method as claimed in claim 2, is characterized in that, current video has a label;
Step S 41for: this display device is searched for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
Step S 42for: this display device is searched for the object language captions bag of current video on network according to this label, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast.
4. a video language interpretation method, is characterized in that, comprises the following steps:
S 1, a display device obtains current video, current video comprises current image frame and/or current speech;
S 2, this display device transfers to a terminal equipment by current video;
S 3, this terminal equipment read self language is set as object language, and current video is carried out to word identification and/or speech recognition, in the situation that recognition result is to meet default object language, using current video as video to be broadcast and perform step S 5, in the situation that recognition result is not meet default object language to perform step S 4;
S 4, this terminal equipment is translated as this object language by the word in current image frame and/or current speech, and translation result is added in current video, to form video to be broadcast, then video to be broadcast is sent to this display device;
S 5, this display device plays video to be broadcast.
5. video language interpretation method as claimed in claim 4, is characterized in that, current video comprises current image frame and current speech, this step S 4comprise the following steps:
S 41, this terminal equipment is translated as this object language to form voice to be broadcast by current speech;
S 42, this terminal equipment is this object language by the character translation in current image frame, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S 43, this terminal equipment synthesizes video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two;
S 44, this terminal equipment is sent to this display device by video to be broadcast.
6. video language interpretation method as claimed in claim 5, is characterized in that, current video has a label;
Step S 41for: this terminal equipment is searched for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
Step S 42for: this terminal equipment is searched for the object language captions bag of current video on network according to this label, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast.
7. an intelligent display device, is characterized in that, comprising:
One video acquiring module, for obtaining current video, current video comprises current image frame and/or current speech;
One object language module, for communicating by letter with terminal equipment, and obtains and language is set as object language on terminal equipment;
One identification module, for current video is carried out to word identification and/or speech recognition, at recognition result for meet this object language in the situation that, current video is sent to a playing module as video to be broadcast, in the situation that recognition result is not meet default object language, word in current image frame and/or current speech are translated as to this object language, and translation result is added in current video, to form video to be broadcast;
This playing module, for playing video to be broadcast.
8. intelligent display device as claimed in claim 7, is characterized in that, current video comprises current image frame and current speech, and this identification module comprises an Audio Processing Unit, a graphics processing unit and a synthesis unit;
This Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
9. intelligent display device as claimed in claim 8, is characterized in that, current video has a label, and this identification module also comprises a web search unit;
This web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
10. a video translation system, is characterized in that, comprises a display device and a terminal equipment;
This display device comprises a video acquiring module, a transport module and a playing module, this video acquiring module is used for obtaining current video, current video comprises current image frame and/or current speech, this transport module is for transferring to current video this terminal equipment and receive video to be broadcast from this terminal equipment, and this playing module is used for playing video to be broadcast:
This terminal equipment comprises a transport module and an identification module, this transport module be used for and this display device between video receiving, this identification module is for carrying out word identification and/or speech recognition to current video, at recognition result for meet this object language in the situation that, current video is sent to a playing module as video to be broadcast, in the situation that recognition result is not meet default object language, word in current image frame and/or current speech are translated as to this object language, and translation result is added in current video, to form video to be broadcast.
11. video translation systems as claimed in claim 10, is characterized in that, current video comprises current image frame and current speech, and this identification module comprises an Audio Processing Unit, a graphics processing unit and a synthesis unit;
This Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
12. video translation systems as claimed in claim 11, is characterized in that, current video has a label, and this identification module also comprises a web search unit;
This web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
13. video translation systems as described in any one in claim 10-12, is characterized in that, this display device is intelligent television, and this terminal equipment is smart mobile phone or wearable device.
CN201410522251.6A 2014-09-30 2014-09-30 Video language translation method and system and intelligent display device Pending CN104219459A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410522251.6A CN104219459A (en) 2014-09-30 2014-09-30 Video language translation method and system and intelligent display device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410522251.6A CN104219459A (en) 2014-09-30 2014-09-30 Video language translation method and system and intelligent display device

Publications (1)

Publication Number Publication Date
CN104219459A true CN104219459A (en) 2014-12-17

Family

ID=52100552

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410522251.6A Pending CN104219459A (en) 2014-09-30 2014-09-30 Video language translation method and system and intelligent display device

Country Status (1)

Country Link
CN (1) CN104219459A (en)

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104936015A (en) * 2015-06-24 2015-09-23 冯旋宇 Set top box language control method and system
CN106340291A (en) * 2016-09-27 2017-01-18 广东小天才科技有限公司 Bilingual subtitle production method and system
WO2017008241A1 (en) * 2015-07-14 2017-01-19 张阳 Subtitle control method and system for ktv song selection system
CN107484002A (en) * 2017-08-25 2017-12-15 四川长虹电器股份有限公司 The method of intelligent translation captions
CN108259750A (en) * 2018-02-07 2018-07-06 商丘职业技术学院 Course recording system, video process apparatus and record class method based on fixed seat in the plane
CN108764967A (en) * 2018-04-28 2018-11-06 北京鸿途信达科技股份有限公司 Internet advertising generation method and device
CN109255130A (en) * 2018-07-17 2019-01-22 北京赛思美科技术有限公司 A kind of method, system and the equipment of language translation and study based on artificial intelligence
CN109472035A (en) * 2018-11-12 2019-03-15 深圳市友杰智新科技有限公司 Switch the control system and method for interpretive scheme
CN110648553A (en) * 2019-09-26 2020-01-03 北京声智科技有限公司 Site reminding method, electronic equipment and computer readable storage medium
CN110765787A (en) * 2019-10-21 2020-02-07 深圳传音控股股份有限公司 Information interaction real-time translation method, medium and terminal
US10652622B2 (en) 2017-06-27 2020-05-12 At&T Intellectual Property I, L.P. Method and apparatus for providing content based upon a selected language
CN111356025A (en) * 2018-12-24 2020-06-30 深圳Tcl新技术有限公司 Multi-subtitle display method, intelligent terminal and storage medium
CN111448527A (en) * 2020-01-14 2020-07-24 深圳市元征科技股份有限公司 Vehicle diagnosis process playback method and device and readable storage medium
CN111986656A (en) * 2020-08-31 2020-11-24 上海松鼠课堂人工智能科技有限公司 Teaching video automatic caption processing method and system
CN112163102A (en) * 2020-09-29 2021-01-01 北京字跳网络技术有限公司 Search content matching method and device, electronic equipment and storage medium
CN112584209A (en) * 2020-12-04 2021-03-30 深圳创维-Rgb电子有限公司 Display method, display device, storage medium and smart television
WO2021057957A1 (en) * 2019-09-27 2021-04-01 深圳市万普拉斯科技有限公司 Video call method and apparatus, computer device and storage medium
CN114257862A (en) * 2020-09-24 2022-03-29 北京字跳网络技术有限公司 Video generation method, device, equipment and storage medium
CN115484477A (en) * 2021-05-31 2022-12-16 上海哔哩哔哩科技有限公司 Subtitle generating method and device
CN115484478A (en) * 2021-05-31 2022-12-16 上海哔哩哔哩科技有限公司 Subtitle processing method and device
WO2023097446A1 (en) * 2021-11-30 2023-06-08 深圳传音控股股份有限公司 Video processing method, smart terminal, and storage medium
CN114257862B (en) * 2020-09-24 2024-05-14 北京字跳网络技术有限公司 Video generation method, device, equipment and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080066138A1 (en) * 2006-09-13 2008-03-13 Nortel Networks Limited Closed captioning language translation
CN101472104A (en) * 2007-12-26 2009-07-01 英业达股份有限公司 Video signal display device with translation function
CN101924863A (en) * 2010-05-21 2010-12-22 中山大学 Digital television equipment
CN103067775A (en) * 2013-01-28 2013-04-24 Tcl集团股份有限公司 Subtitle display method for audio/video terminal, audio/video terminal and server
CN103106190A (en) * 2011-11-09 2013-05-15 财团法人资讯工业策进会 Instant translation system and method for digital television
CN103226947A (en) * 2013-03-27 2013-07-31 广东欧珀移动通信有限公司 Mobile terminal-based audio processing method and device
CN103488630A (en) * 2013-09-29 2014-01-01 小米科技有限责任公司 Method, device and terminal for processing picture
US20140053171A1 (en) * 2006-04-14 2014-02-20 At&T Intellectual Property Ii, L.P. On-Demand Language Translation for Television Programs
CN103634645A (en) * 2013-12-10 2014-03-12 青岛海尔软件有限公司 Method for controlling smart television by using mobile phone

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140053171A1 (en) * 2006-04-14 2014-02-20 At&T Intellectual Property Ii, L.P. On-Demand Language Translation for Television Programs
US20080066138A1 (en) * 2006-09-13 2008-03-13 Nortel Networks Limited Closed captioning language translation
CN101472104A (en) * 2007-12-26 2009-07-01 英业达股份有限公司 Video signal display device with translation function
CN101924863A (en) * 2010-05-21 2010-12-22 中山大学 Digital television equipment
CN103106190A (en) * 2011-11-09 2013-05-15 财团法人资讯工业策进会 Instant translation system and method for digital television
CN103067775A (en) * 2013-01-28 2013-04-24 Tcl集团股份有限公司 Subtitle display method for audio/video terminal, audio/video terminal and server
CN103226947A (en) * 2013-03-27 2013-07-31 广东欧珀移动通信有限公司 Mobile terminal-based audio processing method and device
CN103488630A (en) * 2013-09-29 2014-01-01 小米科技有限责任公司 Method, device and terminal for processing picture
CN103634645A (en) * 2013-12-10 2014-03-12 青岛海尔软件有限公司 Method for controlling smart television by using mobile phone

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104936015A (en) * 2015-06-24 2015-09-23 冯旋宇 Set top box language control method and system
WO2017008241A1 (en) * 2015-07-14 2017-01-19 张阳 Subtitle control method and system for ktv song selection system
CN106340291A (en) * 2016-09-27 2017-01-18 广东小天才科技有限公司 Bilingual subtitle production method and system
US10652622B2 (en) 2017-06-27 2020-05-12 At&T Intellectual Property I, L.P. Method and apparatus for providing content based upon a selected language
CN107484002A (en) * 2017-08-25 2017-12-15 四川长虹电器股份有限公司 The method of intelligent translation captions
CN108259750A (en) * 2018-02-07 2018-07-06 商丘职业技术学院 Course recording system, video process apparatus and record class method based on fixed seat in the plane
CN108764967A (en) * 2018-04-28 2018-11-06 北京鸿途信达科技股份有限公司 Internet advertising generation method and device
CN109255130A (en) * 2018-07-17 2019-01-22 北京赛思美科技术有限公司 A kind of method, system and the equipment of language translation and study based on artificial intelligence
CN109472035A (en) * 2018-11-12 2019-03-15 深圳市友杰智新科技有限公司 Switch the control system and method for interpretive scheme
CN109472035B (en) * 2018-11-12 2023-05-09 深圳市友杰智新科技有限公司 Control system and method for switching translation mode
CN111356025A (en) * 2018-12-24 2020-06-30 深圳Tcl新技术有限公司 Multi-subtitle display method, intelligent terminal and storage medium
CN110648553A (en) * 2019-09-26 2020-01-03 北京声智科技有限公司 Site reminding method, electronic equipment and computer readable storage medium
CN110648553B (en) * 2019-09-26 2021-05-28 北京声智科技有限公司 Site reminding method, electronic equipment and computer readable storage medium
WO2021057957A1 (en) * 2019-09-27 2021-04-01 深圳市万普拉斯科技有限公司 Video call method and apparatus, computer device and storage medium
CN110765787A (en) * 2019-10-21 2020-02-07 深圳传音控股股份有限公司 Information interaction real-time translation method, medium and terminal
CN111448527A (en) * 2020-01-14 2020-07-24 深圳市元征科技股份有限公司 Vehicle diagnosis process playback method and device and readable storage medium
CN111986656A (en) * 2020-08-31 2020-11-24 上海松鼠课堂人工智能科技有限公司 Teaching video automatic caption processing method and system
CN114257862A (en) * 2020-09-24 2022-03-29 北京字跳网络技术有限公司 Video generation method, device, equipment and storage medium
CN114257862B (en) * 2020-09-24 2024-05-14 北京字跳网络技术有限公司 Video generation method, device, equipment and storage medium
CN112163102A (en) * 2020-09-29 2021-01-01 北京字跳网络技术有限公司 Search content matching method and device, electronic equipment and storage medium
CN112584209A (en) * 2020-12-04 2021-03-30 深圳创维-Rgb电子有限公司 Display method, display device, storage medium and smart television
CN115484477A (en) * 2021-05-31 2022-12-16 上海哔哩哔哩科技有限公司 Subtitle generating method and device
CN115484478A (en) * 2021-05-31 2022-12-16 上海哔哩哔哩科技有限公司 Subtitle processing method and device
WO2023097446A1 (en) * 2021-11-30 2023-06-08 深圳传音控股股份有限公司 Video processing method, smart terminal, and storage medium

Similar Documents

Publication Publication Date Title
CN104219459A (en) Video language translation method and system and intelligent display device
US8656281B2 (en) Information processing apparatus, information processing method, information processing system, and program
US20150113558A1 (en) Receiver apparatus, broadcast/communication-cooperation system, and broadcast/communication-cooperation method
CN109729420A (en) Image processing method and device, mobile terminal and computer readable storage medium
US11227620B2 (en) Information processing apparatus and information processing method
US20080066097A1 (en) Method Of Realizing Interactive Advertisement Under Digital Braodcasting Environment By Extending Program Associated Data-Broadcasting To Internet Area
KR101293301B1 (en) System and method for serching images using caption of moving picture in keyword
CN102291615B (en) Television program precision searching and detail viewing device and method based on one-way network
CN105992041A (en) Method and device for encoding a captured screenshot and controlling program content switching based on the captured screenshot
CN105744346A (en) Caption switching method and device
CN102088631B (en) Live and demand broadcast method of digital television (TV) programs as well as related device and system
EP3142373A1 (en) Channel classification method and device
CN108810580B (en) Media content pushing method and device
CN102193794A (en) Linking real time media context to related applications and services
JP4774462B2 (en) Reception device, reception method, and reception program
CN105847946A (en) Screen transmission video processing method
TWI439123B (en) Set-top box and method for searching characters thereof
CN105872728A (en) Screen transfer video processing method for multi-screen interaction
CN103477653A (en) Supplying apparatus, supplying method, receiving apparatus, receiving method, program, and broadcasting system
CN102595232B (en) Relative information search method of digital television programs and digital television receiving terminal
CN105847971A (en) Method for processing screen transmission video
CN103501457B (en) The method and apparatus that a kind of program is play
CN103686336A (en) Video playing control method and device
CN107465946B (en) Video playing method, device, system and terminal equipment
CN108363770A (en) A kind of set-top box supports multipath extraction keyword and the method and system of search

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20141217

RJ01 Rejection of invention patent application after publication