CN104219459A - Video language translation method and system and intelligent display device - Google Patents
Video language translation method and system and intelligent display device Download PDFInfo
- Publication number
- CN104219459A CN104219459A CN201410522251.6A CN201410522251A CN104219459A CN 104219459 A CN104219459 A CN 104219459A CN 201410522251 A CN201410522251 A CN 201410522251A CN 104219459 A CN104219459 A CN 104219459A
- Authority
- CN
- China
- Prior art keywords
- video
- current
- broadcast
- object language
- image frame
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Abstract
The invention discloses a video language translation method, a video language translation system and an intelligent display device. The video language translation method comprises the following steps of: obtaining current video which includes a current image frame and/or current language; automatically obtaining target language; recognizing characters and/or voice of the current video, playing the video when the recognizing result shows that the target language is met; translating the characters and/or voice of the current image frame, and adding the translating result into the current video; playing the video. According to the video language translation method and system and the intelligent display device provided by the invention, non-native language programs or video can be converted into the programs or video with the language similar to the native language for audiences, thus helping the audiences overcome the language barrier, and improving view experience when the audiences watch the programs or video through networks.
Description
Technical field
The present invention relates to a kind of video language interpretation method, system and intelligent display device.
Background technology
Along with popularizing of intelligent television and network, increasing televiewer can select to watch the video on network.Than traditional TV channel, the TV programme that can watch on network or the kind of video and quantity are all far away more than the former.Magnanimity program or video on network, not only comprised and the program of national this area also comprised a large amount of external programs.This has just brought a problem, and the video language play may not be mother tongue or spectators' mother tongue, and aphasis can affect spectators' viewing experience greatly.
Obviously, if can manage, in progress non-mother tongue program and video, necessarily process, to help spectators to overcome aphasis, can greatly promote spectators' viewing experience, more effectively utilize the resource on the Internet.
Summary of the invention
The technical problem to be solved in the present invention is to be difficult to overcome aphasis thereby the not good defect of perception in order to overcome in prior art when spectators or user watch program or video by network, proposes a kind of video language interpretation method, system and intelligent display device.
The present invention solves above-mentioned technical problem by following technical proposals:
The invention provides a kind of video language interpretation method, it is characterized in that, comprise the following steps:
S
1, a display device obtains current video, current video comprises current image frame and/or current speech;
S
2, this display device communicates by letter with a terminal equipment, and obtain and language be set as object language on this terminal equipment;
S
3, this display device carries out word identification and/or speech recognition to current video, at recognition result for meet this object language in the situation that, using current video as video to be broadcast and perform step S
5, in the situation that recognition result is not meet default object language to perform step S
4;
S
4, this display device is translated as this object language by the word in current image frame and/or current speech, and translation result is added in current video, to form video to be broadcast;
S
5, this display device plays video to be broadcast
It will be appreciated by those skilled in the art that and adopt any equipment to carry out the broadcasting of video, in fact all need first to obtain video, and then play.When both interval very in short-term, be just equivalent to instant broadcasting for the audience.Above-mentioned current video is the video that equipment obtains, and before video playback or in playing process, needs to carry out above-mentioned steps S
2~S
4.
Step S wherein
2to the automatic acquisition of object language, can be that display language on the terminal equipment arranging according to user obtains automatically, generally think that user arranges on terminal equipment language is spectators' mother tongue, be also object language.Here said terminal equipment can be user's the mobile terminal such as smart mobile phone, panel computer.Through above-mentioned steps S
4after processing, at least guaranteed in the situation that the video obtaining and mother tongue are not inconsistent, the content that translation is obtained is additional in video to be play, thereby helps spectators to be crossed at least to a certain extent aphasis more to understand the content of broadcasting, improves its viewing experience.
Preferably, current video comprises current image frame and current speech, this step S
4comprise the following steps:
S
41, current speech is translated as to this object language to form voice to be broadcast;
S
42, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S
43, according to the sequential of voice to be broadcast and picture frame to be broadcast, the two is synthesized to video to be broadcast.
Preferably, current video has a label;
Step S
41for: this display device is searched for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
Step S
42for: this display device is searched for the object language captions bag of current video on network according to this label, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast.
The label here can be the combination of the information such as the title, network resources address of current video or much information, and it can reflect the feature of current video, thereby is convenient to carry out according to it search of voice packet or captions bag.
The present invention also provides a kind of video language interpretation method, it is characterized in that, comprises the following steps:
S
1, a display device obtains current video, current video comprises current image frame and/or current speech;
S
2, this display device transfers to a terminal equipment by current video;
S
3, this terminal equipment read self language is set as object language, and current video is carried out to word identification and/or speech recognition, in the situation that recognition result is to meet default object language, using current video as video to be broadcast and perform step S
5, in the situation that recognition result is not meet default object language to perform step S
4;
S
4, this terminal equipment is translated as this object language by the word in current image frame and/or current speech, and translation result is added in current video, to form video to be broadcast, then video to be broadcast is sent to this display device;
S
5, this display device plays video to be broadcast.
Should be understood that, although adopt said method to increase the transfer of data between display device and terminal equipment, greatly reduce the performance requirement for display device itself simultaneously.Even if video playback apparatus non intelligentization of display device, comparatively old-fashioned herein, as long as it has ability to play and data transmission capabilities, and do not need data-handling capacity, can realize equally non-mother tongue video is converted to a certain extent to the function for the video of mother tongue.Need the terminal equipment of certain data-handling capacity can adopt smart mobile phone or wearable device etc.
Transfer of data can adopt wired form, such as the transmission such as USB, MHL interface (being Mobile High-Definition Link, is mobile terminal high-definition audio and video standard interface), also can utilize Radio Transmission Technology, such as WIFI.
Preferably, current video comprises current image frame and current speech, this step S
4comprise the following steps:
S
41, this terminal equipment is translated as this object language to form voice to be broadcast by current speech;
S
42, this terminal equipment is this object language by the character translation in current image frame, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S
43, this terminal equipment synthesizes video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two;
S
44, this terminal equipment is sent to this display device by video to be broadcast.
Preferably, current video has a label;
Step S
41for: this terminal equipment is searched for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
Step S
42for: this terminal equipment is searched for the object language captions bag of current video on network according to this label, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast.
The present invention also provides a kind of intelligent display device, comprises a video acquiring module, an object language module, an identification module and a playing module.
This video acquiring module is used for obtaining current video, and current video comprises current image frame and/or current speech.This object language module, for communicating by letter with terminal equipment, is obtained the language that arranges on terminal equipment, is defaulted as object language.
This identification module is for carrying out word identification and/or speech recognition to current video, at recognition result for meet this object language in the situation that, current video is sent to a playing module as video to be broadcast, in the situation that recognition result is not meet default object language, word in current image frame and/or current speech are translated as to this object language, and translation result is added in current video, to form video to be broadcast.This playing module is for playing video to be broadcast.
Preferably, current video comprises current image frame and current speech, and this identification module comprises an Audio Processing Unit, a graphics processing unit and a synthesis unit.
Wherein, this Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
Preferably, current video has a label, and this identification module also comprises a web search unit.This web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
The present invention also provides a kind of video translation system, comprises a display device and a terminal equipment.
This display device comprises a video acquiring module, a transport module and a playing module, this video acquiring module is used for obtaining current video, current video comprises current image frame and/or current speech, this transport module is for transferring to current video this terminal equipment and receive video to be broadcast from this terminal equipment, and this playing module is used for playing video to be broadcast.
This terminal equipment comprises a transport module and an identification module, this transport module be used for and this display device between video receiving, this identification module is for carrying out word identification and/or speech recognition to current video, at recognition result for meet this object language in the situation that, current video is sent to a playing module as video to be broadcast, in the situation that recognition result is not meet default object language, word in current image frame and/or current speech are translated as to this object language, and translation result is added in current video, to form video to be broadcast.
Preferably, this display device can be intelligent television, and this terminal equipment can be smart mobile phone or wearable device.
Preferably, current video comprises current image frame and current speech, and this identification module comprises an Audio Processing Unit, a graphics processing unit and a synthesis unit.
This Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
Preferably, current video has a label, and this identification module also comprises a web search unit.This web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
Meeting on the basis of this area general knowledge, above-mentioned each optimum condition, can combination in any, obtains the preferred embodiments of the invention.
Positive progressive effect of the present invention is:
Video language interpretation method of the present invention, system and intelligent display device, can be for spectators be by non-mother tongue program or intimate program or the video that is similar to mother tongue that be immediately converted to of video, help spectators to overcome aphasis, improve by the viewing experience of network or television-viewing program or video.
Accompanying drawing explanation
Fig. 1 is the flow chart of the video language interpretation method of the embodiment of the present invention 1.
Fig. 2 is the flow chart of the video language interpretation method of the embodiment of the present invention 2.
Fig. 3 is the schematic diagram of the intelligent display device of the embodiment of the present invention 3.
Fig. 4 is the schematic diagram of the video translation system of the embodiment of the present invention 3.
Embodiment
Below in conjunction with accompanying drawing, provide preferred embodiment of the present invention, to describe technical scheme of the present invention in detail, but therefore do not limit the present invention among described scope of embodiments.
Embodiment 1
As shown in Figure 1, the video language interpretation method of the present embodiment comprises the following steps:
S
1, obtain current video, current video comprises current image frame and current speech, current video also has a label;
S
2, automatic acquisition user terminal equipment language is set as object language;
S
3, current video is carried out to word identification and speech recognition, at recognition result for meet this object language in the situation that, using current video as video to be broadcast and perform step S
5, in the situation that recognition result is not meet default object language to perform step S
41;
S
41, according to this label, on network, search for the object language voice packet of current video, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
S
42, according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S
43, according to the sequential of voice to be broadcast and picture frame to be broadcast, the two is synthesized to video to be broadcast;
S
5, play video to be broadcast.
In the present embodiment, the network resources address that this label is current video, this object language is Chinese.And current video comprises some two field picture frames.
Embodiment 2
Shown in figure 2, the video language interpretation method of the present embodiment comprises the following steps:
S
1, a display device obtains current video, current video comprises current image frame and current speech, and current video also has a label;
S
2, this display device transfers to a terminal equipment by current video;
S
3, this terminal equipment carries out word identification and speech recognition to current video, in the situation that recognition result is to meet default object language, using current video as video to be broadcast and perform step S
5, in the situation that recognition result is not meet default object language to perform step S
41;
S
41, this terminal equipment searches for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
S
42, this terminal equipment searches for the object language captions bag of current video on network according to this label, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S
43, this terminal equipment synthesizes video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two;
S
44, this terminal equipment is sent to this display device by video to be broadcast;
S
5, this display device plays video to be broadcast.
Although the present embodiment adopts said method to increase the transfer of data between display device and terminal equipment, greatly reduce the performance requirement for display device itself simultaneously.And terminal equipment in the present embodiment is a wearing equipment.
Embodiment 3
Shown in figure 3, the intelligent display device of the present embodiment comprises a video acquiring module 1, an object language module 2, an identification module 3 and a playing module 4.
This video acquiring module is used for obtaining current video, and current video comprises current image frame and current speech, and has a label.This object language module is for communicating by letter with terminal equipment, automatic acquisition terminal equipment language is set, using it as object language.
This identification module comprises an Audio Processing Unit, a graphics processing unit, a web search unit and a synthesis unit.
Wherein, this web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
This Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.This playing module is for playing video to be broadcast.
The present embodiment adopts said method to take intelligent display device as main body, has reduced the transfer of data between display device and terminal equipment, but higher to the performance requirement of intelligent display device itself.Intelligent display device in the present embodiment is intelligent television.
Embodiment 4
Shown in figure 4, the video translation system of the present embodiment includes a display device and a terminal equipment.
This display device comprises a video acquiring module 11, a transport module 13 and a playing module 12, and this video acquiring module 11 is for obtaining current video, and current video comprises current image frame and current speech, also has a label.This transport module 13 is for transferring to current video on this terminal equipment and receiving video to be broadcast from this terminal equipment, and this playing module 12 is for playing video to be broadcast.
This terminal equipment comprises a transport module 22 and an identification module 21, this transport module 22 for complete and this display device between video receiving, this identification module 21 is for carrying out word identification and speech recognition to current video, at recognition result for meet this object language in the situation that, current video is sent to a playing module as video to be broadcast, in the situation that recognition result is not meet default object language, word in current image frame and current speech are translated as to this object language, and translation result is added in current video, to form video to be broadcast.
This identification module 21 comprises an Audio Processing Unit, a graphics processing unit, a web search unit and a synthesis unit.
This web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
This Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
Although more than described the specific embodiment of the present invention, it will be understood by those of skill in the art that these only illustrate, protection scope of the present invention is limited by appended claims.Those skilled in the art is not deviating under the prerequisite of principle of the present invention and essence, can make various changes or modifications to these execution modes, but these changes and modification all fall into protection scope of the present invention.
Claims (13)
1. a video language interpretation method, is characterized in that, comprises the following steps:
S
1, a display device obtains current video, current video comprises current image frame and/or current speech;
S
2, this display device communicates by letter with a terminal equipment, and obtain and language be set as object language on this terminal equipment;
S
3, this display device carries out word identification and/or speech recognition to current video, at recognition result for meet this object language in the situation that, using current video as video to be broadcast and perform step S
5, in the situation that recognition result is not meet default object language to perform step S
4;
S
4, this display device is translated as this object language by the word in current image frame and/or current speech, and translation result is added in current video, to form video to be broadcast;
S
5, this display device plays video to be broadcast.
2. video language interpretation method as claimed in claim 1, is characterized in that, current video comprises current image frame and current speech, this step S
4comprise the following steps:
S
41, this display device is translated as this object language to form voice to be broadcast by current speech;
S
42, this display device is this object language by the character translation in current image frame, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S
43, this display device synthesizes video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
3. video language interpretation method as claimed in claim 2, is characterized in that, current video has a label;
Step S
41for: this display device is searched for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
Step S
42for: this display device is searched for the object language captions bag of current video on network according to this label, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast.
4. a video language interpretation method, is characterized in that, comprises the following steps:
S
1, a display device obtains current video, current video comprises current image frame and/or current speech;
S
2, this display device transfers to a terminal equipment by current video;
S
3, this terminal equipment read self language is set as object language, and current video is carried out to word identification and/or speech recognition, in the situation that recognition result is to meet default object language, using current video as video to be broadcast and perform step S
5, in the situation that recognition result is not meet default object language to perform step S
4;
S
4, this terminal equipment is translated as this object language by the word in current image frame and/or current speech, and translation result is added in current video, to form video to be broadcast, then video to be broadcast is sent to this display device;
S
5, this display device plays video to be broadcast.
5. video language interpretation method as claimed in claim 4, is characterized in that, current video comprises current image frame and current speech, this step S
4comprise the following steps:
S
41, this terminal equipment is translated as this object language to form voice to be broadcast by current speech;
S
42, this terminal equipment is this object language by the character translation in current image frame, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast;
S
43, this terminal equipment synthesizes video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two;
S
44, this terminal equipment is sent to this display device by video to be broadcast.
6. video language interpretation method as claimed in claim 5, is characterized in that, current video has a label;
Step S
41for: this terminal equipment is searched for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, if search, current speech is not translated as to this object language to form voice to be broadcast;
Step S
42for: this terminal equipment is searched for the object language captions bag of current video on network according to this label, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if do not search, by the character translation in current image frame, be this object language, and the word that translation is obtained be added into corresponding former word in current image frame present position around, to form picture frame to be broadcast.
7. an intelligent display device, is characterized in that, comprising:
One video acquiring module, for obtaining current video, current video comprises current image frame and/or current speech;
One object language module, for communicating by letter with terminal equipment, and obtains and language is set as object language on terminal equipment;
One identification module, for current video is carried out to word identification and/or speech recognition, at recognition result for meet this object language in the situation that, current video is sent to a playing module as video to be broadcast, in the situation that recognition result is not meet default object language, word in current image frame and/or current speech are translated as to this object language, and translation result is added in current video, to form video to be broadcast;
This playing module, for playing video to be broadcast.
8. intelligent display device as claimed in claim 7, is characterized in that, current video comprises current image frame and current speech, and this identification module comprises an Audio Processing Unit, a graphics processing unit and a synthesis unit;
This Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
9. intelligent display device as claimed in claim 8, is characterized in that, current video has a label, and this identification module also comprises a web search unit;
This web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
10. a video translation system, is characterized in that, comprises a display device and a terminal equipment;
This display device comprises a video acquiring module, a transport module and a playing module, this video acquiring module is used for obtaining current video, current video comprises current image frame and/or current speech, this transport module is for transferring to current video this terminal equipment and receive video to be broadcast from this terminal equipment, and this playing module is used for playing video to be broadcast:
This terminal equipment comprises a transport module and an identification module, this transport module be used for and this display device between video receiving, this identification module is for carrying out word identification and/or speech recognition to current video, at recognition result for meet this object language in the situation that, current video is sent to a playing module as video to be broadcast, in the situation that recognition result is not meet default object language, word in current image frame and/or current speech are translated as to this object language, and translation result is added in current video, to form video to be broadcast.
11. video translation systems as claimed in claim 10, is characterized in that, current video comprises current image frame and current speech, and this identification module comprises an Audio Processing Unit, a graphics processing unit and a synthesis unit;
This Audio Processing Unit is for being translated as this object language to form voice to be broadcast by current speech, it is this object language by the character translation of current image frame that this graphics processing unit is used for, and the word that translation is obtained is added into corresponding former word, and in current image frame, present position is around, to form picture frame to be broadcast, this synthesis unit is for synthesizing video to be broadcast according to the sequential of voice to be broadcast and picture frame to be broadcast by the two.
12. video translation systems as claimed in claim 11, is characterized in that, current video has a label, and this identification module also comprises a web search unit;
This web search unit for searching for the object language voice packet of current video on network according to this label, if search, adopt object language voice packet coupling current speech, to obtain voice to be broadcast, and according to this label, on network, search for the object language captions bag of current video, if search, adopt the word in object language captions bag coupling current image frame, to form picture frame to be broadcast, if search object language voice packet, do not enable this Audio Processing Unit, if search object language captions Bao Ze, do not enable this graphics processing unit, if searching, object language voice packet and object language captions Bao Jun enable this synthesis unit.
13. video translation systems as described in any one in claim 10-12, is characterized in that, this display device is intelligent television, and this terminal equipment is smart mobile phone or wearable device.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410522251.6A CN104219459A (en) | 2014-09-30 | 2014-09-30 | Video language translation method and system and intelligent display device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410522251.6A CN104219459A (en) | 2014-09-30 | 2014-09-30 | Video language translation method and system and intelligent display device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN104219459A true CN104219459A (en) | 2014-12-17 |
Family
ID=52100552
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410522251.6A Pending CN104219459A (en) | 2014-09-30 | 2014-09-30 | Video language translation method and system and intelligent display device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104219459A (en) |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104936015A (en) * | 2015-06-24 | 2015-09-23 | 冯旋宇 | Set top box language control method and system |
CN106340291A (en) * | 2016-09-27 | 2017-01-18 | 广东小天才科技有限公司 | Bilingual subtitle production method and system |
WO2017008241A1 (en) * | 2015-07-14 | 2017-01-19 | 张阳 | Subtitle control method and system for ktv song selection system |
CN107484002A (en) * | 2017-08-25 | 2017-12-15 | 四川长虹电器股份有限公司 | The method of intelligent translation captions |
CN108259750A (en) * | 2018-02-07 | 2018-07-06 | 商丘职业技术学院 | Course recording system, video process apparatus and record class method based on fixed seat in the plane |
CN108764967A (en) * | 2018-04-28 | 2018-11-06 | 北京鸿途信达科技股份有限公司 | Internet advertising generation method and device |
CN109255130A (en) * | 2018-07-17 | 2019-01-22 | 北京赛思美科技术有限公司 | A kind of method, system and the equipment of language translation and study based on artificial intelligence |
CN109472035A (en) * | 2018-11-12 | 2019-03-15 | 深圳市友杰智新科技有限公司 | Switch the control system and method for interpretive scheme |
CN110648553A (en) * | 2019-09-26 | 2020-01-03 | 北京声智科技有限公司 | Site reminding method, electronic equipment and computer readable storage medium |
CN110765787A (en) * | 2019-10-21 | 2020-02-07 | 深圳传音控股股份有限公司 | Information interaction real-time translation method, medium and terminal |
US10652622B2 (en) | 2017-06-27 | 2020-05-12 | At&T Intellectual Property I, L.P. | Method and apparatus for providing content based upon a selected language |
CN111356025A (en) * | 2018-12-24 | 2020-06-30 | 深圳Tcl新技术有限公司 | Multi-subtitle display method, intelligent terminal and storage medium |
CN111448527A (en) * | 2020-01-14 | 2020-07-24 | 深圳市元征科技股份有限公司 | Vehicle diagnosis process playback method and device and readable storage medium |
CN111986656A (en) * | 2020-08-31 | 2020-11-24 | 上海松鼠课堂人工智能科技有限公司 | Teaching video automatic caption processing method and system |
CN112163102A (en) * | 2020-09-29 | 2021-01-01 | 北京字跳网络技术有限公司 | Search content matching method and device, electronic equipment and storage medium |
CN112584209A (en) * | 2020-12-04 | 2021-03-30 | 深圳创维-Rgb电子有限公司 | Display method, display device, storage medium and smart television |
WO2021057957A1 (en) * | 2019-09-27 | 2021-04-01 | 深圳市万普拉斯科技有限公司 | Video call method and apparatus, computer device and storage medium |
CN114257862A (en) * | 2020-09-24 | 2022-03-29 | 北京字跳网络技术有限公司 | Video generation method, device, equipment and storage medium |
CN115484477A (en) * | 2021-05-31 | 2022-12-16 | 上海哔哩哔哩科技有限公司 | Subtitle generating method and device |
CN115484478A (en) * | 2021-05-31 | 2022-12-16 | 上海哔哩哔哩科技有限公司 | Subtitle processing method and device |
WO2023097446A1 (en) * | 2021-11-30 | 2023-06-08 | 深圳传音控股股份有限公司 | Video processing method, smart terminal, and storage medium |
CN114257862B (en) * | 2020-09-24 | 2024-05-14 | 北京字跳网络技术有限公司 | Video generation method, device, equipment and storage medium |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080066138A1 (en) * | 2006-09-13 | 2008-03-13 | Nortel Networks Limited | Closed captioning language translation |
CN101472104A (en) * | 2007-12-26 | 2009-07-01 | 英业达股份有限公司 | Video signal display device with translation function |
CN101924863A (en) * | 2010-05-21 | 2010-12-22 | 中山大学 | Digital television equipment |
CN103067775A (en) * | 2013-01-28 | 2013-04-24 | Tcl集团股份有限公司 | Subtitle display method for audio/video terminal, audio/video terminal and server |
CN103106190A (en) * | 2011-11-09 | 2013-05-15 | 财团法人资讯工业策进会 | Instant translation system and method for digital television |
CN103226947A (en) * | 2013-03-27 | 2013-07-31 | 广东欧珀移动通信有限公司 | Mobile terminal-based audio processing method and device |
CN103488630A (en) * | 2013-09-29 | 2014-01-01 | 小米科技有限责任公司 | Method, device and terminal for processing picture |
US20140053171A1 (en) * | 2006-04-14 | 2014-02-20 | At&T Intellectual Property Ii, L.P. | On-Demand Language Translation for Television Programs |
CN103634645A (en) * | 2013-12-10 | 2014-03-12 | 青岛海尔软件有限公司 | Method for controlling smart television by using mobile phone |
-
2014
- 2014-09-30 CN CN201410522251.6A patent/CN104219459A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140053171A1 (en) * | 2006-04-14 | 2014-02-20 | At&T Intellectual Property Ii, L.P. | On-Demand Language Translation for Television Programs |
US20080066138A1 (en) * | 2006-09-13 | 2008-03-13 | Nortel Networks Limited | Closed captioning language translation |
CN101472104A (en) * | 2007-12-26 | 2009-07-01 | 英业达股份有限公司 | Video signal display device with translation function |
CN101924863A (en) * | 2010-05-21 | 2010-12-22 | 中山大学 | Digital television equipment |
CN103106190A (en) * | 2011-11-09 | 2013-05-15 | 财团法人资讯工业策进会 | Instant translation system and method for digital television |
CN103067775A (en) * | 2013-01-28 | 2013-04-24 | Tcl集团股份有限公司 | Subtitle display method for audio/video terminal, audio/video terminal and server |
CN103226947A (en) * | 2013-03-27 | 2013-07-31 | 广东欧珀移动通信有限公司 | Mobile terminal-based audio processing method and device |
CN103488630A (en) * | 2013-09-29 | 2014-01-01 | 小米科技有限责任公司 | Method, device and terminal for processing picture |
CN103634645A (en) * | 2013-12-10 | 2014-03-12 | 青岛海尔软件有限公司 | Method for controlling smart television by using mobile phone |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104936015A (en) * | 2015-06-24 | 2015-09-23 | 冯旋宇 | Set top box language control method and system |
WO2017008241A1 (en) * | 2015-07-14 | 2017-01-19 | 张阳 | Subtitle control method and system for ktv song selection system |
CN106340291A (en) * | 2016-09-27 | 2017-01-18 | 广东小天才科技有限公司 | Bilingual subtitle production method and system |
US10652622B2 (en) | 2017-06-27 | 2020-05-12 | At&T Intellectual Property I, L.P. | Method and apparatus for providing content based upon a selected language |
CN107484002A (en) * | 2017-08-25 | 2017-12-15 | 四川长虹电器股份有限公司 | The method of intelligent translation captions |
CN108259750A (en) * | 2018-02-07 | 2018-07-06 | 商丘职业技术学院 | Course recording system, video process apparatus and record class method based on fixed seat in the plane |
CN108764967A (en) * | 2018-04-28 | 2018-11-06 | 北京鸿途信达科技股份有限公司 | Internet advertising generation method and device |
CN109255130A (en) * | 2018-07-17 | 2019-01-22 | 北京赛思美科技术有限公司 | A kind of method, system and the equipment of language translation and study based on artificial intelligence |
CN109472035A (en) * | 2018-11-12 | 2019-03-15 | 深圳市友杰智新科技有限公司 | Switch the control system and method for interpretive scheme |
CN109472035B (en) * | 2018-11-12 | 2023-05-09 | 深圳市友杰智新科技有限公司 | Control system and method for switching translation mode |
CN111356025A (en) * | 2018-12-24 | 2020-06-30 | 深圳Tcl新技术有限公司 | Multi-subtitle display method, intelligent terminal and storage medium |
CN110648553A (en) * | 2019-09-26 | 2020-01-03 | 北京声智科技有限公司 | Site reminding method, electronic equipment and computer readable storage medium |
CN110648553B (en) * | 2019-09-26 | 2021-05-28 | 北京声智科技有限公司 | Site reminding method, electronic equipment and computer readable storage medium |
WO2021057957A1 (en) * | 2019-09-27 | 2021-04-01 | 深圳市万普拉斯科技有限公司 | Video call method and apparatus, computer device and storage medium |
CN110765787A (en) * | 2019-10-21 | 2020-02-07 | 深圳传音控股股份有限公司 | Information interaction real-time translation method, medium and terminal |
CN111448527A (en) * | 2020-01-14 | 2020-07-24 | 深圳市元征科技股份有限公司 | Vehicle diagnosis process playback method and device and readable storage medium |
CN111986656A (en) * | 2020-08-31 | 2020-11-24 | 上海松鼠课堂人工智能科技有限公司 | Teaching video automatic caption processing method and system |
CN114257862A (en) * | 2020-09-24 | 2022-03-29 | 北京字跳网络技术有限公司 | Video generation method, device, equipment and storage medium |
CN114257862B (en) * | 2020-09-24 | 2024-05-14 | 北京字跳网络技术有限公司 | Video generation method, device, equipment and storage medium |
CN112163102A (en) * | 2020-09-29 | 2021-01-01 | 北京字跳网络技术有限公司 | Search content matching method and device, electronic equipment and storage medium |
CN112584209A (en) * | 2020-12-04 | 2021-03-30 | 深圳创维-Rgb电子有限公司 | Display method, display device, storage medium and smart television |
CN115484477A (en) * | 2021-05-31 | 2022-12-16 | 上海哔哩哔哩科技有限公司 | Subtitle generating method and device |
CN115484478A (en) * | 2021-05-31 | 2022-12-16 | 上海哔哩哔哩科技有限公司 | Subtitle processing method and device |
WO2023097446A1 (en) * | 2021-11-30 | 2023-06-08 | 深圳传音控股股份有限公司 | Video processing method, smart terminal, and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104219459A (en) | Video language translation method and system and intelligent display device | |
US8656281B2 (en) | Information processing apparatus, information processing method, information processing system, and program | |
US20150113558A1 (en) | Receiver apparatus, broadcast/communication-cooperation system, and broadcast/communication-cooperation method | |
CN109729420A (en) | Image processing method and device, mobile terminal and computer readable storage medium | |
US11227620B2 (en) | Information processing apparatus and information processing method | |
US20080066097A1 (en) | Method Of Realizing Interactive Advertisement Under Digital Braodcasting Environment By Extending Program Associated Data-Broadcasting To Internet Area | |
KR101293301B1 (en) | System and method for serching images using caption of moving picture in keyword | |
CN102291615B (en) | Television program precision searching and detail viewing device and method based on one-way network | |
CN105992041A (en) | Method and device for encoding a captured screenshot and controlling program content switching based on the captured screenshot | |
CN105744346A (en) | Caption switching method and device | |
CN102088631B (en) | Live and demand broadcast method of digital television (TV) programs as well as related device and system | |
EP3142373A1 (en) | Channel classification method and device | |
CN108810580B (en) | Media content pushing method and device | |
CN102193794A (en) | Linking real time media context to related applications and services | |
JP4774462B2 (en) | Reception device, reception method, and reception program | |
CN105847946A (en) | Screen transmission video processing method | |
TWI439123B (en) | Set-top box and method for searching characters thereof | |
CN105872728A (en) | Screen transfer video processing method for multi-screen interaction | |
CN103477653A (en) | Supplying apparatus, supplying method, receiving apparatus, receiving method, program, and broadcasting system | |
CN102595232B (en) | Relative information search method of digital television programs and digital television receiving terminal | |
CN105847971A (en) | Method for processing screen transmission video | |
CN103501457B (en) | The method and apparatus that a kind of program is play | |
CN103686336A (en) | Video playing control method and device | |
CN107465946B (en) | Video playing method, device, system and terminal equipment | |
CN108363770A (en) | A kind of set-top box supports multipath extraction keyword and the method and system of search |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20141217 |
|
RJ01 | Rejection of invention patent application after publication |