CN103297710A - Audio and video recorded broadcast device capable of marking Chinese and foreign language subtitles automatically in real time for Chinese - Google Patents

Audio and video recorded broadcast device capable of marking Chinese and foreign language subtitles automatically in real time for Chinese Download PDF

Info

Publication number
CN103297710A
CN103297710A CN2013102435501A CN201310243550A CN103297710A CN 103297710 A CN103297710 A CN 103297710A CN 2013102435501 A CN2013102435501 A CN 2013102435501A CN 201310243550 A CN201310243550 A CN 201310243550A CN 103297710 A CN103297710 A CN 103297710A
Authority
CN
China
Prior art keywords
chinese
module
audio
foreign language
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013102435501A
Other languages
Chinese (zh)
Other versions
CN103297710B (en
Inventor
不公告发明人
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
QINGHAI HANLA INFORMATION TECHNOLOGY CO., LTD.
Original Assignee
Jiangsu Huayin Information Science & Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Huayin Information Science & Technology Co Ltd filed Critical Jiangsu Huayin Information Science & Technology Co Ltd
Priority to CN201310243550.1A priority Critical patent/CN103297710B/en
Publication of CN103297710A publication Critical patent/CN103297710A/en
Application granted granted Critical
Publication of CN103297710B publication Critical patent/CN103297710B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses an audio and video recorded broadcast device capable of marking Chinese and foreign language subtitles automatically in real time for Chinese and belongs to the technical field of speech and image data processing devices. According to the technical scheme, the audio and video recorded broadcast device comprises a microphone and camera module 1, an audio and video synchronizing signal marking module 2, an audio language audio signal extraction module 3, a Chinese speech recognition module 4, a Chinese-to-foreign-language machine translation module 5, a video frame or image frame subtitle overlaying machine module 6, an audio and video coding and compression module 7, a network transmission module 8, a server module 9 with audio and video decoding and decompression software, a network transmission module 10 and a client module 11 with audio and video broadcasting software. By the aid of the audio and video recorded broadcast device, Chinese can be well spread throughout the world, and the communication between the Chinese culture and the world culture is facilitated.

Description

Foreign language caption phonotape and videotape recorded broadcast equipment during Chinese marks automatically in real time
Technical field
The technical program belongs to voice and image-data processing apparatus technical field.
Background technology
The Chinese character of the present sound image data of Chinese on the market or foreign language or its contrast subtitle superposition, generally convert the Chinese in the sound image data of Chinese to Chinese character or foreign language by manual type, giving Chinese character or the foreign language caption that video pictures or image frame subtitle superposition machine will express the Chinese meaning again is superimposed upon on video pictures or the image frame, owing to there are a large amount of real-time or non real-time sound image datas of Chinese, comprise sound image datas such as telerecording and film, therefore, can be very time-consuming and be difficult to accomplish real-time Transmission if depend merely on the mode that adopts artificial conversion, appearance along with the sound image technology of numeral, particularly computer system occurs for the treatment of the technology of video image data, more and more need to have a kind of technology appearance that can in real time the phonotape and videotape of Chinese speech be converted to foreign language caption in Chinese speech and the filling automatically, and this technology that can be automatically converts Chinese and middle foreign language caption in real time to according to Chinese speech not only can be moved in the computer system of band hanzi system, can also be or else be to move in the Hesperian computer system of representative with the U.S. with the ASCII character system of 128 characters only with hanzi system, to satisfy increasingly extensive utilization and the cloud computing of the Internet, the world, Internet of Things and the appearance of Chinese language craze all over the world, Chinese and Western culture exchanges the needs of new situations more and more frequently.
Summary of the invention
The proposition of the technical program is exactly in order to solve above-mentioned these problems that occurs.The technical program solves the problem of above-mentioned appearance by the technology that adopts foreign language caption phonotape and videotape recorded broadcast equipment in the automatic mark in real time of following Chinese specifically:
The audio-video recorded broadcast equipment that the technical program adopts comprises: microphone and camara module 1, audio-visual synchronization signal mark module 2, sound language audio signal extraction module 3, Chinese speech identification module 4, the machine translation module 5 of foreign language translated in Chinese, video pictures or image frame subtitle superposition machine module 6, audio/video coding compression module 7, network transmission module 8, band audio/video decoding PKUNZIP server module 9, network transmission module 10, the client modules 11 of band audio frequency and video phonotape and videotape playout software.
Carry out according to the following steps during audio-video recorded broadcast equipment work that the technical program adopts: at the scene during Chinese speech phonotape and videotape recorded broadcast in real time, described recorded broadcast equipment is by microphone and camara module 1, with Chinese speech and field scene typing and be stored in the system of described recorded broadcast equipment, computer in the system is at first carried out the audio signal synchronizing signal mark of the corresponding Chinese sound language of recording by video pictures or image frame in the image data of above-mentioned camara module 1 production and above-mentioned microphone by audio-visual synchronization signal mark module 2 and is stored in the stocking system of audio-video recorded broadcast equipment, to extract by sound language audio signal extraction module 3 with the audio signal of the Chinese sound language of synchronizing signal mark then, after extracting, the audio signal of the Chinese sound language of band synchronizing signal mark passes to the Chinese speech identification module 4 in the computer again, Chinese speech identification module 4 is identified as Chinese speech the Chinese phonetics codes represented of 26 Latin alphabets of usefulness of the identical synchronizing signal mark with the Chinese speech of identifying of band, again the machine translation module 5 of translating into foreign language by Chinese with above-mentioned Chinese phonetics codes translate into represent with 26 Latin alphabets have the foreign language sentence of the appointment of identical synchronizing signal mark with corresponding Chinese phonetics codes sentence, again the Chinese phonetics codes captions of above-mentioned band synchronizing signal mark or foreign language caption or their contrast text subtitles are transferred to existing video pictures or image frame subtitle superposition machine module 6, corresponding relation according to Chinese phonetics codes captions or foreign language caption or their contrast text subtitles and video pictures or image frame synchronizing signal mark is superimposed upon caption information on video pictures or the image frame, and encode and compress by above-mentioned audio/video coding compression module 7, after above-mentioned coding and compression, be transferred to network transmission module 8 again, again by network transmission module 8 will encode with compression after above-mentioned band with identical synchronizing signal mark in video pictures or the image frame of foreign language caption and Chinese speech be transferred to broadband network, broadband network is transferred to it on band audio/video decoding PKUNZIP server module 9 of appointment and stores, the client modules 11 of band audio frequency and video phonotape and videotape playout software by network transmission module 10 log on above-mentioned band audio/video decoding PKUNZIP server module 9 just can watch in real time above-mentioned scene be with in real time in the video image data picture of foreign language caption and Chinese speech.
Machine translation module 5 embedded Chinese characters and the Chinese phonetic alphabet and the Chinese voice code bidirectional modular converter that foreign language translated in above-mentioned Chinese speech identification module 4 and Chinese.
Above-mentioned network transmission module 8 or network transmission module 10, can be that the cable network transport module also can be 3G, 4G, wifi, wimax, blue tooth radio network transport module, when adopting the cable network transport module, above-mentioned broadband network is wired broadband network, when adopting wireless network transmission module, above-mentioned broadband network is wireless broadband network.
Above-mentioned Chinese phonetics codes, in the computer of hanzi system, can convert Chinese character, Chinese phonetics codes, the Chinese phonetic alphabet to by described Chinese character and the Chinese phonetic alphabet and Chinese voice code bidirectional modular converter, Chinese character can be separately or Chinese phonetics codes and Chinese character, the Chinese phonetic alphabet, and the foreign language contrast of meaning unanimity shows, stores, output.
Above-mentioned Chinese phonetics codes is to be unit with the word, here regard single Chinese character as monosyllable, according to the phonetic in " Scheme for the Chinese Phonetic Alphabet " of each syllable of forming this word, with and only use 26 Latin alphabets to the initial consonant of the Chinese phonetic alphabet, referral letter, simple or compound vowel of a Chinese syllable, tone is taked to encode earlier and is spelt by the sequential encoding of " the sign indicating number+sign indicating number+rhythm sign indicating number that is situated between+double sound insulation of accent sign indicating number saves symbol " successively, and directly express Chinese information by the coding of the phonetic code that obtains, when direct term syllable code is represented Chinese information, its usage in punctuation is identical with English usage in punctuation, a plurality of syllables of same word will have the space to separate without the space continuous programming code during coding between word and the word.
Because the Chinese phonetics codes that the technical program adopts 26 Latin alphabets to represent is expressed Chinese information, and when direct term syllable code is represented Chinese information, its usage in punctuation is identical with English usage in punctuation, like this with regard to the expression punctuation mark that guaranteed Chinese information interior all in full accord with ASCII character, also namely with ASCII character 100% compatibility, above-mentioned like this Chinese speech identification module, the machine translation module, the phonetic synthesis module is because the Chinese information of handling is used with the on all four Chinese phonetics codes of ASCII character represents, so just make these modules in the computer of ASCII character system, to move, because forming the module of whole system can move in the computer of ASCII character system, therefore, whole system can be moved in the computer of ASCII character system.
Had after the technical program, Chinese information can be in the ASCII of Hanzi internal code system and non-Hanzi internal code system transmits in the computer information system of code system unblockedly and handles, and along with increasingly extensive utilization and the cloud computing of the Internet, the world, Internet of Things and the appearance of Chinese language craze all over the world, make Chinese and the viewing and emulating mutually to exchange and bring great convenience of real-time non real-time image data that with English is the countries in the world of representative, particularly convenient foreign spectators see that by the real time video data limit of China Chinese news limit learns to speak Chinese, Chinese character, the Chinese phonetic alphabet and Chinese phonetics codes, thereby Chinese can extensivelyr be propagated into all over the world better, promote Chinese culture and the mutual of world culture to exchange.
Description of drawings
It is foreign language caption phonotape and videotape recorded broadcast device systems schematic diagram during Chinese of the present invention marks automatically in real time shown in the accompanying drawing
Embodiment
Be further described below in conjunction with the specific embodiment of the present invention of embodiment.
(1) following method is adopted in the coding method of each syllable sound, rhyme, tone of the Chinese phonetics codes that adopts of the technical program:
Annotate: the symbol behind the dash "-" is Chinese phonetic symbols, and the preceding letter of dash "-" is the coding of each syllable sound, rhyme, tone of Chinese of adopting, below all with, below abbreviate the following table of comparisons as code table.
Here it is worthy of note: when keying in the punctuation mark of Chinese phonetics codes and Chinese phonetics codes statement with keyboard, 26 Latin alphabets forming the Chinese phonetics codes coding are identical corresponding fully one by one with 26 letter keys of western language QWERTY keyboard, the punctuation mark key of Chinese phonetics codes statement is also identical corresponding one by one with the punctuation mark key of western language QWERTY keyboard, when input Chinese speech code letter and punctuation mark, the corresponding same keys position that only need impact the western language QWERTY keyboard gets final product.
1, the coded identification of sound sign indicating number adopts the letter character with the initial consonant of Scheme for the Chinese Phonetic Alphabet basically identical, such as the coding form of this sound sign indicating number below adopting:
b—b ; p—p ; m—m ; f—f ; d—d ; t—t ;
n—n ; l—l ; g—g ; k—k ; h—h ;
j—zh, j ; q—ch,q ; x—sh,x ;r—r ;
z—z ; c—c ; s—s ; y—y ; w—w 。
2, Chinese phonetic alphabet referral letter (ü) adopts a letter representation in 26 Latin alphabets, such as the coding form of this sign indicating number that is situated between below adopting:
i—i ; u—u ; y—ü 。
3, the coding of rhythm sign indicating number, to the letter representation of single vowel in 26 Latin alphabets of (ü) employing, other adopts the letter character identical with the Chinese phonetic alphabet, the composite vowel of the Chinese phonetic alphabet can adopt " Scheme for the Chinese Phonetic Alphabet " identical form, also can adopt a consonant to encode, come the simple or compound vowel of a Chinese syllable of the Chinese phonetic alphabet is encoded such as this letter character below adopting:
a—a ; o—o ; e—e ; i—i ; u—u ; y—ü ;
k—ao ; c— ai ; s—an ; x—ou ; w—ei ; n—en ;
z—ua ; l—uo ; b—ang ; d—ong ; p—eng ;
q—ing ; g—ng ; er—er ;
R-i; " when i only pieced together mutually with Chinese Pin Yin pseudonym zh, ch, sh, the i of the Chinese phonetic alphabet represented with the coding r of phonetic code ".That is: the zhi of the Chinese phonetic alphabet, chi, shi represent with jr, qr, the xr of phonetic code respectively.Press two key position inputs of J and R or Q and R or X and R and E and R when jr or qr or xr and the input of er keyboard respectively.
4, the coding of transferring sign indicating number is except adopting a no consonant v of Chinese to represent going up the sound (∨) of the Chinese phonetic alphabet, and other adopts vowel to represent the tone of Chinese, comes the tone of the Chinese phonetic alphabet is encoded such as the letter below adopting:
A---, high and level tone; E-/, rising tone; V-∨, last sound; U-, falling tone;
O-phonetic is not marked tone mark softly, softly.
(2) utilize the Chinese phonetics codes Chinese information of above-mentioned coding to represent to adopt following method:
Be unit with the word, here regard single Chinese character as monosyllable, according to the phonetic in " Scheme for the Chinese Phonetic Alphabet " of each syllable of forming this word, press the sequential encoding of " the sound sign indicating number+sign indicating number+rhythm sign indicating number that is situated between+double sound insulation of accent sign indicating number saves symbol " successively, a plurality of syllables of same word separate write the two or more syllables of a word together without the space, and the coding between word and the word separates with the space, when Chinese information represents to be in the Chinese phonetics codes state, its six kinds of periods, seven kinds of labels adopt and English identical form with the number of dividing a word with a hyphen at the end of a line;
Here owing to regard the independent Chinese character that uses as monosyllable, therefore, the method of Chinese character encoding of the present invention is identical with Chinese single syllable Methods for Coding, adopt the single syllable coding by obtaining the word coding behind the word write the two or more syllables of a word together in the present invention, we will be called phrase by one group of word that several words are formed, the coding of phrase that the present invention adopts is identical with the coding of Chinese sentence, because phrase and Chinese sentence can be represented in word, therefore the coding of the coding of the phrase that adopts in the present invention and Chinese sentence can be realized by the coding of word, and do not need the special coding of a cover formulated in addition in phrase and Chinese sentence, generally when whole sentence entire chapter is the unit representation Chinese information with the word, when understanding, generally do not need to carry out the selection of unisonance words, sound the sentence that can not produce ambiguity in principle, also can not produce ambiguity when expressing with coding.
Be the specific implementation step that example illustrates the technical program with the voice by the Chinese sentence of microphone input below:
At the scene during Chinese speech phonotape and videotape recorded broadcast in real time, described recorded broadcast equipment is by microphone and camara module 1, with Chinese speech and field scene typing and be stored in the system of described recorded broadcast equipment, computer in the system is at first carried out the audio signal synchronizing signal mark of the corresponding Chinese sound language of recording by video pictures or image frame in the image data of above-mentioned camara module 1 production and above-mentioned microphone by audio-visual synchronization signal mark module 2 and is stored in the stocking system of audio-video recorded broadcast equipment, and audio-visual synchronization signal mark module 2 is made the synchronizing signal mark and can be adopted the technology of existing making video pictures or image frame and audio sync timestamp mark to carry out.
To extract by sound language audio signal extraction module 3 with the audio signal of the Chinese sound language of synchronizing signal mark then, the Chinese sound language audio digital signals that extracting method can directly be pressed system stores extracts, also can extract again there being Chinese sound language audio signal to convert the Chinese sound language audio digital signals of system stores to analog signal by the D/A digiverter, more original method is that the mode that Chinese sound language audio signal is play the sound of Chinese sound language by loudspeaker extracts, and does not just enumerate one by one here.
Pass to the Chinese speech identification module 4 in the computer after the audio signal of the Chinese sound language of band synchronizing signal mark extracts again, Chinese speech identification module 4 is identified as the Chinese phonetics codes that 26 Latin alphabets of usefulness of the identical synchronizing signal mark with the Chinese speech of identifying of band are represented with Chinese speech.
When adopting Chinese-voice-code voice identification module 4 to carry out Chinese speech identification, this Chinese speech identification module is with the primitive of Chinese syllable as identification, by searching Chinese syllable sound template and the Chinese speech syllabified code table of comparisons that is stored in advance in the computer system, identify corresponding Chinese syllable phonetic code after the coupling, when importing continuously, voice just obtain continuous Chinese syllable phonetic code string, the above-mentioned Chinese syllable phonetic code that obtains was ganged up the mode of checking thesaurus and carried out by word segmentation, to the multiple segmentation of words, carry out the segmentation of words again after can differentiating according to means such as the contact of Chinese lexical syntactic context and statistical laws, write the two or more syllables of a word together between the syllable of same word and the syllable taked in the word that is syncopated as, and the mode in space is represented between word and the word.
Exemplify the example that Chinese speech is carried out Chinese phonetics codes identification with the inventive method below:
1. Chinese speech converts Chinese phonetics codes to:
Such as: we extract the Chinese speech of the following Chinese sentence in the image data:
" we use Latin every day.”
(1) by searching Chinese syllable sound template and the Chinese speech syllabified code table of comparisons that is stored in advance in the computer system, identify corresponding Chinese syllable phonetic code string after the coupling:
Between Wov mno mwv tisa xrv ydu laa dqa wnv .(syllable and the syllable space is arranged)
Or wo vMn oMw vTis aXr vYd uLa aDq aWn v. (not having the space between syllable and the syllable)
(the schwa symbol o among the skilled back mno can omit when not causing audio mixing, more than below all with.)
Added underscore in order to allow everybody see the letter that will represent tone here clearly, the tool sound insulation joint effect simultaneously of the tone letter in the phonetic code, tone does not have underscore in the actual speech sign indicating number, and tone is held concurrently and can conveniently be distinguished every syllabic sign behind the skilled Chinese phonetics codes.
So just, finished the irrelevant pure speech recognition process of dictionary scale of the complexity of a system and system.
(2) the phonetic code string is carried out the segmentation of words, finally finishing with the word is the phonetic code conversion of unit.
By searching the Chinese phonetics codes word dictionary of the good word of branch that is stored in advance in the computer system, with a plurality of syllable write the two or more syllables of a word together of same word, separate the Chinese phonetics codes that just obtains following our final needs with the space between word and the word:
Wovmno mwvtisa xrvydu laadqawnv.
Again the machine translation module (5) of translating into foreign language by Chinese with above-mentioned Chinese phonetics codes translate into represent with 26 Latin alphabets have the foreign language sentence of the appointment of identical synchronizing signal mark with corresponding Chinese phonetics codes sentence:
Call Chinese and translate into the machine translation module (5) of foreign language, the Chinese information with the Chinese speech representation that obtains converts foreign language to again, is example here with English, to other foreign language too, just differs one here for example.
(annotate: above is the implication of understanding Chinese phonetics codes for convenience with the Chinese character with the Chinese phonetics codes contrast that hereinafter occurs, actually do not occur in that pure ASCII character system is in service, more than below all with)
Chinese information such as the Chinese speech representation that will obtain above:
wovmno mwvtisa xrvydu laadqawnv .
Call Chinese and translate into the machine translation module (5) of foreign language and obtain following translation switch process, finally obtain the english sentence of above-mentioned corresponding Chinese phonetics codes:
1.wovmno the Chinese information of mwvtisa xrvydu laadqawnv .(Chinese speech representation)
We use Latin every day.(Chinese information of representing with Chinese character)
A) Chinese dictionary of looking into the mark word part of speech that is stored in advance in the computer system is set up word part of speech string: (part of sentence in the bracket is part of speech, below all with)
Wovmno(personal pronoun 1)+mwvtisa(time noun 1)+xrvydu(verb 1)+laadqawnv(noun 2).
Our (personal pronoun 1)+every day (time noun 1)+use (verb 1)+Latin (noun 2).
B) look into the table that is stored in advance in the computer system according to the sentence part of speech string that obtains above and be stored in Chinese sentence patterns in the table in advance:
(the sentence element string done of part of speech and this word is formed sentence pattern, below all with)
Wovmno(personal pronoun 1 is made subject)+mwvtisa (time noun 1 is made time adverbial)+xrvydu(verb 1 makes predicate)+laadqawnv (object made in noun 2)
Our (personal pronoun 1 is made subject)+every day (time noun 1 is made time adverbial)+use (predicate made in verb 1)+Latin (object made in noun 2)
C) table look-up according to the Chinese sentence patterns that obtains above and be stored in the English sentence of the correspondence in the table in advance:
Wovmno(personal pronoun 1 is made subject)+xrvydu (predicate made in verb 1)+laadqawnv(noun 2 makes object)+mwvtisa(time noun 1 makes time adverbial)
Our (personal pronoun 1 is made subject)+use (predicate made in verb 1)+Latin (object made in noun 2)
+ every day (time noun 1 is made time adverbial)
Look into the Chinese-English dictionary that is stored in advance in the computer system this moment and carry out the conversion of word or the phrase meaning, and export in proper order by this sentence pattern and just to finish the conversion that English translated in Chinese, but in order to show the amphicheirality of this machine translation process, further conversion below we remake:
D) according to above obtain English sentence and table look-up and be stored in the table and corresponding English word or the consistent part of speech string of phrase part of speech in advance: (this part of speech string also can extract from the object language sentence pattern that obtains and obtain, below all with)
Wovmno(personal pronoun 1)+xrvydu(verb 1)+laadqawnv(noun 2)+mwvtisa(time noun 1).
We (personal pronoun 1)+use (verb 1)+Latin (noun 2)+every day (time noun 1).
E) look into that the Chinese-English dictionary that is stored in advance in the computer system carries out the conversion of word or the phrase meaning and by the order output of top resulting English sentence:
We(personal pronoun 1) every day(time noun 1 latin(noun 2 use(verb 1))).
we use latin every day.
So just, finished the conversion that English translated in Chinese.
Further after obtaining Chinese phonetics codes, when needing, Chinese phonetics codes can convert Chinese character or the Chinese phonetic alphabet to by Chinese character and the Chinese phonetic alphabet and Chinese voice code bidirectional modular converter, this Chinese phonetics codes Chinese character modular converter can be embedded in the Chinese speech identification module 4, this moment, whole system had to operate in the computer of hanzi system, Chinese phonetics codes or Chinese character or the Chinese phonetic alphabet can be separately or Chinese phonetics codes and Chinese character, the Chinese phonetic alphabet, the foreign language contrast of meaning unanimity shows, stores, exports, and detailed process is as follows:
Convert Chinese phonetics codes to Chinese character by calling Chinese phonetics codes Chinese character bi-directional conversion modular computer by following steps:
Can easily Chinese phonetics codes be converted to Chinese character and the Chinese phonetic alphabet by searching Chinese phonetics codes respectively with the Chinese character and the Chinese phonetic alphabet table of comparisons that with the word are unit, such as:
Wovmno is by looking into the sound sign indicating number, sign indicating number is situated between, the rhythm sign indicating number, transfer sign indicating number and the Chinese phonetic alphabet table of comparisons or obtain w ǒ men according to the Chinese phonetics codes syllable of this table of comparisons generation or word and pinyin syllable or the word table of comparisons, finding with the word by w ǒ men is the Chinese character of unit again, when being that the phonetic code of unit is by after to be the Chinese phonetic alphabet of unit with the word with the Chinese character that is unit with the word set up corresponding relation with the word, in case the phonetic code that need be unit with the word can no longer need by being the Chinese phonetic alphabet of unit with the word, directly sets up corresponding relation and carries out corresponding conversion with the Chinese character that with the word is unit.Such as: wovmno can be converted to w ǒ men, can convert " we " to by w ǒ men again, wovmno and " we " have just directly set up corresponding relation like this, can not change by Chinese phonetic alphabet w ǒ men when needing, and directly between wovmno and " us ", realize the bidirectional reversible conversion.
When meeting homonym, carrying out with the word after can differentiating according to means such as the contact of Chinese lexical syntactic context and statistical laws is that the Chinese character of unit is selected.Such as: filled mailbag on the ysvlune.Filled crude oil on the ysvlune.Can know in conjunction with contextual contact: " ysvlune " in one of the front represents cruise, and " ysvlune " in one of the back represents oil tanker, and these two words can convert " having filled mailbag on the cruise " and " having filled crude oil on the oil tanker " respectively to.To other word situation also.
The result of above-mentioned bidirectional reversible conversion both can show separately also can contrast demonstration, such as:
Former sentence: " we use the Chinese character and latin literary composition every day." can reversibly be converted to following several form with the inventive method computer:
1.“Wǒmen měitiān shǐyòng lādīngwěn。”
2.“wovmno mwvtisa xrvydu laadqawnv.”
3.“Wǒmen měitiān shǐyòng lādīngwěn。”
We use Latin every day.
4.“wovmno mwvtisa xrvydu laadqawnv.”
We use Latin every day.
5. “Wǒmen měitiān shǐyòng lādīngwěn。”
“wovmno mwvtisa xrvydu laadqawnv.”
In order to allow the foreigner or Chinese ethnic group more implication and the learning Chinese of ground, aspect understanding Chinese, also can in the word of each contrast, insert corresponding foreign language word or minority language, such as adding the note that corresponding English word is made the Chinese meaning in the word below:
“wovmno Wǒmen mwvtisa měitiān xrvydu shǐyòng laadqawnv lādīngwěn 。”
We We every every day day uses use Latin Latine.
Above following Chinese subtitle or the Chinese captions mentioned in the technical program like this just can be Chinese phonetics codes, Chinese character and the Chinese phonetic alphabet.
Again the Chinese phonetics codes captions of above-mentioned band synchronizing signal mark or foreign language caption or their contrast text subtitles are transferred to traditional video pictures or image frame subtitle superposition machine module 6, corresponding relation according to Chinese phonetics codes captions or foreign language caption or their contrast text subtitles and video pictures or image frame synchronizing signal mark is superimposed upon caption information on video pictures or the image frame, and is synthesized together storage or output synchronously.
We adopt said method to realize Chinese speech real-time imaging data is transformed into the real-time imaging data of foreign language caption in Chinese speech and the filling like this, in like manner also can adopt identical method to realize above process and result to other foreign language, just tire out no longer one by one here and state.
Last and encode through audio/video coding compression module 7 and compress by the real-time imaging data of foreign language caption in the above-mentioned Chinese speech that obtains and the filling, after above-mentioned coding and compression, be transferred to network transmission module 8 again, again by network transmission module 8 will encode with compression after above-mentioned band with identical synchronizing signal mark in video pictures or the image frame of foreign language caption and Chinese speech be transferred to broadband network, broadband network is transferred to it on band audio/video decoding PKUNZIP server module 9 of appointment and stores, the client modules 11 of band audio frequency and video phonotape and videotape playout software logs on above-mentioned band audio/video decoding PKUNZIP server module 9 by network transmission module 10 just can watch the above-mentioned scene video image data picture of foreign language caption and Chinese speech in the band in real time in real time, like this we just by the equipment of present technique finished real-time Chinese speech phonotape and videotape be converted into real-time Chinese speech also annotate in the recorded broadcast process of audio and video files of foreign language datum.
By that analogy, use said method, we can also realize that Chinese stores to the conversion of other foreign language and corresponding captions thereof and with synchronous corresponding video pictures or the synthetic stack of image frame or and can watch Chinese speech after the conversion and the audio and video files picture of middle foreign language caption in real time by described network transmission and server and client.Can download these audio and video files and convert the various forms of being convenient to play to from server when further needing and play for TV station or multimedia machine.
It is worthy of note at last: the machine translation module 5 that foreign language translated in above-mentioned Chinese can adopt a kind of Chinese and foreign language bidirectional reversible machine translation module of using Chinese phonetics codes, and above-mentioned two kinds of machine translation modules all can embedded Chinese character and the Chinese phonetic alphabet and Chinese voice code bidirectional modular converter.
Above-mentioned network transmission module 8 or network transmission module 10, can be that the cable network transport module also can be 3G, 4G, wifi, wimax, blue tooth radio network transport module, when adopting the cable network transport module, above-mentioned broadband network is wired broadband network, when adopting wireless network transmission module, above-mentioned broadband network is wireless broadband network, because above-mentioned network all is prior art, object lesson is not here just tired stating one by one.

Claims (10)

1. foreign language caption phonotape and videotape recorded broadcast equipment during a Chinese marks automatically in real time is characterized in that: comprise microphone and camara module (1), audio-visual synchronization signal mark module (2), sound language audio signal extraction module (3), Chinese speech identification module (4), the machine translation module (5) of foreign language translated in Chinese, video pictures or image frame subtitle superposition machine module (6), audio/video coding compression module (7), network transmission module (8), band audio/video decoding PKUNZIP server module (9), network transmission module (10), the client modules (11) of band audio frequency and video phonotape and videotape playout software.
2. foreign language caption phonotape and videotape recorded broadcast equipment during Chinese as claimed in claim 1 marks automatically in real time, carry out according to the following steps when it is characterized in that the work of this equipment: at the scene during Chinese speech phonotape and videotape recorded broadcast in real time, described recorded broadcast equipment is by microphone and camara module (1), with Chinese speech and field scene typing and be stored in the system of described recorded broadcast equipment, computer in the system is at first carried out the audio signal synchronizing signal mark of the corresponding Chinese sound language of recording by video pictures or image frame in the image data of above-mentioned camara module (1) production and above-mentioned microphone by audio-visual synchronization signal mark module (2) and is stored in the stocking system of audio-video recorded broadcast equipment, to extract by sound language audio signal extraction module (3) with the audio signal of the Chinese sound language of synchronizing signal mark then, after extracting, the audio signal of the Chinese sound language of band synchronizing signal mark passes to the Chinese speech identification module (4) in the computer again, Chinese speech identification module (4) is identified as Chinese speech the Chinese phonetics codes represented of 26 Latin alphabets of usefulness of the identical synchronizing signal mark with the Chinese speech of identifying of band, again the machine translation module (5) of translating into foreign language by Chinese with above-mentioned Chinese phonetics codes translate into represent with 26 Latin alphabets have the foreign language sentence of the appointment of identical synchronizing signal mark with corresponding Chinese phonetics codes sentence, again the Chinese phonetics codes captions of above-mentioned band synchronizing signal mark or foreign language caption or their contrast text subtitles are transferred to existing video pictures or image frame subtitle superposition machine module (6), corresponding relation according to Chinese phonetics codes captions or foreign language caption or their contrast text subtitles and video pictures or image frame synchronizing signal mark is superimposed upon caption information on video pictures or the image frame, and encode and compress by above-mentioned audio/video coding compression module (7), after above-mentioned coding and compression, be transferred to network transmission module (8) again, again by network transmission module (8) will encode with compression after above-mentioned band with identical synchronizing signal mark in video pictures or the image frame of foreign language caption and Chinese speech be transferred to broadband network, broadband network is transferred to it on band audio/video decoding PKUNZIP server module (9) of appointment and stores, the client modules (11) of band audio frequency and video phonotape and videotape playout software by network transmission module (10) log on above-mentioned band audio/video decoding PKUNZIP server module (9) just can watch in real time that above-mentioned scene is real-time and be with in the video image data picture of foreign language caption and Chinese speech.
3. foreign language caption phonotape and videotape recorded broadcast equipment during Chinese as claimed in claim 1 marks automatically in real time, it is characterized in that: the embedded Chinese character of machine translation module (5) and the Chinese phonetic alphabet and the Chinese voice code bidirectional modular converter of foreign language translated in described Chinese speech identification module (4) and Chinese.
4. foreign language caption phonotape and videotape recorded broadcast equipment during Chinese as claimed in claim 1 marks automatically in real time, it is characterized in that: above-mentioned network transmission module (8) or network transmission module (10), can be that the cable network transport module also can be 3G, 4G, wifi, wimax, blue tooth radio network transport module, when adopting the cable network transport module, above-mentioned broadband network is wired broadband network, when adopting wireless network transmission module, above-mentioned broadband network is wireless broadband network.
5. as foreign language caption phonotape and videotape recorded broadcast equipment in claim 2 or the automatic mark in real time of 3 described Chinese, it is characterized in that: described Chinese phonetics codes is to be unit with the word, here regard single Chinese character as monosyllable, according to the phonetic in " Scheme for the Chinese Phonetic Alphabet " of each syllable of forming this word, with and only use 26 Latin alphabets to the initial consonant of the Chinese phonetic alphabet, referral letter, simple or compound vowel of a Chinese syllable, tone is taked to encode earlier and is spelt by the sequential encoding of " the sign indicating number+sign indicating number+rhythm sign indicating number that is situated between+double sound insulation of accent sign indicating number saves symbol " successively, and directly express Chinese information by the coding of the phonetic code that obtains, when direct term syllable code is represented Chinese information, its usage in punctuation is identical with English usage in punctuation, a plurality of syllables of same word will have the space to separate without the space continuous programming code during coding between word and the word.
6. foreign language caption phonotape and videotape recorded broadcast equipment during Chinese as claimed in claim 5 marks automatically in real time, it is characterized in that: described Chinese phonetics codes is that initial consonant is all represented with the consonant Latin alphabet, be used for the initial consonant of phonetic code of expression Chinese information except the initial consonant zh of " Scheme for the Chinese Phonetic Alphabet ", ch, sh uses j respectively, q, outside three consonant Latin alphabets of x are represented, remaining initial consonant use with " Scheme for the Chinese Phonetic Alphabet " in the consonant Latin alphabet of same-sign represent, zhi in " Scheme for the Chinese Phonetic Alphabet ", chi, shi uses the jr of phonetic code respectively, qr, xr represents, er in " Scheme for the Chinese Phonetic Alphabet " represents with the er of phonetic code, presses two key position inputs of J and R or Q and R or X and R and E and R when jr or qr or xr and the input of er keyboard respectively.
7. foreign language caption phonotape and videotape recorded broadcast equipment during Chinese as claimed in claim 5 marks automatically in real time, it is characterized in that: described Chinese phonetics codes is represented single vowel in " Scheme for the Chinese Phonetic Alphabet " originally and the ü in the referral letter with an alphabetical y in 26 letters, the coding of all the other single vowels and referral letter adopt with " Scheme for the Chinese Phonetic Alphabet " in the single vowel symbol identical with referral letter.
8. the automatic in real time foreign language caption phonotape and videotape recorded broadcast equipment in the mark of Chinese as claimed in claim 5 is characterized in that: described Chinese phonetics codes composite vowel except use with " Scheme for the Chinese Phonetic Alphabet " in identical symbolic representation, represent with a consonant.
9. foreign language caption phonotape and videotape recorded broadcast equipment during Chinese as claimed in claim 5 marks automatically in real time, it is characterized in that: described Chinese phonetics codes it transfer sign indicating number to represent with four vowels and the no alphabetical v of Chinese, with Latin alphabet a, e, v, u, o represent respectively in " Scheme for the Chinese Phonetic Alphabet " high and level tone-, rising tone e :/, last v: ∨, falling tone u:, o does not mark softly.
10. as foreign language caption phonotape and videotape recorded broadcast equipment in claim 2 or the automatic mark in real time of 3 described Chinese, it is characterized in that: described Chinese phonetics codes, in the computer of hanzi system, can convert Chinese character, Chinese phonetics codes, the Chinese phonetic alphabet to by described Chinese character and the Chinese phonetic alphabet and Chinese voice code bidirectional modular converter, Chinese character can be separately or Chinese phonetics codes and Chinese character, the Chinese phonetic alphabet, and the foreign language contrast of meaning unanimity shows, stores, output.
CN201310243550.1A 2013-06-19 2013-06-19 Chinese mark the most in real time in foreign language caption phonotape and videotape recorded broadcast equipment Expired - Fee Related CN103297710B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310243550.1A CN103297710B (en) 2013-06-19 2013-06-19 Chinese mark the most in real time in foreign language caption phonotape and videotape recorded broadcast equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310243550.1A CN103297710B (en) 2013-06-19 2013-06-19 Chinese mark the most in real time in foreign language caption phonotape and videotape recorded broadcast equipment

Publications (2)

Publication Number Publication Date
CN103297710A true CN103297710A (en) 2013-09-11
CN103297710B CN103297710B (en) 2016-08-17

Family

ID=49097961

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310243550.1A Expired - Fee Related CN103297710B (en) 2013-06-19 2013-06-19 Chinese mark the most in real time in foreign language caption phonotape and videotape recorded broadcast equipment

Country Status (1)

Country Link
CN (1) CN103297710B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104333817A (en) * 2014-11-07 2015-02-04 重庆晋才富熙科技有限公司 Method for quickly marking video
CN104698998A (en) * 2013-12-05 2015-06-10 上海能感物联网有限公司 Robot system under Chinese speech field control
CN104916285A (en) * 2014-10-13 2015-09-16 江苏华音信息科技有限公司 Full-automatic foreign language voice field control automobile driving controller apparatus
CN106340294A (en) * 2016-09-29 2017-01-18 安徽声讯信息技术有限公司 Synchronous translation-based news live streaming subtitle on-line production system
CN107154173A (en) * 2017-04-06 2017-09-12 苏州爱灵格教育科技有限公司 A kind of interactive learning methods and system
CN107316642A (en) * 2017-06-30 2017-11-03 联想(北京)有限公司 Video file method for recording, audio file method for recording and mobile terminal
CN108597497A (en) * 2018-04-03 2018-09-28 中译语通科技股份有限公司 A kind of accurate synchronization system of subtitle language and method, information data processing terminal
CN109448466A (en) * 2019-01-08 2019-03-08 上海健坤教育科技有限公司 The learning method of too many levels training mode based on video teaching
CN110727854A (en) * 2019-08-21 2020-01-24 北京奇艺世纪科技有限公司 Data processing method and device, electronic equipment and computer readable storage medium
CN114007116A (en) * 2022-01-05 2022-02-01 凯新创达(深圳)科技发展有限公司 Video processing method and video processing device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101118540A (en) * 2006-08-02 2008-02-06 苗玉水 Chinese characters phonetic and Chinese voice code bidirectional reversible transform method
CN101118541A (en) * 2006-08-03 2008-02-06 苗玉水 Chinese-voice-code voice recognizing method
CN101131689A (en) * 2006-08-22 2008-02-27 苗玉水 Bidirectional mechanical translation method for sentence pattern conversion between Chinese language and foreign language
US20100278507A1 (en) * 2009-04-30 2010-11-04 Mitac International Corp. Subtitle Generation System and Method Thereof
CN103309855A (en) * 2013-06-18 2013-09-18 江苏华音信息科技有限公司 Audio-video recording and broadcasting device capable of translating speeches and marking subtitles automatically in real time for Chinese and foreign languages
CN103902530A (en) * 2012-12-30 2014-07-02 上海能感物联网有限公司 Audio and video recording and broadcasting method for automatically annotating Chinese and foreign language subtitles in Chinese in real time

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101118540A (en) * 2006-08-02 2008-02-06 苗玉水 Chinese characters phonetic and Chinese voice code bidirectional reversible transform method
CN101118541A (en) * 2006-08-03 2008-02-06 苗玉水 Chinese-voice-code voice recognizing method
CN101131689A (en) * 2006-08-22 2008-02-27 苗玉水 Bidirectional mechanical translation method for sentence pattern conversion between Chinese language and foreign language
US20100278507A1 (en) * 2009-04-30 2010-11-04 Mitac International Corp. Subtitle Generation System and Method Thereof
CN103902530A (en) * 2012-12-30 2014-07-02 上海能感物联网有限公司 Audio and video recording and broadcasting method for automatically annotating Chinese and foreign language subtitles in Chinese in real time
CN103309855A (en) * 2013-06-18 2013-09-18 江苏华音信息科技有限公司 Audio-video recording and broadcasting device capable of translating speeches and marking subtitles automatically in real time for Chinese and foreign languages

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104698998A (en) * 2013-12-05 2015-06-10 上海能感物联网有限公司 Robot system under Chinese speech field control
CN104916285A (en) * 2014-10-13 2015-09-16 江苏华音信息科技有限公司 Full-automatic foreign language voice field control automobile driving controller apparatus
CN104333817A (en) * 2014-11-07 2015-02-04 重庆晋才富熙科技有限公司 Method for quickly marking video
CN106340294A (en) * 2016-09-29 2017-01-18 安徽声讯信息技术有限公司 Synchronous translation-based news live streaming subtitle on-line production system
CN107154173A (en) * 2017-04-06 2017-09-12 苏州爱灵格教育科技有限公司 A kind of interactive learning methods and system
WO2019000721A1 (en) * 2017-06-30 2019-01-03 联想(北京)有限公司 Video file recording method, audio file recording method, and mobile terminal
CN107316642A (en) * 2017-06-30 2017-11-03 联想(北京)有限公司 Video file method for recording, audio file method for recording and mobile terminal
CN108597497A (en) * 2018-04-03 2018-09-28 中译语通科技股份有限公司 A kind of accurate synchronization system of subtitle language and method, information data processing terminal
CN108597497B (en) * 2018-04-03 2020-09-08 中译语通科技股份有限公司 Subtitle voice accurate synchronization system and method and information data processing terminal
CN109448466A (en) * 2019-01-08 2019-03-08 上海健坤教育科技有限公司 The learning method of too many levels training mode based on video teaching
CN110727854A (en) * 2019-08-21 2020-01-24 北京奇艺世纪科技有限公司 Data processing method and device, electronic equipment and computer readable storage medium
CN110727854B (en) * 2019-08-21 2022-07-12 北京奇艺世纪科技有限公司 Data processing method and device, electronic equipment and computer readable storage medium
CN114007116A (en) * 2022-01-05 2022-02-01 凯新创达(深圳)科技发展有限公司 Video processing method and video processing device

Also Published As

Publication number Publication date
CN103297710B (en) 2016-08-17

Similar Documents

Publication Publication Date Title
CN103297710B (en) Chinese mark the most in real time in foreign language caption phonotape and videotape recorded broadcast equipment
CN103309855A (en) Audio-video recording and broadcasting device capable of translating speeches and marking subtitles automatically in real time for Chinese and foreign languages
CN102479208B (en) The various webpage information search transition translation of Chinese phonetics codes method
CN101118541B (en) Chinese-voice-code voice recognizing method
CN111968649A (en) Subtitle correction method, subtitle display method, device, equipment and medium
CN101482975A (en) Method and apparatus for converting words into animation
CN105159870A (en) Processing system for precisely completing continuous natural speech textualization and method for precisely completing continuous natural speech textualization
CN101923858A (en) Real-time and synchronous mutual translation voice terminal
Dreuw et al. SignSpeak-understanding, recognition, and translation of sign languages
CN103902531A (en) Audio and video recording and broadcasting method for Chinese and foreign language automatic real-time voice translation and subtitle annotation
CN103902529A (en) Audio-video recording and broadcasting method capable of automatically annotating with Chinese and foreign language subtitles for foreign languages
CN101118540A (en) Chinese characters phonetic and Chinese voice code bidirectional reversible transform method
CN101123089B (en) Voice mixing method for Chinese voice code
CN103854648A (en) Chinese and foreign language voiced image data bidirectional reversible voice converting and subtitle labeling method
CN103646645A (en) Method based on voice translation text output
CN103853709A (en) Method for automatically adding Chinese/foreign language subtitles for Chinese voiced image materials by computer
CN110851564B (en) Voice data processing method and related device
CN103297709A (en) Device for adding Chinese subtitles to Chinese audio video data
CN103905743A (en) Phonotape and videotape recording and broadcasting method for automatic and real-time Chinese subtitles labeling with Chinese language
CN103853705A (en) Real-time voice subtitle translation method of Chinese voice and foreign language voice of computer
CN103297711A (en) Recorded broadcast device capable of marking Chinese subtitles automatically in real time for Chinese
CN103902530A (en) Audio and video recording and broadcasting method for automatically annotating Chinese and foreign language subtitles in Chinese in real time
CN103164396A (en) Chinese-Uygur language-Kazakh-Kirgiz language electronic dictionary and automatic translating Chinese-Uygur language-Kazakh-Kirgiz language method thereof
CN103854647A (en) Chinese-foreign-language bidirectional real time voice translation wireless mobile communication device
CN100458668C (en) Input method for Chinese character of first pronunciation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20160719

Address after: 810003 Qinghai city of Xining province Qinghai Biotechnology Industrial Park by the four Road No. 26 building 510 room hatch

Applicant after: QINGHAI HANLA INFORMATION TECHNOLOGY CO., LTD.

Address before: Taicang City, Suzhou City, Jiangsu Province, and 215411 Metro Jianxiong Road No. 20

Applicant before: Jiangsu Huayin Information Science & Technology Co., Ltd.

C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160817

Termination date: 20200619