Comprise the information storage medium that is used for multilingual caption data and the equipment thereof that use text data and downloadable fonts
Technical field
The present invention relates to a kind of record thereon and be used to use text data and downloadable fonts to support the information storage medium of multilingual captions, and equipment.
Background technology
Conventional digital multi-usage CD (DVD) uses bitmap images as captions.The caption data of bitmap images is nondestructively encoded and is recorded on the DVD that can write down 32 kinds of captions thereon at most.
Now, the data structure of the video data on the DVD of one of several types as traditional multimedia information storage medium will be explained.
Fig. 1 is the diagrammatic sketch of the data structure of DVD.
With reference to Fig. 1, be divided into VMG zone and a plurality of VTS zone as the disk space of the DVD of multimedia storage medium.Heading message and be stored in the VMG zone about the information of title menu, and be stored in a plurality of VTS zone about the information of title.The VMG zone comprises that 2 to 3 files and each VTS zone comprise 3 to 12 files.
Fig. 2 is the detailed view in VMG zone.
With reference to Fig. 2, the VMG zone comprises: storage is about the VMGI zone of the additional information of VMG; Storage is about the VOBS zone of the video information (object video) of menu; With the backup zone that is used for VMGI.These zones exist as file and between them the VOBS zone exist for optional.
In the VTS zone, about being stored as the information of the title of reproduction units with about information as the VOBS of video data.In a VTS, at least one title is recorded.
Fig. 3 is the detailed view in VTS zone.
With reference to Fig. 3, the VTS zone comprises: video title set information (VTSI), as the VOBS of the video data of menu screen, as the VOBS of the video data of video title set and the Backup Data of VTSI.Be used for the display menu screen VOBS exist for optional.Each VOBS is divided into VOB and unit (cell) as record cell once more.A VOB comprises a plurality of unit.The all time low unit of mentioning in the present invention is the unit.
Fig. 4 is the detailed view as the VOBS of video data.
With reference to Fig. 4, a VOBS comprises a plurality of VOB, and a VOB comprises a plurality of unit.The unit comprises a plurality of VOBU.VOBU is coded data according to Motion Picture Experts Group's (MPEG) method of using the encoding moving pictures in DVD.According to the MPEG method, because image is by the space-time compressed encoding, so for to picture decoding, need previous or successive image.Therefore, for the random access function of supporting to begin from the optional position, each predetermined image is carried out the intraframe coding (intra encoding) that does not need previous or successive image by its reproduction.This image is called as interior picture of frame or the I picture among the MPEG, and the image between I picture and next I picture is called as image sets (GOP).Usually, GOP comprises 12 to 15 pictures.
MPEG definition is used for video data and voice data are encapsulated as the system coding (ISO/IEC13818-1) of a bit stream.Two kinds of multiplexing methods of this system coding definition, these two kinds of multiplexing methods comprise: program stream (PS) multiplexing method, be applicable to produce a program and with this program storage in information storage medium; With the transmission flow multiplex method, it is suitable for producing and transmitting a plurality of programs.In these methods, DVD adopts the PS coding method.According to the PS coding method, video data and voice data be that unit is cut apart and is re-used by the time division (time division) of wrapping with bag (PCK) respectively.Except that being named as private data stream (private stream) by the video of MPEG definition and the data the voice data and also being included among the PCK so that these data can be re-used with the Voice ﹠ Video data.
VOBU comprises a plurality of PCK.PCK among a plurality of PCK is a navigation bag (NV_PCK).Then, remainder comprises video packets (V_PCK), audio pack (A_PCK) and subframe bag (SP_PCK).The video data that is included in the video packets comprises a plurality of GOP.
SP_PCK is used for X-Y scheme data and caption data.That is, in DVD, encode by the method identical with being used for 2 dimension graph datas with the caption data of the stacked appearance of video pictures.That is, for DVD, be used to support multilingual independent coding method not to be used and after each caption data is converted to graph data, graph data is processed and be recorded subsequently by a kind of coding method.The graph data that is used for captions is called as subframe (sub picture).Subframe comprises subframe unit (SPU).A subframe unit is corresponding to a graph data table.
Fig. 5 is the diagrammatic sketch that shows the relation between SPU and the SP_PCK.
With reference to Fig. 5, a SPU comprises subframe unit header (SPUH), pixel data (PXD) and subframe display control sequence table (SP_DCSQT), and it is cut apart and be registered as the SP_PCK of a plurality of 2048 bytes with this order.At this moment, if the last data item of SPU does not have SP_PCK of complete filling, the remainder of so last SP_PCK is filled to have the size identical with other SP_PCK.Therefore, a SPU comprises a plurality of SP_PCK.
In SPUH, write down the size of whole SPU and begin the position of SP_DCSQT data from it.The PXD data are by obtaining the subframe coding.The pixel data that forms subframe can have 4 dissimilar values, its be can by 2 bit values represent and have respectively binary value 00,01,10 and 11 background (background), pattern pixel (pattern pixel), emphasize pixel (emphasispixel)-1 and emphasize pixel-2.Therefore, subframe can be regarded as one group and has the data of four pixel values and be formed multirow.Every row is carried out coding.As shown in Figure 6, SPU is encoded by run length (run-length).That is, if 1 to 3 intended pixel data item is continuous, the number of contiguous pixels (No_P) is represented with 2 bits so, and 2-bit pixel data value (PD) is recorded thereafter.If 4 to 15 pixel data items are continuous, first 2 bit is registered as 0 so, and No_P comes record by using 4 bits then, and PD comes record by using 2 bits.If 16 to 63 pixel data items are continuous, first 4 bit is registered as 0 so, and No_P is recorded by using 8 bits then, and PD is recorded by using 2 bits.If the pixel data item arrives the ending of row continuously, first 14 bit is registered as 0 so, and PD is recorded by using 2 bits then.If when the arrangement that to the coding of row when finishing with the byte is unit is not implemented, 4 bits are registered as 0 so.The length of coded data can not surpass 1440 bits in delegation.
Fig. 7 is the diagrammatic sketch of the data structure of SP_DCSQT.
With reference to Fig. 7, SP_DCSQT comprises the display control information that is used to export the PXD data.SP_DCSQT comprises a plurality of subframe display control sequences (SP_DCSQ).A SP_DCSQT is one group of demonstration control command (SP_DCCMD) once carrying out, and comprises: SP_DCSQ_STM, expression start time; SP_NXT_DCSQ_SA comprises the information about the position of next SP_DCSQ; With a plurality of SP_DCCMD.
SP_DCCMD is the control information that how to be combined and to export about pixel data (PXD) and video pictures, and comprise the pixel data colouring information, about with the information of video data contrast with about the information of output time and deadline.
Fig. 8 is the reference figure that shows the output situation of considering the subframe data.
With reference to Fig. 8, pixel data itself nondestructively is encoded to PXD.SP_DCSQT comprises the information about the SP viewing area, and this SP viewing area is as the subframe viewing area that shows subframe therein in the video display area of video image region; And about the start time of output and the information of deadline.
In DVD, the subframe data that are used for the caption data of maximum 32 kinds of different languages can be re-used and are recorded with video data.Stream id that the differentiation of these different languages is provided by mpeg system coding and time stream id that defines in DVD carry out.Therefore, if the user selects a kind of language, SPU only extracts from having corresponding to the stream id of selected language and the SP_PCK of time stream id so, and is decoded subsequently, and caption data is extracted.Then, according to showing control command control output.
The fact that is re-used with video data of caption data has caused many problems as mentioned above.
At first, the quantity of the bit that will produce for the subframe data when video data is encoded should be considered.That is because caption data is converted into graph data and processed, think each language and the quantity of the data that produce differs from one another and enormous amount.Usually, after the coding of moving image is performed once, thereby the subframe data that are used for each language are suitable for each regional DVD with the output that is added to coding and are made by multiplexing once more.Yet, according to this language, thus the enormous amount of subframe data when the subframe data by with video data when multiplexing, the total quantity of the bit of generation surpasses MAD.In addition, owing to the subframe data are re-used between video data, so the starting point of each VOBU is according to regional and different.Because the starting point of VOBU is managed individually, so when newly beginning multiplexing process, this information should be updated.
Secondly,,, for example export bilingual simultaneously, because caption data is only exported a kind of language so the subframe data can not be used to other purposes because the content of picture can not be learnt each time.
Summary of the invention
The invention provides a kind of information storage medium that uses a kind of data structure to write down the subframe data thereon, wherein, when video data is encoded, do not need to consider in advance the bit stream of subframe data with generation, and a kind of equipment that is used for it.
The present invention also provides a kind of information storage medium that uses a kind of data structure to write down the subframe data thereon, and wherein, the subframe data can be used for the purpose except that captions, and the equipment that is used for it.
Will be in ensuing description part set forth the present invention other aspect and/or advantage, some will be clearly by describing, and perhaps can learn through enforcement of the present invention.
According to an aspect of the present invention, provide a kind of information storage medium of recording video data thereon, this information storage medium comprises: a plurality of montages, as the record unit of stored video data therein; With the text data of captions, itself and a plurality of montage are stored dividually and can be stacked and exportable subsequently with the image according to video data, and text data comprises the data of the captions that are used to provide at least a language.
Information storage medium can comprise: character font data with a plurality of montages record dividually, is used for the diagrammatic representation of text data, and can uses in text data.
When text data is multilingual, can be recorded in independent space for multilingual every kind of text data.
Text data can comprise the character data that can be converted into graph data and be used for the output synchronizing information of synchronizing pattern data and video data.
Text data can comprise the character data that can be converted into graph data and indication when graph data can with according to the image of video data when stacked with the output position information of the position of display graphics data.
Text data can comprise the character data that can be converted into graph data and be used to represent the information of the output of the graph data of multiple size when graph data and image are stacked.
But video data can be divided into the unit of successively reproducing, and is limited corresponding to the size of all text datas of a unit.
But video data can be divided into the unit of a plurality of successively reproducings, is divided into a plurality of group of languages corresponding to each text data that reproduces unit, and the size that forms a kind of all text datas of group of languages is limited.
With Unicode represent and write down form text data data to support multilingual character set.
When the text data of captions only was used as the ASCII of basic English character group and forms as the character of one of ISO8859-1 of the Latin language character group of expansion, text data can be encoded and record by using the UTF-8 that is encoded as a plurality of 8 units by an one character.
When text data comprised the character of code-point value of 2 byte-sized with Unicode, text data can be encoded and record by using the UFT-16 that is encoded as a plurality of 16 units by an one character.
Information storage medium is removable type.
The CD that information storage medium can be read by the optical device of reproducer.
According to a further aspect in the invention, a kind of reproducer that is used for reproducing from information storage medium data is provided, on this information storage medium, video data is recorded, this video data is encoded and is divided into as the montage of writing down unit, and be recorded in a plurality of montages, and on this information storage medium, the text data of captions forms and can be used as graph data with stacked based on the image of video data with multilingual data, text data and montage be record dividually, this reproducer comprises: the data reproduction unit is used for from the information storage medium reading of data; Demoder is used for the video data decoding to coding; Translater is used for text data is converted to graph data; Mixer is used for graph data and video data stacked to produce image; First impact damper is used for stored video data provisionally; With second impact damper, be used to store text data.
Font data can be stored in the 3rd impact damper and can use the diagrammatic representation that is used for text data in text data and be recorded in information storage medium dividually with montage, and translater is converted to graph data by using font data with text data.
When text data is multilingual data, can be recorded in independent space for every kind of language text data, wherein, a kind of conduct is selected by the user and the text data of language that is set to a kind of language of initial reproducing language is temporarily stored in second impact damper, the font data that is used for text data is converted to graph data can be temporarily stored in the 3rd impact damper, and, side by side, when reproducing video data, text data can be converted into graph data and graph data can be output.
This equipment can comprise: controller, be used for by using the output zero-time and the concluding time of synchronizing information control text data, but recording text data on information storage medium, text data comprise: synchronizing information, by its text data be converted into based on the stacked graph data of the image of video data.
This equipment can comprise: controller, be used for by use output position information control text data with based on the stacked position of the image of video data.But recording text data on information storage medium, text data comprise: character data, it can be converted into graph data; And output position information, indication when graph data and based on the image of video data when stacked with the position of output pattern data.
But be recorded in the unit that video data on the information storage medium is divided into successively reproducing, and in the size corresponding to the restriction of all text datas of record unit, text data is recorded.But before the unit that reproduces successively reproducing, all text datas of its limited size can be stored in second impact damper, and when when reproduction period generation language changes, the caption data corresponding to language that is stored in the impact damper can be output.
Video data can be divided into can be by the unit of successively reproducing, is divided into a plurality of group of languages corresponding to the text data of a unit, is limited thereby the text data that forms a kind of captions of group of languages is recorded all text datas.But before the unit that reproduces successively reproducing, text data corresponding to the group of languages that comprises the caption data of exporting simultaneously with video data can be stored in the impact damper, and when reproduction period generation language changes, when the text data of this language during at impact damper, the text data of this language is output, and when the text data of this language is not in impact damper, be stored in the impact damper and the text data of this language can be output corresponding to the text data of the group of languages of the text data that comprises this language.
This equipment can comprise: the big minor switch of captions is used for importing the size of selecting caption data based on the user.Text data can comprise the character data that can be converted into graph data, and indication when graph data and based on the image of video data when stacked the information of the output of a plurality of graph data items can be recorded on the information storage medium.
Represent and write down the data that form text data supporting multilingual collection with Unicode, and the character conversion that translater is represented Unicode is a graph data.
On information storage medium, when the text data of captions only is used as the ASCII of basic English character group and forms as the character of one of ISO8859-1 of the Latin language character group of expansion, text data can be encoded and record by the UTF-8 that an one character is encoded as a plurality of 8 units by use, and translater can be graph data with the character conversion of being represented by UFT-8.
On information storage medium, when text data comprises the character of code-point value of 2 byte-sized with Unicode, text data can be encoded and record by the UFT-16 that an one character is encoded as a plurality of 16 units by use, and the character conversion that translater will be represented by UTF-16 is a graph data.
Information storage medium is removable type, and reproducer can reproduce the data that are recorded on the removable information storage medium.
Information storage medium is by the readable CD of the optical device of reproducer, and reproducer can reproduce the data that are recorded on the CD.
The exportable graph data of reproducer and do not reproduce the video data that is recorded on the information storage medium.
Caption data can comprise the caption data of one or more language, and translater can be converted to graph data with the text data of one or more language.
Caption data can the synchronously stacked and output subsequently with video image.
According to a further aspect in the invention, provide a kind of being used for that the recording unit of video data recording on information storage medium comprised: data writing device is used for data are write on information storage medium; Scrambler is used for video data encoding; Caption generator is used to produce the caption data that can add video data to; Central processing unit (CPU); The storer of fixed type; And impact damper.Scrambler video image is divided into as the record unit montage and to the montage compressed encoding after, video data is stored in the storer of fixed type.Caption generator produces multilingual caption data with the form of text, and caption data can be with reproduced based on the image of video data and be stored in the storer of fixed type.Impact damper is the data of store storage in the storer of fixed type provisionally.Video data and caption data that data writing device will be stored in the coding in the impact damper temporarily are recorded on the information storage medium.The coding of CPU control video data is recorded in the video data and the caption data of coding in each independent zone on the information storage medium.
This equipment can comprise: the font data generator produces the font data that is used for the text data of captions is converted to graph data.The font data generator can produce and be used for caption data is converted to the required font data of graph data, and font data can be stored in the storer of fixed type.Impact damper is the font data of store storage in the fixed type storer temporarily, data writing device can be recorded in the font data that temporarily is stored in the storer of fixed type on the information storage medium, and the generation that CPU can the control word graphic data and font data being recorded in the independent zone of information storage medium.
When text data is multilingual data, thereby CPU may command caption data is recorded in the independent space every kind of language subtitle data.
This equipment can comprise: caption generator, and by comprising the character data that can be converted into graph data and output subsequently and being used for producing caption data with the synchronous output synchronizing information of the reproduction of video image.
Caption generator can by comprise the character data that can be converted into graph data produce caption data and can export indication when graph data with based on the image of video data when stacked with the output position information of the position of output pattern data.
Caption generator can by comprise the character data that can be converted into graph data and be used to represent when graph data and based on the image of video data when stacked the information of the output of the graph data of multiple size produce text data.
But the video data of coding can be divided into the record unit of successively reproducing, thereby and caption generator can produce text data and be limited corresponding to the size of all caption datas of record unit.
But the video data of coding can be divided into the record unit of successively reproducing, and after the text data corresponding to the record unit is divided into a plurality of group of languages,, caption generator generation text data is limited thereby forming the size of the whole caption data of a group of languages.
Caption generator can produce with Unicode and form text data to support the data of multilingual character set.
When text data only was used as the ASCII of basic English character group and forms as the character of one of the ISO8859-1 of the Latin language character group of expansion, scrambler can be encoded by using the UTF-8 that is encoded as a plurality of 8 units by an one character.
When text data comprised the character of code-point value of 2 byte-sized with Unicode, scrambler was encoded by using the UFT-16 that is encoded as a plurality of 16 units by an one character.
Information storage medium is removable type.
But information storage medium CD.
According to a further aspect in the invention, provide a kind of reproduction to be stored in the method for the data on the information storage medium, comprising: read audiovisual (AV) data and text data; From text data caption view data; AV data to AV data decode and output decoder; With the AV data of mixing captions view data and decoding.
According to a further aspect in the invention, provide a kind of reproducer, comprising: reading section is used to read audiovisual (AV) data, text data and font data; Decoder section is used for AV data decode and output movement view data; Translator unit is used for from text data caption view data; And mixing portion, view data and captions view data are used to be synchronized with the movement.
According to a further aspect in the invention, provide a kind of reproducer, comprising: reading section is used to read text data and font data; Translator unit is used for from text data caption view data; Output is used to export the captions view data; With the input receiving unit, the input of caption data that is used to receive next line is with the output time of control caption data.
According to a further aspect in the invention, provide a kind of data recording and/or reproducer, comprising: storage area; Scrambler is used for audiovisual (AV) digital coding to produce the AV data of coding; Caption generator is used to produce the interpretable text data of captions; Data writing device is used for the AV data and the interpretable text data of coding are write storage area; Reading section is used to read the AV data and the interpretable text data of coding; Decoder section is used for AV data decode to coding to produce motion image data; Translator unit, but be used for from cypher text data translation captions view data; And mixing portion, be used for aggregate motion view data and captions view data to produce the motion image data that mixes.
In order to realize above and/or aspect and advantage, on information storage medium according to various embodiments of the present invention, each caption data item is not encoded with the AV data, and is recorded in the independent record space with the form of independent text data not in the AV data.In addition, on information storage medium, the independent font data that is used to translate as the caption data of form of textual data is recorded.In addition, be used for the interlocking caption data and the output information finishing the AV moving image synchronizing information of decoding processing and be used for screen output is recorded.Caption data is corresponding to the subframe data among traditional DVD.That is, on the information storage medium according to a plurality of embodiment of the present invention, following element is recorded:
1) video information is compressed the AV data (montage) that are encoded to;
2) text data of multilingual captions; With
3) be used for the font data of cypher text data.
Description of drawings
Fig. 1 is the diagrammatic sketch of the data structure of DVD;
Fig. 2 is the detailed view in VMG zone;
Fig. 3 is the detailed view in VTS zone;
Fig. 4 is the detailed view as the VOBS of video data;
Fig. 5 is the diagrammatic sketch that shows the relation between SPU and the SP_PCK;
Fig. 6 is the diagrammatic sketch of the data structure of subframe when subframe is encoded;
Fig. 7 is the diagrammatic sketch of the data structure of SP_DCSQT;
Fig. 8 is the reference figure that shows the output situation of considering the subframe data;
Fig. 9 is the block scheme according to the reproducer of the embodiment of the invention;
Figure 10 is the diagrammatic sketch that is stored in according to the data structure of the text data in the information storage medium of the embodiment of the invention;
Figure 11 is the embodiment according to the text data of the captions of the embodiment of the invention;
Figure 12 is the diagrammatic sketch of data structure of text data of language that is different from the language of Figure 11;
Figure 13 is to use the example at text of the present invention;
Figure 14 is the example that different fonts is applied to its captions;
Figure 15 is the example of the captions of demonstration after line feed;
Figure 16 is the example that shows the situation of user's effective language change when a kind of captions of language are just reproduced;
Figure 17 is the example of a plurality of group of languages of multilingual caption data and font data;
Figure 18 be display the play tabulation, play, the diagrammatic sketch of the mutual relationship of clip information and montage;
Figure 19 is the example according to bibliographic structure of the present invention;
Figure 20 is the example that display reproduction equipment is only exported the captions data conditions;
Figure 21 is that display reproduction equipment is exported the example more than a kind of situation of caption data of language simultaneously;
Figure 22 is presented at only to reproduce during the caption data, the example of the situation that the normal reproduction of video data begins from the video data corresponding to the captions line data; With
Figure 23 is the block scheme according to the recording unit of the embodiment of the invention.
Embodiment
Now, will describe embodiments of the invention in detail, its example represents that in the accompanying drawings wherein, identical label is represented same parts all the time.Below by embodiment being described with reference to the drawings to explain the present invention.
Fig. 9 is the block scheme according to the reproducer of the embodiment of the invention.
With reference to Fig. 9, reproducer comprises: reader is used for reading the AV data that are stored in information storage medium, the text data of captions and the font data of downloading; Demoder is used for the data decode to AV; Translater (renderer) is used for the cypher text file; And mixer, be used for the moving image from demoder output is made up with the caption data of exporting from translater.
In addition, this reproducer also comprises data that are used to cushion between reader and demoder and the translater and the impact damper of storing the font data of determining, and can comprise the storer (not shown) that is used to store as the default intrinsic font data of storing in advance.
As using in this, translation (rendering) comprises that all relate to caption text data is converted to graph data to be displayed on needed action on the display device.Promptly, translation comprise by be recycled and reused for from the font data of the information storage medium or the download of reading, find from intrinsic font data with text data in the font of character code coupling of each character, and the processing that this font data is converted to graph data produced graph data to form the captions image.Translation also comprises to be selected or converting colors, the suitable graph data of writing with horizontal line or vertical row of the size of selection or hand over word and generation.Specifically, when the font data that just is being used was housing font (outline font), font data was the curvilinear equation formula with the shape definition of each character.In this case, translation also comprises and being used for by handling the rasterization process (rasterizing process) that this curvilinear equation formula produces graph data.
Figure 10 is the diagrammatic sketch that is stored in according to the data structure of the text data in the information storage medium of the embodiment of the invention (that is caption data).
With reference to Figure 10, text data and AV flow point are turned up the soil and are recorded.Text data comprises synchronizing information, display region information and display font case information.Synchronizing information can be added to the data that will export with captions and can be used for captions with synchronous from the video information of AV flow data decoding in Translation Processing.The caption data that display region information indicates translation is displayed on the position on the screen.Display font case information comprises size about character in the viewing area, writes the information of the caption data of translation and arrangement, color, contrast etc. with horizontal line or vertical row.In addition, owing to multilingual every kind text data can be write, so text data also comprises the information of the language in the expression multilingual.For each language each, this so-called multi-language data can be stored in independent space, is stored in the space after perhaps can being re-used in the order with output time.
Figure 11 illustrates the text data according to the captions of the embodiment of the invention.
With reference to Figure 11, SGML is used as the text data of captions in the present embodiment.The purpose of considering use is for captions, and the mark (tag) or the element (element) of the SGML that is used for captions of minimal amount are used, and as mentioned above, the mark or the attribute that are used for synchronous and screen display can be comprised.Here, subtitle, head, meta, body, p element are shown as an example.In the present embodiment, information is shown with attribute.The attribute of use in this example is as follows:
-start: the time,, when the zero-time of the moving image that should reproduce with caption data is set to 0, should be output corresponding to the caption data of moving image at it.Time that its captions are shown with the time (HH): divide (MM): second (SS): the form of frame (FF) is represented.Time can 1/1000 second be represented for unit.In addition, if video data is the MPEG video, the time can have Presentation Time Stamp (presentation time stamp, PTS) value of the stacked and video image that is shown of captions thereon so.Usually, pts value is the count value that is operated in 27MHz or 90MHz.If pts value is used, caption data can mate with video data and be operated exactly so.
-end: the time, disappear and have and the property value of ' start ' same type at the captions of its demonstration.
-position: this indicates therein caption data with the coordinate of the left upper apex in the video area in the viewing area that is shown.
-direction: this indicates the direction of the caption data that will be shown.
-size: this indicates width or the height with the viewing area that is shown of caption data therein.If the property value of " direction " is " horizontal ", the fixing width value of caption data case is instructed to so, and if be " vertical ", the level altitude value of caption data case is instructed to so.
Among the element that uses, subtitle elements is used to indicate the root of text data, and the head element is used to comprise the meta element of processing by the information of all text data needs, the style element that does not perhaps show in the example of Figure 11.In the present embodiment, the meta element is used with the title of expression corresponding text data with the language that is used.That is, when multilingual was selected, by using the metamessage in the text data, the language text file of expectation can be selected easily.In addition, if be prepared for the different directories of every kind of language text file, language can be distinguished according to the title of text or according to directory name so.
Before video data was reproduced, Cun Chu caption data was loaded in the impact damper of reproducer like this, and along with the reproduction of video data, was converted into graph data and is caught stacked video image by the translater caption data.Therefore, for example Korean caption data is displayed in the viewing area at precise time.As described above, for text data, except that the captions character data, control information can also form or grammer write.Therefore, translater has the analyser function that is used to verify that whether text to be stored is write according to grammer.In addition, for be included in by use synchronizing information in the text with caption data with synchronous by the video image of decoder decode, exist by its be used to send or determine about the incident of the information of the playback mode of recovery time and demoder by with the passage of demoder exchange.
Figure 12 is the diagrammatic sketch for the data structure of the text data of the language outside the Korean that is different from Figure 11.
With reference to Figure 12, when video data and text data are recorded in the zones of different, to multilingual support by with caption data be attainable indignantly to video data encoding and the video data that subsequently text data of variant language added to coding.In addition, when not being stored in the caption data on the information storage medium and font data is downloaded by network or by when the additional information storage medium is loaded into reproducer, therefore, caption data is easily used in other cases with video data.
When therefore multilingual is supported, the character code that is used to text data should be determined.In an embodiment, Unicode is used.Unicode is used for expressing having more than the language in the All Around The World of 65,000 characters.According to Unicode, each character is represented by the code-point among the Unicode (code point).The character of representing various language is the group with code-point of regular successive value.Character with consecutive intervals of code-point is called as code table (code chart).In addition, Unicode supports UTF-8, UTF-16, UTF-32 to store practically or transmission character data, i.e. code-point as coded format.These forms will be represented a character by using a plurality of data item with 8 bit lengths, 16 bit lengths and 32 bit lengths respectively.
Be used for representing the ASCII character of English character and be used for having the code-point value from 0x00 to 0xFF at Unicode by the ISO8859-1 sign indicating number that the language of European countries expressed in the expansion Latin language.Japanese Hirakana character has the code-point value from 0x3040 to 0x309F.Be used for representing that modern Korean 11,172 characters have the code-point value from 0xAC00 to 0XD7AF.Here, 0x indication code point value is represented by sexadecimal number.
If caption data only comprises english character, so by using UTF-8 to carry out coding.For Korean and Japanese caption data,, can use 3 bytes to represent a character so if UTF-8 is used.If UTF-18 is used, character can be with 2 byte representations so, and are included in each also available 2 byte representation of the english character in the caption data.
Each country has the character code of its different with Unicode oneself.For example, in Korean character code group KSC5601, the Korean character has 2 bytecode point values, and english character has 1 bytecode point value.If by using the code except that Unicode but be not that each national character group produces caption data, each reproducer is understood these all character group so, thereby increases the burden of realization.
Need font data so that caption data is handled as text data.In addition, in order to support multilingual, font data is supported multilingual.Yet very difficult production has all reproducers of supporting multilingual these fonts.Therefore, in this embodiment of the present invention, only be used for using font data to be recorded in the information storage medium as caption data at the character of information storage medium, thereby in reproducer, this font data is loaded into impact damper and is used subsequently before reproducing video data.That is, reproducer links each section of caption text data and also reproduces these data subsequently with font data.The link information of caption text data and font data is recorded in the text data of captions or is recorded in the independent zone.Consider the situation of user in the reproduction period effective language change of data, reproducer loads corresponding to video data and continuous reproducible caption data and font data before reproducing, and uses these data subsequently.Here, successively reproducing is included in the video of video data and the audio frequency output and does not suspend, ends or interrupt reproducing.Usually, reproducer reproduces data by data volume is stored in video or the audio buffer, if the underflow in the impact damper of reproducer is prevented from, successively reproducing is possible so.When being read once more when changing captions by reader corresponding to the captions of video data or font data,, can not need pre-loaded so if do not take place in the underflow of this time durations video and voice data at reproduction period.
Figure 13 is to use the example of the text in this embodiment of the present invention.
With reference to Figure 13, in this embodiment of the present invention, the style element is used in the head element to use the application of CSS file layout as the font of the SGML that is used for realizing text.By using CSS, caption data can use the multiple font with different sizes and color.
In some application programs, or for some users, by the captions font inconvenience of default setting.For example, if the size of the font of captioned test is very little, the people who has poor eyesight so may feel inconvenient.Therefore, expectation is used and the people of display font to satisfy domestic consumer or to have poor eyesight when being applied to the same text file.Therefore, by allowing the user to determine font by menu, the size of font for example is used for can being used according to the table of type that user's setting is used font and had a plurality of options that can be selected by the user when information reproduction storage medium in first reproducer.
In the present invention, will explain the @ user policy that can be provided with according to user's captions font now by it.User type is one group of CSS attribute.In the present embodiment, the detailed difference of user type, promptly the degree of poor eyesight has infelicity, and therefore following only two kinds of ensuing situations will be explained:
-little: the font that is used to have the user of common eyesight; With
-big: the font that is used to have the user of bad eyesight;
As shown in figure 14, it is pre-seted or can be shown the captions that the user's different fonts that has good eyesight or have a bad eyesight is applied to it by using the @ user policy.
Reproducer also can not need use the position of being determined by caption data to use according to user's hobby and export captions by using diverse location and size with size.
Figure 15 is that the text data of the Korean captions wherein realized in Figure 11 is displayed on the example on the actual screen.
With reference to Figure 15, because by second<p〉in the screen of element representation, the width value of caption data viewing area is confirmed as 520 by " size " attribute, the caption data that demonstration can not be expressed in delegation after line feed.On the other hand, caption data only is exportable and passes through to use line feed element (br) that line feed can be selected forcibly in the viewing area.
The 3rd<p〉element is the example of wherein vertically being carried out by the demonstration of " direction " attribute caption data.
Figure 16 is the example that illustrates when the situation that user's effective language changes when just reproduced with a kind of captions of language.
With reference to Figure 16, when needs changed language, reproducer changed just reproduced caption text data (for example Korean), link is corresponding to the font data of text data, the data of the language (for example English) that translation changes, and by doing the output captions like this.If the font data of the data of captions and these data all is loaded in the impact damper, the successively reproducing of video data can easily be carried out so.Be not loaded in the impact damper if expect reformed text data or font data, these data should be loaded into impact damper so.At this moment, in the reproduction of video data, may suspend, end or interrupt.
For the multilingual conversion that do not have to suspend, ends or break of video is reproduced, the big I that is used for the data of captions and font data is restricted to the size less than each impact damper.Yet in this case, the number of languages that is supported is limited.Therefore, in present embodiment of the present invention, this problem solves by the unit that establishment is called as group of languages.
Figure 17 is the example that is used for a plurality of group of languages of multilingual caption data and font data.
With reference to Figure 17, the multilingual caption data and the font data that add a video image to are divided into a plurality of group of languages.Be limited size corresponding to the caption data of a group of languages and font data less than the size of impact damper.Comprising before the reproducing video data after the group of languages of being selected by the user or being selected by reproducer as the caption data of default language is loaded into impact damper, reproducing video data begins.When user's effective language changes, because data have been loaded into impact damper, so because caption data is included in this group of languages, language subtitle changes can not had the carrying out of ending.Yet if carry out not being included in the change of the language in this group of languages, reproducer loads the caption data and the font data of the group of languages of expectation once more so.In this case, the data of existing group of languages are all deleted.At this moment, in reproducing video data, suspend, end or interrupt and to take place.Thereafter, if effective language changes, language changes operation and is carried out once more according to this language and the relation that is loaded between the group of languages in the impact damper so.Can be recorded in the information storage medium or by considering to be stored in the data in the information storage medium and the size of the impact damper in the reproducer, reproducer is determined the information about language set arbitrarily when reproducing data about the information of group of languages.
Explain in the information of reproducing video data needs and the relation between the caption data now with reference to embodiment.
As using in this, montage is the record unit of video data, and a playlist (PlayList) and play (PlayItem) and will be used to indication and reproduce unit.
In the information storage medium according to the embodiment of the invention, AV stream is separated and is that unit is recorded with the montage.Usually, montage is recorded in the continuous space.In order to reduce its amount, AV is compressed and is recorded.Therefore, in order to reproduce the AV stream of this compression, the attribute information of compressed video data should be notified.Therefore, clip information is recorded in each montage.Clip information comprises the audio frequency and video attribute of this montage and therein about being the entrance mapping (Entry Point Map) that the information of the position of available entrance (Entry Point) is recorded at each every middle random access therein.In being widely used as the MPEG of video compression technology, the entrance is the position of the compressed I image of I picture wherein, and the entrance mapping is mainly used in and is used for finding the time of any to inquire about with the time interval after the starting point of reproducing.
Playlist is the base unit that reproduces.In the information storage medium according to present embodiment, a plurality of playlists are stored.A playlist comprises a series of a plurality of broadcast item.Playlist is corresponding to the part of montage, and more particularly, it is used with the form that is determined by reproduction zero-time in its montage and concluding time.Therefore, by using clip information, be identified corresponding to the position of this part in the actual montage of playing.
Figure 18 be display the play tabulation, play, the diagrammatic sketch of the mutual relationship of clip information and montage.
With reference to Figure 18, in present embodiment of the present invention, except that playlist, broadcast item, clip information and montage, the text data item of the captions of a plurality of each montage is recorded in the space that separates with montage.A plurality of data item of captions are linked to a montage and this link information can be recorded in the clip information.For some montages, a plurality of data item that are used for captions are linked, and for other montages, do not have data item or only a data item of captions can be linked.When playlist is reproduced, be included in broadcast item in the playlist by sequential reproduction.As a result, any one that is linked in each montage of playing item is translated and exports with a plurality of captions that are linked to this montage.Because the successively reproducing between the playlist is not guaranteed usually, the text data that is used for captions of all-links can be loaded into impact damper before reproducing playlist.In Figure 18, font data is not separated ground mark.
Usually, produce font data for each language.Therefore, be recorded in independent space for each language font data.
Figure 19 is the example according to the bibliographic structure of the embodiment of the invention.
With reference to Figure 19, in catalogue, montage, clip information, playlist, caption text data and font data are stored with the form of file and are stored in the different catalogue spaces according to type separately.As shown, the text data of captions and font file can be stored in the catalogue space that separates with video data.
The information storage medium of different embodiment according to the subject invention is an information storage medium (promptly be not fixed to reproducer, and a kind of information storage medium that only can be placed and use when data are reproduced) movably.Do not resemble fixing information storage medium such as the hard disk with high power capacity, removable information storage medium has limited capacity.In addition, the reproducer that is used to reproduce this medium usually has the impact damper of limited size and the low order function with limited performance.Therefore, video data on being recorded in removable information storage medium, only caption data and the font data that is used for caption data be recorded in information storage medium and by use these data when video data by when information storage medium reproduces, should can be minimized by pre-prepd data volume.The representative example of this movably recording medium is a CD.
On the information storage medium according to the embodiment of the invention, video data is stored in the space that separates with caption text data.If this caption text data is used for multilingual and has the font data that is used to export caption data, reproducer only loading caption data and font data in impact damper so, and subsequently, when reproducing video data, the stacked and output caption data of caption data and video image.
Figure 20 is the example that display reproduction equipment is only exported the captions data conditions.
With reference to Figure 20, can only export caption data according to the reproducer of the embodiment of the invention.That is, according to one of a plurality of specific reproduction functions, video data does not have reproduced, and only will be output to be converted into graph data with the stacked caption data of video data and to be output subsequently.In this case, caption data can be used, for example, and in order to learn foreign languages.Here, video data do not have stacked and only caption data be output.In addition, synchronizing information and positional information all are left in the basket or are not comprised, and the multirow data item output that reproducer will comprise caption data is on whole screen, and wait for user's input.After watching all output caption datas, the user will be used to show that the signal of the caption data of next line sends to the output time of reproducer with the control caption data.
Figure 21 is that display reproduction equipment is exported the example more than a kind of situation of caption data of language simultaneously.
With reference to Figure 21, as embodiment, reproducer can have the function of the caption data of exporting two or more language simultaneously when caption data comprises multilingual.At this moment, the synchronizing information of the caption data by using each language, the caption data that is displayed on the screen is selected.That is, caption data is output with the order of output zero-time, and when the output zero-time was identical, caption data was output according to language.
Only when caption data is reproduced, the function that can begin the normal reproduction of video data from the video data corresponding to captions line data item also is attainable by it.
Figure 22 only is presented at during the reproducing caption data, begins the example of situation of the normal reproduction of video data from the video data corresponding to the captions line data.
As shown in figure 22, when being used to select a kind of captions line data item, being selected once more corresponding to the recovery time of line data item, and normally reproduced corresponding to the video data of this time.
Recording unit according to the embodiment of the invention is recorded in video data and caption data on the information storage medium.
Figure 23 is the block scheme according to the recording unit of the embodiment of the invention.
With reference to Figure 23, recording unit comprises central processing unit (CPU), high capacity memory, scrambler, caption generator, character pattern generator, write device and the impact damper fixed.
Scrambler, caption generator and character pattern generator can be realized by the software on the CPU.
In addition, the video input block that is used for real-time receiving video data also can comprise.
Memory stores is as the video image of object of coding, perhaps by the video data of encoder encodes.In addition, memory stores invests the dialogue and the high capacity font data of video data.Caption generator receives the captions line data from the information of scrambler reception about the output time of captions line data item from dialogue data, produces to be used for the caption data of captions, and caption data is stored in the memory device of fixed type.Character pattern generator produces from the high capacity font data and comprises use at the font data of the character of the caption data that is used for captions and this font data is stored in the memory device of fixed type.That is, being stored in font data in the information storage medium is a high capacity font data part that is stored in the memory device of fixed type.This processing that will be stored in the data in the information storage medium with this form generation is known as editor (authoring).
If editing and processing is finished, the video data that is stored in the coding in the memory device of fixed type so is divided into the montage as the record unit, and is recorded on the information storage medium.In addition, the caption data that is used for adding to the captions of the video data that is included in montage is recorded in the independent zone.In addition, the font data that caption data need be converted to graph data is recorded in the independent zone.
But video data is divided into the reproduction unit of successively reproducing, and usually, this reproduces unit and comprises a plurality of montages.As embodiment, can be included in one and reproduce the stacked and size caption data that is output of video image in the unit and be restricted to less than the size when being used for multilingual data and all being added to caption data.On the other hand, should by be included in one and reproduce the stacked caption data of video image in the unit and be divided into group of languages, use the language change when video data is reproduced of this language set to be carried out continuously.Being included in a caption data that reproduces in the unit comprises a plurality of group of languages and is included in a caption data, the size that is used for multilingual additional data in the group of languages and be restricted to less than a size.
Caption data comprises that in fact the character code of using Unicode and the data layout of physical record can encode according to UTF-8 or UTF-16.
The video data, the caption data that is used for captions and the font data that are recorded in the memory device of fixed type are temporarily stored in impact damper and are recorded in information storage medium by write device.Thereby CPU carries out these functions of software program of each device of control and is carried out in proper order.
As mentioned above, according to the abovementioned embodiments of the present invention, the text data that is used for multilingual captions is made into text and is recorded in the space of opening with the AV flow point subsequently, thereby how different captions can be provided for the user and the record space arrangement can be carried out easily.
By collecting the character that captioned test needs, font data for this reason is generated to have minimum size and to be stored in the information storage medium individually and to be used.
Although shown and described some embodiments of the present invention, the present invention is not restricted to disclosed embodiment.In addition, it should be appreciated by those skilled in the art, under situation about not breaking away from, can change in these embodiments by the principle of the present invention of claim and its equivalent limited range and spirit.
Utilizability on the industry
The present invention can be applied in the field relevant with reproduction with the record of moving image, specifically, and in the field that multilingual text data must be provided therein when reproducing motion pictures the time.