CN100426851C - Method and apparatus for fetching volume ordinal number of TV play from digital TV broadcast - Google Patents

Method and apparatus for fetching volume ordinal number of TV play from digital TV broadcast Download PDF

Info

Publication number
CN100426851C
CN100426851C CNB2004100319225A CN200410031922A CN100426851C CN 100426851 C CN100426851 C CN 100426851C CN B2004100319225 A CNB2004100319225 A CN B2004100319225A CN 200410031922 A CN200410031922 A CN 200410031922A CN 100426851 C CN100426851 C CN 100426851C
Authority
CN
China
Prior art keywords
collection
ordinal number
information
program
ordinal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2004100319225A
Other languages
Chinese (zh)
Other versions
CN1678042A (en
Inventor
郑文涛
燕鹏举
李斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to CNB2004100319225A priority Critical patent/CN100426851C/en
Publication of CN1678042A publication Critical patent/CN1678042A/en
Application granted granted Critical
Publication of CN100426851C publication Critical patent/CN100426851C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Machine Translation (AREA)

Abstract

The present invention provides a method for extracting set ordinal number information of a television program from digital video broadcasting, which comprises the following procedures: title information and type information of the television program to be played are received; the program title indicated by the type information as a television sequence program is analyzed according to a prestored program title syntax table, and a title character string corresponding to the program title is generated; the set ordinal number information of the television sequence program is extracted from the generated title character string according to the prestored program title syntax table and a semantic interpretation function table which interprets the program title syntax table. The present invention also provides a set ordinal number device for extracting a television program.

Description

From digital television broadcasting, extract the method and apparatus of the collection ordinal number of program
Technical field
The present invention relates to the method and system of a kind of extraction by the information on services of the TV programme of television channel transmission, particularly relate to and extracting in the digital video broadcasting, by the broadcast collection ordinal number of electronic program guides, thereby remind the user to avoid missing the method and system of watching the TV sequence program of liking to the television series program that comprises broadcast of users broadcasting.
Background technology
TV series are one of most popular TV programme, and people wish to know the sequence program that switches off the TV usually in watching activity, for example the collection ordinal number information of TV series (that is which collection of the serial of being play).Usually, spectators wish whether also watch this serial after watching first collection to decide, and wish perhaps to know next rally is by which cannel broadcast.But the continuous increase of television channel number and the continuous quickening of rhythm of life have increased the weight of people and have born by the time that traditional approach obtains TV program information.
Usually, in order to obtain the listing of TV programme, need consult the TV programme hurdle of newspaper or magazine.Along with the development of digital video broadcasting (DVB), the broadcasting station can send the data that comprise vision signal, audio signal and various other additional informations.Television receiver receives these additional informations, and response user's requirement uses these information, thus make programme browse, predict, remind and means such as recommendation are achieved automatically or semi-automatic.
For example, electronic program guides (Electronic Program Guide-EPG) is implemented on the top box of digital machine, and it provides a kind of easy use, friendly interface, can fast access have wanted the mode of the TV programme of watching for the user.And the TV programme that is applied to terminal is equally filtered or commending system, according to the history that the user watches TV programme, can new program be marked, thereby may interested TV programme reduce the time that the user browses programme by providing to the user.Yet, the head that existing these means do not solve serial collects prompting and the collection ordinal number extracts problem, should see also that simultaneously existing digital video broadcasting business information digital television service standards such as (DVB-SI) is not included the collection ordinal number markup information of serial in its framework as metadata type independently.
Summary of the invention
The purpose of this invention is to provide the information on services that is intended to utilize digital television program, automatically find TV sequence program, for example, the head of serial collects and makes prompting to the user, provide the method and apparatus of collection ordinal number to make things convenient for the user to browse of serial simultaneously, be unlikely to miss the TV play of liking so that the user sees after the first collection of TV programme is reminded.The present invention includes decoding, the collection ordinal number representation model of serial, aspects such as the collection ordinal number calculating of TV programme to information on services.
In order to realize purpose of the present invention,
According to an aspect of the present invention, provide a kind of method of from digital video broadcasting, extracting the collection ordinal number information of TV programme, comprise step: receive television program titles and the type information play; Program title syntax table analysis according to storage in advance is designated as the program title of TV sequence program and the generation heading character string corresponding with described program title by type information; With the collection ordinal number information of from the heading character string of described generation, extracting described TV sequence program according to described program title syntax table of storing in advance and the semantic interpretation function table of explaining described program title syntax table.
According to another aspect of the present invention, a kind of device that extracts the collection ordinal number of TV programme is provided, comprise: the information on services decoding device, be used for the code stream that digital television broadcasting provides is decoded, therefrom detect the information on services of the collection ordinal number information that comprises TV programme; Collect the ordinal number model equipment, be used for the expression model of the collection ordinal number of stored television program; Collection ordinal number extraction element, the decoding information on services that provides according to described information on services decoding unit, the expression model of the collection ordinal number of storing in the utilization collection ordinal number model unit mates the information on services of decoding and discerns calculating, to obtain the collection ordinal number information of TV programme; And control device, be used to control the operation of each device and the required program of operation that storage is used to control each unit.
According to a further aspect of the invention, provide a kind of method of from digital video broadcasting, extracting the collection ordinal number of TV programme, comprise step: the data code flow that receives information on services; To the information on services sign indicating number decoding of reception, therefrom extract information on services, the collection ordinal number expression model of use storage mates the information on services of decoding and discerns calculating, so that isolate collection ordinal number information from information on services; When isolated collection ordinal number information is first collection, collect broadcast information to the head of user reminding TV programme; When isolated collection ordinal number not being first collection, provide the corresponding collection ordinal number information of TV programme to the user by browser interface.
In addition, the present invention also provides storage to carry out the described recording medium of program of method that is used for extracting from digital video broadcasting the collection ordinal number of TV programme.
Description of drawings
By explaining being used for below in conjunction with accompanying drawing, rather than restriction the preferred embodiments of the present invention are described in detail, and will make above-mentioned and other purpose of the present invention, feature and advantage clearer, wherein:
Fig. 1 is the configuration block diagram according to the device of the extraction TV programme collection ordinal number of the embodiment of the invention;
Fig. 2 is the flow chart according to the extraction TV programme collection ordinal number of the embodiment of the invention;
Fig. 3 is according to the schematic diagram of the embodiment of the invention by the syntax tree of the title of the TV programme of syntax analyzer output;
Fig. 4 is the configuration schematic diagram according to the top-down syntax analyzer of the embodiment of the invention; With
Fig. 5 is the configuration schematic diagram of bottom-up syntax analyzer in accordance with another embodiment of the present invention;
Fig. 6 is the configuration schematic diagram according to the semantic interpreter of the embodiment of the invention; With
Fig. 7 is the flow chart that extracts the collection ordinal number of TV programme according to the present invention.
Embodiment
Below in conjunction with accompanying drawing the device of extraction TV programme collection ordinal number of the present invention and the structure model that TV programme collection ordinal number is provided to the user are described.
Fig. 1 is the configuration block diagram according to the device of the extraction TV programme collection ordinal number of the embodiment of the invention.As shown in Figure 1, this device comprises central control unit 11, information on services decoding unit 12, and collection ordinal number model unit 13, collection ordinal number extraction unit 14, input request unit 15, first collection are reminded boundary element 16 and collection ordinal number browser interface unit 17.
Control unit 11 is the master control unit that extract the device of TV programme collection ordinal number, and the data that are used to control between each unit send, and give each unit with work allocation.Control unit 11 storages are used to control the required program of operation of each unit.As an example, control unit 11 can be realized by CPU (CPU).The code stream that 12 pairs of digital television broadcastings of information on services decoding unit provide is decoded, and therefrom detects the information on services that comprises collection ordinal number information.Collection ordinal number model unit 13 is that the modelling of the relevant collection of storage ordinal number expression formula is represented the unit.The collection ordinal number is expressed model and is made up under off-line case by the system development personnel, also can be responsible for renewal by producer when version updating.Collection ordinal number extraction unit 14 is under the control of control unit 11, according to information on services by 12 decodings of information on services decoding unit, the pattern of storing in the utilization collection ordinal number model unit 13 is mated the information on services of decoding or is discerned calculating, calculates the collection ordinal number information of TV programme.The head that input request unit 15 is accepted user input collects the request of prompting, or such as to the inquiry of collection ordinal number, and the request of other that uses and so on is sent control command by control unit 11 to units corresponding.In addition, as an example, the input request unit also can be accepted to import from remote controller.First collection reminds boundary element 16 according to the user's request that obtains from input request unit 15, and the collection ordinal number that utilizes collection ordinal number extraction unit to calculate by showing an interface, collects broadcast information with the head of at present known serial and is shown to the user.Collection ordinal number browser interface unit 17 utilizes collection ordinal number extraction unit to calculate the collection ordinal number according to the user's request that obtains from input request unit 15, shows in this unit, can carry out simultaneously by operations such as the collection ordinal number sort, searches.
The device of extraction TV programme collection ordinal number of the present invention can be integrated in the local set-top box, also can be used as separative element and is arranged in the set-top box.Collection ordinal number representation model is optionally, or renewable, for example, can pass through such as camera cable, microwave, the new model of the online download of the transmission line of satellite circuit and so on upgrades, or directly provide, and need not user's operation bidirectional by broadcast program supplier etc.
Fig. 2 shows the present invention and utilizes information on services standard in the Digital Television that the flow process of TV program information is provided.
By such as camera cable, microwave, the transmission line of satellite circuit and so on receives the data code flow of information on services.In set-top box, to the information on services sign indicating number decoding that receives, therefrom extract information on services, and utilize the collection ordinal number of storage in the collection ordinal number model unit 13 to express model the information on services of decoding is mated and discern calculating, collect ordinal number information so that from information on services, isolate.After this, judge that collection ordinal number information is the first collection or the collection ordinal number of other collection.If isolated collection ordinal number is first collection, then collects and recommend to remind the interface to collect broadcast information to the head of the relevant TV programme of user reminding by head.If isolated collection ordinal number is not first collection information, then provide the collection ordinal number information of TV programme to the user by browser interface.
The following describes the formation and the utilization of information on services.
In digital video broadcasting, information such as all videos, audio frequency, literal, picture have all become data after digitized processing, and pack according to the standard of MPEG-2, form the transmission bag of regular length (188 bytes).Then these packets are carried out multiplexingly, form to transmit code stream (TS).The corresponding TS stream of a common channel, the TS stream of a channel is by a plurality of programs and professional the composition.The business information of inserting in the transmission stream (TS) of Digital Television (SI) has been carried the required total data of electronic program guides (EPG).Business information comprises data such as being used for describing transfer system, transmission content and broadcast data stream timetable, and its helps Integrated Receive Decoder (IRD) automatic tuning, provides additional information to the user, makes IRD that alternative business can be set automatically.Business information is inserted by related standards as long as broadcast front end, the decoder of receiving terminal just can take out business information from TS, constitutes the EPG of difference in functionality.If do not have guidance information in TS stream, the terminal equipment of Digital Television can't find the code stream that needs, so in MPEG-2, defined program specific information (PSI) specially, its effect is that setting automatically and directing receiver are decoded.PSI information is inserted in the TS stream by multiplexer when multiplexing, and identifies with specific PID (Packet Identifier).
The program business information PSI that defines in the Moving Picture Experts Group-2 is the description to single code stream.PSI is made up of Program Association Table (PAT), CAT Conditional Access Table (CAT), Program Map Table (PMT) and network information table (NIT).Each table is divided into plurality of sections mapping (conversion) to transmitting transmission in the stream.PSI information is inserted in the TS stream by multiplexer when multiplexing, and identifies with specific PID (Packet Identifier).How PSI has specified from a transmission stream that carries a plurality of programs and has correctly found specific program, when receiver will receive some appointed programs, it at first obtains the pid value of the Program Map Table of this program from Program Association Table (PAT), from TS, find out the corresponding Program Map Table of pid value therewith then, from this Program Map Table, obtain the pid value of the elementary stream of this program of formation, leach elementary streams such as corresponding video, audio frequency and data according to this pid value, the decoding back is restored and is primary signal, the transmission bag of all the other PID that deletion is comprised.
DVB and relevant DVB-SI are the standards of current the most popular digital video broadcasting, also be simultaneously the digital video broadcasting standard that Europe and China are about to take, so the present invention are that the basis is designed with the DVB-SI standard promptly.Should be appreciated that application of the present invention is not limited thereto, can be based on design of the present invention, improve according to concrete using standard and do not break away from the spirit and scope of the present invention.
In the DVB-SI standard, the collection ordinal number is not included under the situation of its standard as metadata type independently, even can stipulate certain expansion or reserved field, have no reason also to expect that each different channel provider can use identical expansion or reserved field in order to express this information.Therefore must be based upon and extract collection ordinal number information on the basis of general specification and be only reliably.
9 tables have been defined in the business information (SI): 1) BAT bouquet association table (BAT); 2) SDT Service Description Table (SDT); 2) Event Information Table (EIT); 4) Running Status Table (RST); 5) TDT Time and Date Table (TDT); 6) TOT Time Offset Table (TOT); 7) ST Stuffing Table (ST); 8) select information table (SIT); 9) be interrupted information table (DIT).
Investigate the DVB-SI standard, can find that wherein available information spinner will comprise the title and the type of program, wherein title is with the event_name_char field description of short_event_descriptor descriptor in Event Information Table (Event Information Table-EIT), and a secondary type is with the content_nibble_level_1 and the content_nibble-level_2 field description of content_descriptor descriptor in the EIT table.
Though the item_char field of the text_char of short_event_descriptor descriptor or extended_event_descriptor descriptor can be held the textual description information more, that ability to express is more powerful, but collecting ordinal number extraction computing in these fields will be very complicated, so the present invention does not utilize all these more complicated descriptors.
The pattern of TV collection ordinal number
All there is TV guide table separately in each present TV station of China, perhaps publishes on the TV newspaper, perhaps delivers on its webpage.Investigate existing program notice list and can know the expression pattern of collection ordinal number by inference.The TV guide of delivering on hereinafter will the international website with Chinese Central Television (CCTV) of authority is an example, and the inductive method of collection ordinal number model is described.
The pattern sample
In on the international website of the Chinese Central Television (CCTV), collect 2 months, the program notice list of 12 television channels, investigate the title of TV play class program, have the title that collects ordinal number that has of typical meaning to be summarised in the table 1.
The typical form of presentation of collection ordinal number during table 1. exemplary program is single
Figure C20041003192200111
In the his-and-hers watches 1 analysis of the expression way of collection ordinal number as can be seen, the title sample in the table has following characteristics: 1). generally can start with a name of tv column; 2). the general colon ": " of using is as separator behind name of tv column; 3). in colon separator back, may have the sequence number of column, for example, in the table 1 the 7th, 8 " 2000-156 ", these sequence numbers often contain numeral and dash "-"; 4). the TV play name occurs after above-mentioned project, is mark with punctuation marks used to enclose the title " " " " sometimes; 5). the collection ordinal number places the last of title generally speaking, often uses bracket " O " to be comprised, and perhaps the form with " x collection " occurs; A plurality of collection ordinal numbers occur with the form of tabulation sometimes, and for example, in the table 1 the 9th " (5.6.7) ", the form with the interval occurs sometimes, for example, and " 15-17 " in the 10th; The collection ordinal number can be represented with Arabic numerals, also can represent with Chinese character; 6). the last of title has other information sometimes, such as the output country name of this TV programme, for example, in the table 1 the 2nd " (Korea Spro) " etc.; 7). some more complicated title can hold multi-level collection ordinal number information, as " 13 (2) affairs of the homicide case (2) " of the 13rd in the table 1.
If the collection ordinal number that solves typical sample shown in the table 1 with common Programming Methodology detects problem and also can realize, but the complexity of program and the possibility of makeing mistakes are bigger, and the reopening amount of sending out of program also can be very big behind the schema modification.The analytical method (or natural language understanding) that the present invention proposes the type of service language solves the extraction problem that collects ordinal number, system developer only need be write grammer (grammar) and semantic (semantic) function, and corresponding syntactic analysis (parsing) then uses general syntax analyzer (parser) program to handle.
The regular grammar of TV program information is represented
Be similar to and can come a kind of automatic language of approximate description with CFG, (regular grammar type-3grammar), can accurately describe the expression-form of program title by using more simple regular grammar.
Be the regular grammar of the program title of summarizing according to typical sample in the table 1 below.Can clearly find out, all samples all thus grammer generate, and the title that generates of grammer also meets these samples or other title that one will understand that is described example thus.
For the symbol in the grammer,, then be called nonterminal character if it does not appear at the right part of any rule; Otherwise, be called terminal character.Provide the syntactic representation and the implication thereof of TV program information below.
Punctuation mark and other: the punctuation mark and other symbol and the literal that occur in the expression program title
Colon → ': ' | ': ' one_dash → ' ' | '-' dash → one_dash|one_dash one_dash bracket_l → ' (' | ' (' | ' [' | ' ' bracket_r → ') ' | ') ' | '] ' | ' } ' bookpunc_1 → ' " ' bookpunc_r → ' " ' separator → ', ' | ‘ ' | '. ' di_1 → ' the ' di_r → ' collection '
Content character: numeral, letter and literal in the expression program title
latin→‘a’|‘b’|...|‘z’|‘A’|‘B’|...|’Z’ arabic→‘0’|‘1’|‘2’|‘3’||‘4’|‘5’|‘6’|‘7’|‘8’|‘9’ hanzi→{gb2312charset}
Arabic numerals string: the Arabic numerals in the expression program title
arabic_str→arabic|arabic_str?arabic ordinal→arabic_str
The numeric string that Chinese character is represented: the numeric string that contains in the expression program title
Hanzi_0 → ' zero ' hanzi_2_9 → ' two ' | ' three ' | ' four ' | ' five ' | ' six ' | ' seven ' | ' eight ' | ' nine ' hanzi_1_9 → ' one ' | hanzi_2_9 hanzi_10 → ' ten ' hanzi_100 → ' hundred ' ordinal → hanxi_1_9 ordinal → hanzi_1_0 ordinal → hanzi_1_0hanzi_1_9 ordinal → hanzi_2_9hanzi_10 ordinal → hanzi_2_9hanzi_10hanzi_1_9 ordinal → hanzi_1_9hanzi_100 ordinal → hanzi_1_9hanzi_100hanzi_0hanzi_1_9 ordinal → hanzi_1_9hanzi_100hanzi_1_9hanzi_10 ordinal → hanzi_1_9hanzi_100hanzi_1_9hanzi_10hanzi_1_9
Legal text-string: expression program title Chinese version character string
literal_ch→latin|arabic|hanzi|‘-‘ literal→literal_ch|literal_ch?literal
Column section: the represented column of expression program title
column_no→arabic_str|arabic_strdash?arabic?str column_sec→literal?column?colon?|literal?column?colon column_no
Collection ordinal number section: the collection ordinal number of the represented television series program of expression program title
?ordinal_list→ordinal|ordinal?ordinal_list ?ordinal_int→ordinal?dash?ordinal ?ordinal_spec→ordinal_list|ordinal_int ?ordinal_sec→ordinal_spec|bracket_l?ordinal_spec?bracket_r|di_l ?ordinal_spec?di_r
Note and other: the note and the out of Memory of the represented TV programme of expression program title
comment_sec→bracket_1literal?bracket_r
Title section: the title of expression TV programme
name_sec→literal|bookpunc_1literal?bookpunc_r
This collection is described section: expression is to the concise and to the point description of content of TV program
this_sec→name_sec|name_sec ordinal_sec|name_sec ordinal_sec?comment_sec|name_sec?comment_sec|name_sec comment_sec?ordinal_sec
Inclusive segment is described section by a plurality of collection and is constituted: the description of the content of TV program that expression is once play
content_sec→this_sec|this_secthis_sec
Program title: the title of the TV programme that expression is broadcasted
title→content_sec|column_sec?content_sec
Syntax analyzer
The following describes by analyzing the syntax analyzer that top grammatical representation formula obtains the collection ordinal number of programme content and TV programme.
Judge whether a title can pass through heading syntax mentioned above, perhaps determine by which composition (constituent) to form in the title, need to use syntax analyzer to realize by which kind of mode.The output result of syntax analyzer is exactly the syntax tree (syntactic tree) of this title.Two kinds of general CFGs (Context Free Grammar-CFG) analyzer is arranged, and a kind of is top-down (top-down) analyzer, and another kind is bottom-up (bottom-up) analyzer.
The details of these general-purpose algorithms will be described later.But the syntax tree that different syntax analyzers provides is the same.At this, only the example that provides with Fig. 3 describes.Be the syntax tree of output after the 1st sample " gold hot broadcast: the man is able and the woman is beautiful (4) " in the top table 1 analyses item by item to the content in the grammatical representation formula through syntax analyzer among Fig. 3, wherein node is a composition, the next door be labeled as its corresponding grammatical symbol; Set membership is represented in connection between the composition, and what be in relative top is father's composition, and the below is subconstiuent relatively.
The top-down parsing device
Briefly, the thinking of top-down algorithm is, from the initial symbol S (noticing that the initial symbol of grammer of top grammatical representation is title among the present invention) of grammer.Enumerate the rule in the grammer, the nonterminal character in the current state is rewritten or derived, all be rewritten into terminal character, and the part of speech of terminal character string and input sentence is till all the match is successful until all nonterminal characters.
The intermediate object program of top-down parsing device can be represented with state ((derivation tree (deductiontree)) ((table of deriving) deduction list) current location (current position)), wherein derivation tree (deduction tree) is corresponding uncompleted syntax tree, wherein the symbol of node to be matched is formed deduction list tabulation from left to right, and current position is the position of current input sentence.The characteristics of deduction tree tree are, there is child node in a node, and the leaf node of all left fraternal institute collars trees of and if only if this node all is the terminal character in the grammer.
Algorithm need be safeguarded a possibility status list (possibilities list), and its first element is current state (current state), and all the other elements are Status of Backups (backup state).Algorithm is from initial possible status list ((S) (S) 1), (S) tree representation wherein has only the tree of a S root node, (S) the S node during the S symbol among the deduction list has a pointer and (S) sets links to each other, and should initially may not contain Status of Backups by status list.Analyzer is as follows to the analytical procedure of the content in the syntactic representation:
1). if possible status list is empty, and then the algorithm failure is withdrawed from; Otherwise choose wherein first state C as current state, and with it from leaving out the status list.
2) if. the deduction list of C is an empty string, and analysis position is the end of the sentence position, and then algorithm successfully withdraws from, and this moment, the deduction tree of C was exactly complete syntax tree, with its output.Possible status list is removed, and quits a program.
3). otherwise handle respectively according to following three kinds of situations,
3a). if first symbol of deduction list is a terminal character among the C, and next incoming symbol equals this terminal character, then this terminal character is left out in deduction list, the pointer of the sensing deduction tree node to be matched that this terminal character contains is deleted simultaneously, with current
Position adds 1, and the new state that obtains is joined and may go in the status list;
3b). if first symbol of deduction list is a terminal character among the C, but next incoming symbol is not equal to this terminal character, then is left intact;
3c). if first symbol of deduction list is a nonterminal character among the C, enumerates then that all left parts are these nonterminal character rules in the grammer, this terminal character is rewritten, and all these new states are added and may go in status lists.The concrete way that generates new state is, replaces this symbol with the right part symbol string of this rule in deduction list, places before all symbols of deduction list.Utilize the pointer of this symbol among the deduction list, find node to be matched corresponding in deduction tree, the child node node that generation is the similar number of symbol with this regular right part symbol under this node, set up among the deduction list among new symbol and the deduction tree pointer between the new node in order and get in touch, at last this symbol and corresponding pointer thereof are deleted.
4). return step 1).
As can be seen, step 1) always selects the 1st state as current state, but when step 3) joins new state in the possibility status list.Two kinds of selections are arranged, a kind of is the front end (look the Status of Backups tabulation and be first-in last-out stack) that is added to the possibility status list, another kind is the rear end (look the Status of Backups tabulation and be fifo queue) that is added to the possibility status list, this just forms depth-first search and two kinds of strategies of BFS, and these two kinds of methods all can be selected for use in the present invention.
Fig. 4 shows top-down analyzer 40.Analyzer 40 comprises syntax table storage device 41, incoming symbol string buffer storage 42, analyzer controller 43 and possibility status list storage device 44.Wherein syntax table storage device 41 is deposited all r bar rules in the grammer with the form of one-dimension array.As an alternative, syntax rule also can be stored in the described collection ordinal number of Fig. 1 model unit 13.Incoming symbol string buffer storage 42 is deposited all s symbol of importing in the sentence with the form of one-dimension array, the possible status list of safeguarding when possible status list storage device 44 is deposited the analyzer operation (number is indefinite).The top-down parser controller 43 is algorithm controls parts then, it carries out above-mentioned algorithm according to the content of three storage devices, and above-mentioned three storage devices are inquired about, obtain, deleted and operation such as renewal in the moment that algorithm needs, at last the syntax tree that analyzes is exported.
The following describes the bottom-up syntax analysis device.The thinking of bottom-up algorithm is, from the symbol string of input sentence, the adjacent-symbol string summed up, and generates the left part symbol of the rule of correspondence, until final generative grammar method primary sign S (the initial symbol of the grammer in the aforementioned syntactic representation is title).
Line diagram analyzer (Chart Parser) is most typical bottom-up syntax analysis device, and it comprises following four main data structures:
1). arc of motion-active arc, refer to current expanded a part but still do not have the regular example of summing up to the end.Its method for expressing and Regularia seemingly but need to insert a round dot at the right part intersymbol, indicate next step matched position.Such as NP → ART.This arc of motion of ADJN, the next symbol to be expanded of its indication is this terminal character of ADJ.
2). composition-constituent, refer to current regular example of having summed up at last, or incoming symbol string example.
3). agenda-agenda, the new composition that obtains of summing up leaves among the agenda, till their all processed (being expanded).The top-down parser of erect image is the same, and the graphic analyses device also has two kinds of search strategies, i.e. depth-first and breadth First, when agenda is first-in last-out stack (FILO), be depth-first search, when agenda is fifo queue (FIFO), then be BFS.
4). line diagram-chart, it is a data structure of depositing current all intermediate object programs that obtain by analysis, by this mechanism, can avoid existing composition repeatedly to be summed up, and realizes sharing.
The algorithmic procedure of bottom-up syntax analysis device is described below:
1) if. do not had incoming symbol (sentence is handled), then seeking symbol in line diagram is the composition of S, if exist, then with the output of its collar tree, if do not have, then failure is analyzed in explanation, exports empty result.With all arc of motion and composition deletion, quit a program.
2). generate corresponding composition according to current incoming symbol, it is inserted agenda.
3) if. agenda is empty, then changes the 1st) step.
4). select (generally getting a first) composition C from agenda, establishing this position that is divided into is (p 1, p 2), it is left out from agenda, and join in the line diagram and go.
5). for any bar shaped such as X → CX in the syntax table 1X 2X nRule, add a new arc of motion X →.CX 1X 2X n, its position is made as (p 1, p 2), its child node tabulation is for empty.
6). for already present any bar shaped such as X → X 1X 2ο C ... X nAnd the position is at (p 0, p 1) arc of motion, add a new arc of motion X → X 1X 2C ο ... X n, its position is made as (p 0, p 2), the child node and the C of original arc of motion added the child node that becomes this new arc of motion.
7). for already present any bar shaped such as X → X 1X 2X nο C and position are at (p 0, p 1) arc of motion, sum up a new component, its position is made as (p 0, p 2), the child node and the C of original arc of motion added the child node that becomes this new component; Place agenda to go (depth-first is different with the breadth First way, and preamble is stated) this new component.
8) if. agenda is empty, then changes for the 1st step; Otherwise changeed for the 4th step, the step above repeating.
Fig. 5 shows bottom-up line diagram syntax analyzer 50.As shown in Figure 6, line diagram syntax analyzer 50 comprises syntax table storage device 51, incoming symbol string buffer storage 52, analyzer controller 53, arc of motion storage device 54, agenda storage device 55 and line diagram storage device 56.
Syntax table storage device 51 is deposited all r bar rules in the grammer with the form of one-dimension array.As an alternative, syntax rule also can be stored in the collection ordinal number model unit 13 shown in Figure 1.Incoming symbol string buffer storage 52 is deposited all s symbol of importing in the sentence with the form of one-dimension array.Composition (number is indefinite) to be expanded of a certain moment during agenda storage device 55 storage runnings.The all the components that generates when all arc of motion (number is indefinite) that generate during arc of motion storage device 54 storage runnings, 56 of line diagram storage devices are deposited operation.Bottom-up line diagram analyzer controller then is the algorithm controls parts, and it carries out above-mentioned algorithm according to the content of five storage devices, and in moment that algorithm needs.Describe algorithm and above-mentioned five storage devices are inquired about, obtain, deleted and operation such as renewal, at last the syntax tree that analyzes is exported.
Through after the aforementioned processing, the syntax tree that obtains analyzing.Syntax tree only provides the constituent structure of title, and corresponding data computation need be provided with semantic function and finish.Below the method flow of operation semantic interpretation function among the present invention in the hope of the collection numerical sequence is described.Seeking symbol in syntax tree is the node of ordinal spec, if do not find, and then algorithm failure, the result does not have the collection ordinal number.If find, then search the semantic interpretation function with this node rule of correspondence, if be empty, then failure, the result does not have the collection ordinal number; If this semantic interpretation function exists, then call this semantic interpretation function, the function return value that obtains is exactly last collection ordinal number (monodrome, interval or tabulation), with its output.
Semantic interpretation function must all be provided with at each rule place that needs, and this function is a condition with the value of each subconstiuent, provides the value of this composition.For the heading syntax example of the grammatical representation that provides previously, can write out below shown in semantic interpretation function.Wherein variable is represented the output of this function, the also i.e. value of this composition, and (i>0) then is the value of i subconstiuent.Notice that some regular semantic interpretation function is empty, then respective rule is no longer listed below.In addition, it may be noted that in the grammatical representation of front owing to write,, but separately write out in below the semantic interpretation function can not be shared when their semantic function the time so some rule gets up to be write as one because the left part symbol is identical with ' | ' merging.
The content character
arabic→‘0’|‘1’|‘2’|‘3’||‘4’|‘5’|‘6’|‘7’|‘8’|‘9’ semfunc_arabic_0{ $0=$1-‘0’;
}
The Arabic numerals string
arabic_str→arabic semfunc_arabic_str_0{ $0=$1; } arabic_str→arabic_str?arabic semfunc_arabic_str_1{ $0=$1; } ordinal→arabic_str semfunc_ordinal_0{ $0=$1; }
The numeric string that Chinese character is represented
Hanzi_0 → ' zero ' hanzi_2_9 → ' two ' semfunc_hanzi_2_9_0{ $0=2; Hanzi_2_9 → ' three ' semfunc_hanzi_2_9_1{ $0=3; Hanzi_2_9 → ' four ' semfunc_hanzi_2_9_2{ $0=4; Hanzi_2_9 → ' five ' semfunc_hanzi_2_9_3{ $0=5; Hanzi_2_9 → ' six ' semfunc_hanzi_2_9_4{ $0=6; Hanzi_2_9 → ' seven ' semfunc_hanzi_2_9_5{ $0=7; Hanzi_2_9 → ' eight ' semfunc_hanzi_2_9_6{
$0=8; Hanzi_2_9 → ' nine ' semfunc_hanzi_2_9_7{ $0=9; Hanzi_1_9 → ' one ' semfunc_hanzi_1_9_0{ $0=1; Hanzi_1_9 → hanzi_2_9 semfunc_hanzi_1_9_1{ $0=$1; Hanzi_10 → ' ten ' hanzi_100 → ' hundred ' ordinal → hanzi_1_9 semfunc_ordinal_1{ $0=$1; Ordinal → hanzi_10 semfunc_ordinal_2{ $0=10; Ordinal → hanzi_10hanzi_1_9 semfunc_ordinal_3{ $0=10+$2; Ordinal → hanzi_2_9hanzi_10
semfunc_ordinal_4{ $0=$1*10; } ordinal→hanzi_2_9hanzi_10hanzi_1_9 semfunc_ordinal_5{ $0=$1*10+$3; } ordinal→hanzi_1_9hanzi_100 semfunc_ordinal_6{ $0=$1*100; } ordinal→hanzi_1_9hanzi_100hanzi_0hanzi_1_9 semfunc_ordinal_7{ $0=$1*100+$4; } ordinal→hanzi_1_9hanzi_100hanzi_1_9hanzi_10 semfunc_ordinal_8{ $0=$1*100+$3*10; } ordinal→hanzi_1_9hanzi_100hanzi_1_9hanzi_10hanzi_1_9 semfunc_ordinal_9{ $0=$1*100+$3*10+$5; }
Collection ordinal number section
Ordinal_list → ordinal semfunc_ordinal_list_0{ $0={$1}; // { } expression set }
Ordinal_list → ordinal ordinal_list semfunc_ordinal_list_1{ $0={$1}$2; Two union of sets collection are got in // expression } ordinal int → ordinal dash ordinal semfunc_ordinal_int_0{ $0=[$1 , $3]; // [,] represents interval } ordinal_spec → ordinal_list|ordinal_int semfunc_ordinal_spec_0{ $0=$1; Ordinal_sec → ordinal_spec|bracket_lordinal_spec bracket_r|di_l ordinal_spec di_r
Fig. 6 shows the semantic function interpreter 60 that uses among the present invention.As shown in Figure 6, this semantic function interpreter comprises syntax table storage device 61, semantic function storage device 62, semantic function interpreter controller 63.Semantic function interpreter 60 is accepted the syntax tree of input and the node to be asked in the syntax tree, and 62 storages of semantic function storage device and syntax rule be semantic interpretation function one to one.Semantic interpreter controller 63 is core components, and it searches out corresponding rule according to node to be calculated in syntax table storage device 61, and finds corresponding semantic function in semantic function storage device 62, carries out semantic function then.Because semantic interpretation function generally is a recurrence, therefore in the middle of the implementation of certain semantic interpretation function, need recursively repeatedly in syntax tree, to find corresponding child node, in syntax table storage device 61, find the sub-rule of this child node correspondence, and in semantic function storage device 62, find corresponding subfunction and execution.Wait that asking the semantic function return value of node correspondence is exactly the collection ordinal number of corresponding TV programme, is exported by the semantic interpreter controller.
The method of using the present invention's proposition can extract the collection ordinal number information that TV program information comprises.Fig. 7 shows the process chart that draws the collection ordinal number.Wherein directly (reference number 11 expressions among Fig. 1) draw according to the information on services decoder for the title of program and type; Syntax analyzer can adopt above-described top-down syntax analyzer, or bottom-up syntax analyzer disposes and realizes.Can be directly according to whether there being the node corresponding (being the ordinal_spec node among the embodiment) to judge whether collection ordinal number composition in the syntax tree with the collection ordinal number.And obtain in the semantic function that heading syntax table and semantic interpretation function table can provide from above.
The flow process of extracting the collection ordinal number of TV programme according to the present invention is described below with reference to Fig. 7.At step S71, acceptance judges from program title and type information that information on services decoding unit (11 Fig. 1) provides whether program category is TV sequence program.If not the television series program, then do not collect ordinal number information, and output there is not the indication of collection ordinal number.If determine that the type of this TV programme is a TV sequence program, then program title is offered syntax analyzer, the program title that syntax analyzer is provided according to the heading syntax expression parsing of storing, the syntax tree that output is relevant at step S72.Next, judge whether comprise collection ordinal number composition in this syntax tree at step S73.If judged result then shows not collect ordinal number information for negating, and output does not have the indication of collection ordinal number.Exist collection ordinal number information if judged result shows, then syntax tree is offered semantic interpreter, flow process proceeds to step S74.Semantic interpreter calculates the collection ordinal number according to the heading syntax table and the semantic interpretation function table of storage, and output collection ordinal number information.After this, can see the collection ordinal number of this television series program by collection ordinal number browser interface shown in Figure 1.
The invention provides a kind ofly under digital television service information standard framework DVB-SI, utilize program title information to find the method and apparatus of continuous collection of drama ordinal number information automatically.The content of this method comprises the formation structure of describing title with regular grammar, uses the CFG analyzer to analyze the syntax tree of title, writes the collection ordinal number that semantic interpretation function calculates relevant composition correspondence.
Therefore it is pointed out that because which kind of structure unpredictable each television channel can provide program title with the heading syntax write of off-line can not cover all possible sentence pattern (sentence pattern) fully in theory.But title holds information its limitation is arranged, and the system development personnel also can accomplish the coverage (such as 95%) of high level, thereby reach practical requirement by market survey and analyzing in detail.Other research experience about natural language understanding system has also proved this point.
In addition, in set-top box dynamically under the prerequisite of install software, mode that also can be by version updating provides the download of new syntax to the user, extracts performance to obtain being close to 100% collection ordinal number
Method according to the collection ordinal number of extraction TV programme of the present invention can realize also can having the program of corresponding function by execution by realizing by processor by hardware.Described program can be recorded in such as floppy disk, hard disk, and CD-ROM is on the computer-readable recording medium of DVD-ROM and so on.
So far the detailed description of in conjunction with the preferred embodiments the present invention being carried out.Should be appreciated that the present invention is not limited thereto, but only be defined by the following claims that those skilled in the art can carry out various changes and improvements to embodiments of the invention without departing from the spirit and scope of the present invention.

Claims (13)

1. method of extracting the collection ordinal number information of TV programme from digital video broadcasting comprises step:
Receive television program titles and the type information play;
Program title syntax table analysis according to storage in advance is designated as the program title of TV sequence program and the generation heading character string corresponding with described program title by type information; With
According to the described program title syntax table of storage in advance and the semantic interpretation function table of the described program title syntax table of explanation, from the heading character string of described generation, extract the collection ordinal number information of described TV sequence program.
2. method according to claim 1, wherein the step of analyzing the described program title of described TV sequence program according to the program title syntax table of storage in advance further comprises: enumerate the syntax rule in the described syntax table, nonterminal character in the current state is rewritten or derived, all be rewritten into terminal character until all nonterminal characters, and the step of the whole couplings of the part of speech of the program title of terminal character string and input.
3. method according to claim 1, wherein the step of analyzing the described program title of described TV sequence program according to the program title syntax table of storage in advance further comprises: from the character string of the described program title of input, the adjacent character string is summed up, generation is corresponding to the left part symbol of the syntax rule in the described syntax table, until final generative grammar primary sign.
4. method according to claim 1, the step of wherein extracting the collection ordinal number information of described TV sequence program is included in the node of seeking indication collection ordinal number information in the described heading character string, search the semantic interpretation function of the syntax rule corresponding, utilize described semantic interpretation function to calculate the functional value of collection ordinal number information of described node as the collection ordinal number of described TV sequence program with this node.
5. device that extracts the collection ordinal number of TV programme comprises:
The information on services decoding device is used for the code stream that digital television broadcasting provides is decoded, and therefrom detects the information on services of the collection ordinal number information that comprises TV programme;
Collect the ordinal number model equipment, be used for the expression model of the collection ordinal number of stored television program;
Collection ordinal number extraction element, the decoding information on services that provides according to described information on services decoding device, the expression model of the collection ordinal number of storing in the utilization collection ordinal number model equipment mates the information on services of decoding and discerns calculating, to obtain the collection ordinal number information of TV programme; With
Control device is used to control the operation of each device and the required program of operation that storage is used to control each device.
6. device according to claim 5 comprises that also first collection reminds interface device, is used for the request according to the user, and the collection ordinal number that utilizes collection ordinal number extraction element to calculate collects broadcast information by display interface with the head of TV programme and is shown to the user.
7. device according to claim 5 also comprises collection ordinal number browser interface device, is used for the request according to the user, and the collection ordinal number that utilizes collection ordinal number extraction element to calculate is shown to the user by display interface with the collection ordinal number of TV programme.
8. device according to claim 5, wherein said collection ordinal number model equipment is stored the grammatical representation formula of described program title.
9. device according to claim 5, wherein said collection ordinal number extraction element comprises syntax analyzer, described syntax analyzer comprises:
Grammatical representation formula storage device, the grammatical representation formula that is used to store described program title;
Incoming symbol string buffer storage is used for all symbols with the program title of the form of one-dimension array storage input;
Possible status list storage device is used to store the possible status list of safeguarding when described syntax analyzer moves; With
The top-down parser controller is used for going out corresponding heading character string and output according to the content analysis of above-mentioned memory device stores.
10. device according to claim 5, wherein said collection ordinal number extraction element comprises syntax analyzer, described syntax analyzer comprises:
Incoming symbol string buffer storage is used for depositing with the form of one-dimension array all symbols of the program title of input;
The agenda storage device, composition to be expanded of a certain moment when being used for storage running;
The arc of motion storage device, all arc of motion that generate when being used for storage running;
The line diagram storage device, all the components that generates when being used for storage running; With
The analyzer controller is used to analyze the content of above-mentioned memory device stores, to analyze corresponding heading character string and output.
11. device according to claim 10, wherein said syntax analyzer comprise grammatical representation formula storage device, are used to store the grammatical representation formula of described program title.
12. according to claim 9 or 11 described devices, wherein said collection ordinal number extraction element comprises the semantic function interpreting means, described semantic function interpreting means comprises:
The semantic function storage device is used for storage and grammatical representation formula semantic interpretation function one to one;
The semantic interpreter controller, heading character string according to described syntax analyzer output, search the grammatical representation formula of grammatical representation formula memory device stores, semantic function with the correspondence of storing in the semantic function storage device, calculate described semantic function then, to obtain the collection ordinal number of corresponding semantic function return value as the TV programme of correspondence.
13. a method of extracting the collection ordinal number of TV programme from digital video broadcasting comprises step:
Receive the data code flow of information on services;
To the information on services sign indicating number decoding that receives, therefrom extract information on services,
The collection ordinal number expression model of use storage mates the information on services of decoding and discerns calculating, so that isolate collection ordinal number information from information on services;
When isolated collection ordinal number information is first collection, collect broadcast information to the head of user reminding TV programme; With
When isolated collection ordinal number is not first collection, provide the corresponding collection ordinal number information of TV programme to the user by browser interface.
CNB2004100319225A 2004-03-31 2004-03-31 Method and apparatus for fetching volume ordinal number of TV play from digital TV broadcast Expired - Fee Related CN100426851C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2004100319225A CN100426851C (en) 2004-03-31 2004-03-31 Method and apparatus for fetching volume ordinal number of TV play from digital TV broadcast

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2004100319225A CN100426851C (en) 2004-03-31 2004-03-31 Method and apparatus for fetching volume ordinal number of TV play from digital TV broadcast

Publications (2)

Publication Number Publication Date
CN1678042A CN1678042A (en) 2005-10-05
CN100426851C true CN100426851C (en) 2008-10-15

Family

ID=35050321

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2004100319225A Expired - Fee Related CN100426851C (en) 2004-03-31 2004-03-31 Method and apparatus for fetching volume ordinal number of TV play from digital TV broadcast

Country Status (1)

Country Link
CN (1) CN100426851C (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102164305B (en) * 2011-01-26 2017-02-22 优视科技有限公司 Video processing method and device and mobile communication terminal

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1130843A (en) * 1994-09-29 1996-09-11 索尼公司 Program information broadcasting system, program information display method, and receiving device
JPH11298815A (en) * 1998-04-08 1999-10-29 Hitachi Ltd Receiver making it possible to display program information
CN1272281A (en) * 1998-05-29 2000-11-01 索尼公司 Information processing apparatus and method, and providing medium
US20020157097A1 (en) * 2001-04-24 2002-10-24 Williams Joseph F. What has changed on television
CN1477873A (en) * 2002-08-22 2004-02-25 Lg������ʽ���� Digital TV and method for managing program information

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1130843A (en) * 1994-09-29 1996-09-11 索尼公司 Program information broadcasting system, program information display method, and receiving device
JPH11298815A (en) * 1998-04-08 1999-10-29 Hitachi Ltd Receiver making it possible to display program information
CN1272281A (en) * 1998-05-29 2000-11-01 索尼公司 Information processing apparatus and method, and providing medium
US20020157097A1 (en) * 2001-04-24 2002-10-24 Williams Joseph F. What has changed on television
CN1477873A (en) * 2002-08-22 2004-02-25 Lg������ʽ���� Digital TV and method for managing program information

Also Published As

Publication number Publication date
CN1678042A (en) 2005-10-05

Similar Documents

Publication Publication Date Title
US10397654B2 (en) Transmission and reception apparatuses, methods, and systems for filtering content
US9177080B2 (en) Automatic segmentation of video
CN102265276B (en) Commending system based on context
CN101267518B (en) Method and system for extracting relevant information from content metadata
CN101600118B (en) Device and method for extracting audio/video content information
JP3606764B2 (en) A system for performing recording reservation or playing a recorded program from a TV program guide presented in association with file object browsing
CN101422041A (en) Internet search-based television
CN101296362A (en) Method and system for providing access to information of potential interest to a user
CN101595481A (en) Be used on electronic installation, promoting the method and system of information search
CN103354623A (en) Network terminal system and terminal device
CA2784366A1 (en) Segmentation of video according to narrative theme
CN104079993B (en) A kind of set top box upgrading method, set top box, server and system
CN103218385A (en) Server apparatus, information terminal, and program
CN104717572A (en) Film searching and sorting method, system and semantic dictionary set establishing method
CN100574421C (en) A kind of method of program searching of Digital Television
CN102291615A (en) Television program precision searching and detail viewing device and method based on one-way network
WO2000048095A1 (en) Information transfer system and apparatus for preparing electronic mail
CN107566906A (en) A kind of video comments processing method and processing device
EP1345418A2 (en) Reception apparatus
CN102256179A (en) Method and system for displaying program information of television terminal and television terminal
CN100426851C (en) Method and apparatus for fetching volume ordinal number of TV play from digital TV broadcast
JP2004274257A (en) Transmitter, receiver, and viewing history information utilization type broadcasting system in data broadcasting
US20110131598A1 (en) System and method for producing an electronic program guide for user-created content
Lukic et al. A java API interface for the search of DTV services in embedded multimedia devices
EP1069715A1 (en) Method and apparatus for data transmission

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20081015

Termination date: 20200331