EP1723555A1 - Procede et appareil de positionnement de contenu dans un programme - Google Patents

Procede et appareil de positionnement de contenu dans un programme

Info

Publication number
EP1723555A1
EP1723555A1 EP05702854A EP05702854A EP1723555A1 EP 1723555 A1 EP1723555 A1 EP 1723555A1 EP 05702854 A EP05702854 A EP 05702854A EP 05702854 A EP05702854 A EP 05702854A EP 1723555 A1 EP1723555 A1 EP 1723555A1
Authority
EP
European Patent Office
Prior art keywords
word symbol
information
stream
user
program
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP05702854A
Other languages
German (de)
English (en)
Inventor
Xin Philips Electronics China CHEN
Yongqin Philips Electronics China ZENG
Ningjiang Philips Electronics China CHEN
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Publication of EP1723555A1 publication Critical patent/EP1723555A1/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7844Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using original textual content or text extracted from visual content or transcript of audio data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • H04N21/2387Stream processing in response to a playback request from an end-user, e.g. for trick-play

Definitions

  • the present invention relates to a method and apparatus for locating program contents , particularly to a method and apparatus for locating according to the contents of the multimedia programs.
  • a multimedia program In addition to a video stream and an audio stream , a multimedia program generally contains an image stream and/or a text stream, these streams are synchronized with each other according to particular rules and predetermined time sequence for users to enjoy.
  • the synchronized multimedia integration language (SMIL) is a popular editing language.
  • the SMIL can not only integrate the respective content streams of a multimedia program in time sequence , but also be used to manage the layout of the multimedia program being presented. While watching a multimedia program, a user sometimes needs to find a particular segment of the program.
  • the multimedia playing apparatus should be able to automatically matching analysis the contents of the video streams, so that when the Sydney Opera Theater appears, the related segment is presented to the user.
  • content locating as described above , if a user performs location manually, he/she has to perform repeatedly the search before finding the desired position of the segment, which would be time- consuming and bothersome.
  • the editing tools provide only a very limited number of titles for users to choose from , which restricts the arbitrary of the user's choices, and renders the user-based choice impossible. Therefore, a new program content locating method and apparatus is needed, which enables users to locate program contents in multimedia programs conveniently so that their individual requirements could be satisfied by obtaining any segments as they want.
  • one of the objects of the present invention is providing a new program content locating method and apparatus to overcome the defects of the prior art, which enables users to locate program contents in multimedia programs conveniently to obtain the particular segments as they want.
  • the present invention provides a method for locating content in a multimedia program, which comprising a stream with word symbol information, comprising: firstly receiving a request comprising a specific word symbol from a user; then determining a position where the specific word symbol appears in the stream with word symbol information; and finally determining other presentable information synchronous with the word symbol information at the position.
  • the other presentable information may be video information or audio information.
  • the word symbol information may exist in a text format or image format.
  • the locating method further comprises the step of obtaining the text information corresponding to the word symbol information.
  • the stream provided with word symbol information may have a layered structure. If so, the locating method further comprises the step of determining a layer containing the position where specific word symbol appear and having a particular starting position and a particular end position, so that the other finally determined presentable information has the corresponding start position and end position.
  • the present invention further provides an apparatus for locating contents in a multimedia program, which has a stream provided with word symbol information.
  • the word symbol information may exist in a text format or an image format.
  • the apparatus includes a request receiving means, a word symbol locating means and a synch-locating means.
  • the request receiving means is used to receive a request comprising a specific word symbol from a user; the word symbol locating means is used to determine the position where the specific word symbol appear in the stream provided with word symbol information; and the synch-locating means is used to determine the other presentable information that synchronizes with the word symbol information appearing at the position.
  • the other presentable information may be video information or audio information.
  • the present invention locates the position of a user required segment in a program by analyzing the stream provided with word symbol information which is included in multimedia programs, then finds corresponding video or audio segment according to the synchronization rules.
  • the streams provided with word symbol information such as text streams or image streams, contain much less a quantity of data relative to video or audio, and the analysis of text is also much simpler than that of picture or audio; therefore , the present invention has greatly reduced the complexity of searching program contents, lowered the hardware requirement , made user's operation convenient, and satisfied different needs of individual users.
  • word symbol information such as text streams or image streams
  • Figure 1 is a system block diagram of an apparatus for locating contents in a multimedia program according to an embodiment of the present invention
  • Figure 2 is a flow chart of the process for locating contents in a multimedia program according to an embodiment of the invention
  • Figure 3 is a flow chart of the process for locating contents in a multimedia program and extracting particular segments according to another embodiment of the present invention
  • the same reference numbers indicate similar or identical features and functions.
  • Figure 1 shows a system block diagram of an apparatus for locating contents in multimedia programs according to an embodiment of the present invention.
  • the apparatus 100 may be part of a multimedia program making apparatus (not shown in this figure) or a multimedia playing apparatus (not shown in this figure).
  • the apparatus 100 includes a request receiving module 120, a text locating module 130 and a synch-locating module 140.
  • the apparatus 100 further includes a content receiving module 110, a presentation module 150 and an extraction module 160.
  • Said module included in the apparatus 100 can be realized by those skilled in the art by the various existing module as long as their combination can perform the functions of the present invention.
  • the content receiving module 110 is used to receive a multimedia program, which contains a stream provided with word symbol information, such as a text stream or an image stream having the word symbol information (as slides of the auxiliary demonstration tools in existing multimedia demonstration programs, e.g., one page of a PowerPoint file, sometimes transmitted in an image format).
  • the multimedia program may come from a local storage module (not shown in the figure), such as a DVD, or from a web server (not shown in the figure).
  • Request receiving module 120 is used to receive a request, which contains specific word symbol, such as, "Sydney Opera Theater". A user hopes to find with this request in the segment on Sydney Opera Theater in the multimedia program being edited/appreciated.
  • the multimedia program includes a stream provided with word symbol information.
  • Text locating module 130 is used to determine the position of specific word symbol in the multimedia program.
  • Module 130 searches the specific word symbol, such as "Sydney Opera Theater", in the stream provided with word symbol information, and, after the specific word symbol are found, obtains the information on their positions in the programs. If the stream provided with word symbol information is an image stream, the module 130 is further used to obtain the text information corresponding to the word symbol information in the image stream.
  • Synch-locating module 140 is used to determine the other presentable information that synchronizes with the word symbol information appearing at the site.
  • FIG. 150 is used to present to the user the program contents in the particular position within a multimedia program.
  • Extracting module 160 is used to extract a particular segment from a multimedia program. In this embodiment , the particular segment may contain the particular text information.
  • Figure 2 is a flow chart of a process for locating contents in a multimedia program according to an embodiment of the present invention.
  • a multimedia program including a stream provided with word symbol information is obtained in step 210 (S210).
  • the word symbol information exists in a text format, for example, in the case of a multimedia digital television program stream, the captions exist in the data stream in a text format; in the case of a multimedia demonstration program stream , the wording contents for the demonstration exist in a text stream in a text format. If the multimedia program is relatively long, this step will not end until the entire locating process ends.
  • the multimedia program about Australia Scenery is taken as an example.
  • the program includes a text stream carrying corresponding commentary contents.
  • a request containing specific word symbol such as "Sydney Opera Theater" is received from a user (S230) ; the user expects that specific word symbol exist at certain position in the text stream and hopes to find the segment containing the specific word symbol in the multimedia program obtained in S210.
  • the specific word symbol are searched in the text stream and it is judged whether they have been found appeared at a particular site in the text stream (S230). If they have not been found, then the process informs the user that the specific word symbol are not found in this multimedia program (S234) , and the entire process comes to an end. If they have been found, the process obtains the position information about the site where they appeared (S238 ) , for example, that of the "Sydney Opera
  • the corresponding position of the specific word symbol in the video stream is determined on the basis of the particular synchronization rules of the multimedia program (S240), if the video at "01 : 03: 06" (hh: mm: ss) from the start of the program is found, the picture at the time often contains the scenery of the Sydney Opera Theater corresponding to the commentary.
  • the synchronization rules for a multimedia program can be varied, and will not be elaborated here.
  • the video contents at the particular position are presented to the user (S250), the pictures of the particular position contain the scenery of the Sydney Opera Theater that the user wants.
  • the user it is possible to present to the user all the contents of the multimedia program, such as, the video/audio, image and text at this particular position; or to present another part of them, for example, the audio only, to the user, to satisfy his individual needs.
  • the presenting process of S250 it is also possible to present the video contents in the periods of time before and/or after this particular site appears.
  • the duration of the period may be fixed a time-value by the user, or be fixed a default value by the system.
  • the user may include a starting position information and a ending position information in the request of the S220, both of which correspond to the particular appearing site expected by the user.
  • the position where the specific word symbol appear in the corresponding position of audio or image streams may be determined according to the synchronization rules. Since video, audio or even image is more complex than text in composition, the processes of analyzing and locating it are also much more complex than those of text. Thus it can be seen that the locating method developed in the present invention is much simpler than that of the prior method through audio/video. In said locating process , if the specific word symbol , such as " Sydney Opera Theater " appears many times in said text stream, when S250 presents the video contents of the particular site to the user, the user is given a chance to choose whether to keep on searching.
  • FIG. 3 is a flow chart of a process for locating contents in a multimedia program and extracting particular segment according to another embodiment of the present invention.
  • a multimedia program is obtained (S310) , which includes a stream provided with word symbol information existing in an image format.
  • word symbol information existing in an image format.
  • its demonstration slides contain word symbol information contents, and exist in an image stream in an image format.
  • Table 1 is a SMIL Script of a multimedia demonstration program including a video stream and an image stream synchronized with the video stream; said image stream includes the demonstration slides and words on the slides, and these words are in an image format.
  • Table 1 A Multimedia Demonstration Program It is seen from Table 1 that said image stream, having a layered structure, contains 9 sections: imagel , Image2 >. image3 > image4 image ⁇ image6 image7> images ⁇ I mageg.
  • Each section corresponds to one slide, that is, each section has its particular starting position and length of continuation for the reason that the video/audio generally change constantly during the demonstration process, and each slide is normally kept unchanged for a period of time. Since it is impossible to directly conduct a textual analysis of the words existing in an image format, a certain means may be used to obtain the text information corresponding to the word symbol information in the image stream (S320). This obtainment step can be performed by the existing Optical Character Recognition (OCR) technology. Then, a request containing specific word symbol is received from the user (S330); the user expects that the specific word symbol exist at one or more position in said multimedia program stream and hopes to find and extract the segments including the specific word symbol through the request.
  • OCR Optical Character Recognition
  • the specific word symbol are searched in the word symbol information of the image stream and it is judged whether the specific word symbol have been found appeared at a particular site (S340). If they are not found, then the user is informed that the specific word symbol are not found in this multimedia program (S344) , and the entire process ends. If they are, then the information appeared at the particular site (S350) is obtained. For example : these specific word symbol appear in the word symbol information of the image2, then the starting position and the duration of image2 are obtained. After that, according to the synchronization rules of the particular multimedia program, determination is made of the corresponding position in the video stream of the site where the words appear (S360). At this time, the starting position and duration of the particular segment of the corresponding video stream are the same as those of image2.
  • the original SMIL Script is modified on the basis of the obtained starting position and duration of the particular segment to obtain a new SMIL Script (S370).
  • This SMIL Script reflects only the segment found, thus making it possible to extract the user needed particular segment from the multimedia program. By selectively performing the modified SMIL Script, the user can directly browse the needed particular segment.
  • S380 further judgment can be made as to whether or not it is necessary to go on with the search (S380). If it is not, then the entire extracting process ends; if it is, the process will return to S340, then go ahead with the search from the last found particular site along the original search direction until a next segment or program the user wants to watch is found.
  • the judgment can be made by automatically judging whether the multimedia program ends or not, or a decision is made by the user by prompting him to do so.
  • said particular text information is also found in image ⁇ and image ⁇ .
  • the modified SMIL Script finally obtained is shown in Table 2.
  • the multimedia program segments corresponding to the SMIL Script contain said particular text information.
  • the stream provided with word symbol information of the multimedia program has a layered structure.
  • the layered structure can be presented as 9 parallel images arranged in sequence like the chapters of a book, that is, the respective layers can be mutually contained.
  • the present invention uses the streams provided with word symbol information contained in the multimedia program to perform the locating and since the analysis of the word symbol information is much simpler than that of the audio/video information, the present invention frees the program producers from a lot of work and reduces the complexity of the work. It enables the users to relatively easily perform locating operation with simpler and less expensive equipment. Furthermore , it also makes it possible to use voice recognition technology to convert dialogues in the audio into the text information to be used for the locating operation.

Abstract

La présente invention concerne un procédé de positionnement de contenu dans un programme multimédia comprenant un train de données contenant des informations sur les symboles des mots. Le procédé consiste à recevoir d'un utilisateur une demande contenant un symbole d'un mot spécifique, à déterminer la position courante dudit symbole du mot spécifique dans ledit train de données contenant des informations sur les symboles des mots, et à déterminer d'autres informations pouvant être présentées de manière synchronisée avec les informations sur les symboles des mots sur la position obtenue. Par comparaison avec les informations vidéo ou les informations audio, un train de données contenant des informations de symboles de mots contient une plus petite quantité de données, et l'analyse du symbole d'un mot est plus simple. La présente invention simplifie ainsi de manière significative la complexité de la recherche d'un contenu de programme, réduit les exigences de matériel, facilite les opérations de l'utilisateur, et satisfait les exigences personnelles de l'utilisateur.
EP05702854A 2004-02-24 2005-02-01 Procede et appareil de positionnement de contenu dans un programme Withdrawn EP1723555A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2004100076685A CN1662053A (zh) 2004-02-24 2004-02-24 一种节目内容定位方法和装置
PCT/IB2005/050415 WO2005083592A1 (fr) 2004-02-24 2005-02-01 Procede et appareil de positionnement de contenu dans un programme

Publications (1)

Publication Number Publication Date
EP1723555A1 true EP1723555A1 (fr) 2006-11-22

Family

ID=34892100

Family Applications (1)

Application Number Title Priority Date Filing Date
EP05702854A Withdrawn EP1723555A1 (fr) 2004-02-24 2005-02-01 Procede et appareil de positionnement de contenu dans un programme

Country Status (5)

Country Link
EP (1) EP1723555A1 (fr)
JP (1) JP2007525900A (fr)
KR (1) KR20070020208A (fr)
CN (2) CN1662053A (fr)
WO (1) WO2005083592A1 (fr)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075233B (zh) * 2006-05-17 2012-05-02 华为技术有限公司 多媒体内容收集部件、系统及其方法
WO2008131520A1 (fr) 2007-04-25 2008-11-06 Miovision Technologies Incorporated Procédé et système pour analyser un contenu multimédia
CN101470710B (zh) * 2007-12-27 2011-01-12 Tcl集团股份有限公司 多媒体文件中的内容的定位方法
CN102955809A (zh) * 2011-08-26 2013-03-06 吴志刚 媒体文件编辑和播放的方法和系统
CN102592628A (zh) * 2012-02-15 2012-07-18 张群 一种音视频播放文件的播放控制方法
CN103226966A (zh) * 2013-04-26 2013-07-31 广东欧珀移动通信有限公司 一种可快速定位播放进度的方法及移动终端
CN104572714A (zh) * 2013-10-18 2015-04-29 英业达科技有限公司 学习影像的查询系统及其方法
CN104572716A (zh) * 2013-10-18 2015-04-29 英业达科技有限公司 影音文件播放的系统及其方法
CN104572712A (zh) * 2013-10-18 2015-04-29 英业达科技有限公司 浏览多媒体文件的系统及方法
CN103605765B (zh) * 2013-11-26 2016-11-16 电子科技大学 一种基于聚类紧凑特征的海量图像检索系统
CN105117407B (zh) * 2015-07-27 2019-03-26 电子科技大学 一种基于聚类的距离方向直方图的图像检索方法
CN105163178B (zh) * 2015-08-28 2018-08-07 北京奇艺世纪科技有限公司 一种视频播放位置定位方法和装置
CN107093336A (zh) * 2016-09-06 2017-08-25 北京新学堂网络科技有限公司 一种将电影做成阅读学习式连环画的制作方法
CN108062302B (zh) * 2016-11-08 2019-03-26 北京国双科技有限公司 一种文本信息的识别方法及装置
CN107340968B (zh) * 2017-07-18 2021-03-09 网易传媒科技(北京)有限公司 一种基于手势来播放多媒体文件的方法、设备和计算机可读存储介质
CN108989851A (zh) * 2018-08-27 2018-12-11 努比亚技术有限公司 一种视频播放方法、终端及计算机可读存储介质
CN111339323A (zh) * 2020-02-21 2020-06-26 联想(北京)有限公司 一种多媒体文件的信息处理方法及装置

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5136655A (en) * 1990-03-26 1992-08-04 Hewlett-Pacard Company Method and apparatus for indexing and retrieving audio-video data
US6430357B1 (en) * 1998-09-22 2002-08-06 Ati International Srl Text data extraction system for interleaved video data streams
US7653925B2 (en) * 1999-11-17 2010-01-26 Ricoh Company, Ltd. Techniques for receiving information during multimedia presentations and communicating the information

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2005083592A1 *

Also Published As

Publication number Publication date
WO2005083592A1 (fr) 2005-09-09
JP2007525900A (ja) 2007-09-06
KR20070020208A (ko) 2007-02-20
CN1922610A (zh) 2007-02-28
CN1662053A (zh) 2005-08-31

Similar Documents

Publication Publication Date Title
WO2005083592A1 (fr) Procede et appareil de positionnement de contenu dans un programme
US8374845B2 (en) Retrieving apparatus, retrieving method, and computer program product
JP4550725B2 (ja) 映像視聴支援システム
US8799945B2 (en) Information processing apparatus, information processing method, and computer program
TWI358948B (fr)
US20200126559A1 (en) Creating multi-media from transcript-aligned media recordings
US8413192B2 (en) Video content viewing apparatus
US20200126583A1 (en) Discovering highlights in transcribed source material for rapid multimedia production
US7844115B2 (en) Information processing apparatus, method, and program product
KR20090004990A (ko) 인터넷 검색 기반 텔레비전을 위한 방법, 매체 및 시스템
US7904452B2 (en) Information providing server, information providing method, and information providing system
JP2009055152A (ja) 動画作成装置、動画作成方法、およびプログラム
JP2006155384A (ja) 映像コメント入力・表示方法及び装置及びプログラム及びプログラムを格納した記憶媒体
US20120066235A1 (en) Content processing device
JP2010161722A (ja) データ処理装置、データ処理方法、及び、プログラム
US20040177317A1 (en) Closed caption navigation
JP2010220065A (ja) コンテンツ推薦装置及びコンテンツ推薦方法
US20100083314A1 (en) Information processing apparatus, information acquisition method, recording medium recording information acquisition program, and information retrieval system
US20080005100A1 (en) Multimedia system and multimedia search engine relating thereto
US20090083227A1 (en) Retrieving apparatus, retrieving method, and computer program product
CN114268829B (zh) 视频处理方法、装置、电子设备及计算机可读存储介质
JP4064902B2 (ja) メタ情報生成方法、メタ情報生成装置、検索方法および検索装置
KR102252522B1 (ko) 내용 기반 동영상 목차 자동생성 방법 및 시스템
US20080016068A1 (en) Media-personality information search system, media-personality information acquiring apparatus, media-personality information search apparatus, and method and program therefor
JP2007511858A (ja) 拡張検索機能を提供するメタ情報及びサブタイトル情報が記録された記録媒体及びその再生装置

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20060925

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR

DAX Request for extension of the european patent (deleted)
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20080507