WO2008130182A1 - Method and apparatus for retrieving multimedia contents - Google Patents

Method and apparatus for retrieving multimedia contents Download PDF

Info

Publication number
WO2008130182A1
WO2008130182A1 PCT/KR2008/002285 KR2008002285W WO2008130182A1 WO 2008130182 A1 WO2008130182 A1 WO 2008130182A1 KR 2008002285 W KR2008002285 W KR 2008002285W WO 2008130182 A1 WO2008130182 A1 WO 2008130182A1
Authority
WO
WIPO (PCT)
Prior art keywords
indicator
query
user query
multimedia contents
mpeg
Prior art date
Application number
PCT/KR2008/002285
Other languages
French (fr)
Inventor
Hee-Cheol Seo
Mi-Ran Choi
Hyun-Ki Kim
Myung-Gil Jang
Jeong Heo
Soo-Jong Lim
Yeo-Chan Yoon
Kyoung-Ro Yoon
Original Assignee
Electronics And Telecommunications Research Institute
Konkuk University Industrial Cooperation Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electronics And Telecommunications Research Institute, Konkuk University Industrial Cooperation Corp filed Critical Electronics And Telecommunications Research Institute
Priority to EP08753141A priority Critical patent/EP2143027A4/en
Priority to CN2008800173376A priority patent/CN101720462B/en
Priority to JP2010506044A priority patent/JP5426533B2/en
Priority to US12/597,158 priority patent/US8577919B2/en
Publication of WO2008130182A1 publication Critical patent/WO2008130182A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/587Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using geographical or spatial information, e.g. location
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/632Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data

Definitions

  • the present invention relates to an apparatus and method for retrieving multimedia contents; and, more particularly, to multimedia contents retrieving apparatus that can retrieve multimedia contents represented based on Moving Picture Experts Group 7 (MPEG-7) by transforming a user query into an MPEG-7 query format, and a method thereof.
  • MPEG-7 Moving Picture Experts Group 7
  • Moving Picture Experts Group 7 is an international standardization on the architectures of metadata representing multimedia information, such as image, audio and moving picture.
  • An MPEG-7 query format is used to retrieve multimedia contents represented based on the MPEG-7.
  • An MPEG-7 multimedia contents retrieving system retrieves multimedia contents related to a query inputted in an MPEG-7 query format.
  • the MPEG-7 query format defines syntaxes for retrieving MPEG-7 documents.
  • the syntaxes can represent diverse types of queries that can be used for the retrieval of MPEG-7 documents. For example, they can represent not only natural sentence-type query such as "an image with mountain” but also example-based query using a multimedia file as a query and MPEG-7 textual description-based query.
  • An embodiment of the present invention which is invented to resolve the problem, is directed to providing a Moving Picture Experts Group 7 (MPEG-7) query format that can satisfy more than two retrieval conditions within the same structure and clearly represent that different MPEG-7 documents are referred to.
  • MPEG-7 Moving Picture Experts Group 7
  • Another embodiment of the present invention is directed to providing an apparatus and method that can accurately retrieve multimedia contents by precisely analyzing the meaning of a user query in a retrieving process .
  • a method for retrieving multimedia contents which includes: representing a user query by using an indicator indicating a specific region of a Moving Picture Experts Group 7 (MPEG-7) document and a reference for referring to the indicator; analyzing a meaning of the user query represented by using the indicator and the reference to thereby produce an analysis result; and retrieving multimedia contents according to the analysis result.
  • MPEG-7 Moving Picture Experts Group 7
  • a method for processing a user query to retrieve multimedia contents which includes: receiving a query for retrieving multimedia contents from a user; representing the user query by using an indicator for indicating a specific region of an MPEG-7 document and a reference for referring to the indicator .
  • an apparatus for retrieving multimedia contents which includes: a query input unit for receiving a query for retrieving multimedia contents from a user; a query representation unit for representing the user query inputted through the query input unit by using an indicator for indicating a specific region of an MPEG-7 document and a reference for referring to the indicator; a query analysis unit for analyzing a meaning of the user query represented in the query representation unit by using the indicator and the reference to thereby produce an analysis result; and a contents retrieval unit for retrieving multimedia contents according to the analysis result.
  • a data structure for representing a user query to retrieve multimedia contents which includes: an indicator for indicating a specific region of an MPEG-7 document; and a reference for referring to the indicator.
  • the present invention described above provides an MPEG-7 query format that can satisfy more than two retrieval conditions within the same structure and clearly represent that different MPEG-7 documents are referred to. Also, since the meaning of a user query is precisely analyzed during a retrieving process, it is possible to retrieve multimedia contents that accurately agree with the user query.
  • Fig. 1 is a flowchart describing a multimedia contents retrieving method in accordance with an embodiment of the present invention.
  • Fig. 2 illustrates an extensible Markup Language (XML) schema of an indicator in accordance with an embodiment of the present invention.
  • Fig. 3 illustrates an XML schema of a reference in accordance with an embodiment of the present invention.
  • XML extensible Markup Language
  • Fig. 4 is a flowchart describing a query representation step SlO of Fig. 1 in detail.
  • Fig. 5 is a flowchart describing a query processing step S20 of Fig. 1 in detail.
  • Fig. 6 illustrates an XML schema of an indicator in accordance with another embodiment of the present invention .
  • Fig. 7 is a block view showing a structure of multimedia contents retrieving apparatus in accordance with an embodiment of the present invention.
  • Fig. 1 is a flowchart describing a multimedia contents retrieving method in accordance with an embodiment of the present invention.
  • a user query is represented as a query for retrieving multimedia contents.
  • the user query is represented using an indicator and a reference for referring to the indicator to precisely represent the meaning of the user query.
  • the indicator denotes a specific region of a Moving Picture Experts Group 7 (MPEG-7) document, and the reference is used to refer to the indicator.
  • MPEG-7 Moving Picture Experts Group 7
  • the reference is used to refer to the indicator.
  • MPEG-7 Moving Picture Experts Group 7
  • two indicators may be established for two different MPEG-7 documents, respectively, and each of the two indicators may have references to clearly represent the two different MPEG-7 documents from each other.
  • a query processor analyzes the user query represented using the indicator and references.
  • a retrieval engine retrieves multimedia contents related to the user query analyzed in the query processor and, in step S40, provides a retrieval result.
  • Fig. 2 illustrates an extensible Markup Language (XML) schema of an indicator in accordance with an embodiment of the present invention.
  • an indicator includes an indicator identification (ID) number 101, an indicator region descriptor 102, and an indicator limiting descriptor 103.
  • the indicator region descriptor 102 may include a reference 104 for referring to another indicator.
  • the indicator limiting descriptor 103 includes a part 105 describing conditions for limiting an indicator.
  • An MPEG-7 document is described in an XML format, and an indicator indicates a specific region of the MPEG- 7 document.
  • the indicator region descriptor 102 is used to designate an uppermost node of the specific region.
  • the indicator limiting descriptor 103 is used when an additional limiting condition is needed in connection with a region represented by indicator region descriptor.
  • the indicator ID number 101 is used when an indicator is referred to.
  • Table 1 shows Fig. 2 described in the format of an XML schema.
  • a "path” element is a part for describing an indicator region
  • a “selector” element is a part for describing limitation of an indicator.
  • the “id” denotes the unique number of an indicator.
  • a "ref” attribute is used.
  • "ConditionalType” is defined as a limiting condition to describe specific condition.
  • Fig. 3 illustrates an XML schema of a reference in accordance with an embodiment of the present invention.
  • An indicator may refer to a specific indicator, and it is possible to refer to a node inside a specific region which is indicated by the indicator.
  • an indicator may include a "ref” attribute for referring to a specific indicator, and represent a region related to the indicator by the attribute value.
  • XML schema related to Fig. 3 may be described as the following Table 2, where the "ref” attribute refers to the indicator and "xPathType" describes a part related to the indicator.
  • Fig. 4 is a flowchart describing a query representation step SlO of Fig. 1 in detail.
  • step S402 a query for retrieving multimedia contents is inputted from a user.
  • step S404 the inputted user query is represented as an indicator for indicating a specific region of an MPEG-7 document and a reference for referring to the indicator.
  • a query for "retrieving images whose horizontal length X vertical length is greater than 1024X768" can be represented as the following Table 3 based on the XML schema defined in the Tables 1 and 2.
  • Table 3 an indicator is referred to by using a reference “href,” and a specific part related to a region indicated by the indicator can be indicated by describing an additional path.
  • Fig. 5 is a flowchart describing a query processing step S20 of Fig. 1 in detail.
  • the meaning of the user query represented using an indicator and a reference is analyzed in the query processing step S20.
  • an XML parser parses a user query described in an XML format.
  • the indicator and the reference are processed based on a parsing result.
  • the meaning of the user query is analyzed using the processed indicator and reference.
  • references referring to the same indicator are regarded as values for referring to a value in the inside of the same region to analyze the meaning of the user query. For example, since “ ⁇ height” and “ ⁇ width” refer to "VisualCodingFramelD" in the user query, it is analyzed that the two refer to a value in the inside a region indicated by the "VisualCodingFramelD. "
  • Fig. 6 illustrates an XML schema of an indicator in accordance with another embodiment of the present invention.
  • An indicator ID number 601 is the same as the indicator ID number 101 of Fig. 2.
  • An indicator region descriptor of Fig. 6 is an optional element whereas an indicator limiting descriptor 603 is essential element, and it does not have "attribute.”
  • Fig. 6 may be described in an XML schema, which is presented in the following Table 4.
  • FIG. 7 is a block view showing a structure of multimedia contents retrieving apparatus in accordance with an embodiment of the present invention.
  • the multimedia contents retrieving apparatus 700 includes a query input unit 702, a query representation unit 704, a query analysis unit 706, a contents retrieval unit 708, and an output unit 710.
  • the query input unit 702 receives a query for retrieving multimedia contents from a user.
  • the query representation unit 704 represents the user query inputted through the query input unit 702 into an MPEG-7 query format by using an indicator indicating a specific region of an MPEG-7 document and a reference for referring to the indicator.
  • An indicator includes an indicator ID number used for a reference to refer to the indicator, a descriptor for describing limiting conditions for the region indicated by the indicator, and a descriptor for designating an uppermost node of the region indicated by the indicator.
  • the user query is represented in an XML format.
  • the query analysis unit 706 analyzes the meaning of the user query represented using the indicator and the reference in the query representation unit 704.
  • the query analysis unit 706 includes an XML parser 712 for parsing a user query, a descriptor processor 714 for processing an indicator and a reference based on the parsing result of the XML parser 712, and a meaning analyzer 716 for analyzing the meaning of the user query based on the indicator and the reference processed in the descriptor processor 714.
  • the contents retrieval unit 708 retrieves multimedia contents according to the analysis result of the user query analysis unit 706.
  • the contents retrieval unit 708 may retrieve a database 718 or search the internet 722 through a communication unit 720.
  • the database 718 may be set up inside or outside the multimedia contents retrieving apparatus 700.
  • the output unit 710 provides multimedia contents retrieved by the contents retrieval unit 708 to the user.
  • the method of the present invention described above may be realized as a program and stored in a computer-readable recording medium, such as CD-ROM, RAM, ROM, floppy disks, hard disks, magneto-optical disks and the like. Since this process can be easily implemented by those skilled in the art to which the present invention belongs, further description will not be provided herein.
  • a computer-readable recording medium such as CD-ROM, RAM, ROM, floppy disks, hard disks, magneto-optical disks and the like. Since this process can be easily implemented by those skilled in the art to which the present invention belongs, further description will not be provided herein.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Library & Information Science (AREA)
  • Mathematical Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Disclosed is an apparatus and method for retrieving multimedia contents represented in a Moving Picture Experts Group (MPEG) 7 by transforming a user query into an MPEG-7 query format. The method for retrieving multimedia contents includes: representing a user query by using an indicator indicating a specific region of a Moving Picture Experts Group 7 (MPEG-7) document and a reference for referring to the indicator; analyzing a meaning of the user query represented by using the indicator and the reference to thereby produce an analysis result; and retrieving multimedia contents according to the analysis result. The present research can satisfy more than two retrieval conditions within the same structure in an MPEG-7 query format and it can also clearly represent that two different MPEG-7 documents are referred to. Since the meaning of a user query is analyzed accurately during retrieval process, it is possible to precisely retrieve multimedia contents.

Description

DESCRIPTION
METHOD AND APPARATUS FOR RETRIEVING MULTIMEDIA CONTENTS
TECHNICAL FIELD The present invention relates to an apparatus and method for retrieving multimedia contents; and, more particularly, to multimedia contents retrieving apparatus that can retrieve multimedia contents represented based on Moving Picture Experts Group 7 (MPEG-7) by transforming a user query into an MPEG-7 query format, and a method thereof.
This work was supported by the IT R&D program of MIC/IITA [2005-S-117-03, "Development of Intelligent Personal Media Managing Technology for Ubiquitous Environment"].
BACKGROUND ART
Moving Picture Experts Group 7 (MPEG-7) is an international standardization on the architectures of metadata representing multimedia information, such as image, audio and moving picture. An MPEG-7 query format is used to retrieve multimedia contents represented based on the MPEG-7. An MPEG-7 multimedia contents retrieving system retrieves multimedia contents related to a query inputted in an MPEG-7 query format.
The MPEG-7 query format defines syntaxes for retrieving MPEG-7 documents. The syntaxes can represent diverse types of queries that can be used for the retrieval of MPEG-7 documents. For example, they can represent not only natural sentence-type query such as "an image with mountain" but also example-based query using a multimedia file as a query and MPEG-7 textual description-based query.
While representing such diverse queries, referring to the same or different portions of an MPEG-7 document occurs frequently. To be specific, there is a case where more than one retrieval condition should be all satisfied in the same structure. For example, to retrieve moving picture segments with "mountain" and "sea", the presence of "mountain" and "sea" could be represented for one region. As for joint operation, two different MPEG-7 documents should be referred to. For this, it should be clearly represented that two different documents are referred to. Conventional MPEG-7 query formats may satisfy more than two retrieval conditions within the same architecture, but they have a shortcoming that they cannot clearly represent reference to two different MPEG- 7 documents.
DISCLOSURE TECHNICAL PROBLEM
An embodiment of the present invention, which is invented to resolve the problem, is directed to providing a Moving Picture Experts Group 7 (MPEG-7) query format that can satisfy more than two retrieval conditions within the same structure and clearly represent that different MPEG-7 documents are referred to.
Another embodiment of the present invention is directed to providing an apparatus and method that can accurately retrieve multimedia contents by precisely analyzing the meaning of a user query in a retrieving process .
Other objects and advantages of the present invention can be understood by the following description, and become apparent with reference to the embodiments of the present invention. Also, it is obvious to those skilled in the art of the present invention that the objects and advantages of the present invention can be realized by the means as claimed and combinations thereof. TECHNICAL SOLUTION
In accordance with an aspect of the present invention, there is provided a method for retrieving multimedia contents, which includes: representing a user query by using an indicator indicating a specific region of a Moving Picture Experts Group 7 (MPEG-7) document and a reference for referring to the indicator; analyzing a meaning of the user query represented by using the indicator and the reference to thereby produce an analysis result; and retrieving multimedia contents according to the analysis result.
In accordance with another aspect of the present invention, there is provided a method for processing a user query to retrieve multimedia contents, which includes: receiving a query for retrieving multimedia contents from a user; representing the user query by using an indicator for indicating a specific region of an MPEG-7 document and a reference for referring to the indicator .
In accordance with another aspect of the present invention, there is provided an apparatus for retrieving multimedia contents, which includes: a query input unit for receiving a query for retrieving multimedia contents from a user; a query representation unit for representing the user query inputted through the query input unit by using an indicator for indicating a specific region of an MPEG-7 document and a reference for referring to the indicator; a query analysis unit for analyzing a meaning of the user query represented in the query representation unit by using the indicator and the reference to thereby produce an analysis result; and a contents retrieval unit for retrieving multimedia contents according to the analysis result. In accordance with another aspect of the present invention, there is provided a data structure for representing a user query to retrieve multimedia contents, which includes: an indicator for indicating a specific region of an MPEG-7 document; and a reference for referring to the indicator.
ADVANTAGEOUS EFFECTS
The present invention described above provides an MPEG-7 query format that can satisfy more than two retrieval conditions within the same structure and clearly represent that different MPEG-7 documents are referred to. Also, since the meaning of a user query is precisely analyzed during a retrieving process, it is possible to retrieve multimedia contents that accurately agree with the user query.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 is a flowchart describing a multimedia contents retrieving method in accordance with an embodiment of the present invention.
Fig. 2 illustrates an extensible Markup Language (XML) schema of an indicator in accordance with an embodiment of the present invention. Fig. 3 illustrates an XML schema of a reference in accordance with an embodiment of the present invention.
Fig. 4 is a flowchart describing a query representation step SlO of Fig. 1 in detail.
Fig. 5 is a flowchart describing a query processing step S20 of Fig. 1 in detail.
Fig. 6 illustrates an XML schema of an indicator in accordance with another embodiment of the present invention .
Fig. 7 is a block view showing a structure of multimedia contents retrieving apparatus in accordance with an embodiment of the present invention.
BEST MODE
The advantages, features and aspects of the invention will become apparent from the following description of the embodiments with reference to the accompanying drawings, which is set forth hereinafter. When it is considered that detailed description on a related art may obscure a point of the present invention, the description will not be provided herein. Hereinafter, specific embodiments of the present invention will be described with reference to the accompanying drawings.
Fig. 1 is a flowchart describing a multimedia contents retrieving method in accordance with an embodiment of the present invention.
In step SlO, a user query is represented as a query for retrieving multimedia contents. The user query is represented using an indicator and a reference for referring to the indicator to precisely represent the meaning of the user query. The indicator denotes a specific region of a Moving Picture Experts Group 7 (MPEG-7) document, and the reference is used to refer to the indicator. For example, when moving picture segments with "mountain" and "sea" is retrieved for, there is an indicator for a moving picture segment and a reference of the indicator may represent the presence of "mountain" and another reference, the presence of "sea." In subsequent joint operation, two indicators may be established for two different MPEG-7 documents, respectively, and each of the two indicators may have references to clearly represent the two different MPEG-7 documents from each other.
In step S20, a query processor analyzes the user query represented using the indicator and references. In step S30, a retrieval engine retrieves multimedia contents related to the user query analyzed in the query processor and, in step S40, provides a retrieval result.
Fig. 2 illustrates an extensible Markup Language (XML) schema of an indicator in accordance with an embodiment of the present invention. As shown in the drawing, an indicator includes an indicator identification (ID) number 101, an indicator region descriptor 102, and an indicator limiting descriptor 103. The indicator region descriptor 102 may include a reference 104 for referring to another indicator. The indicator limiting descriptor 103 includes a part 105 describing conditions for limiting an indicator.
An MPEG-7 document is described in an XML format, and an indicator indicates a specific region of the MPEG- 7 document. For this, the indicator region descriptor 102 is used to designate an uppermost node of the specific region. The indicator limiting descriptor 103 is used when an additional limiting condition is needed in connection with a region represented by indicator region descriptor. The indicator ID number 101 is used when an indicator is referred to.
The following Table 1 shows Fig. 2 described in the format of an XML schema. In the Table 1, a "path" element is a part for describing an indicator region, and a "selector" element is a part for describing limitation of an indicator. The "id" denotes the unique number of an indicator. To allow referring to other indicators within an indicator, a "ref" attribute is used. In a part limiting an indicator, "ConditionalType" is defined as a limiting condition to describe specific condition.
Table 1 <complexType name="IndicatorType"> <sequence>
<element name="Path"> <complexType> <simpleConteαt> Extension base="mpeg7:xPathType"> attribute name="ref type=NIDREF" use="optionar/> <extension> </simpleContent> <7complexType> </element>
<element name="Selector" type="mp7qf:ConditionTypef' minOccurs=B0"/> </sequence> attribute name="id" type="ID" use="required'7> </complexType>
Fig. 3 illustrates an XML schema of a reference in accordance with an embodiment of the present invention. An indicator may refer to a specific indicator, and it is possible to refer to a node inside a specific region which is indicated by the indicator.
In Fig. 3, an indicator may include a "ref" attribute for referring to a specific indicator, and represent a region related to the indicator by the attribute value. XML schema related to Fig. 3 may be described as the following Table 2, where the "ref" attribute refers to the indicator and "xPathType" describes a part related to the indicator.
Table 2
<compiexType name="FeatureNameType"> <simpleContent> extension base="mpeg7:xPathType"> attribute name-'ref ' type="IDREF" usc="optional"> </extension> </simpleContent> </complexType>
Fig. 4 is a flowchart describing a query representation step SlO of Fig. 1 in detail. In step S402, a query for retrieving multimedia contents is inputted from a user. In step S404, the inputted user query is represented as an indicator for indicating a specific region of an MPEG-7 document and a reference for referring to the indicator.
For example, a query for "retrieving images whose horizontal length X vertical length is greater than 1024X768" can be represented as the following Table 3 based on the XML schema defined in the Tables 1 and 2. In the Table 3, an indicator is referred to by using a reference "href," and a specific part related to a region indicated by the indicator can be indicated by describing an additional path.
Table 3
<mp7qf:RetrieveData>
<mp7qf:Indicator id="M7DocϊD">
<mp7qf:Path>/Mpeg7</mρ7qf:Path> </mp7qf:Indicator>
<mp7qf:Indicatorid="VisualCodingFrameID">
<mp7qf:Path ref="M7DocK>">//VisualCoding/Frame<mp7qf:Path> <Λnp7qf:Indicator>
<mp7qf:Condition> <mp7qf:ConditionBagoperator="ANI)">
<!~ target content : Image --> <mp7qf:FeatureConditionoperator="equalTo"> <mp7qf:SourceFeature ref="M7DocID">
//MediaFoπnat/Content/Name </mp7qf:SourceFeature> <mp7qhTargetConstantValuexsi:typeβ:Hnip7qf:FeatureStringTypell>
<mp7qf:value>Image</mp7qf:value> </mp7qf:TargetConstantValue> </mp7qf:FeatureCondition>
<!-- sizes are greater than or equal to 1024*768 pixels (width * height) --> <mp7qf:FeatureCondition operator="greaterThanOREqualTo"> <mp7qf: SourceFeatureExpression operator="multiply"> <mp7qf:FeatureName ref="VisualCodingFrameID">
@height
</mp7qf:FeatureName> <mp7qf:FeatureNarae ref="VisualCodingFrameID">
@width
</rap7qf:FeatureName> </mp7qf:SourceFeatureExpression> <mp7qf:TargetFeatureExpression operator="multiply"> <mp7qf:ConstantValue xsi:rype=Hmp7qf:FeatureDecimalType">
<mp7qf:value>1024</mp7qf:value> </mp7qf:ConstantValue> <mp7qf:ConstantValue xsi:type=nmp7qf:FeatureDecimalType">
<mp7qf:value>768«s/mp7qf:value> </mp7qf:ConstantValue> </mp7qf:TargetFeatureExpression> </mp7qf:FeatureCondition> </mp7qf:ConditionBag>
</mp7qf:Condition> </mp7qf:RetrieveData> Fig. 5 is a flowchart describing a query processing step S20 of Fig. 1 in detail. The meaning of the user query represented using an indicator and a reference is analyzed in the query processing step S20. First, in step S502, an XML parser parses a user query described in an XML format. Subsequently, in step S504, the indicator and the reference are processed based on a parsing result. In step S506, the meaning of the user query is analyzed using the processed indicator and reference.
In the step S504 where the indicator and the reference are processed, references referring to the same indicator are regarded as values for referring to a value in the inside of the same region to analyze the meaning of the user query. For example, since "Θheight" and "Θwidth" refer to "VisualCodingFramelD" in the user query, it is analyzed that the two refer to a value in the inside a region indicated by the "VisualCodingFramelD. "
Fig. 6 illustrates an XML schema of an indicator in accordance with another embodiment of the present invention. An indicator ID number 601 is the same as the indicator ID number 101 of Fig. 2. An indicator region descriptor of Fig. 6 is an optional element whereas an indicator limiting descriptor 603 is essential element, and it does not have "attribute." Fig. 6 may be described in an XML schema, which is presented in the following Table 4.
Table 4 <mp7qf:RetrievβData>
<mp7qf:Indicator id="M7DocϊD">
<mp7qf:Path>/Mpeg7</mp7qf:Path> </mp7qf:Indicator>
<mp7qf:Indicatorids="VisualCodingFrameID">
<mp7qf:Path re^nM7Docn)">//VisualCoding/Franie</inp7qf:Path> <ymp7qf:Indicator>
<mp7qf:Condition> <mp7qf:ConditionBag operator3" AND">
<!- target content : Image --> <mp7qf:FeatureCondition operator="equalTo"> <mp7qf:SourceFeature ref="M7DocID">
//MediaFormat/Content/Name </mp7qf:SourceFeature> <mp7qf:TargeConstantValyexsi:rypeβ"mp7qf:FeatureStringType">
<mp7qf:value>Image</mp7qf:value> </mp7qf:TargetConstantValue> <ymp7qf:FeatureCondition>
<!- sizes are greater than or equal to 1024*768 pixels (width * height) ~> <mp7qf:FeatureCondition operator="greaterThanOREqualToH> <mp7qf:SourceFeatureExpression operator="raultiply"> <mp7qf:FeatureName ref="VisualCodingFrameID">
@height
</mp7qf:FeatureName> <mp7qf:FeatureNameref="VisualCodingFrameID">
@width
</mp7qf:FeatureName> </mp7qf:SourceFeatureExpression> <mp7qf:TargetFeatureExpression operator="multiply"> <mp7qf:ConstantValue xsi:rype=Hmp7qf:FeatureDecimalType">
<mp7qf:value>1024</mp7qf:vaiue> </mρ7qf:ConstantValue> <mp7qf:ConstantValue xsi:type="mp7qf:FeatureDecimalType">
<mp7qf:value>768</mp7qf:value> </rap7qf:ConstantValue> </mp7qf:TargetFeatureExpression> </mp7qf:FeatureCondition> </mp7qf:ConditionBag>
</mp7qf:Condition> </mp7qf:RetrieveData> Fig. 7 is a block view showing a structure of multimedia contents retrieving apparatus in accordance with an embodiment of the present invention. As shown in the drawing, the multimedia contents retrieving apparatus 700 includes a query input unit 702, a query representation unit 704, a query analysis unit 706, a contents retrieval unit 708, and an output unit 710.
The query input unit 702 receives a query for retrieving multimedia contents from a user. The query representation unit 704 represents the user query inputted through the query input unit 702 into an MPEG-7 query format by using an indicator indicating a specific region of an MPEG-7 document and a reference for referring to the indicator. An indicator includes an indicator ID number used for a reference to refer to the indicator, a descriptor for describing limiting conditions for the region indicated by the indicator, and a descriptor for designating an uppermost node of the region indicated by the indicator. The user query is represented in an XML format.
The query analysis unit 706 analyzes the meaning of the user query represented using the indicator and the reference in the query representation unit 704. The query analysis unit 706 includes an XML parser 712 for parsing a user query, a descriptor processor 714 for processing an indicator and a reference based on the parsing result of the XML parser 712, and a meaning analyzer 716 for analyzing the meaning of the user query based on the indicator and the reference processed in the descriptor processor 714. The contents retrieval unit 708 retrieves multimedia contents according to the analysis result of the user query analysis unit 706. The contents retrieval unit 708 may retrieve a database 718 or search the internet 722 through a communication unit 720. The database 718 may be set up inside or outside the multimedia contents retrieving apparatus 700. The output unit 710 provides multimedia contents retrieved by the contents retrieval unit 708 to the user.
MODE FOR THE INVENTION
The method of the present invention described above may be realized as a program and stored in a computer-readable recording medium, such as CD-ROM, RAM, ROM, floppy disks, hard disks, magneto-optical disks and the like. Since this process can be easily implemented by those skilled in the art to which the present invention belongs, further description will not be provided herein.
While the present invention has been described with respect to the specific embodiments, it will be apparent to those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the following claims.

Claims

WHAT IS CLAIMED IS
1. A method for retrieving multimedia contents, comprising: representing a user query by using an indicator indicating a specific region of a Moving Picture Experts Group 7 (MPEG-7) document and a reference for referring to the indicator; analyzing a meaning of the user query represented by using the indicator and the reference to thereby produce an analysis result; and retrieving multimedia contents according to the analysis result.
2. The method of claim 1, wherein the indicator includes : an indicator identification (ID) code used for the reference to refer to the indicator; and a descriptor for describing a condition limiting the region indicated by the indicator.
3. The method of claim 2, wherein the indicator further includes: a descriptor for designating an uppermost node of the region indicated by the indicator.
4. The method of claim 1, wherein the user query is described in an extensible Markup Language (XML) format in the representing a user query by using an indicator and a reference.
5. The method of claim 4, wherein the analyzing a meaning of the user query represented by using the indicator and the reference includes: parsing the user query by using an XML parser to thereby produce a parsing result; processing the indicator and the reference based on the parsing result; and analyzing a meaning of the user query by using the processed indicator and reference.
6. The method of claim 5, wherein in the processing the indicator and the reference, a value inside a same region is referred to for references referring to a same indicator.
7. A method for processing a user query to retrieve multimedia contents, comprising: receiving a query for retrieving multimedia contents from a user; representing the user query by using an indicator for indicating a specific region of an MPEG-7 document and a reference for referring to the indicator.
8. The method of claim 7, wherein the indicator includes : an indicator ID code used for the reference to refer to the indicator; and a descriptor for describing a condition limiting the region indicated by the indicator.
9. The method of claim 8, wherein the indicator includes : a descriptor for designating an uppermost node of the region indicated by the indicator.
10. The method of claim 7, wherein the user query is described in an XML format in the representing a user query by using an indicator and a reference.
11. An apparatus for retrieving multimedia contents, comprising: a query input unit for receiving a query for retrieving multimedia contents from a user; a query representation unit for representing the user query inputted through the query input unit by using an indicator for indicating a specific region of an MPEG-
7 document and a reference for referring to the indicator; a query analysis unit for analyzing a meaning of the user query represented in the query representation unit by using the indicator and the reference to thereby produce an analysis result; and a contents retrieval unit for retrieving multimedia contents according to the analysis result.
12. The apparatus of claim 11, wherein the indicator includes: an indicator ID code used for the reference to refer to the indicator; and a descriptor for describing a condition limiting the region indicated by the indicator.
13. The apparatus of claim 12, wherein the indicator further includes: a descriptor for designating an uppermost node of the region indicated by the indicator.
14. The apparatus of claim 11, wherein the user query is described in an XML format in the query representation unit.
15. The apparatus of claim 14, wherein the query analysis unit includes: an XML parser for parsing the user query to thereby produce a parsing result; a descriptor processor for processing the indicator and the reference based on the parsing result of the XML parser; a meaning analyzer for analyzing a meaning of the user query by using the processed indicator and reference which are obtained in the descriptor processor.
16. A data structure for representing a user query to retrieve multimedia contents, comprising: an indicator for indicating a specific region of an MPEG-7 document; and a reference for referring to the indicator.
17. The data structure of claim 16, wherein the indicator includes : an indicator ID code used for the reference to refer to the indicator; and a descriptor for describing a condition limiting the region indicated by the indicator.
18. The data structure of claim 17, wherein the indicator further includes: a descriptor for designating an uppermost node of the region indicated by the indicator.
19. The data structure of claim 16, wherein the data structure is described in an XML format.
PCT/KR2008/002285 2007-04-23 2008-04-23 Method and apparatus for retrieving multimedia contents WO2008130182A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP08753141A EP2143027A4 (en) 2007-04-23 2008-04-23 Method and apparatus for retrieving multimedia contents
CN2008800173376A CN101720462B (en) 2007-04-23 2008-04-23 Method and apparatus for retrieving multimedia contents
JP2010506044A JP5426533B2 (en) 2007-04-23 2008-04-23 Method and apparatus for searching multimedia content
US12/597,158 US8577919B2 (en) 2007-04-23 2008-04-23 Method and apparatus for retrieving multimedia contents

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR20070039475 2007-04-23
KR10-2007-0039475 2007-04-23
KR1020080035896A KR100961444B1 (en) 2007-04-23 2008-04-18 Method and apparatus for retrieving multimedia contents
KR10-2008-0035896 2008-04-18

Publications (1)

Publication Number Publication Date
WO2008130182A1 true WO2008130182A1 (en) 2008-10-30

Family

ID=40154958

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2008/002285 WO2008130182A1 (en) 2007-04-23 2008-04-23 Method and apparatus for retrieving multimedia contents

Country Status (6)

Country Link
US (1) US8577919B2 (en)
EP (1) EP2143027A4 (en)
JP (1) JP5426533B2 (en)
KR (1) KR100961444B1 (en)
CN (1) CN101720462B (en)
WO (1) WO2008130182A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10235410B2 (en) 2015-02-11 2019-03-19 Electronics And Telecommunications Research Institute Query input apparatus and method

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102479026B1 (en) * 2017-09-27 2022-12-20 한국전자통신연구원 QUERY AND RESPONSE SYSTEM AND METHOD IN MPEG IoMT ENVIRONMENT
CN109582763B (en) * 2017-09-27 2023-08-22 韩国电子通信研究院 Answering system and method in moving picture expert group media Internet of things environment

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070233673A1 (en) * 2006-03-29 2007-10-04 Hee-Cheol Seo Apparatus and method for searching multimedia data based on metadata

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6070167A (en) * 1997-09-29 2000-05-30 Sharp Laboratories Of America, Inc. Hierarchical method and system for object-based audiovisual descriptive tagging of images for information retrieval, editing, and manipulation
WO2000028467A1 (en) 1998-11-06 2000-05-18 The Trustees Of Columbia University In The City Of New York Image description system and method
US6490370B1 (en) * 1999-01-28 2002-12-03 Koninklijke Philips Electronics N.V. System and method for describing multimedia content
US6593936B1 (en) * 1999-02-01 2003-07-15 At&T Corp. Synthetic audiovisual description scheme, method and system for MPEG-7
US6411724B1 (en) * 1999-07-02 2002-06-25 Koninklijke Philips Electronics N.V. Using meta-descriptors to represent multimedia information
US6629088B1 (en) * 1999-11-30 2003-09-30 Sony Corporation Method and apparatus for measuring the quality of descriptors and description schemes
KR100739031B1 (en) 2000-03-27 2007-07-25 주식회사 큐론 Method of mpeg-7 meta data hiding and detection to retrieve multimedia for multimedia indexing retrieval system
JP3784289B2 (en) * 2000-09-12 2006-06-07 松下電器産業株式会社 Media editing method and apparatus
KR100413679B1 (en) * 2000-10-21 2003-12-31 삼성전자주식회사 Shape descriptor extracting method
US20030009472A1 (en) 2001-07-09 2003-01-09 Tomohiro Azami Method related to structured metadata
US7231394B2 (en) * 2001-07-17 2007-06-12 Sony Corporation Incremental bottom-up construction of data documents
CN1549982A (en) 2001-08-28 2004-11-24 皇家飞利浦电子股份有限公司 Automatic question formulation from a user selection in multimedia content
US7284188B2 (en) * 2002-03-29 2007-10-16 Sony Corporation Method and system for embedding MPEG-7 header data to improve digital content queries
US7664830B2 (en) * 2002-03-29 2010-02-16 Sony Corporation Method and system for utilizing embedded MPEG-7 content descriptions
CN100418088C (en) 2002-09-03 2008-09-10 富士通株式会社 Search processing system, search server, client, search processing method, program, and recording medium
JP4004521B2 (en) * 2003-02-03 2007-11-07 シャープ株式会社 Encoding apparatus and method, decoding apparatus and method, program, and recording medium
US20040267720A1 (en) * 2003-06-27 2004-12-30 Peiya Liu Query system for structured multimedia content retrieval
KR100558881B1 (en) 2003-12-27 2006-03-10 한국전자통신연구원 Apparatus and method for searching and browsing of multimedia contents
JP2005332274A (en) * 2004-05-20 2005-12-02 Toshiba Corp Data structure of metadata stream for object in dynamic image, retrieval method and reproduction method
EP1759315B1 (en) 2004-06-23 2010-06-30 Oracle International Corporation Efficient evaluation of queries using translation
JP4179262B2 (en) 2004-10-06 2008-11-12 ソニー株式会社 Information processing apparatus and method, and program
KR100780786B1 (en) 2005-02-04 2007-11-29 후지쯔 가부시끼가이샤 Search system, search server, client, search method, and recording medium
US8805868B2 (en) * 2007-08-03 2014-08-12 Electronics And Telecommunications Research Institute Apparatus and method for a query express

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070233673A1 (en) * 2006-03-29 2007-10-04 Hee-Cheol Seo Apparatus and method for searching multimedia data based on metadata

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
ADISTAMBHA K. ET AL.: "The MPEG-7 Query Format: A New Standard in Progress for Multimedia Query by Content", ISCIT 2007, 17 October 2007 (2007-10-17) - 19 October 2007 (2007-10-19), pages 479 - 484, XP031166511 *
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO, N8780, 19 JANUARY 2007 *
See also references of EP2143027A4 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10235410B2 (en) 2015-02-11 2019-03-19 Electronics And Telecommunications Research Institute Query input apparatus and method

Also Published As

Publication number Publication date
CN101720462B (en) 2012-11-07
KR100961444B1 (en) 2010-06-09
US8577919B2 (en) 2013-11-05
EP2143027A4 (en) 2010-05-05
JP5426533B2 (en) 2014-02-26
JP2010528347A (en) 2010-08-19
CN101720462A (en) 2010-06-02
EP2143027A1 (en) 2010-01-13
US20100131557A1 (en) 2010-05-27
KR20080095180A (en) 2008-10-28

Similar Documents

Publication Publication Date Title
US7139746B2 (en) Extended markup language (XML) indexing method for processing regular path expression queries in a relational database and a data structure thereof
US7953592B2 (en) Semantic analysis apparatus, semantic analysis method and semantic analysis program
US20070171482A1 (en) Method and apparatus for managing information, and computer program product
US20070198574A1 (en) Mpv file creating method and appartus, and storage medium therefor
US20220114211A1 (en) Video matching service to offline counterpart
US20190272452A1 (en) Methods and apparatus for identifying objects depicted in a video using extracted video frames in combination with a reverse image search engine
WO2009070327A2 (en) Method and apparatus for generation, distribution and display of interactive video content
US20090083227A1 (en) Retrieving apparatus, retrieving method, and computer program product
US8577919B2 (en) Method and apparatus for retrieving multimedia contents
US20080016068A1 (en) Media-personality information search system, media-personality information acquiring apparatus, media-personality information search apparatus, and method and program therefor
US8805868B2 (en) Apparatus and method for a query express
US7698262B2 (en) Apparatus and method for searching multimedia data based on metadata
Gibbon et al. Automated content metadata extraction services based on MPEG standards
Schallauer et al. Multimedia metadata standards
JP2007293602A (en) System and method for retrieving image and program
US20110145700A1 (en) Structured document analysis apparatus and structured document analysis method
Wattamwar et al. Multimedia explorer: Content based multimedia exploration
KR100540175B1 (en) Data management apparatus and method for reflecting MPEG-4 contents characteristic
KR100602388B1 (en) Resource Reference Method of MPEG - 21 Multimedia Framework
WO2006004284A1 (en) Mpv file creating method and apparatus, and storage medium therefor
RU2549102C2 (en) Method of determining real-time broadcast media streams and system therefor
CN100521732C (en) Processing method for comment data and STB device
Timmerer Resource Adaptation using XML within the MPEG-21 Multimedia Framework
Kim et al. A study on Implementation of XML-based Information retrieval system for video contents
KR20100059110A (en) The method of searching multimedia data by using subttitle files formatted by smil(synchronized multimedia intergration language)

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880017337.6

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08753141

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 12597158

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2010506044

Country of ref document: JP

Ref document number: 2008753141

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE