KR20100064136A - Multimedia data search method - Google Patents
Multimedia data search method Download PDFInfo
- Publication number
- KR20100064136A KR20100064136A KR1020080122567A KR20080122567A KR20100064136A KR 20100064136 A KR20100064136 A KR 20100064136A KR 1020080122567 A KR1020080122567 A KR 1020080122567A KR 20080122567 A KR20080122567 A KR 20080122567A KR 20100064136 A KR20100064136 A KR 20100064136A
- Authority
- KR
- South Korea
- Prior art keywords
- indexing
- search
- multimedia data
- server
- query
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 23
- 239000002699 waste material Substances 0.000 abstract description 5
- 230000005540 biological transmission Effects 0.000 abstract 1
- 238000012545 processing Methods 0.000 description 6
- 238000004891 communication Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000007792 addition Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 230000007423 decrease Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/225—Feedback of the input speech
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention does not use a separate keyword text input means when searching for a desired music, video, video on demand, etc. can be used during driving to provide a convenient driving environment, delivery of a search query, indexing server 20 ), And the transmission of the search results to the telematics terminal 30 are all performed in a short form, thereby minimizing waste of resources due to network interworking due to unnecessary waiting time.
Description
The present invention relates to a method for retrieving multimedia data, and more particularly, it is possible to provide a convenient driving environment because it can be used while driving without using a text input means for a keyword when searching for music, video, video on demand, and the like. The present invention relates to a method for retrieving multimedia data.
Recently, due to the development of multimedia technology, the usage of multimedia data in each field is increasing.
The multimedia data retrieval method according to the prior art uses a text based indexing method. The text search method is a method of searching text by inputting a desired keyword by indexing features of multimedia data into text. Such a text search method requires a driver to input by using a separate keyword text input means while driving, there is a risk of an accident, a feature is inefficiently extracted, and a considerable amount due to the limitation that does not contain all the contents of the multimedia data. Gets wrong multimedia search results.
In addition, text-based search engines require a considerable amount of time to search and their accuracy decreases as the content increases. When the multimedia service is performed through wireless communication, there is a problem that an excessive communication fee is added due to the increase of search time due to the above problem, which may cause a large burden on the user.
An object of the present invention is to provide a multimedia data retrieval method that can provide a convenient driving environment can be used during driving do not use a separate text input means for keywords when searching for desired music, video, video on demand, etc. do.
In addition, the present invention is a multimedia data that can minimize the waste of resources due to network interworking caused by unnecessary waiting time because the delivery of the search query, the search of the indexing server, the delivery of the search results to the telematics terminal is all performed in a short form. It is an object to provide a search method.
In the multimedia data retrieval method according to the present invention, a feature indexing file is generated by extracting audio characteristic values of multimedia data stored in a data server, and the feature indexing file and link information for accessing the corresponding original multimedia data are stored in an indexing server. Making; Generating a query when the driver inputs a search keyword by humming or general voice; Generating a query indexing file by dividing whether the query is a hum or a general voice and extracting audio characteristic values or lylics through digital signal processing according to a case; Providing the driver with a result of the comparison with the feature indexing file stored in the indexing server based on the query indexing file; And when the driver selects the result value corresponding to the desired multimedia data from the result values, corresponding driver multimedia data stored in the data server is provided to the driver by the link information corresponding to the result value. It includes.
In addition, the query indexing file is generated by implementing a voice-to-text function for storing the general voice as text when the query is the general voice.
The feature indexing file is a lyrics composed by directly extracting lyrics of a video including a feature index extracted through digital signal processing of music itself, a text index composed of a brief description made by operators and users, and voice information. (lylics) has an indexing array of indexes,
Forwarding of the query, searching of the indexing server, and forwarding of the result value are performed in short form,
Extracting feature data for generating a feature indexing file from an audio signal through a feature extraction method for extracting an audio feature value; And formatting and standardizing the feature data into the feature indexing file that can be entered into a search tool,
Retrieving the feature indexing file comprises implementing the feature data in a cluster by size; Extracting a group having a specific size by searching for each size; And performing a rescan using the hourly index in the group.
The present invention does not use a text input means for a keyword when searching for a desired music, video, video on demand, etc., which can be used during driving, thereby providing a convenient driving environment.
In addition, the present invention has the effect that it is possible to minimize the waste of resources due to network interworking caused by unnecessary latency because all of the delivery of the search query, the search of the indexing server, the delivery of the search results to the telematics terminal is performed in short form. have.
Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. However, the invention is not limited to the embodiments described herein but may be embodied in other forms. Rather, the embodiments introduced herein are provided so that the spirit of the present invention is thoroughly and completely disclosed, and the spirit of the present invention to those skilled in the art will be fully delivered. Also, like reference numerals denote like elements throughout the specification.
1 is a block diagram showing an apparatus for retrieving multimedia data according to the present invention.
The multimedia data retrieval apparatus includes a
The
The
The
Here, multimedia data includes images, images, music, sound, etc., humming refers to a nasal music technique with a closed mouth, and a search query refers to a query that defines data in a database to search a database. do.
2 is a flowchart illustrating a method of retrieving multimedia data according to the present invention.
First, the
The feature indexing file and the link information for accessing the corresponding original multimedia data are stored in the
Afterwards, when the driver inputs a search keyword by voice, hum, or the like using an input means such as a keyboard or a microphone, a search query generated by the
The
The feature indexing file stored in the
The
In addition, the multimedia data retrieval method according to the present invention may search for multimedia data and provide it to the driver even when the driver inputs a text-based search keyword through the keyword input means. Here, the text input by the driver may search for a lylics index formed by extracting a text index composed of a description made by an operator or users, and a dialogue of a video including a voice information and lyrics of a song.
In general, in the case of a movie or a video including voice, voice information is virtually impossible to index, so if a dialogue is found in a certain part of a video, it cannot be searched using a conventional text-based or audio / video content search method.
Accordingly, the present invention implements a voice-to-text function for storing voice information of multimedia data as text. Here, the voice-to-text function is an application example of speech recognition and an algorithm for extracting text from speech information on media.
When the driver inputs a search text using keyword input means such as a touch screen, the
A search indexing file is generated by extracting audio characteristic values through digital signal processing on a search query input using a search engine.
The feature indexing file stored in the
The
The above-described multimedia data retrieval method of the present invention shows an embodiment of inputting a voice, hum or text, but can be fused to generate and implement a single query indexing file.
At this time, the feature indexing file is composed by directly extracting the feature index extracted through the signal processing of the music itself, a text index consisting of briefly written descriptions by operators and users, dialogue of the video including voice information, song lyrics, and the like. It has an indexing array composed of lylics indices. Thus, the search accuracy can be improved and various search channels can be provided. Here, the feature index may be used when not knowing the lyrics but knowing the pitch or part of the melody, and the lyrics index may be used when not knowing the pitch or melody but knowing the part or lyrics of the part.
In the present invention, both the delivery of the search query and the delivery of the search result to the
In the above-described embodiment of the present invention, if feature data is extracted from an audio signal through a feature extraction method that extracts audio feature values using a search engine, this is a raw data level. In other words, the form or data format is unstructured for search engine tools. Therefore, feature data should be formatted and standardized into feature indexing files that can be entered into search engine tools. In other words, since the feature data output through digital signal processing vary only in magnitude, they should be formatted into a feature indexing file having a specific value (for example, a value between 0 and 1).
In this way, after each feature data is formatted into a feature indexing file, feature indexing files should be arranged and assigned to a search engine to shorten the search time to ensure an appropriate level of accuracy and search time.
And since the feature indexing files are truncated by time unit, if they are simply arranged in chronological order, it takes tremendous search time to search if the characteristics of the data entered by the query are located in the latter part of the multimedia data. Therefore, the feature indexing files have their respective indexes over time, but the search time is shortened only if they are grouped into clusters of similar size, not hourly. In other words, after extracting the cluster group having similar size by searching the indexing files by size, the scope of the search result should be narrowed through re-search using the hourly index in the cluster group.
It will be apparent to those skilled in the art that various modifications, additions, and substitutions are possible, and that various modifications, additions and substitutions are possible, within the spirit and scope of the appended claims. As shown in Fig.
1 is a block diagram showing an apparatus for retrieving multimedia data according to the present invention.
2 is a flowchart illustrating a method of retrieving multimedia data according to the present invention.
<Description of the symbols for the main parts of the drawings>
10: data server
12: Multimedia Database
20: indexing server
22: Indexing Database
30: telematic unit
Claims (5)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020080122567A KR20100064136A (en) | 2008-12-04 | 2008-12-04 | Multimedia data search method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020080122567A KR20100064136A (en) | 2008-12-04 | 2008-12-04 | Multimedia data search method |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20100064136A true KR20100064136A (en) | 2010-06-14 |
Family
ID=42363857
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020080122567A KR20100064136A (en) | 2008-12-04 | 2008-12-04 | Multimedia data search method |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20100064136A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9171544B2 (en) | 2011-10-13 | 2015-10-27 | Hyundai Motor Company | System for providing a sound source information management service |
KR20160044652A (en) * | 2014-10-15 | 2016-04-26 | 현대모비스 주식회사 | Control method of avn system for vehicle using voice recognition |
-
2008
- 2008-12-04 KR KR1020080122567A patent/KR20100064136A/en not_active Application Discontinuation
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9171544B2 (en) | 2011-10-13 | 2015-10-27 | Hyundai Motor Company | System for providing a sound source information management service |
KR20160044652A (en) * | 2014-10-15 | 2016-04-26 | 현대모비스 주식회사 | Control method of avn system for vehicle using voice recognition |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5073630B2 (en) | Natural language based service selection system and method, service query system and method | |
JP5147947B2 (en) | Method and system for generating search collection by query | |
US20090100045A1 (en) | Device and method for adaptive service selection, query system and method | |
CN101276372A (en) | Apparatus and method for searching information | |
JP5600736B2 (en) | Database management method and system | |
JP2009087339A (en) | Method and device for importing/exporting ontology data | |
JP2009140477A (en) | Device and method for service proposition, system for service proposition, and device and method for service proposition based on user's favorite base | |
JP2005209214A5 (en) | ||
JP6457123B2 (en) | Search processing method and device | |
JPWO2006134682A1 (en) | Named entity extraction apparatus, method, and program | |
JPH11232192A (en) | Data processing system and method for archiving and accessing electronic message | |
US20070203874A1 (en) | System and method for managing files on a file server using embedded metadata and a search engine | |
JP2013196435A (en) | Retrieval device, retrieval method, and program | |
US7366710B2 (en) | Apparatus for retrieving and presenting digital data | |
EP2306333A1 (en) | Offline software library | |
JP4894253B2 (en) | Metadata generating apparatus and metadata generating method | |
KR20100064136A (en) | Multimedia data search method | |
JP2008129434A (en) | Voice synthesis server system | |
JP2008077353A (en) | Method for classifying keyword, server computer, and program | |
US20070150463A1 (en) | Advanced method of searching, drafting and editing of electronic files | |
KR20090089121A (en) | User providing system and method for customized information | |
KR100784068B1 (en) | Method for Changing Ring Back Tone Using Short Message and Ring Back Tone Providing System therefor | |
JPH09245046A (en) | Information retrieval device | |
JP5787794B2 (en) | Speech synthesis system, speech conversion support device, and speech conversion support method | |
KR20140123647A (en) | System for analyzing intellectual property |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E90F | Notification of reason for final refusal | ||
E601 | Decision to refuse application |