CN101021855A - Video searching system based on content - Google Patents

Video searching system based on content Download PDF

Info

Publication number
CN101021855A
CN101021855A CN 200610140832 CN200610140832A CN101021855A CN 101021855 A CN101021855 A CN 101021855A CN 200610140832 CN200610140832 CN 200610140832 CN 200610140832 A CN200610140832 A CN 200610140832A CN 101021855 A CN101021855 A CN 101021855A
Authority
CN
China
Prior art keywords
retrieval
content
cluster
search
index
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200610140832
Other languages
Chinese (zh)
Other versions
CN101021855B (en
Inventor
江南
苏磊
鲍东山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING NUFRONT SOFTWARE TECHNOLOGY Co Ltd
Original Assignee
鲍东山
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 鲍东山 filed Critical 鲍东山
Priority to CN2006101408329A priority Critical patent/CN101021855B/en
Publication of CN101021855A publication Critical patent/CN101021855A/en
Application granted granted Critical
Publication of CN101021855B publication Critical patent/CN101021855B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

This invention provides a search system based on video program content characters including a search dispatch server used in analyzing and dispatching the search requests put forward by users to search in terms of a certain strategy to grade, arrange and merge the search result to feed them back to the user, an element data search server searching the element data of the video program based on the requirement of the server, a caption search server searching the XML file storing program caption texts, a video search cluster used in searching the character data of the video key frame, a phone search cluster used in searching the phonetic information of video programs including spelling series and spelling patterns and a search interface facing to users receiving request of users in terms of signed message format and feeding back the search result.

Description

The Content-based Video Retrieval system
Technical field
The present invention relates to Content-based Video Retrieval (CBR) field.Comprise tissue, the storage of video frequency feature data, the index of high dimensional feature vector and retrieval, the technology in fields such as distributed search.
Background technology
An information retrieval system generally includes the searching database of a core, search dispatching server and group of server.External then the search and the interface of information typing are provided.As shown in Figure 1.
Wherein, the information typing is to rely on the artificial mode of keying in mostly.Promptly the provider by retrieval of content in the system is entered into database offering the information typing interface of the content information user, that be used for searching for by searching system.
The core database system then mainly is responsible for the information data of storage for the usefulness of user search.
The search dispatching server is responsible for receiving, resolving user's request, and its searching request is distributed to retrieval server, to carry out actual retrieval.After result for retrieval turned back to the search dispatching server, the search dispatching server will be handled return results, as ordering, merging, screening etc.After handling it is returned to the user.Finish once search.
Different with common information retrieval system, video searching system is comparatively complicated, and the module that comprises is also more relatively.
A video searching system is made up of several big modules such as video features analysis, characteristic storage, search dispatching and Content-based Video Retrieval.Wherein, the storage of characteristic, search dispatching and video frequency searching are the nucleus modules of this type of search engine.As shown in Figure 2.
Traditional video frequency search system will obtain video frequency program for information about by manual annotation, and leaves these information in supply inquiry after this in the database usefulness.That is to say that analysis module is actual to be an operational module of being finished by hand by the people.
This mode has significant limitation.Artificial note not only expends great amount of manpower and time, and often has very big subjectivity, can not make accurate, just portrayal to video program content.Particularly especially can't accurate description to physical features such as the color of video frequency program, textures.Even voice, this class of captions are not limited by the feature because of subjective factor, also often make artificial treatment become infeasible because its data volume is huge.
For this reason, people are used for video frequency program with graphical analysis, speech analysis and captions analytical technology and handle, and are main tool with the computing machine, robotization obtain the characteristic information relevant with video program content, for content-based search provides support.
In such system, carry out analyzing and processing to the each side feature of one section video frequency program.
Aspect image, carry out the division of scene and camera lens and extract representative key frame video frequency program, and then key frame is carried out Flame Image Process, with its color, texture, shape form,, represent as vector with mathematics.
Further, also to extract high-layer semantic information in the just aforesaid characteristic,, and they also are expressed as the form of text or mathematics as the people's face that occurs in the key frame, the movement tendency of object etc. from the low-level features of key frame of video.
Aspect audio frequency, handle the people's that occurs in the video frequency program voice, background music etc. with computing machine, convert thereof into character string or have the mathematical form of certain implication.
For example,, can use the method for speech recognition, speech conversion be become the form of phonetic figure or speech figure for the people's who occurs in the video frequency program voice.
For the music that occurs in the video frequency program, also can obtain melody, the tonality feature of music by analysis to its waveform character, or the height variation characteristic of tone etc.
Aspect captions, need the Chinese character that occurs in the identification video image, and it is extracted convert character to.
The characteristic that is called as video frequency program through the data that obtain after the above means processing.The quantity of characteristic is very huge often.As, video frequency program about 30 minutes may comprise the key frame picture more than 500, and the feature of each picture often need several tens in addition the vector of dimensions up to a hundred portrayed; The voice of same one section program often need several million space preservation after the characteristic structure that changes into figure one class.
Therefore, when carrying out Content-based Video Retrieval, it is huge often to be faced with data volume, the problem that recall precision is low.Must manage to solve.Perhaps reduce the data volume of characteristic, perhaps take ad hoc base to dwindle range of search to improve retrieval rate.
Simultaneously, also there is the problem that can't accurately mate in Content-based Video Retrieval.The characteristic of depositing in search condition and the Database Systems often is not hundred-percent coupling.For example, even same individual's image occurred in the search condition image and in the database key frame picture, after these key frames were analyzed, the proper vector that obtains also can not be fully was the same with the characteristic of search condition image.But for video frequency searching, these images but are " meeting " search conditions.Therefore, should carry out the fuzzy matching strategy at the retrieval of proper vector.Need suitable retrieval and search strategy to seek the result that can satisfy condition, and obtain the degree of fuzzy matching.
At present, in the video analysis field, art of image analysis, speech analysis field and captions extract the field, and stem-winding achievement in research has all been arranged.The precision of analyzing has reached certain degree.But, still seldom the achievement in research in above-mentioned field is applied in the actual product at present at home.Combine as for achievement in research, be the Content-based Video Retrieval service, especially beyond example above-mentioned field.
The achievement of video analysis field, art of image analysis, speech analysis field and captions being extracted the field combines, and is aided with other focus technologies, is the Content-based Video Retrieval service jointly, also is faced with very big difficulty and challenge.No matter from design, still the exploitation from reality all also has considerable technological difficulties to need to solve.
Summary of the invention
The objective of the invention is to realize a system that can carry out video frequency searching based on video program content information.This system combines the achievement that video analysis field, art of image analysis, speech analysis field and captions extract the field, and is aided with other focus technologies, is the Content-based Video Retrieval service jointly.
A Content-based Video Retrieval system comprises:
A metadata retrieval server is retrieved the metadata of video frequency program according to the requirement of search dispatching server;
A captions retrieval server is used for the XML file of depositing the program captioned test is retrieved;
A video frequency searching cluster is used for the characteristic of key frame of video is retrieved;
A speech retrieval cluster is used for voice messaging to video frequency program, comprises that pinyin string and phonetic figure retrieve;
A user oriented search interface: message format by appointment receives user's searching request and returns Search Results.
Metadata table in the one described metadata retrieval server comprises program ID, programm name, director, performer, language, the place of production, Class1, type 2, file layout, file size, length, screen width, screen height, program address, the program file name is uploaded the time, last set address is uploaded state, whether must examine program level, the examination sign, the program price, attribute field is closed down in the program brief introduction; Metadata retrieval module wherein comprises one and retrieves the storing process of program, a storing process of retrieving program according to combination condition according to program ID; Metadata typing module wherein comprises the storing process that will specify metadata information to insert database table.
Comprise a database table that is used for storing captions XML file in the one described captions retrieval server, a table that is used for the storage server relevant configuration information, a storing process that reads configuration information, a storing process that is used for the XML retrieval, one is used for to the storing process of database typing XML file and the segmented index of an XML.
-described captions XML document data bank table comprises a program id field, XML filename field and XML file field.
-described server configures information table comprises parameter I D field, parameter name field and parameter value field.
-described the storing process that is used for the XML retrieval comprises keyword logical expression generator program piece and search program piece.
-described video frequency searching cluster comprises a video scene retrieval server and a video frequency searching server.
Comprise a storage scenarios key frame indexed data storehouse table in the-described video scene retrieval server, a storing process that is used for the index typing, a routine package that is used to retrieve the scene key frame.
-described scene key frame index data base table comprises the index id field, three index cluster lower bound vector fields, three index cluster upper bound vector fields, index content nested table, total digital section of key frame and index cluster ultimate range field in the index cluster.
-described index content nested table, comprise clauses and subclauses sign id field, program id field under the key frame, key frame number field, critical frame types field, scene start time field, scene concluding time field, camera lens start time field, camera lens concluding time field, key frame time point field, three key frame characteristic information vector fields.
-described video frequency searching server, comprise a database table of depositing key frame of video XML file, deposit key frame indexed data storehouse table for one, the database table of a service device configuration information, a storing process that is used for typing XML file, a routine package that is used to generate index, a routine package that is used for the search index table, a remote linkage that is used to call video scene retrieval server internal program.
The structure of the scene key frame index data base table of introducing in the structure and 9 of-described key frame of video XML document data bank table is identical, and it also comprises an index content nested table.
The structure of the index content nested table of introducing in-described index content the nested table and 10 is identical.
-described the routine package that is used to generate index comprises that one is used for adding a new images eigenwert routine package of a cluster to; A routine package that is used for by a cluster of appointed threshold value expansion; A storing process that is used to create new cluster (directory entry).
-described the routine package that is used for by a cluster of appointed threshold value expansion, it comprises the primary storage process of an expansion, a storing process that is used to calculate expansion back cluster hypermatrix principal diagonal length, one is used to calculate expansion back cluster and whether has the storing process that overlaps with existing other clusters; Wherein, the maximum permissible value of cluster hypermatrix catercorner length is made as 2.0.
-described being used for added a new images eigenwert routine package of a cluster to, and it comprises a primary storage process that is used to add, and one is used to judge whether the eigenwert of an image belongs to the storing process of certain cluster.
-described the routine package that is used for the search index table, it comprises a retrieval primary storage process, a storing process that calculates the minor increment of search condition image and certain cluster, the storing process of the minimum value of the ultimate range of a calculating search condition image and certain cluster, whether effectively one be used to judge certain cluster program segment.
-described retrieval primary storage process, wherein the maximum permissible value of distance is made as 2.0 between search condition image and the cluster.
-described the routine package that is used for retrieving the scene key frame is identical with the structure of 17 routine packages of introducing that are used for the search index table.
-described speech retrieval cluster, comprising a speech buffer storage retrieval server, an optimum retrieval server of voice and a voice phonetic figure retrieval server.
For this cover system, require it not only can the traditional retrieval of back compatible based on essential informations such as literal, also to provide following search function:
1. based on the retrieval of picture.
System user provides a pictures as search condition, may contain user's interest sight, personage or building in the picture.
Searching system will be sought in database with above-mentioned search condition picture and be complementary, and promptly matching degree reaches the key frame picture of certain thresholding, and the video frequency program fragment at these key frame places is returned to the user.
Searching system is in when retrieval, may be according to the global feature of search condition image, as the color of whole picture, texture, shape etc., retrieve.Also may be local feature,, retrieve as personage's (recognition of face) of occurring in the image, buildings, natural scene etc. according to the search condition image.Even can also retrieve according to the motion feature of object in the search condition image.
2. based on the retrieval of a video segment
System user provides a video segment as search condition, and this fragment is shorter and smaller usually, may be the propaganda film or the fragment of that target video program of user's interest.
The video segment that searching system then at first provides the user is analyzed, and extracts its key frame, database is retrieved as search condition with the characteristic of these key frames then.
Different with simple image retrieval, have certain association between each key frame images that from the video segment that the user provides, extracts, because they are from same video.So when result for retrieval is handled, consider this correlativity.Assurance returns to user's result for retrieval, is the video segment with user search condition coupling, but not a plurality of independently, uncorrelated frame.
For 1 and 2, consider the huge of key frame of video characteristic amount, set up index for characteristic.
Different with the index of traditional content of text, the video frequency feature data index will carry out index to the video feature vector of higher-dimension.The basic thought of high dimensional feature vector index is a cluster.Promptly the proper vector of " similar " is divided into a class, the number of times of comparing when retrieving the minimizing after.
3. based on the retrieval of the online voice of user.
System user uses the online one or more search keys of oral account of microphone.In client, computer software will be done simple analysis to user's voice, convert its voice signal to form that searching system needs, give searching system then and carry out actual retrieval.
Searching system will be retrieved the speech retrieval cluster after obtaining above-mentioned search condition.Find the degree of matching to reach the sound bite of certain thresholding, and the video frequency program fragment at these sound bite places is returned to the user.
Consider the degree of accuracy of the huge of voice feature data amount and retrieval, speech searching system is designed to a distributed retrieval cluster.Adopt cache policies to improve the speed of retrieval.
4. based on the retrieval of a sound bite.
System user provides one section voice document, and as the wav form, content wherein is one section voice.Computer software will be done simple analysis to the voice document that the user provides, and convert its voice signal to form that searching system needs, give searching system then and carry out actual retrieval.
Searching system will be retrieved the speech retrieval cluster after obtaining above-mentioned search condition.Find the degree of matching to reach the sound bite of certain thresholding, and the video frequency program fragment at these sound bite places is returned to the user.
Equally, consider the degree of accuracy of the huge of voice feature data amount and retrieval, speech searching system is designed to a distributed retrieval cluster.Adopt cache policies to improve the speed of retrieval.
5. based on the retrieval of caption information
For system user, different on this retrieval mode and traditional retrieval mode are not directly perceived.The user remains the manual search condition of keying in textual form.These conditions are sent to searching system.
The working method of searching system is also similar with traditional searching system.Only, the target of retrieval is a caption database, and the content in the caption database is not from artificial typing, but the result of captions analysis module processing video programs.
The quantity of video frequency program may be magnanimity, and the caption information amount in while every program is also very huge.Therefore, the content of whole caption database magnanimity especially just.To also set up index for caption information for this reason, improve recall precision.
6. integrated retrieval
For example, when the user provided the character search condition, searching system was understood integrated retrieval metadatabase, caption database, even literal is become phonetic, removed to retrieve speech database.
The present invention combines the achievement that video analysis field, art of image analysis, speech analysis field and captions extract the field, and is aided with other focus technologies, is the Content-based Video Retrieval service jointly.
Description of drawings
Accompanying drawing 1 is the structural drawing of general information searching system
Accompanying drawing 2 is Content-based Video Retrieval system module figure
Accompanying drawing 3 is the system construction drawing of Content-based Video Retrieval system
Accompanying drawing 4 is the process flow diagram of captions retrieval module
Accompanying drawing 5 is the process flow diagram of key frame of video retrieval
Embodiment
The structural drawing of this system as shown in Figure 3.System is divided into following several module.
1. metadata retrieval server;
Metadata is the Word message of manually filling in when programming, and is used for portraying the content information such as exercise question, director, performer, the place of production, brief introduction of video frequency program, and frame per second, resolution, program request expense, whether needs characteristics such as DRM checking.
This part is the artificial module that participates in of unique needs in the total system.
After manually filling these data, it is entered in the metadatabase.
Carrying out simple metadata query, or carrying out when needing the relevant information of Search Results after the content-based inquiry, all will send retrieval request, metadatabase is inquired about to the metadata retrieval server.
2. captions retrieval server;
Obtaining the captions characteristic is exactly the captioned test that occurred in the video frequency program.When captions are analyzed, with these texts and scene that the place belongs to occurs and the start and end time of camera lens saves as the XML file of specified format, and be entered in the caption database.
The retrieval module structure of captions retrieval server as shown in Figure 4.
The search condition of being sent by the search dispatching server is a character string, wherein comprises several search conditions, with specifying separator to separate.
At first, extract different search conditions, and they be connected into the logical expression of designated mode according to the requirement of later search program.Then, the video frequency program captions XML file in the caption database is filtered, the program that comprises search condition in the file is picked out according to this expression formula.At last, retrieval by window condition in the file of electing finds this condition the temporal information that the place belongs to scene and camera lens to occur.
3. video frequency searching cluster;
Huge in view of the characteristic amount of key frame of video, in order to guarantee the response time, the video frequency searching module is designed to a retrieval cluster.This cluster comprises video scene and two servers of video frequency searching.
Though video frequency feature data also is to deposit with the form of XML file,, all key frame of video can be carried out index in order to improve recall precision when input database.
Index adopts the high dimension vector index technology based on the R tree, and its basic thought is: defining the distance between two image feature datas, is a cluster with the image division of phase mutual edge distance in specified scope, promptly the image of " similar " is divided into a class.When retrieving, a search index calculates " minor increment " and " minimum value of ultimate range " between search condition and each cluster, eliminates the far cluster of those and search condition image difference according to these two eigenwerts.At last, image in the cluster that calculating is not eliminated and the distance between the search condition image, and ordering is returned.
Like this, just significantly reduce the amount of images of participation comparison and the number of times of calculating, improved the speed of retrieval.
In the two-server of video frequency searching cluster, all use above-mentioned index to represent the key frame images of video frequency program.The cooperation mode of two-server as shown in Figure 5.
(1) video scene retrieval server:
Here deposit all video scene key frame clusters.Because in a video frequency program, the scene quantity of key frames is lacked an order of magnitude than total key frame quantity, and scene key frame itself also has very strong representativeness, so, at first the scene key frame is retrieved, can improve retrieval rate like this.
(2) video frequency searching server:
Here deposit the cluster of all scenes and camera lens key frame.When only retrieving the scene key frame and be not being met necessarily required result for retrieval, retrieve all key frames, with the result of really being mated.
The matching algorithm of key frame of video is a fuzzy matching algorithm.As long as be that the matching degree of key frame in the database and search condition image reaches certain thresholding and can be accepted.
4. speech retrieval cluster;
When the voice of video frequency program are analyzed, will obtain the phonetic figure of voice, what which said figure can obtain to occur in the program by search phonetic.But, the voice messaging of a program about 30 minutes needs the above phonetic figure of 600 width of cloth to be portrayed, and the search speed of phonetic figure itself is not high yet, therefore, in order to guarantee the retrieval rate of searching system, with the speech retrieval partial design is a retrieval cluster, comprises that speech buffer storage retrieval, the retrieval of voice optimization and voice phonetic figure retrieve three retrieval servers.
This three station server has guaranteed that the user can retrieve the voice messaging of those " often accessed ", i.e. information in the speech buffer storage fast.When not having information needed in the speech buffer storage, the content in the retrieval voice optimization server is promptly retrieved from a small amount of more excellent result of voice analysis.Simultaneously, those conditions that the background program of retrieval server will use user search to cross are carried out comprehensive retrieval of off-line to voice phonetic figure, and the result that will obtain is updated in the buffer memory.Like this, just improved user's retrieval rate after this.
About the detailed description of this part, see also patent " distributing speech searching system ".
As previously described.For the optimum pinyin string of voice, voice phonetic figure, video feature vector and caption information all is that form with the XML file passes to database.
1. the core content of the optimum pinyin string XML of voice file comprises: position (start and end times of scene, camera lens, voice) appears in optimum pinyin string content, the posterior probability of optimum pinyin string, optimum pinyin string.Optimum pinyin string is the higher phonetic graph search result of posterior probability who extracts from voice phonetic figure according to certain threshold requirement.
2. the core content of voice phonetic figure XML file comprises: the node of the phonetic figure of one section voice correspondence and arc information, position (start and end times of scene, camera lens, voice) appears in phonetic figure.
3. the core content of video feature vector XML file comprises: the classification of a key frame (scene or camera lens), the color moment characteristic of key frame images, the color histogram characteristic of key frame images, the textural characteristics data of key frame images, the start and end time of scene, camera lens under the key frame, the time point of key frame.
4. the core content of captions XML file comprises: caption content (text formatting), the appearance position of captions (start and end times of scene, camera lens and this section captions).
In above-mentioned retrieval module, used following index.
1.XML segmented index.For captions and voice optimization database, all be directly to retrieve the XML file, therefore to most crucial content in the XML file, promptly voice pinyin string and captioned test are set up segmented index, to improve retrieval rate.
When insertion, renewal, delete database content, carry out synchronously above-mentioned XML segmented index.
At set intervals, carry out Optimizing operation to above-mentioned XML segmented index.This work is designed to the background job of retrieval server, every scheduling in 15 days once.
2. based on the high dimension vector index of R tree.Key frame of video characteristic (representing with the high dimension vector form) has been set up the index of setting based on R.
At present, the achievement in research based on the index technology of R tree is a lot.Native system uses the thought of " first first index ", and the key frame images that is introduced into database is preferentially set up index, and the key frame images of putting in storage is subsequently checked then they can be divided in existing which cluster.If they do not belong to any existing cluster, then go to expand successively existing cluster with them, expand successful condition and be: the cluster (being the cluster hypermatrix) that the ultimate range (being the principal diagonal length of cluster hypermatrix) in the cluster of expansion back between the key frame is no more than after appointed threshold and the expansion does not intersect with other existing clusters.
3. plain text index.Some hot spot field that comprises metadatabase, as director, performer, brief introduction etc., and the keyword of speech buffer storage database.

Claims (20)

1. Content-based Video Retrieval system is characterized in that: comprising:
A metadata retrieval server is retrieved the metadata of video frequency program according to the requirement of search dispatching server;
A captions retrieval server is used for the XML file of depositing the program captioned test is retrieved;
A video frequency searching cluster is used for the characteristic of key frame of video is retrieved;
A speech retrieval cluster is used for voice messaging to video frequency program, comprises that pinyin string and phonetic figure retrieve;
A user oriented search interface: message format by appointment receives user's searching request and returns Search Results.
2. Content-based Video Retrieval as claimed in claim 1 system, it is characterized in that: the metadata table in the metadata retrieval server comprises program ID, programm name, director, performer, language, the place of production, Class1, type 2, file layout, file size, length, screen width, screen height, program address, the program file name is uploaded the time, last set address is uploaded state, whether must examine program level, the examination sign, the program price, attribute field is closed down in the program brief introduction; Metadata retrieval module wherein comprises one and retrieves the storing process of program, a storing process of retrieving program according to combination condition according to program ID; Metadata typing module wherein comprises the storing process that will specify metadata information to insert database table.
3. Content-based Video Retrieval as claimed in claim 1 system, it is characterized in that: comprise a database table that is used for storing captions XML file in the captions retrieval server, a table that is used for the storage server relevant configuration information, a storing process that reads configuration information, a storing process that is used for the XML retrieval, one is used for to the storing process of database typing XML file and the segmented index of an XML.
4. Content-based Video Retrieval as claimed in claim 3 system, it is characterized in that: captions XML document data bank table comprises a program id field, XML filename field and XML file field.
5. Content-based Video Retrieval as claimed in claim 3 system, it is characterized in that: the server configures information table comprises parameter I D field, parameter name field and parameter value field.
6. Content-based Video Retrieval as claimed in claim 3 system is characterized in that: be used for the storing process of XML retrieval, comprise keyword logical expression generator program piece and search program piece.
7. Content-based Video Retrieval as claimed in claim 1 system, it is characterized in that: the video frequency searching cluster comprises a video scene retrieval server and a video frequency searching server.
8. Content-based Video Retrieval as claimed in claim 7 system, it is characterized in that: comprise a storage scenarios key frame indexed data storehouse table in the video scene retrieval server, a storing process that is used for the index typing, a routine package that is used to retrieve the scene key frame.
9. Content-based Video Retrieval as claimed in claim 8 system, it is characterized in that: scene key frame index data base table, comprise the index id field, three index cluster lower bound vector fields, three index cluster upper bound vector fields, total digital section of key frame and index cluster ultimate range field in the index content nested table, index cluster.
10. Content-based Video Retrieval as claimed in claim 9 system, it is characterized in that: the index content nested table comprises clauses and subclauses sign id field, program id field under the key frame, the key frame number field, the critical frame types field, scene start time field, scene concluding time field, camera lens start time field, camera lens concluding time field, key frame time point field, three key frame characteristic information vector fields.
11. Content-based Video Retrieval as claimed in claim 7 system, it is characterized in that: the video frequency searching server, comprise a database table of depositing key frame of video XML file, deposit key frame indexed data storehouse table for one, the database table of a service device configuration information, a storing process that is used for typing XML file, a routine package that is used to generate index, a routine package that is used for the search index table, a remote linkage that is used to call video scene retrieval server internal program.
12. Content-based Video Retrieval as claimed in claim 11 system is characterized in that: the structure of the scene key frame index data base table of introducing in the structure and 9 of key frame of video XML document data bank table is identical, and it also comprises an index content nested table.
13. Content-based Video Retrieval as claimed in claim 12 system is characterized in that: the structure of the index content nested table of introducing in the index content nested table and 10 is identical.
14. Content-based Video Retrieval as claimed in claim 11 system is characterized in that: be used to generate the routine package of index, comprise that one is used for adding a new images eigenwert routine package of a cluster to; A routine package that is used for by a cluster of appointed threshold value expansion; A storing process that is used to create new cluster (directory entry).
15. the Content-based Video Retrieval system that uses as claimed in claim 14, it is characterized in that: be used for routine package by a cluster of appointed threshold value expansion, it comprises the primary storage process of an expansion, a storing process that is used to calculate expansion back cluster hypermatrix principal diagonal length, one is used to calculate expansion back cluster and whether has the storing process that overlaps with existing other clusters; Wherein, the maximum permissible value of cluster hypermatrix catercorner length is made as 2.0.
16. Content-based Video Retrieval as claimed in claim 14 system, it is characterized in that: be used for adding a new images eigenwert routine package of a cluster to, it comprises a primary storage process that is used to add, and one is used to judge whether the eigenwert of an image belongs to the storing process of certain cluster.
17. Content-based Video Retrieval as claimed in claim 11 system, it is characterized in that: the routine package that is used for the search index table, it comprises a retrieval primary storage process, a storing process that calculates the minor increment of search condition image and certain cluster, the storing process of the minimum value of the ultimate range of a calculating search condition image and certain cluster, whether effectively one be used to judge certain cluster program segment.
18. Content-based Video Retrieval as claimed in claim 17 system is characterized in that: retrieval primary storage process, wherein the maximum permissible value of distance is made as 2.0 between search condition image and the cluster.
19. Content-based Video Retrieval as claimed in claim 8 system is characterized in that: the routine package that is used for retrieving the scene key frame is identical with the structure of 17 routine packages of introducing that are used for the search index table.
20. Content-based Video Retrieval as claimed in claim 1 system is characterized in that: the speech retrieval cluster, comprising a speech buffer storage retrieval server, an optimum retrieval server of voice and a voice phonetic figure retrieval server.
CN2006101408329A 2006-10-11 2006-10-11 Video searching system based on content Expired - Fee Related CN101021855B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2006101408329A CN101021855B (en) 2006-10-11 2006-10-11 Video searching system based on content

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2006101408329A CN101021855B (en) 2006-10-11 2006-10-11 Video searching system based on content

Publications (2)

Publication Number Publication Date
CN101021855A true CN101021855A (en) 2007-08-22
CN101021855B CN101021855B (en) 2010-04-07

Family

ID=38709623

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2006101408329A Expired - Fee Related CN101021855B (en) 2006-10-11 2006-10-11 Video searching system based on content

Country Status (1)

Country Link
CN (1) CN101021855B (en)

Cited By (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101938619A (en) * 2009-06-30 2011-01-05 索尼公司 Information processing apparatus, method and program thereof
CN101515278B (en) * 2008-02-22 2011-01-26 鸿富锦精密工业(深圳)有限公司 Image access device and method for storing and reading images
CN102340705A (en) * 2010-07-19 2012-02-01 中兴通讯股份有限公司 System and method for obtaining key frame
CN102375846A (en) * 2010-08-20 2012-03-14 上海弘视通信技术有限公司 Method for searching subsequent alarm picture
CN102480565A (en) * 2010-11-19 2012-05-30 Lg电子株式会社 Mobile terminal and method of managing video using metadata therein
CN102650993A (en) * 2011-02-25 2012-08-29 北大方正集团有限公司 Index establishing and searching methods, devices and systems for audio-video file
CN102722510A (en) * 2011-12-06 2012-10-10 新奥特(北京)视频技术有限公司 Method for sharing heterogeneous databse of field score treatment system
CN102855317A (en) * 2012-08-31 2013-01-02 王晖 Multimode indexing method and system based on demonstration video
CN102884538A (en) * 2010-04-26 2013-01-16 微软公司 Enriching online videos by content detection, searching, and information aggregation
CN101719144B (en) * 2009-11-04 2013-04-24 中国科学院声学研究所 Method for segmenting and indexing scenes by combining captions and video image information
CN103269444A (en) * 2013-04-03 2013-08-28 深圳市龙视传媒有限公司 Method and device for automatically generating program group purchase
CN103279494A (en) * 2013-05-03 2013-09-04 吴军 Dynamic video analysis moving target retrieval system
CN103309865A (en) * 2012-03-07 2013-09-18 腾讯科技(深圳)有限公司 Method and system for realizing video source clustering
CN103605765A (en) * 2013-11-26 2014-02-26 电子科技大学 Mass image retrieval system based on cluster compactness
CN103606158A (en) * 2013-11-29 2014-02-26 深圳市龙视传媒有限公司 Preprocessing method and terminal for video clipping
CN103634626A (en) * 2012-08-24 2014-03-12 神讯电脑(昆山)有限公司 Sectional type operation guidance video automatic play system
WO2014082288A1 (en) * 2012-11-30 2014-06-05 Thomson Licensing Method and apparatus for video retrieval
CN103916687A (en) * 2013-01-07 2014-07-09 三星电子株式会社 Display apparatus and method of controlling display apparatus
CN103945281A (en) * 2014-04-29 2014-07-23 中国联合网络通信集团有限公司 Method, device and system for video transmission processing
CN104011740A (en) * 2012-01-06 2014-08-27 高通股份有限公司 Object tracking and processing
CN104023181A (en) * 2014-06-23 2014-09-03 联想(北京)有限公司 Information processing method and device
CN104376003A (en) * 2013-08-13 2015-02-25 深圳市腾讯计算机系统有限公司 Video retrieval method and device
CN104391924A (en) * 2014-11-21 2015-03-04 南京讯思雅信息科技有限公司 Mixed audio and video search method and system
WO2015096427A1 (en) * 2013-12-27 2015-07-02 中兴通讯股份有限公司 Method and system for locating and searching surveillance video
CN105868684A (en) * 2015-12-10 2016-08-17 乐视网信息技术(北京)股份有限公司 Video information acquisition method and apparatus
CN106372067A (en) * 2015-07-20 2017-02-01 联想移动通信软件(武汉)有限公司 Method and device for image data search and electronic equipment
CN103559196B (en) * 2013-09-23 2017-02-22 浙江大学 Video retrieval method based on multi-core canonical correlation analysis
CN107194015A (en) * 2017-07-07 2017-09-22 上海思依暄机器人科技股份有限公司 A kind of method and apparatus for controlling audio and video resources to play
CN107609149A (en) * 2017-09-21 2018-01-19 北京奇艺世纪科技有限公司 A kind of video locating method and device
CN107862003A (en) * 2017-10-24 2018-03-30 珠海市魅族科技有限公司 Video contents search method, apparatus, terminal and readable storage medium storing program for executing
CN107870959A (en) * 2016-09-23 2018-04-03 奥多比公司 Inquired about in response to video search and associated video scene is provided
CN108464007A (en) * 2016-04-13 2018-08-28 谷歌有限责任公司 Video metadata correlation recommendation
CN109543511A (en) * 2018-10-09 2019-03-29 广州市诚毅科技软件开发有限公司 Video frequency identifying method, system and device based on figure line mutation frame and feature calculation
CN109710844A (en) * 2018-12-20 2019-05-03 中国银行业监督管理委员会福建监管局 The method and apparatus for quick and precisely positioning file based on search engine
CN109977268A (en) * 2019-02-26 2019-07-05 南京邮电大学 A kind of video information search method
CN109982106A (en) * 2019-04-29 2019-07-05 百度在线网络技术(北京)有限公司 A kind of video recommendation method, server, client and electronic equipment
CN109982105A (en) * 2017-12-27 2019-07-05 上海文广互动电视有限公司 Content retrieval system and method for broadcast platform
CN110008378A (en) * 2019-01-28 2019-07-12 平安科技(深圳)有限公司 Corpus collection method, device, equipment and storage medium based on artificial intelligence
CN110392281A (en) * 2018-04-20 2019-10-29 腾讯科技(深圳)有限公司 Image synthesizing method, device, computer equipment and storage medium
CN111259193A (en) * 2020-01-16 2020-06-09 高新兴科技集团股份有限公司 Feature retrieval system based on clustering filtration and application method thereof
CN111930970A (en) * 2020-08-06 2020-11-13 通维数码科技(上海)有限公司 News storage and search method based on video and voice recognition
CN112836185A (en) * 2019-11-22 2021-05-25 上海哔哩哔哩科技有限公司 User authentication method and system
CN114385859A (en) * 2021-12-29 2022-04-22 北京理工大学 Multi-modal retrieval method for video content
CN114385855A (en) * 2022-03-25 2022-04-22 广州长嘉电子有限公司 Method, system and medium for recommending media content based on media database
CN115168690A (en) * 2022-09-06 2022-10-11 深圳市明源云科技有限公司 Data query method and device based on browser plug-in, electronic equipment and medium
CN117633297A (en) * 2024-01-26 2024-03-01 江苏瑞宁信创科技有限公司 Video retrieval method, device, system and medium based on annotation

Cited By (69)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101515278B (en) * 2008-02-22 2011-01-26 鸿富锦精密工业(深圳)有限公司 Image access device and method for storing and reading images
CN101938619B (en) * 2009-06-30 2012-10-10 索尼公司 Information processing apparatus, method
CN101938619A (en) * 2009-06-30 2011-01-05 索尼公司 Information processing apparatus, method and program thereof
CN101719144B (en) * 2009-11-04 2013-04-24 中国科学院声学研究所 Method for segmenting and indexing scenes by combining captions and video image information
CN102884538A (en) * 2010-04-26 2013-01-16 微软公司 Enriching online videos by content detection, searching, and information aggregation
CN102884538B (en) * 2010-04-26 2016-11-16 微软技术许可有限责任公司 Online Video is enriched by content detection, search and information aggregation
US9443147B2 (en) 2010-04-26 2016-09-13 Microsoft Technology Licensing, Llc Enriching online videos by content detection, searching, and information aggregation
CN102340705A (en) * 2010-07-19 2012-02-01 中兴通讯股份有限公司 System and method for obtaining key frame
CN102340705B (en) * 2010-07-19 2014-04-30 中兴通讯股份有限公司 System and method for obtaining key frame
CN102375846A (en) * 2010-08-20 2012-03-14 上海弘视通信技术有限公司 Method for searching subsequent alarm picture
CN102480565A (en) * 2010-11-19 2012-05-30 Lg电子株式会社 Mobile terminal and method of managing video using metadata therein
CN102480565B (en) * 2010-11-19 2014-12-10 Lg电子株式会社 Mobile terminal and method of managing video using metadata therein
CN102650993A (en) * 2011-02-25 2012-08-29 北大方正集团有限公司 Index establishing and searching methods, devices and systems for audio-video file
CN102722510A (en) * 2011-12-06 2012-10-10 新奥特(北京)视频技术有限公司 Method for sharing heterogeneous databse of field score treatment system
CN102722510B (en) * 2011-12-06 2018-12-25 新奥特(北京)视频技术有限公司 A kind of method that the heterogeneous database of on-site score processing system is shared
CN104011740A (en) * 2012-01-06 2014-08-27 高通股份有限公司 Object tracking and processing
CN104011740B (en) * 2012-01-06 2017-06-09 高通股份有限公司 Object tracking and treatment
CN103309865A (en) * 2012-03-07 2013-09-18 腾讯科技(深圳)有限公司 Method and system for realizing video source clustering
CN103309865B (en) * 2012-03-07 2017-03-22 腾讯科技(深圳)有限公司 Method and system for realizing video source clustering
CN103634626A (en) * 2012-08-24 2014-03-12 神讯电脑(昆山)有限公司 Sectional type operation guidance video automatic play system
CN102855317A (en) * 2012-08-31 2013-01-02 王晖 Multimode indexing method and system based on demonstration video
CN102855317B (en) * 2012-08-31 2016-05-04 王晖 A kind of multi-mode indexing means and system based on demonstration video
WO2014082288A1 (en) * 2012-11-30 2014-06-05 Thomson Licensing Method and apparatus for video retrieval
CN104798068A (en) * 2012-11-30 2015-07-22 汤姆逊许可公司 Method and apparatus for video retrieval
CN103916687A (en) * 2013-01-07 2014-07-09 三星电子株式会社 Display apparatus and method of controlling display apparatus
CN103269444A (en) * 2013-04-03 2013-08-28 深圳市龙视传媒有限公司 Method and device for automatically generating program group purchase
CN103279494A (en) * 2013-05-03 2013-09-04 吴军 Dynamic video analysis moving target retrieval system
CN104376003A (en) * 2013-08-13 2015-02-25 深圳市腾讯计算机系统有限公司 Video retrieval method and device
CN104376003B (en) * 2013-08-13 2019-07-05 深圳市腾讯计算机系统有限公司 A kind of video retrieval method and device
CN103559196B (en) * 2013-09-23 2017-02-22 浙江大学 Video retrieval method based on multi-core canonical correlation analysis
CN103605765B (en) * 2013-11-26 2016-11-16 电子科技大学 A kind of based on the massive image retrieval system clustering compact feature
CN103605765A (en) * 2013-11-26 2014-02-26 电子科技大学 Mass image retrieval system based on cluster compactness
CN103606158A (en) * 2013-11-29 2014-02-26 深圳市龙视传媒有限公司 Preprocessing method and terminal for video clipping
WO2015096427A1 (en) * 2013-12-27 2015-07-02 中兴通讯股份有限公司 Method and system for locating and searching surveillance video
CN103945281B (en) * 2014-04-29 2018-04-17 中国联合网络通信集团有限公司 Transmission of video processing method, device and system
CN103945281A (en) * 2014-04-29 2014-07-23 中国联合网络通信集团有限公司 Method, device and system for video transmission processing
CN104023181A (en) * 2014-06-23 2014-09-03 联想(北京)有限公司 Information processing method and device
CN104023181B (en) * 2014-06-23 2018-08-31 联想(北京)有限公司 Information processing method and device
CN104391924A (en) * 2014-11-21 2015-03-04 南京讯思雅信息科技有限公司 Mixed audio and video search method and system
CN106372067A (en) * 2015-07-20 2017-02-01 联想移动通信软件(武汉)有限公司 Method and device for image data search and electronic equipment
CN105868684A (en) * 2015-12-10 2016-08-17 乐视网信息技术(北京)股份有限公司 Video information acquisition method and apparatus
CN108464007A (en) * 2016-04-13 2018-08-28 谷歌有限责任公司 Video metadata correlation recommendation
CN107870959A (en) * 2016-09-23 2018-04-03 奥多比公司 Inquired about in response to video search and associated video scene is provided
CN107870959B (en) * 2016-09-23 2023-08-04 奥多比公司 Providing relevant video scenes in response to a video search query
CN107194015A (en) * 2017-07-07 2017-09-22 上海思依暄机器人科技股份有限公司 A kind of method and apparatus for controlling audio and video resources to play
CN107609149A (en) * 2017-09-21 2018-01-19 北京奇艺世纪科技有限公司 A kind of video locating method and device
CN107609149B (en) * 2017-09-21 2020-06-19 北京奇艺世纪科技有限公司 Video positioning method and device
CN107862003A (en) * 2017-10-24 2018-03-30 珠海市魅族科技有限公司 Video contents search method, apparatus, terminal and readable storage medium storing program for executing
CN109982105A (en) * 2017-12-27 2019-07-05 上海文广互动电视有限公司 Content retrieval system and method for broadcast platform
CN110392281A (en) * 2018-04-20 2019-10-29 腾讯科技(深圳)有限公司 Image synthesizing method, device, computer equipment and storage medium
CN110392281B (en) * 2018-04-20 2022-03-18 腾讯科技(深圳)有限公司 Video synthesis method and device, computer equipment and storage medium
CN109543511A (en) * 2018-10-09 2019-03-29 广州市诚毅科技软件开发有限公司 Video frequency identifying method, system and device based on figure line mutation frame and feature calculation
CN109710844A (en) * 2018-12-20 2019-05-03 中国银行业监督管理委员会福建监管局 The method and apparatus for quick and precisely positioning file based on search engine
CN110008378B (en) * 2019-01-28 2024-03-19 平安科技(深圳)有限公司 Corpus collection method, device, equipment and storage medium based on artificial intelligence
CN110008378A (en) * 2019-01-28 2019-07-12 平安科技(深圳)有限公司 Corpus collection method, device, equipment and storage medium based on artificial intelligence
CN109977268B (en) * 2019-02-26 2022-10-14 南京邮电大学 Video information retrieval method
CN109977268A (en) * 2019-02-26 2019-07-05 南京邮电大学 A kind of video information search method
CN109982106B (en) * 2019-04-29 2021-11-26 百度在线网络技术(北京)有限公司 Video recommendation method, server, client and electronic equipment
CN109982106A (en) * 2019-04-29 2019-07-05 百度在线网络技术(北京)有限公司 A kind of video recommendation method, server, client and electronic equipment
CN112836185A (en) * 2019-11-22 2021-05-25 上海哔哩哔哩科技有限公司 User authentication method and system
CN111259193B (en) * 2020-01-16 2023-08-25 高新兴科技集团股份有限公司 Feature retrieval system based on cluster filtering and application method thereof
CN111259193A (en) * 2020-01-16 2020-06-09 高新兴科技集团股份有限公司 Feature retrieval system based on clustering filtration and application method thereof
CN111930970A (en) * 2020-08-06 2020-11-13 通维数码科技(上海)有限公司 News storage and search method based on video and voice recognition
CN114385859A (en) * 2021-12-29 2022-04-22 北京理工大学 Multi-modal retrieval method for video content
CN114385859B (en) * 2021-12-29 2024-07-16 北京理工大学 Multi-mode retrieval method for video content
CN114385855A (en) * 2022-03-25 2022-04-22 广州长嘉电子有限公司 Method, system and medium for recommending media content based on media database
CN115168690A (en) * 2022-09-06 2022-10-11 深圳市明源云科技有限公司 Data query method and device based on browser plug-in, electronic equipment and medium
CN117633297A (en) * 2024-01-26 2024-03-01 江苏瑞宁信创科技有限公司 Video retrieval method, device, system and medium based on annotation
CN117633297B (en) * 2024-01-26 2024-04-30 江苏瑞宁信创科技有限公司 Video retrieval method, device, system and medium based on annotation

Also Published As

Publication number Publication date
CN101021855C (en)
CN101021855B (en) 2010-04-07

Similar Documents

Publication Publication Date Title
CN101021855B (en) Video searching system based on content
CN100423004C (en) Video search dispatching system based on content
CN109992645B (en) Data management system and method based on text data
CN110430476B (en) Live broadcast room searching method, system, computer equipment and storage medium
US8296797B2 (en) Intelligent video summaries in information access
CN101021857A (en) Video searching system based on content analysis
CN110888990A (en) Text recommendation method, device, equipment and medium
CN101021856B (en) Distributing speech searching system
CN101211370A (en) Content register device, content register method and content register program
CN104679875B (en) A kind of information data classification method based on digital newspaper
CN102236714A (en) Extensible markup language (XML)-based interactive application multimedia information retrieval method
KR20220130863A (en) Apparatus for Providing Multimedia Conversion Content Creation Service Based on Voice-Text Conversion Video Resource Matching
CN102457817A (en) Method and system for extracting news contents from mobile phone newspaper
CN116010552A (en) Engineering cost data analysis system and method based on keyword word library
JP2008217701A (en) Metadata providing device, metadata providing method, metadata providing program, and recording medium recording metadata providing program
CN117952203A (en) Knowledge base construction method, video automatic production method and software product
WO2010071617A1 (en) Method and apparatus for performing image processing
CN107544978A (en) A kind of content based video retrieval system method
CN116976299A (en) Advertisement generation method, device, equipment and storage medium
Messina et al. Creating rich metadata in the TV broadcast archives environment: The Prestospace project
CN114218407A (en) Content creation system based on digital automatic indexing
JP4755122B2 (en) Image dictionary generation method, apparatus, and program
KR20220130864A (en) A system for providing a service that produces voice data into multimedia converted contents
CN113468377A (en) Video and literature association and integration method
TW201142767A (en) Tool and method for creating teaching material

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: BEIJING NUFRONT NETWORK TECHNOLOGY CO., LTD.

Free format text: FORMER OWNER: BAO DONGSHAN

Effective date: 20071228

C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20071228

Address after: A, building 16, building 1, building 8, Qinghua science park, No. 100084, Zhongguancun East Road, Beijing, Haidian District, China

Applicant after: Beijing Nufront Software Technology Co., Ltd.

Address before: A, building 16, building 1, building 8, Qinghua science park, No. 100084, Zhongguancun East Road, Beijing, Haidian District, China

Applicant before: Bao Dongshan

C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20100407

Termination date: 20131011