CN106126619A - A kind of video retrieval method based on video content and system - Google Patents

A kind of video retrieval method based on video content and system Download PDF

Info

Publication number
CN106126619A
CN106126619A CN201610458196.8A CN201610458196A CN106126619A CN 106126619 A CN106126619 A CN 106126619A CN 201610458196 A CN201610458196 A CN 201610458196A CN 106126619 A CN106126619 A CN 106126619A
Authority
CN
China
Prior art keywords
video
similarity
information
result
descriptor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610458196.8A
Other languages
Chinese (zh)
Inventor
罗笑南
徐颂华
姜涛
林格
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National Sun Yat Sen University
Original Assignee
National Sun Yat Sen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National Sun Yat Sen University filed Critical National Sun Yat Sen University
Priority to CN201610458196.8A priority Critical patent/CN106126619A/en
Publication of CN106126619A publication Critical patent/CN106126619A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Library & Information Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of video retrieval method based on video content and system, wherein, described method includes: the descriptor of the search key of user with the topic set of background subject term is mated, obtains descriptor close with search key in coupling;Carry out video frequency searching according to the described descriptor close with search key, obtain the video that the described descriptor close with search key is corresponding;According to the video that the described descriptor close with search key is corresponding, obtain video essential information;According to described video essential information, described Video similarity is estimated, obtain Video similarity and estimate result;Described Video similarity is estimated result and carries out aggregative weighted appraisal, obtain Video similarity and comprehensively estimate result;Comprehensively estimate result according to acquisition Video similarity to show;In embodiments of the present invention, retrieve the inaccurate problem of content present in effective solution video frequency searching based on title, improve the experience sense of user.

Description

A kind of video retrieval method based on video content and system
Technical field
The present invention relates to video search technique area, particularly relate to a kind of video retrieval method based on video content and be System.
Background technology
Since eighties of last century the nineties, along with development and the development of community network, the network of network media technology Broadband improves constantly, and increasing information is presented in the Internet by multimedia forms such as video, audio frequency, images, many Media data presents the growth of explosion type;And in various Multi-media Materials, as video, audio frequency, text, figure, image and Animations etc., because video data enjoys liking of people with its vividness, intuitive and great affinity, in recent years, to provide Video sharing be the video website of main business be also flourish, domestic have the strange skill of love, excellent cruel, Rhizoma Solani tuber osi and a CNTV etc., abroad There are YouTube, Hulu, Yahoo, Video etc.;Large number of video sharing website enriches the audiovisual entertainment of people greatly Deng movable.
The multimedia video of magnanimity enriches the life of the people, along with increasing of video data, to these video datas There is huge technical problem in management, needs to put into a large amount of manpower and materials and video carries out classification process and adds tag processes, But these process still can do nothing to help user and retrieve the video that user needs fast and accurately.
Summary of the invention
It is an object of the invention to overcome the deficiencies in the prior art, the invention provides a kind of video based on video content Search method and system, effectively solve the retrieval inaccurate problem of content present in video frequency searching based on title, improve and use The experience sense at family.
In order to solve above-mentioned technical problem, the invention provides a kind of video retrieval method based on video content, described Method includes:
The descriptor of the search key of user with the topic set of background subject term is mated, obtains in coupling and close with retrieval The descriptor that keyword is close;
Carry out video frequency searching according to the described descriptor close with search key, obtain described close with search key Video corresponding to descriptor;
The video corresponding according to obtaining the described descriptor close with search key, obtains video essential information;
According to described video essential information, described Video similarity is estimated, obtain Video similarity and estimate result;
Described Video similarity is estimated result and carries out aggregative weighted appraisal, obtain Video similarity and comprehensively estimate result;
Comprehensively estimate result according to acquisition Video similarity to show.
Preferably, before the descriptor of the search key of user with the topic set of background subject term is mated, described side Method also includes::
The caption information of all videos in acquisition data base and audio-frequency information, believe described caption information and described audio frequency Breath is converted into the first video text message;
By described first video text message is processed, obtain described video text message descriptor;
The background document relevant to described video text message descriptor is obtained by described video text message descriptor Information, processes described background document information, obtains the first background document descriptor information;
According to described video text message descriptor and described first background document descriptor information, obtain background theme word Set.
Preferably, described video essential information at least includes the second video text message, the second background document descriptor letter Any one information in breath and video comments information.
Preferably, the obtaining step of described video comments information, including:
Described video is carried out data reptile process, obtains described video comments information.
Preferably, described according to video essential information, described Video similarity is estimated, obtain Video similarity and estimate The step of amount result, including:
According to described second video text message, the similarity of described video is estimated, obtain Video similarity and estimate Result;
According to described second background document descriptor information, the similarity of described video is estimated, obtain video similar Property estimate result;
According to described video comments information, the similarity of described video is estimated, obtain Video similarity and estimate knot Really.
Preferably, described estimating the similarity of described video according to described second video text message, acquisition regards Frequently similarity estimates the step of result, including:
The similarity of the character string according to described second video text message carries out Video similarity appraisal, obtains based on institute The Video similarity stating character string estimates result;
The similarity of the corpus according to described second visual text message carries out Video similarity appraisal, obtains based on institute The Video similarity stating corpus estimates result;
The similarity of the word content according to described second video text message carries out Video similarity appraisal, obtain based on The Video similarity of described word content estimates result.
Preferably, described according to described second background document descriptor information, the similarity of described video is estimated, Obtain Video similarity and estimate the step of result, including:
Set similarity according to described second background document descriptor information carries out Video similarity appraisal, obtain based on The Video similarity of described set estimates result;
Lexical Similarity according to described second background document descriptor information carries out Video similarity appraisal, obtain based on The Video similarity of described vocabulary estimates result.
Preferably, described according to described video comments information, the similarity of described video is estimated, obtain video phase The step of result is estimated like property, including:
Described video comments information is processed, obtains the relation letter of the video in video comments information, user, comment Breath;
According to the video in described video comments information, user, the relation information of comment, described video is carried out similarity Estimate, obtain and estimate result based on described video comments information similarity.
Preferably, described described Video similarity is estimated result carry out aggregative weighted appraisal, obtain Video similarity and combine Close the step estimating result, including:
Build Video similarity and comprehensively estimate model;
Comprehensively estimate model according to described Video similarity and described similarity appraisal result is carried out data training, obtain instruction Practice result.
Described training result is carried out aggregative weighted process, obtains aggregative weighted result;
Obtain Video similarity according to aggregative weighted result and comprehensively estimate result.
It addition, present invention also offers a kind of video frequency search system based on video content, described system includes:
Matching module: for the search key of user is matched with the topic set of background subject term, obtain and retrieve key The descriptor that word is close;
Retrieval module: for carrying out video frequency searching according to the described descriptor close with search key, obtain described and The video that descriptor that search key is close is corresponding;
Data obtaining module: for the video corresponding according to the described descriptor close with search key, obtains video Essential information;
Similarity measurement modules: for estimating described Video similarity according to described video essential information, obtains Video similarity estimates result;
Aggregative weighted module: carry out aggregative weighted appraisal for described Video similarity is estimated result, obtain video phase Result is comprehensively estimated like property;
Display module: show for comprehensively estimating result according to acquisition Video similarity.
In embodiments of the present invention, by the video in video library first being carried out pretreatment, obtain each in video library regarding Frequently background theme set, matches with video background theme set according to the search key of user, obtains descriptor, according to master Epigraph carries out Video similarity appraisal, comprehensively estimates result, root eventually through what the mode of aggregative weighted obtained Video similarity It is presented to user according to the comprehensive appraisal result of the Video similarity obtained by high to Low, completes the retrieval to video;Effective solution Certainly retrieve the inaccurate problem of content present in video frequency searching based on title, improve the experience sense of user.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing In having technology to describe, the required accompanying drawing used is briefly described, it should be apparent that, the accompanying drawing in describing below is only this Some embodiments of invention, for those of ordinary skill in the art, on the premise of not paying creative work, it is also possible to Other accompanying drawing is obtained according to these accompanying drawings.
Fig. 1 is the method flow schematic diagram of the video retrieval method in the embodiment of the present invention;
Fig. 2 is the method flow schematic diagram that the background theme set in the embodiment of the present invention obtains;
Fig. 3 is that the Video similarity that obtains in the embodiment of the present invention comprehensively estimates the steps flow chart schematic diagram of result;
Fig. 4 is the system structure composition schematic diagram of the video frequency search system in the embodiment of the present invention;
Fig. 5 is the modular structure composition schematic diagram of the aggregative weighted module in the embodiment of the present invention.
Detailed description of the invention
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Describe, it is clear that described embodiment is only a part of embodiment of the present invention rather than whole embodiments wholely.Based on Embodiment in the present invention, it is every other that those of ordinary skill in the art are obtained under not making creative work premise Embodiment, broadly falls into the scope of protection of the invention.
Fig. 1 is the method flow schematic diagram of the video retrieval method in the embodiment of the present invention, as it is shown in figure 1, the method bag Include:
S11: the descriptor of the search key of user with the topic set of background subject term is mated, with inspection in acquisition coupling The descriptor that rope key word is close;
S12: carry out video frequency searching according to the descriptor close with search key, obtains the master close with search key The video that epigraph is corresponding;
S13: the video corresponding according to obtaining the descriptor close with search key, obtains video essential information;
S14: estimate described Video similarity according to video essential information, obtains Video similarity and estimates result;
S15: Video similarity is estimated result and carries out aggregative weighted appraisal, obtains Video similarity and comprehensively estimates result;
S16: the comprehensive appraisal result according to obtaining Video similarity shows.
S11 is described further:
Search key is got, by get by the way of the search key that frame retrieval receives user's input Search key is mutually matched with background theme word, gets the theme close with search key according to matching result Word.
Further, before S11, Fig. 2 is that the method flow that the background theme set in the embodiment of the present invention obtains shows It is intended to, as in figure 2 it is shown, the method includes:
S111: the caption information of all videos in acquisition data base and audio-frequency information, by caption information and audio-frequency information It is converted into the first video text message;
S112: by processing the first video text message, obtains video text message descriptor;
S113: obtain the background document relevant to video text message descriptor by video text message descriptor and believe Breath, processes background document information, obtains the first background document descriptor information;
S114: according to video text message descriptor and the first background document descriptor information, obtain background theme word set Close.
S111 is described further:
First extract all videos in video library, respectively each video is carried out frame segmentation, use OCR technique to obtain every Caption information on one frame, and the caption information of the caption information of former frame with a later frame is compared, if two frames front and back The similarity of caption information is more than 80%, then it is assumed that two frame caption informations repeat mutually, only preserves the caption information of former frame;Use Such mode, till having obtained the caption information of all frames of video, obtains caption information;Video is carried out audio-frequency information Classification processes, and gets audio-frequency information, uses ASR technology to process this audio-frequency information, this audio-frequency information is converted to literary composition Word information.
Then caption information and Word message carrying out garbage character string removal process, the rule of this process can be such that
(1) if the character that comprises of character string is more than 40, then regard as garbage character string, filter out;
(2) if the sum of the alphanumeric character (or alphabetic character) in a character string is less than 50%, then regard as Garbage character string, filters out;
(3) if a character string has continuous 4 identical characters, then regard as garbage character string, filter out;
(4) for comprising only the character string of letter number, vowel and the quantity of consonant are checked, if a kind of letter Quantity less than the 10% of another kind of number of letters, then regard as garbage character string, filter out;
(5), after removing the head and the tail letter of a character string, if the kind more than two of punctuation character, then rubbish is regarded as Character string, filters out;
(6) it is all lower case when the head and the tail of a character string, if capitalization occurs in middle any position, then assert For garbage character, filter out.
Based on above rule, remove the most garbage character string in caption information and Word message all by effective mistake Filter, simultaneously for the character string of some specific formats, such as addresses of items of mail or some specific representation, uses regular expression With the mode of configuration file, these character strings are retained;Then remaining caption information and Word message are removed The merging repeated, gets visual text message.
S112 is described further:
Use KEA++ algorithm that video text message is processed, obtain the descriptor of text message;It had been embodied as Cheng Zhong, training dataset: using video text message as training data, and video text message remittance is trained;Control word Remittance table: extract the key phrase in video text message, the key phrase of different field is assigned to the control of its art Vocabulary;Descriptor generates: according to training dataset and managing terminology table, uses KEA++ algorithm to generate a learning model, adopts With this learning model, video text message is carried out descriptor prediction, generate descriptor.
S113 is described further:
According in S112 obtain to descriptor, use this descriptor to carry out literature search, get and this descriptor phase The background document information closed, uses KEA++ algorithm to carry out background document information processing acquisition background document descriptor letter Breath;In specific implementation process, training dataset: using background document information as training data, and background document information is entered Row training;Managing terminology table: extract the key phrase in background document information, the key phrase of different field is assigned to its institute The managing terminology table in genus field;Descriptor generates: according to training dataset and managing terminology table, uses KEA++ algorithm to generate one Individual learning model, uses this learning model that background document information carries out descriptor prediction, generates background document descriptor information.
S114 is described further:
The learning model produced according to KEA++ algorithm predicts the video text message descriptor and background document theme obtained Descriptor in word information is multiplied with degree of association probability and the Lucene score of video with this descriptor of prediction, obtains the highest being multiplied Result part builds descriptor set.
Wherein the code of points of Lucene be the descriptor to each documentation & info or text message and its inquiry request or The frequency of occurrences (TF) of key word, reverse documentation & info or text message frequency (IDF) documentation & info or text message and each The weight of inquiry field, and the length of the documentation & info of inquiry field or text message is relevant.
S12 is described further:
By retrieving according to the video in the topic word pair video database close with search key, thus obtain To the video that the descriptor close with search key is corresponding.
S13 is described further:
After getting, according to above-mentioned S12, the video that the descriptor close with search key is corresponding, obtain this video Video essential information;Wherein, video essential information at least includes video text message, background document descriptor information and video Review information.
Video text message and background document descriptor information are to extract in S11 is to the result after this Video processing Out, video comments information is the process processing mode using data reptile, the review information of this video that crawls, thus gets Video comments information.
S14 is described further:
According to video essential information, this video is carried out similarity appraisal, obtain Video similarity and estimate result;Because depending on Frequently essential information at least includes video text message, background document descriptor information and video comments information;In the present embodiment, It is to be respectively adopted video text message, background document descriptor information and video comments information to carry out Video similarity appraisal, obtains Take Video similarity and estimate result.
Further, according to video text message, background document descriptor information and video comments information respectively to video Carry out similarity appraisal, and the appraisal result that the similarity obtaining them respectively is estimated;It is i.e. according to video text message pair The similarity of video is estimated, and obtains Video similarity and estimates result;According to the background document descriptor information phase to video Estimate like property, obtain Video similarity and estimate result;According to video comments information, the similarity of video is estimated, obtain Take Video similarity and estimate result.
Further, according to video text message, the similarity of video is estimated, obtain Video similarity and estimate knot Fruit is divided into the similarity of character string based on video text message to carry out Video similarity appraisal, language based on video text message The similarity in material storehouse carries out the similarity of Video similarity appraisal and word content based on video text message and carries out video phase Estimate like property, thus obtain similarity and estimate result.
Wherein, the similarity of character string based on video text message is carried out between Video similarity appraisal employing character string The similarity of cosine similarity calculating character string, formula is as follows:
cos ( T i , T j ) = Σ k = 1 n W i k W j k Σ k = 1 n W i k 2 Σ k = 1 n W j k 2 ;
Wherein, TiRepresent the vector of i-th character string, wikRepresent the kth dimension of i-th character string, TjRepresent jth word The vector of symbol string, wjkThe kth dimension of expression jth character string vector, k=1 ..., n, i, j=1 ... m.
Wherein, the similarity of corpus based on video text message carries out the Video similarity appraisal similar calculation of employing set Method, formula is as follows:
D i c e ( T i , T j ) = 2 × c o m m ( T i , T j ) s i z e ( T i ) + s i z e ( T j ) ;
Wherein, TiRepresent video text message, TjRepresent corpus information, comm (Ti,Tj) represent text message and language material There is the number of identical characters string, size (T in storehouse informationi)、size(Tj) represent video text message and corpus information respectively The size of string assemble.
Wherein, the similarity of word content based on video text message carry out Video similarity estimate employing utilize Lin Algorithm calculates, and formula is as follows:
L i n ( T i , T j ) = 2 × I C ( L C S ( T i , T j ) ) I C ( T i ) + I C ( T j )
Wherein, Ti,TjIt is to be compared two character strings, LCS (Ti,Tj) it is the nearest ancestors of two character strings, IC (w) Represent the quantity of information of character string T.
Further, according to background document descriptor information, the similarity of video is estimated, obtain Video similarity Estimating result is divided into the set similarity according to background document descriptor information to carry out Video similarity appraisal, obtains based on set Video similarity estimate result;Lexical Similarity according to background document descriptor information carries out Video similarity appraisal, obtains Take Video similarity based on vocabulary and estimate result.
Wherein, carry out Video similarity appraisal according to the set similarity of background document descriptor information, obtain based on collection The Video similarity closed is estimated result and is used set Similarity Algorithm to calculate, and formula is as follows:
S i m ( S i , S j ) = Σ k h max h ( w u p ( t i k , t j h ) ) + Σ h m max k ( w u p ( t j h , t i k ) ) s i z e ( S i ) + s i z e ( S j )
Wherein, Si、SjFor two the most close descriptor set, tik、tjhIt is set S respectivelyi、SjIn descriptor, wup(tik,tjh) it is the wup similarity between two descriptor, maxh(wup(tik,tjh)) it is descriptor tikWith set SjIn The maximum of the wup similarity of all descriptor, maxk(wup(tjh,tik)) it is descriptor tjhWith set SiIn all themes The maximum of the wup similarity of word, size (S) represents the number of set.
Further, according to video comments information, the similarity of video is estimated, obtain Video similarity and estimate knot Really, it is by video comments information is processed, obtains the video in video comments information, user, the relation information of comment; According to the video in video comments information, user, the relation information of comment, video is carried out similarity appraisal, obtain based on video Review information similarity estimates result.
S15 is described further:
Use Video similarity comprehensively to estimate algorithm and build Video similarity appraisal model, comprehensively estimate according to Video similarity Similarity is estimated result and is carried out data training by amount model, obtains training result;Training result is carried out aggregative weighted process, obtains Take aggregative weighted result;Obtain Video similarity according to aggregative weighted result and comprehensively estimate result.
Further, the acquisition Video similarity during Fig. 3 is the embodiment of the present invention is comprehensively estimated the steps flow chart of result and is shown It is intended to, as it is shown on figure 3, this flow process includes:
S151: build Video similarity and comprehensively estimate model;
S152: comprehensively estimate model according to Video similarity and similarity appraisal result is carried out data training, obtain training Result;
S153: training result carries out aggregative weighted process, obtains aggregative weighted result;
S154: obtain Video similarity according to aggregative weighted result and comprehensively estimate result.
S151 is described further:
First build Adaboost algorithm, use Adaboost algorithm to carry out data training and prepare, with on [0,1] interval Real number value represents that the similarity degree of video, 0 expression differ completely, and 1 represents identical, and the biggest similarity of numerical value is the highest, right Training data is marked, and completes training data and prepares, and forms Video similarity and comprehensively estimates model.
S152 is described further:
Adaboost algorithm is utilized successively each Video similarity metric one weak recurrence learning algorithm of customization to be carried out Training, specifically includes the similarity of the comment of visual text message content, background document descriptor content and video, obtains training Result.
S153 is described further:
Correlation result after training is carried out aggregative weighted process, and its aggregative weighted is average weighted processing procedure, The weight of correlation result is all identical, obtains aggregative weighted result.
S154 is described further:
According to the size of aggregative weighted result, it is ranked up, determines the result that Video similarity is comprehensively estimated.
S16 is described further:
Comprehensively estimate result according to acquisition Video similarity to show;It is according to the comprehensive appraisal obtaining Video similarity Result sizes, is shown to user from high to low, facilitates user to check.
Fig. 4 is the system structure composition schematic diagram of the video frequency search system in the embodiment of the present invention, and as shown in Figure 4, this is System includes:
Matching module 11: for using the search key of user to match with the topic set of background subject term, obtain and retrieve The descriptor that key word is close;
Data obtaining module 12: for according to the descriptor close with search key, obtaining the video that descriptor is corresponding In video text message, background document subject information, video comments information;
Similarity measurement modules 13: for according to video text message, background document descriptor information, video comments information This Video similarity is estimated, obtains Video similarity and estimate result;
Aggregative weighted module 14: carry out aggregative weighted appraisal for this Video similarity is estimated result, obtain video phase Result is comprehensively estimated like property;
Display module 15: for being shown to this user according to the comprehensive appraisal result obtaining Video similarity by high to Low.
Preferably, this data obtaining module 12 includes:
Video acquisition unit: for carrying out video frequency searching according to the descriptor that search key is close, obtain topic word pair The video answered;
Information acquisition unit: for regarding according to video acquisition video text message, acquisition video background documentation & info, acquisition Frequently review information.
It should be noted that data obtaining module includes video acquisition unit and information acquisition unit, use video acquisition Unit carries out video frequency searching by the descriptor close with search key, obtains the video retrieved, and uses acquisition of information list These videos are processed by unit, obtain the video text message of these videos, video background documentation & info and video comments letter Breath.
Further, video acquisition unit carries out video frequency searching by the descriptor close with search key, obtains inspection The video that rope arrives, uses OCR technique and ASR technology to process these videos, the word of acquisition in information acquisition unit After information, these Word messages are carried out redundancy removal process, the Word message removing redundancy is merged, gets video Text message;According to the video text message got, this video text message is carried out KEA++ process, obtain video subject Word, uses these video subject words to carry out literature search, obtains background document information, these background document information are carried out KEA+ + and Lucene process, obtain background document descriptor information;After retrieval obtains video, use the mode that data reptile processes Video is processed, obtains the video comments information of this video.
Preferably, this information acquisition unit includes that data reptile processes subelement;
Data reptile processes subelement, for video carries out data reptile process, obtains video comments information.
Preferably, this similarity measurement modules 13 includes:
Text message estimates unit: for estimating the similarity of video according to video text message, obtains video Similarity estimates result;
Descriptor information estimates unit: for estimating the similarity of video according to background document descriptor information, Obtain Video similarity and estimate result;
Review information estimates unit: for estimating the similarity of video according to video comments information, obtains video Similarity estimates result.
It should be noted that use text message to estimate unit, the similarity of video is estimated by video text message Amount, obtains Video similarity and estimates result;Descriptor information is used to estimate unit to background document descriptor information to video Similarity is estimated, and obtains Video similarity and estimates result;Review information is used to estimate unit to video comments information to regarding The similarity of frequency is estimated, and obtains Video similarity and estimates result;Wherein, text message estimates unit, descriptor information is estimated Amount unit and review information are estimated the execution sequence of unit and are not limited, and can perform to be performed separately simultaneously.
Preferably, text information appraisal unit includes:
Character string estimates subelement: the similarity for the character string according to video text message carries out Video similarity and estimates Amount, obtains Video similarity based on character string and estimates result;
Subelement estimated in corpus: the similarity for the corpus according to visual text message carries out Video similarity and estimates Amount, obtains Video similarity based on corpus and estimates result;
Word content estimates subelement: it is similar that the similarity for the word content according to video text message carries out video Property estimate, obtains Video similarity based on word content appraisal result.
Enter it should be noted that use character string to estimate subelement according to the similarity of the character string of video text message Row Video similarity is estimated, and obtains Video similarity based on character string and estimates result;Corpus is used to estimate subelement root Carry out Video similarity appraisal according to the similarity of the corpus of visual text message, obtain Video similarity based on corpus and estimate Amount result;Use word content to estimate subelement and carry out Video similarity according to the similarity of the word content of video text message Estimate, obtain Video similarity based on word content and estimate result;Wherein, use character string to estimate subelement, corpus is estimated When quantum boxes and word content appraisal subelement carry out Video similarity appraisal, their appraisal order is unfixed, can Being to carry out or separately successively carry out simultaneously.
Preferably, descriptor information appraisal unit includes:
Subelement is estimated in set: estimate for carrying out Video similarity according to the set similarity of background document descriptor information Amount, obtains Video similarity based on set and estimates result;
Subelement estimated in vocabulary: estimates for carrying out Video similarity according to the Lexical Similarity of background document descriptor information Amount, obtains Video similarity based on vocabulary and estimates result.
Enter it should be noted that use set to estimate subelement according to the set similarity of background document descriptor information Row Video similarity is estimated, and obtains Video similarity based on set and estimates result;Vocabulary is used to estimate subelement according to the back of the body The Lexical Similarity of scape document subject word information carries out Video similarity appraisal, obtains Video similarity based on vocabulary and estimates knot Really, wherein, when using set appraisal subelement and vocabulary appraisal subelement to carry out Video similarity appraisal, their appraisal order It is unfixed, can be to carry out simultaneously or the most successively carry out.
Preferably, review information appraisal unit includes:
Review information processes subelement: for processing video comments information, obtains regarding in video comments information Frequently, the relation information of user, comment;
Estimate subelement: for video being carried out according to the video in video comments information, user, the relation information of comment Similarity is estimated, and obtains and estimates result based on video comments information similarity.
Preferably, aggregative weighted module 14 includes:
Construction unit 141: be used for building Video similarity and comprehensively estimate model;
Training unit 142: similarity appraisal result is carried out data instruction for comprehensively estimating model according to Video similarity Practice, obtain training result.
Weighting processing unit 143: for training result being carried out aggregative weighted process, obtain aggregative weighted result;
Comprehensive appraisal acquiring unit 144: comprehensively estimate knot for obtaining Video similarity according to aggregative weighted result Really.
It should be noted that use construction unit 141 to build Video similarity comprehensively estimate model;Use training unit 142 estimate model according to the Video similarity built carries out data training to Video similarity appraisal result, obtains training Result;Use weighting processing unit 143 that training result carries out aggregative weighted process, obtain aggregative weighted result;Use Comprehensive appraisal acquiring unit 144 obtains Video similarity according to aggregative weighted result and comprehensively estimates result.
Specifically, the operation principle of the system related functions module of the embodiment of the present invention can be found in the relevant of embodiment of the method Describe, repeat no more here.
In embodiments of the present invention, by the video in video library first being carried out pretreatment, obtain each in video library regarding Frequently background theme set, matches with video background theme set according to the search key of user, obtains descriptor, according to master Epigraph carries out Video similarity appraisal, comprehensively estimates result, root eventually through what the mode of aggregative weighted obtained Video similarity It is presented to user according to the comprehensive appraisal result of the Video similarity obtained by high to Low, completes the retrieval to video;Effective solution Certainly retrieve the inaccurate problem of content present in video frequency searching based on title, improve the experience sense of user.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of above-described embodiment is can Completing instructing relevant hardware by program, this program can be stored in a computer-readable recording medium, storage Medium may include that read only memory (ROM, Read Only Memory), random access memory (RAM, Random Access Memory), disk or CD etc..
It addition, above a kind of based on video content the video retrieval method being provided the embodiment of the present invention and system are entered Having gone and be discussed in detail, principle and the embodiment of the present invention are set forth by specific case used herein, above enforcement The explanation of example is only intended to help to understand method and the core concept thereof of the present invention;General technology people simultaneously for this area Member, according to the thought of the present invention, the most all will change, in sum, and this explanation Book content should not be construed as limitation of the present invention.

Claims (10)

1. a video retrieval method based on video content, it is characterised in that described method includes:
The descriptor of the search key of user with the topic set of background subject term is mated, with search key in acquisition coupling Close descriptor;
Carry out video frequency searching according to the described descriptor close with search key, obtain the described master close with search key The video that epigraph is corresponding;
The video corresponding according to obtaining the described descriptor close with search key, obtains video essential information;
According to described video essential information, described Video similarity is estimated, obtain Video similarity and estimate result;
Described Video similarity is estimated result and carries out aggregative weighted appraisal, obtain Video similarity and comprehensively estimate result;
Comprehensively estimate result according to acquisition Video similarity to show.
Video retrieval method the most according to claim 1, it is characterised in that by the search key of user and background subject term Before the descriptor of topic set is mated, described method also includes:
Obtain caption information and the audio-frequency information of all videos in data base, described caption information and described audio-frequency information are converted It it is the first video text message;
By described first video text message is processed, obtain described video text message descriptor;
The background document information relevant to described video text message descriptor is obtained by described video text message descriptor, Described background document information is processed, obtains the first background document descriptor information;
According to described video text message descriptor and described first background document descriptor information, obtain background theme word set Close.
Video retrieval method the most according to claim 1, it is characterised in that described video essential information at least includes second Any one information in video text message, the second background document descriptor information and video comments information.
Video retrieval method the most according to claim 3, it is characterised in that the obtaining step of described video comments information, Including:
Described video is carried out data reptile process, obtains described video comments information.
Video retrieval method the most according to claim 1, it is characterised in that described regard described according to video essential information Frequently similarity is estimated, and obtains Video similarity and estimates the step of result, including:
According to described second video text message, the similarity of described video is estimated, obtain Video similarity and estimate knot Really;
According to described second background document descriptor information, the similarity of described video is estimated, obtain Video similarity and estimate Amount result;
According to described video comments information, the similarity of described video is estimated, obtain Video similarity and estimate result.
Video retrieval method the most according to claim 5, it is characterised in that described according to described second video text message The similarity of described video is estimated, obtains Video similarity and estimate the step of result, including:
The similarity of the character string according to described second video text message carries out Video similarity appraisal, obtains based on described word The Video similarity of symbol string estimates result;
The similarity of the corpus according to described second video text message carries out Video similarity appraisal, obtains based on institute's predicate The Video similarity in material storehouse estimates result;
The similarity of the word content according to described second video text message carries out Video similarity appraisal, obtains based on described The Video similarity of word content estimates result.
Video retrieval method the most according to claim 5, it is characterised in that described according to described second background document theme The similarity of described video is estimated by word information, obtains Video similarity and estimates the step of result, including:
Set similarity according to described second background document descriptor information carries out Video similarity appraisal, obtains based on described The Video similarity of set estimates result;
Lexical Similarity according to described second background document descriptor information carries out Video similarity appraisal, obtains based on described The Video similarity of vocabulary estimates result.
Video retrieval method the most according to claim 5, it is characterised in that described according to described video comments information to institute The similarity stating video is estimated, and obtains Video similarity and estimates the step of result, including:
Described video comments information is processed, obtains the video in video comments information, user, the relation information of comment;
According to the video in described video comments information, user, the relation information of comment, described video is carried out similarity appraisal, Obtain and estimate result based on described video comments information similarity.
Video retrieval method the most according to claim 1, it is characterised in that described to described Video similarity appraisal result Carry out aggregative weighted appraisal, obtain Video similarity and comprehensively estimate the step of result, including:
Build Video similarity and comprehensively estimate model;
Comprehensively estimate model according to described Video similarity and described similarity appraisal result is carried out data training, obtain training knot Really;
Described training result is carried out aggregative weighted process, obtains aggregative weighted result;
Obtain Video similarity according to aggregative weighted result and comprehensively estimate result.
10. a video frequency search system based on video content, it is characterised in that described system includes:
Matching module: for the descriptor of the search key of user with the topic set of background subject term being mated, obtain coupling In the descriptor close with search key;
Retrieval module: for carrying out video frequency searching according to the described descriptor close with search key, obtains described and retrieval The video that descriptor that key word is close is corresponding;
Data obtaining module: for the video corresponding according to the described descriptor close with search key, obtains video basic Information;
Similarity measurement modules: for estimating described Video similarity according to described video essential information, obtains video Similarity estimates result;
Aggregative weighted module: carry out aggregative weighted appraisal for described Video similarity is estimated result, obtain Video similarity Comprehensively estimate result;
Display module: show for comprehensively estimating result according to acquisition Video similarity.
CN201610458196.8A 2016-06-20 2016-06-20 A kind of video retrieval method based on video content and system Pending CN106126619A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610458196.8A CN106126619A (en) 2016-06-20 2016-06-20 A kind of video retrieval method based on video content and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610458196.8A CN106126619A (en) 2016-06-20 2016-06-20 A kind of video retrieval method based on video content and system

Publications (1)

Publication Number Publication Date
CN106126619A true CN106126619A (en) 2016-11-16

Family

ID=57267814

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610458196.8A Pending CN106126619A (en) 2016-06-20 2016-06-20 A kind of video retrieval method based on video content and system

Country Status (1)

Country Link
CN (1) CN106126619A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106874443A (en) * 2017-02-09 2017-06-20 北京百家互联科技有限公司 Based on information query method and device that video text message is extracted
CN107066621A (en) * 2017-05-11 2017-08-18 腾讯科技(深圳)有限公司 A kind of search method of similar video, device and storage medium
CN107463552A (en) * 2017-07-20 2017-12-12 北京奇艺世纪科技有限公司 A kind of method and apparatus for generating video subject title
CN107798143A (en) * 2017-11-24 2018-03-13 珠海市魅族科技有限公司 A kind of information search method, device, terminal and readable storage medium storing program for executing
CN110555136A (en) * 2018-03-29 2019-12-10 优酷网络技术(北京)有限公司 Video tag generation method and device and computer storage medium
CN110602566A (en) * 2019-09-06 2019-12-20 Oppo广东移动通信有限公司 Matching method, terminal and readable storage medium
CN110598049A (en) * 2019-09-26 2019-12-20 北京字节跳动网络技术有限公司 Method, apparatus, electronic device and computer readable medium for retrieving video
CN111815645A (en) * 2020-06-23 2020-10-23 广州筷子信息科技有限公司 Method and system for cutting advertisement video picture
CN112749599A (en) * 2019-10-31 2021-05-04 北京金山云网络技术有限公司 Image enhancement method and device and server
CN112995642A (en) * 2021-02-04 2021-06-18 深圳小豆视觉科技有限公司 Naked eye 3D dual-channel data processing working method based on intelligent mobile handheld terminal
CN113688283A (en) * 2021-08-27 2021-11-23 北京奇艺世纪科技有限公司 Method and device for determining matching degree of video subtitles and electronic equipment
CN114173191A (en) * 2021-12-09 2022-03-11 上海开放大学 Multi-language question answering method and system based on artificial intelligence
CN115150660A (en) * 2022-06-09 2022-10-04 深圳市大头兄弟科技有限公司 Video editing method based on subtitles and related equipment
WO2023016175A1 (en) * 2021-08-09 2023-02-16 北京字跳网络技术有限公司 Method and apparatus for implementing search, and device

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1461142A (en) * 2003-06-30 2003-12-10 北京大学计算机科学技术研究所 Video segment searching method based on contents
CN101281520A (en) * 2007-04-05 2008-10-08 中国科学院自动化研究所 Interactive physical training video search method based on non-supervision learning and semantic matching characteristic
CN101404032A (en) * 2008-11-11 2009-04-08 清华大学 Video retrieval method and system based on contents
CN101719144A (en) * 2009-11-04 2010-06-02 中国科学院声学研究所 Method for segmenting and indexing scenes by combining captions and video image information
CN101739416A (en) * 2008-11-04 2010-06-16 未序网络科技(上海)有限公司 Method for sequencing multi-index comprehensive weight video
CN103761284A (en) * 2014-01-13 2014-04-30 中国农业大学 Video retrieval method and video retrieval system
CN103761345A (en) * 2014-02-27 2014-04-30 苏州千视通信科技有限公司 Video retrieval method based on OCR character recognition technology
CN103984772A (en) * 2014-06-04 2014-08-13 百度在线网络技术(北京)有限公司 Method and device for generating text retrieval subtitle library and video retrieval method and device
CN104683815A (en) * 2014-11-19 2015-06-03 西安交通大学 H.264 compressed domain video retrieval method based on content
CN104798068A (en) * 2012-11-30 2015-07-22 汤姆逊许可公司 Method and apparatus for video retrieval
CN105654125A (en) * 2015-12-29 2016-06-08 山东大学 Method for calculating video similarity

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1461142A (en) * 2003-06-30 2003-12-10 北京大学计算机科学技术研究所 Video segment searching method based on contents
CN101281520A (en) * 2007-04-05 2008-10-08 中国科学院自动化研究所 Interactive physical training video search method based on non-supervision learning and semantic matching characteristic
CN101739416A (en) * 2008-11-04 2010-06-16 未序网络科技(上海)有限公司 Method for sequencing multi-index comprehensive weight video
CN101404032A (en) * 2008-11-11 2009-04-08 清华大学 Video retrieval method and system based on contents
CN101719144A (en) * 2009-11-04 2010-06-02 中国科学院声学研究所 Method for segmenting and indexing scenes by combining captions and video image information
CN104798068A (en) * 2012-11-30 2015-07-22 汤姆逊许可公司 Method and apparatus for video retrieval
CN103761284A (en) * 2014-01-13 2014-04-30 中国农业大学 Video retrieval method and video retrieval system
CN103761345A (en) * 2014-02-27 2014-04-30 苏州千视通信科技有限公司 Video retrieval method based on OCR character recognition technology
CN103984772A (en) * 2014-06-04 2014-08-13 百度在线网络技术(北京)有限公司 Method and device for generating text retrieval subtitle library and video retrieval method and device
CN104683815A (en) * 2014-11-19 2015-06-03 西安交通大学 H.264 compressed domain video retrieval method based on content
CN105654125A (en) * 2015-12-29 2016-06-08 山东大学 Method for calculating video similarity

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106874443A (en) * 2017-02-09 2017-06-20 北京百家互联科技有限公司 Based on information query method and device that video text message is extracted
CN107066621A (en) * 2017-05-11 2017-08-18 腾讯科技(深圳)有限公司 A kind of search method of similar video, device and storage medium
CN107463552A (en) * 2017-07-20 2017-12-12 北京奇艺世纪科技有限公司 A kind of method and apparatus for generating video subject title
CN107798143A (en) * 2017-11-24 2018-03-13 珠海市魅族科技有限公司 A kind of information search method, device, terminal and readable storage medium storing program for executing
CN110555136A (en) * 2018-03-29 2019-12-10 优酷网络技术(北京)有限公司 Video tag generation method and device and computer storage medium
CN110555136B (en) * 2018-03-29 2022-07-08 阿里巴巴(中国)有限公司 Video tag generation method and device and computer storage medium
CN110602566B (en) * 2019-09-06 2021-10-01 Oppo广东移动通信有限公司 Matching method, terminal and readable storage medium
CN110602566A (en) * 2019-09-06 2019-12-20 Oppo广东移动通信有限公司 Matching method, terminal and readable storage medium
US11984140B2 (en) 2019-09-06 2024-05-14 Guangdong Oppo Mobile Telecommunications Corp., Ltd. Matching method, terminal and readable storage medium
CN110598049A (en) * 2019-09-26 2019-12-20 北京字节跳动网络技术有限公司 Method, apparatus, electronic device and computer readable medium for retrieving video
CN112749599A (en) * 2019-10-31 2021-05-04 北京金山云网络技术有限公司 Image enhancement method and device and server
CN111815645A (en) * 2020-06-23 2020-10-23 广州筷子信息科技有限公司 Method and system for cutting advertisement video picture
CN112995642A (en) * 2021-02-04 2021-06-18 深圳小豆视觉科技有限公司 Naked eye 3D dual-channel data processing working method based on intelligent mobile handheld terminal
CN112995642B (en) * 2021-02-04 2023-02-24 深圳小豆视觉科技有限公司 Naked eye 3D dual-channel data processing working method based on intelligent mobile handheld terminal
WO2023016175A1 (en) * 2021-08-09 2023-02-16 北京字跳网络技术有限公司 Method and apparatus for implementing search, and device
CN113688283A (en) * 2021-08-27 2021-11-23 北京奇艺世纪科技有限公司 Method and device for determining matching degree of video subtitles and electronic equipment
CN113688283B (en) * 2021-08-27 2023-09-05 北京奇艺世纪科技有限公司 Method and device for determining video subtitle matching degree and electronic equipment
CN114173191A (en) * 2021-12-09 2022-03-11 上海开放大学 Multi-language question answering method and system based on artificial intelligence
CN114173191B (en) * 2021-12-09 2024-03-19 上海开放大学 Multi-language answering method and system based on artificial intelligence
CN115150660A (en) * 2022-06-09 2022-10-04 深圳市大头兄弟科技有限公司 Video editing method based on subtitles and related equipment
CN115150660B (en) * 2022-06-09 2024-05-10 深圳市闪剪智能科技有限公司 Video editing method based on subtitles and related equipment

Similar Documents

Publication Publication Date Title
CN106126619A (en) A kind of video retrieval method based on video content and system
CN106874378B (en) Method for constructing knowledge graph based on entity extraction and relation mining of rule model
CN104765769B (en) The short text query expansion and search method of a kind of word-based vector
CN105528437B (en) A kind of question answering system construction method extracted based on structured text knowledge
CN104268160A (en) Evaluation object extraction method based on domain dictionary and semantic roles
CN114065758B (en) Document keyword extraction method based on hypergraph random walk
CN110134792B (en) Text recognition method and device, electronic equipment and storage medium
CN107895303B (en) Personalized recommendation method based on OCEAN model
CN112559684A (en) Keyword extraction and information retrieval method
CN107480200A (en) Word mask method, device, server and the storage medium of word-based label
CN112328800A (en) System and method for automatically generating programming specification question answers
CN109522396B (en) Knowledge processing method and system for national defense science and technology field
US11227183B1 (en) Section segmentation based information retrieval with entity expansion
Tiwari et al. Ensemble approach for twitter sentiment analysis
Shekhawat Sentiment classification of current public opinion on brexit: Naïve Bayes classifier model vs Python’s Textblob approach
CN115757819A (en) Method and device for acquiring information of quoting legal articles in referee document
Sheikh et al. Document level semantic context for retrieving OOV proper names
CN113761128A (en) Event key information extraction method combining domain synonym dictionary and pattern matching
Ye et al. Syntactic word embedding based on dependency syntax and polysemous analysis
Ning Natural Language Processing Technology Used in Artificial Intelligence Scene of Law for Human Behavior
Al-Anzi et al. A micro-word based approach for arabic sentiment analysis
Hua et al. A character-level method for text classification
CN104077419B (en) With reference to semantic method for reordering is retrieved with the long query image of visual information
Al-Sultany et al. Enriching tweets for topic modeling via linking to the wikipedia
Liu et al. Product related information sentiment-content analysis based on convolutional neural networks for the Chinese micro-blog

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20161116