CN103678694A - Method and system for establishing reverse index file of video resources - Google Patents

Method and system for establishing reverse index file of video resources Download PDF

Info

Publication number
CN103678694A
CN103678694A CN201310739955.4A CN201310739955A CN103678694A CN 103678694 A CN103678694 A CN 103678694A CN 201310739955 A CN201310739955 A CN 201310739955A CN 103678694 A CN103678694 A CN 103678694A
Authority
CN
China
Prior art keywords
keyword
information
file
video
dictionary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310739955.4A
Other languages
Chinese (zh)
Inventor
曹坤波
郑磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LeTV Cloud Computing Co Ltd
Original Assignee
LeTV Information Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LeTV Information Technology Beijing Co Ltd filed Critical LeTV Information Technology Beijing Co Ltd
Priority to CN201310739955.4A priority Critical patent/CN103678694A/en
Publication of CN103678694A publication Critical patent/CN103678694A/en
Priority to PCT/CN2014/093176 priority patent/WO2015096609A1/en
Priority to US15/101,698 priority patent/US20160306811A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • G06F16/319Inverted lists
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/7867Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title and artist information, manually generated time, location and usage information, user ratings

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method and system for establishing a reverse index file of video resources. The method includes the first step of carrying out word segmentation processing on video file information in a preset word segmentation mode to obtain keywords, the second step of establishing an index relation between the keywords and the video file information containing the keywords so as to establish the reverse index file of a video file. According to the method and system for establishing the reverse index file of the video resources, index efficiency on mass video data can be improved.

Description

Inverted index file set up method and the system thereof of video resource
Technical field
The present invention relates to information retrieval technique, relate in particular to a kind of inverted index file set up method and system thereof of video resource.
Background technology
Along with scientific and technological development, increasing user is by internet hunt and watch various videos.The video information providing due to internet is very abundant, and has the feature of continuous variation and renewal, has produced multiple search engine thereupon and has carried out Video Information Retrieval Techniques:.
In relational database system, index is the mode of retrieve data full blast.But for the video search engine of the whole network, can not meet its specific (special) requirements:
(1) what search engine was faced is the massive video data of the whole network, such as large-scale video website search engine indexs such as happy views, be all hundred million grades of even webpage quantity of several hundred billion, in the face of the video data of magnanimity like this, make Database Systems be difficult to effectively management.
(2) data manipulation that search engine uses is simple, generally speaking, several functions such as only needs to increase, delete, change, look into, and data have specific form, can design simple efficient application program for these application.General Database Systems are supported large and complete function, have lost speed and space simultaneously.
(3) search engine faces a large amount of user search demands, and this requirement completes being operated in when index is set up of macrooperation amount as much as possible, makes to retrieve operand as far as possible few.General Database Systems are difficult to bear so a large amount of user's requests, and can not satisfy the demands on retrieval response time and retrieval concurrency.
Known in sum, in prior art, there is the technical matters that can not meet the demand of the aspects such as quantity, time, efficiency for the data directory scheme of magnanimity video information, be therefore necessary to propose improved technical scheme and address the above problem.
Summary of the invention
Fundamental purpose of the present invention is to provide a kind of inverted index file set up method and system thereof of video resource, with solve that prior art exists for slow, the inefficient problem of searching mass data speed, wherein:
According to an aspect of the present invention, provide a kind of inverted index file set up method of video resource, it comprises: by default participle mode, video file information is carried out to word segmentation processing and obtain keyword; Set up described keyword and there is the index relative between the video file information of described keyword, thereby set up the inverted index file of video file.
Wherein, described method also comprises: dictionary is provided, and the Data Source of described dictionary comprises: basic dictionary, video copyright dictionary, user-generated content; Described step of video file information being carried out to word segmentation processing by default participle mode comprises: according to described dictionary and by default participle mode, video file information is carried out to word segmentation processing.
Wherein, described participle mode comprises: binary is divided morphology, maximum matching method, statistical method.
Wherein, the described step of setting up described keyword and having an index relative between the video file information of described keyword comprises: record and store the index information of described keyword, described index information comprises: the frequency information that the positional information that the identification information of the video file that comprises keyword, keyword occur, keyword occur; Set up the incidence relation between keyword and its index information.
Wherein, described method also comprises: the result for retrieval that statistics obtains based on inverted index file, search rate is adjusted to the file start-up portion of inverted index file over the keyword of setting threshold.
According to a further aspect in the invention, also provide a kind of inverted index file set up system, it comprises: keyword acquisition module, carries out word segmentation processing for the participle mode by default to video file information and obtains keyword; Inverted index is set up module, for setting up described keyword and having the index relative between the video file information of described keyword, thereby sets up inverted index file.
Wherein, described system also comprises: dictionary maintenance module, and for setting up and safeguard dictionary, the Data Source of described dictionary comprises: basic dictionary, video copyright storehouse, user-generated content; Described keyword acquisition module carries out word segmentation processing according to described dictionary and by default participle mode to video file information.
Wherein, described participle mode comprises: binary is divided morphology, maximum matching method, statistical method.
Wherein, described inverted index is set up module and is comprised: logging modle, for recording and store the index information of described keyword, described index information comprises: the frequency information that the positional information that the identification information of the video file that comprises keyword, keyword occur, keyword occur; Incidence relation is set up module, for setting up the incidence relation between keyword and its index information.
Wherein, described system also comprises: result for retrieval statistical module, for adding up the result for retrieval obtaining based on inverted index file; Processing module, for adjusting to search rate the file start-up portion of inverted index file over the keyword of setting threshold.
According to technical scheme of the present invention, by being carried out to word segmentation processing, video file information obtains keyword, set up keyword and there is the index relative between the video file information of keyword, thereby set up inverted index file, when user uses keyword search video file, can be fast and corresponding information is provided exactly.
Accompanying drawing explanation
Accompanying drawing described herein is used to provide a further understanding of the present invention, forms the application's a part, and schematic description and description of the present invention is used for explaining the present invention, does not form inappropriate limitation of the present invention.In the accompanying drawings:
Fig. 1 is according to the process flow diagram of the inverted index file set up method of the embodiment of the present invention;
Fig. 2 is the structured flowchart of inverted index file set up system according to an embodiment of the invention;
Fig. 3 is the structured flowchart of inverted index file set up system according to another embodiment of the present invention.
Embodiment
General index is just arranging index, is to determine property value by recording; Inverted index is according to property value, to determine the position of record, is therefore called inverted index.The present invention is for having the storage and retrieval of video resource of the video website of magnanimity video resource, by the document of the whole network (video file on internet) is set up by word (word) to the inverted index to document, when user inquires about document (webpage) with keyword, system will be returned to the document (webpage) that contains this keyword to user.
For making the object, technical solutions and advantages of the present invention clearer, below in conjunction with drawings and the specific embodiments, the present invention is described in further detail.
According to the embodiment of the present invention, provide a kind of inverted index file set up method of video resource.With reference to figure 1, be according to the process flow diagram of the inverted index file set up method of the video resource of the embodiment of the present invention, comprise the following steps (step S102-S104):
Step S102, carries out word segmentation processing by default participle mode to video file information and obtains keyword.
Video file information refers to some Word messages such as title that video file comprises, descriptor, brief introduction, obtains the keyword of video file information by word segmentation processing.Usually, word segmentation processing is exactly that continuous word sequence is reassembled into word sequence according to certain standard.The object of participle is exactly that each document analysis is extracted to those words that likely becomes user's query object (word).
According to the difference of category of language that video file information is used, word segmentation processing can be divided into Chinese word segmentation processing and foreign language (take English below as representative explanation) word segmentation processing substantially.English using space as natural separator, by space, just can distinguish word, then reject the words (such as a, the etc.) of some of them redundancy, just can complete word segmentation processing, illustrate below.
For example, there are two pieces of files 1 and 2, the content of file 1 is: " Tom lives in Guangzhou, I live in Guangzhou too. ", all keywords of the file 1 after word segmentation processing are: [tom] [live] [guangzhou] [i] [live] [guangzhou].
The content of file 2 is: " He once lived in Shanghai. ", all keywords of the file 2 after word segmentation processing are: [he] [live] [shanghai].
And the participle of Chinese is more complicated than English participle, between Chinese word, there is no obvious delimiter.The present invention carries out word segmentation processing by introducing dictionary.In actual applications, the Data Source of dictionary includes but not limited to following channel: basic dictionary, video copyright storehouse, user-generated content (User-generated content, referred to as UGC).Wherein, basic dictionary comprises various dictionaries and dictionary, but video file is not strict consistent with the title of dictionary, therefore also needs to use video copyright dictionary.The dictionary of video copyright dictionary for obtaining according to the video resource information with copyright, this dictionary can meet the demand of video file information word segmentation processing.And UGC is that generated by user or that provide or original content, some neologisms that use in network have been supplemented.By above-mentioned multiple dictionary, cooperatively interact and supplement, after word segmentation processing, can access comparatively ideal keyword.
In addition, due to the complicacy of Chinese language, in order to solve the ambiguity producing in participle process, also need to use some minute word algorithm, such as binary, divide the modes such as morphology, maximum matching method, statistical method to carry out word segmentation processing to video file information.So-called binary is divided morphology, and being about to title is 2 to carry out cutting according to step-length, and like this, length is n(n word) title be split as n-1 binary word, its previous word and a rear word have a public word.Maximum matching method comprises maximum matching method, maximum matching method etc. backward forward, repeats no more herein.
Preferably, in employing, divide the modes such as morphology, maximum matching method, statistical method to carry out after word segmentation processing video file information as binary, the word that operation obtains to participle in dictionary is verified, whether accurately determines that participle operates the word obtaining.
Step S104, sets up described keyword and has the index relative between the video file information of described keyword, thereby sets up the inverted index file of video file.
After word segmentation processing obtains keyword, identification information (ID) by keyword together with corresponding file is stored in inverted index file, after All Files is analyzed, by the order of the keyword obtaining to keyword sort, the processing such as merging, add up the probability that each keyword occurs in individual file, and also likely comprise other index informations in index file.For example: number of files, for showing that keyword occurs at how many files; Sum frequency, for the number of times that shows that keyword occurs at All Files; Frequency, for the number of times that shows that keyword occurs at a file.Thereby, set up the incidence relation between keyword and its index information.
Hold above-mentioned example, the index information that keyword is corresponding with it is as shown in table 1, that is to say, " frequency of occurrences " that keyword is corresponding with it and " occurring position " information obtain final index structure.
Table 1
Keyword Document number [frequency of occurrences] There is position
guangzhou 1[2] 3,6
he 2[1] 1
i 1[1] 4
live 1[2],2[1] 2,5,2
shanghai 2[1] 3
tom 1[1] 1
According to above-described embodiment, set up after inverted index file, user input query condition, scanning inverted index file also obtains alternative file collection, according to certain output video file that requires, thereby realize quick and accurate video resource retrieval, met the storage and retrieval requirement of magnanimity video resource.
In actual applications, the search of video resource has paroxysmal feature, for example, for example, when a certain hot video (film, TV play, variety show) release or a certain focus event (media event) generation, in short time, can there is a large amount of searching request, in this case, the result for retrieval that statistics obtains based on inverted index file, adjusts to search rate the file start-up portion of inverted index file, to improve recall precision over the keyword of setting threshold.
According to embodiments of the invention, also provide a kind of inverted index file set up system.As shown in Figure 2, described system at least comprises: keyword acquisition module 10 and inverted index are set up module 20, describes structure and the annexation of each module below in detail.
Keyword acquisition module 10, carries out word segmentation processing for the participle mode by default to video file information and obtains keyword.Wherein, described participle mode includes but not limited to: binary is divided morphology, maximum matching method, statistical method.
Inverted index is set up module 20 and is coupled mutually with keyword acquisition module 10, for setting up described keyword and having the index relative between the video file information of described keyword, thereby sets up inverted index file.
With reference to figure 3, in one embodiment of the invention, described system also comprises: dictionary maintenance module 30, for setting up and safeguard dictionary, the Data Source of described dictionary includes but not limited to: basic dictionary, video copyright storehouse, user-generated content.
Based on this, keyword acquisition module 10 carries out word segmentation processing according to described dictionary and by default participle mode to video file information.
Continuation is with reference to figure 3, and described inverted index is set up module 20 and further comprised: logging modle 210 and incidence relation are set up module 220, wherein:
Logging modle 210 is for recording and store the index information of described keyword, and described index information comprises: the frequency information that the positional information that the identification information of the video file that comprises keyword, keyword occur, keyword occur; Incidence relation is set up module 220 and is coupled mutually with logging modle 210, for setting up the incidence relation between keyword and its index information.
In one embodiment of the invention, described inverted index file set up system also comprises: result for retrieval statistical module (not shown), for adding up the result for retrieval obtaining based on inverted index file; Processing module (not shown), adjusts to the file start-up portion of inverted index file for search rate being surpassed to the keyword of setting threshold, thereby improves recall precision.
The operation steps of method of the present invention is corresponding with the architectural feature of system, can cross-reference, repeat no longer one by one.
In sum, according to technical scheme of the present invention, by being carried out to word segmentation processing, video file information obtains keyword, set up keyword and there is the index relative between the video file information of keyword, thereby set up inverted index file, when user uses keyword search video file, can be fast and corresponding information is provided exactly.
The foregoing is only embodiments of the invention, be not limited to the present invention, for a person skilled in the art, the present invention can have various modifications and variations.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., within all should being included in claim scope of the present invention.

Claims (10)

1. an inverted index file set up method for video resource, is characterized in that, comprising:
By default participle mode, video file information is carried out to word segmentation processing and obtain keyword;
Set up described keyword and there is the index relative between the video file information of described keyword, thereby set up the inverted index file of video file.
2. method according to claim 1, is characterized in that, also comprises:
Dictionary is provided, and the Data Source of described dictionary comprises: basic dictionary, video copyright dictionary, user-generated content;
Described step of video file information being carried out to word segmentation processing by default participle mode comprises: according to described dictionary and by default participle mode, video file information is carried out to word segmentation processing.
3. method according to claim 1 and 2, is characterized in that, described participle mode comprises: binary is divided morphology, maximum matching method, statistical method.
4. method according to claim 1, is characterized in that, the described step of setting up described keyword and having an index relative between the video file information of described keyword comprises:
Record and store the index information of described keyword, described index information comprises: the frequency information that the positional information that the identification information of the video file that comprises keyword, keyword occur, keyword occur;
Set up the incidence relation between keyword and its index information.
5. method according to claim 1, is characterized in that, also comprises:
The result for retrieval that statistics obtains based on inverted index file, adjusts to search rate the file start-up portion of inverted index file over the keyword of setting threshold.
6. an inverted index file set up system, is characterized in that, comprising:
Keyword acquisition module, carries out word segmentation processing for the participle mode by default to video file information and obtains keyword;
Inverted index is set up module, for setting up described keyword and having the index relative between the video file information of described keyword, thereby sets up inverted index file.
7. system according to claim 6, is characterized in that, also comprises:
Dictionary maintenance module, for setting up and safeguard dictionary, the Data Source of described dictionary comprises: basic dictionary, video copyright storehouse, user-generated content;
Described keyword acquisition module carries out word segmentation processing according to described dictionary and by default participle mode to video file information.
8. according to the system described in claim 6 or 7, it is characterized in that, described participle mode comprises: binary is divided morphology, maximum matching method, statistical method.
9. system according to claim 6, is characterized in that, described inverted index is set up module and comprised:
Logging modle, for recording and store the index information of described keyword, described index information comprises: the frequency information that the positional information that the identification information of the video file that comprises keyword, keyword occur, keyword occur;
Incidence relation is set up module, for setting up the incidence relation between keyword and its index information.
10. system according to claim 6, is characterized in that, also comprises:
Result for retrieval statistical module, for adding up the result for retrieval obtaining based on inverted index file;
Processing module, for adjusting to search rate the file start-up portion of inverted index file over the keyword of setting threshold.
CN201310739955.4A 2013-12-26 2013-12-26 Method and system for establishing reverse index file of video resources Pending CN103678694A (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN201310739955.4A CN103678694A (en) 2013-12-26 2013-12-26 Method and system for establishing reverse index file of video resources
PCT/CN2014/093176 WO2015096609A1 (en) 2013-12-26 2014-12-05 Method and system for creating inverted index file of video resource
US15/101,698 US20160306811A1 (en) 2013-12-26 2014-12-05 Method and system for creating inverted index file of video resource

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310739955.4A CN103678694A (en) 2013-12-26 2013-12-26 Method and system for establishing reverse index file of video resources

Publications (1)

Publication Number Publication Date
CN103678694A true CN103678694A (en) 2014-03-26

Family

ID=50316238

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310739955.4A Pending CN103678694A (en) 2013-12-26 2013-12-26 Method and system for establishing reverse index file of video resources

Country Status (1)

Country Link
CN (1) CN103678694A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015096609A1 (en) * 2013-12-26 2015-07-02 乐视网信息技术(北京)股份有限公司 Method and system for creating inverted index file of video resource
CN104933120A (en) * 2015-06-04 2015-09-23 无锡天脉聚源传媒科技有限公司 Keyword setting method and device for video album
CN104978402A (en) * 2015-06-04 2015-10-14 无锡天脉聚源传媒科技有限公司 Keyword setting method and apparatus of video album
CN104978401A (en) * 2015-06-04 2015-10-14 无锡天脉聚源传媒科技有限公司 Keyword setting method and apparatus of video album
CN105005576A (en) * 2015-03-27 2015-10-28 合一信息技术(北京)有限公司 System and method for searching similar users of video website
CN106156155A (en) * 2015-04-15 2016-11-23 厦门简帛信息科技有限公司 A kind of method and system that e-book resource is provided
CN106874443A (en) * 2017-02-09 2017-06-20 北京百家互联科技有限公司 Based on information query method and device that video text message is extracted
CN107704628A (en) * 2017-10-31 2018-02-16 福建中金在线信息科技有限公司 Data retrieval method, index relative method for building up and server
WO2018113673A1 (en) * 2016-12-23 2018-06-28 北京奇虎科技有限公司 Method and apparatus for pushing search result of variety show query
CN109299466A (en) * 2018-10-22 2019-02-01 中国船舶工业综合技术经济研究院 A kind of document retrieval method and system towards science and techniques of defence field
CN109783444A (en) * 2018-12-26 2019-05-21 亚信科技(中国)有限公司 Multichannel file index method, device, computer equipment and storage medium
CN110825913A (en) * 2019-09-03 2020-02-21 上海擎测机电工程技术有限公司 Professional word extraction and part-of-speech tagging method
CN112541115A (en) * 2020-12-02 2021-03-23 创盛视联数码科技(北京)有限公司 Method for recommending teaching video, electronic equipment and computer readable medium
CN114707007A (en) * 2022-06-07 2022-07-05 苏州大学 Image text retrieval method and device and computer storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080059420A1 (en) * 2006-08-22 2008-03-06 International Business Machines Corporation System and Method for Providing a Trustworthy Inverted Index to Enable Searching of Records
CN102201001A (en) * 2011-04-29 2011-09-28 西安交通大学 Fast retrieval method based on inverted technology
CN103428525A (en) * 2013-07-22 2013-12-04 华中科技大学 Online inquiry and play control method and system for network videos and television programs

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080059420A1 (en) * 2006-08-22 2008-03-06 International Business Machines Corporation System and Method for Providing a Trustworthy Inverted Index to Enable Searching of Records
CN102201001A (en) * 2011-04-29 2011-09-28 西安交通大学 Fast retrieval method based on inverted technology
CN103428525A (en) * 2013-07-22 2013-12-04 华中科技大学 Online inquiry and play control method and system for network videos and television programs

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
匡振国 等: "一种基于Lucene的影片搜索引擎的研究和应用", 《计算机工程与应用》 *
郑榕增 等: "基于Lucene 的中文倒排索引技术的研究", 《计算机技术与发展》 *

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015096609A1 (en) * 2013-12-26 2015-07-02 乐视网信息技术(北京)股份有限公司 Method and system for creating inverted index file of video resource
CN105005576B (en) * 2015-03-27 2018-03-09 合一信息技术(北京)有限公司 A kind of video website similar users search system and method
CN105005576A (en) * 2015-03-27 2015-10-28 合一信息技术(北京)有限公司 System and method for searching similar users of video website
CN106156155A (en) * 2015-04-15 2016-11-23 厦门简帛信息科技有限公司 A kind of method and system that e-book resource is provided
CN104978401A (en) * 2015-06-04 2015-10-14 无锡天脉聚源传媒科技有限公司 Keyword setting method and apparatus of video album
CN104978401B (en) * 2015-06-04 2019-07-02 无锡天脉聚源传媒科技有限公司 A kind of the keyword setting method and device of video album
CN104978402A (en) * 2015-06-04 2015-10-14 无锡天脉聚源传媒科技有限公司 Keyword setting method and apparatus of video album
CN104933120A (en) * 2015-06-04 2015-09-23 无锡天脉聚源传媒科技有限公司 Keyword setting method and device for video album
WO2018113673A1 (en) * 2016-12-23 2018-06-28 北京奇虎科技有限公司 Method and apparatus for pushing search result of variety show query
CN106874443A (en) * 2017-02-09 2017-06-20 北京百家互联科技有限公司 Based on information query method and device that video text message is extracted
CN107704628A (en) * 2017-10-31 2018-02-16 福建中金在线信息科技有限公司 Data retrieval method, index relative method for building up and server
CN109299466A (en) * 2018-10-22 2019-02-01 中国船舶工业综合技术经济研究院 A kind of document retrieval method and system towards science and techniques of defence field
CN109299466B (en) * 2018-10-22 2023-07-07 中国船舶工业综合技术经济研究院 Document retrieval method and system oriented to national defense science and technology field
CN109783444A (en) * 2018-12-26 2019-05-21 亚信科技(中国)有限公司 Multichannel file index method, device, computer equipment and storage medium
CN110825913A (en) * 2019-09-03 2020-02-21 上海擎测机电工程技术有限公司 Professional word extraction and part-of-speech tagging method
CN112541115A (en) * 2020-12-02 2021-03-23 创盛视联数码科技(北京)有限公司 Method for recommending teaching video, electronic equipment and computer readable medium
CN114707007A (en) * 2022-06-07 2022-07-05 苏州大学 Image text retrieval method and device and computer storage medium
CN114707007B (en) * 2022-06-07 2022-08-30 苏州大学 Image text retrieval method and device and computer storage medium

Similar Documents

Publication Publication Date Title
CN103678694A (en) Method and system for establishing reverse index file of video resources
US11580176B2 (en) Search infrastructure
Ali et al. Comparison between SQL and NoSQL databases and their relationship with big data analytics
CN110489445B (en) Rapid mass data query method based on polymorphic composition
CN107818115B (en) Method and device for processing data table
US8244767B2 (en) Composite locality sensitive hash based processing of documents
US20040205044A1 (en) Method for storing inverted index, method for on-line updating the same and inverted index mechanism
TW201530328A (en) Method and device for constructing NoSQL database index for semi-structured data
CN109857898A (en) A kind of method and system of mass digital audio-frequency fingerprint storage and retrieval
CN111563095B (en) HBase-based data retrieval device
WO2015096609A1 (en) Method and system for creating inverted index file of video resource
CN106294695A (en) A kind of implementation method towards the biggest data search engine
CN105631003A (en) Intelligent index establishing, inquiring and maintaining method supporting mass data classification and counting
US9262511B2 (en) System and method for indexing streams containing unstructured text data
US20080010238A1 (en) Index having short-term portion and long-term portion
CN111367991B (en) MongoDB data real-time synchronization method and system based on message queue
JP2019512124A (en) Method and apparatus for archiving database generating index information, search method and apparatus for archived database including index information
CN103714158A (en) Vertical search method and system for video websites
CN114139040A (en) Data storage and query method, device, equipment and readable storage medium
CN112100197B (en) Quasi-real-time log data analysis and statistics method based on Elasticissearch
CN111782663A (en) Aggregation index structure and aggregation index method for improving aggregation query efficiency
Zhou et al. Adaptive subspace symbolization for content-based video detection
CN103699659A (en) Method and system for managing word library of video resources
CN112883143A (en) Elasticissearch-based digital exhibition searching method and system
CN103678697A (en) Reverse index storage method and system thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20151228

Address after: Room six, building 19, building 68, No. 100089 South Road, Haidian District, Beijing

Applicant after: LETV CLOUD COMPUTING CO., LTD.

Address before: Room six, building 19, building 68, No. 100089 South Road, Haidian District, Beijing

Applicant before: LeTV Information Technology (Beijing) Co., Ltd.

RJ01 Rejection of invention patent application after publication

Application publication date: 20140326

RJ01 Rejection of invention patent application after publication