CN104598550B - A kind of update method and device of Internet video index - Google Patents

A kind of update method and device of Internet video index Download PDF

Info

Publication number
CN104598550B
CN104598550B CN201410854832.XA CN201410854832A CN104598550B CN 104598550 B CN104598550 B CN 104598550B CN 201410854832 A CN201410854832 A CN 201410854832A CN 104598550 B CN104598550 B CN 104598550B
Authority
CN
China
Prior art keywords
index
database
data
index database
newly
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410854832.XA
Other languages
Chinese (zh)
Other versions
CN104598550A (en
Inventor
李顺龙
周益
祝美莲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing QIYI Century Science and Technology Co Ltd
Original Assignee
Beijing QIYI Century Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing QIYI Century Science and Technology Co Ltd filed Critical Beijing QIYI Century Science and Technology Co Ltd
Priority to CN201410854832.XA priority Critical patent/CN104598550B/en
Publication of CN104598550A publication Critical patent/CN104598550A/en
Application granted granted Critical
Publication of CN104598550B publication Critical patent/CN104598550B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/74Browsing; Visualisation therefor
    • G06F16/748Hypervideo
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Abstract

The update method and device indexed the present invention provides a kind of Internet video, the method includes:The second index database is created in the index server including the first index database, first index database has the first default alias that mark externally provides index service;During second index database creates inverted index, newly-increased index data is received;Newly-increased index data is added respectively in first index database and second index database, for newly-increased index data addition newly-increased mark of the addition in second index database;After the completion of second index database creates inverted index structure, the first default alias that first index database has is deleted, and the first default alias is added for second index database;Corresponding inverted index is created according to the index data that the newly-increased identifier lookup increases newly in second index database, and according to newly-increased index data.

Description

A kind of update method and device of Internet video index
Technical field
The present invention relates to Internet video fields, more particularly to a kind of update method of Internet video index, and, it is a kind of The updating device of Internet video index.
Background technology
In search system, the reconstruction for periodically carrying out data directory is needed for the retrieval of video data, since data volume is huge It is big so index reconstruction process can be long, in order to not influence user experience, need smooth to be indexed cutting for data It changes.Editor is required for the update being indexed, the real-time of data update to want very high any modification of program data, compiles It collects operation to complete, the modification of data will can be retrieved after refresh page.
Existing search system, the switching for full dose index, is all read and write abruption mostly, can accomplish a search clothes Business, a full dose index service after the completion of each timing full dose index construct, carry out service switching.The process of switching need by New index externally provides search service, and after there is no problem, old index service stops simultaneously deleting old index new index, The opening newly indexed is externally searched for and preheating this process of caching generally may require that time of several seconds, thus using making on line At influence.Also, for these reasons, carried out if the switching that full dose indexes all is put into the low peak period that system uses, It can not timely update.
Invention content
The present invention provides the update methods and device of a kind of Internet video index to be kept away with improving the efficiency of index switching Exempt from index switching and generates harmful effect to being used on line.
To solve the above-mentioned problems, the invention discloses a kind of update methods of Internet video index, including:
The second index database is created in the index server including the first index database, first index database has mark pair Outer the first default alias that index service is provided;
During second index database creates inverted index, newly-increased index data is received;
Newly-increased index data is added respectively in first index database and second index database, for addition in institute State the newly-increased newly-increased mark of index data addition in the second index database;
After the completion of second index database creates inverted index structure, first preset first index database has Alias is deleted, and adds the first default alias for second index database;
According to the index data that the newly-increased identifier lookup increases newly in second index database, and according to newly-increased index The corresponding inverted index of data creation.
Preferably, after creating the second index database in the index server including the first index database described, the side Method further includes:
The second default alias is added for second index database;
It is described to include for second index database addition, the first default alias:
The second default alias name modifications that second index database is had are the described first default alias.
Preferably, described to include in second index database establishment inverted index:
Index data is added in second index database, and the corresponding row's of falling rope is created according to the index data added Draw.
Preferably, the inverted index includes at least one keyword extracted from the index data and the index The mapping relations of data, it is described to include according to the corresponding inverted index of the index data added establishment:
Word segmentation processing is carried out to the index data of addition, obtains multiple keywords that the index data includes;
Establish the mapping relations of multiple keywords and the index data that participle obtains.
Preferably, the process that inverted index is created in second index database further includes:
The frequency of the word frequency and index data appearance of the corresponding multiple keywords of the index data is counted, and is added Into the inverted index of establishment;
Wherein, the inverted index is created by memory, and disk is written and is preserved.
Preferably, the first default alias that first index database has is deleted described, and is directed to second rope After drawing the library addition first default alias, the method further includes:
Delete first index database.
The invention also discloses a kind of updating devices of Internet video index, including:
Index database establishes module, described for creating the second index database in the index server including the first index database First index database has the first default alias that mark externally provides index service;
Index creation module, for creating inverted index in second index database;
Receiving module is indexed, for during second index database creates inverted index, receiving newly-increased index Data;
Add module is indexed, for adding newly-increased index respectively in first index database and second index database Data, for newly-increased index data addition newly-increased mark of the addition in second index database;
Alias changes module, is used for after the completion of second index database creates inverted index structure, by first rope Draw the library has first default alias to delete, and the first default alias is added for second index database;
Index creation module, the index number for being increased newly in second index database according to the newly-increased identifier lookup According to, and corresponding inverted index is created according to newly-increased index data.
Preferably, described device further includes:
The second default alias is added for second index database;
The alias changes module, is described specifically for the second default alias name modifications for having second index database First default alias.
Preferably, the index creation module, specifically for adding index data in second index database, and according to The index data added creates corresponding inverted index.
Preferably, the inverted index includes at least one keyword extracted from the index data and the index The mapping relations of data, the index creation module include:
Submodule is segmented, carries out word segmentation processing for the index data to addition, obtaining the index data includes Multiple keywords;
Mapping relations setting up submodule, the mapping for establishing multiple keywords and the index data that participle obtains are closed System.
Compared with the background art, the present invention includes following advantages:
In search system, it is typically necessary the establishment of the carry out full dose index data of timing, it is time-consuming longer, to making on line It is influenced with causing, the present invention is mainly a wound of the switching flow of the old and new's index database during full dose builds and indexes Newly, it by the thinking of space for time, is called by the alias and asynchronous thread of index and completes alias change technology, and searched for The time of system creation alias is very short, as long as several milliseconds, therefore index switching flow using the present invention can accomplish full dose structure When indexing, completes to take over seamlessly new and old index database in Millisecond other time, be indexed switching any time not Can to used on line and the usage experience of user generate harmful effect.
Description of the drawings
Fig. 1 is a kind of update method flow chart of Internet video index of the embodiment of the present invention;
Fig. 2 be the embodiment of the present invention Internet video index update method an example in establish showing for inverted index It is intended to;
Fig. 3 is an exemplary schematic diagram of the update method of the Internet video index of the embodiment of the present invention;
Fig. 4 is a kind of updating device structure diagram of Internet video index of the embodiment of the present invention.
Specific implementation mode
In order to make the foregoing objectives, features and advantages of the present invention clearer and more comprehensible, below in conjunction with the accompanying drawings and specific real Applying mode, the present invention is described in further detail.
Using the search system of background technology, the switching for full dose index is substantially and creates a new rope first Draw library, then batch structure index, after the completion of full dose index construct, old index service is stopped, then by new index database pair Outer offer service.The time needed during this can be long, stops the time of old search service, in addition new search clothes Business externally provides the time of retrieval, along with some preheating times etc. of search caching, it may be desirable to which the several seconds stops search The time of service.
In view of this, the present invention proposes a kind of flow of new index switching.Mainly use a kind of index alias Technology and asynchronous thread technology take over seamlessly new and old index database when capable of accomplishing full dose structure index.
Referring to Fig.1, it illustrates a kind of flow chart of the update method of Internet video index, institutes described in the embodiment of the present invention The method of stating can specifically include:
Step 101 creates the second index database, the first index database tool in the index server including the first index database Standby mark externally provides the first default alias of index service.
First index database is currently used old index database, and the second index database is an interim index database, for first Index database addition mark externally provides the first default alias of index service, and search system is by identifying that the first default alias is looked into Look for the corresponding library that index server is provided.The naming rule of index database can be " library name-random number-current time ", the present invention This is not limited.
Step 102, second index database create inverted index during, receive newly-increased index data.
During video production, the information content of program is huge, and Search Requirement is more complicated, while the reality of retrieval service When property is more demanding, therefore realizes a search service using the technology of inverted index, to meet complicated and quasi real time search Demand.
Inverted index needs the value according to attribute to search record in practical application.Each single item in this concordance list All include an attribute value and the address respectively recorded with the attribute value.Due to not determining attribute value by recording, The position of record, thus referred to as inverted index are determined by attribute value.Inverted index is a kind of indexing means, is used to be stored in The mapping of storage location of some word in a document or one group of document under full-text search.Establish inverted index needs first It to be segmented according to specified participle mode, generate word lexicon, then establish word to the mapping relations of document and some systems Count information.Due to storing attribute value in index to the mapping relations of document so the system queries speed established by inverted index It spends quickly, but herewith as it can be seen that the time that inverted index is established can be long.
It is described to include in second index database establishment inverted index in the embodiment of the present invention:
Index data is added in second index database, and the corresponding row's of falling rope is created according to the index data added Draw.
It is further preferred that the inverted index includes at least one keyword extracted from the index data and institute The mapping relations of index data are stated, it is described to include according to the corresponding inverted index of the index data added establishment:
Word segmentation processing is carried out to the index data of addition, obtains multiple keywords that the index data includes;
Establish the mapping relations of multiple keywords and the index data that participle obtains.
In concrete implementation, the process that inverted index is created in second index database further includes:
The frequency of the word frequency and index data appearance of the corresponding multiple keywords of the index data is counted, and is added Into the inverted index of establishment.
In concrete implementation, the inverted index can be created by memory, and disk is written and is preserved.
Step 103 adds newly-increased index data respectively in first index database and second index database, for Newly-increased index data addition newly-increased mark of the addition in second index database.
The time that full dose builds data can be long, during full dose builds and indexes, more if there is index data Newly, index data can be added in the index that service is being provided by index upgrade thread, while being also needed in the second rope Draw one index data of addition in library, it is new to add a newly-increased mark (a such as preset attribute field) to this index data Increase mark newly to create during full dose indexes for indicating the data.
Step 104, second index database create inverted index structure after the completion of, first index database is had First default alias is deleted, and adds the first default alias for second index database.
After second index database newly-built in the index server including the first index database, the method is also wrapped It includes:The second default alias is added for second index database.
Correspondingly, above-mentioned steps can be, the second alias name modifications that second index database is had are described first pre- Set name.
Step 105, the index data increased newly in second index database according to the newly-increased identifier lookup, and according to new The index data of increasing creates corresponding inverted index.
In the embodiment of the present invention, it is preferable that the first default alias that first index database has is deleted described, and After adding the first default alias for second index database, the method further includes:
Delete first index database.
In search system, it is typically necessary the establishment of the carry out full dose index data of timing, it is time-consuming longer, to making on line It is influenced with causing, through the above steps it is found that the present invention is mainly the old and new's index database during full dose builds and indexes One innovation of switching flow is called by the alias and asynchronous thread of index and is completed not by the thinking of space for time Name change technology, and the time that search system Makes Alias is very short, as long as several milliseconds, therefore index using the present invention switching stream When journey can accomplish full dose structure index, complete to take over seamlessly new and old index database in Millisecond other time, any time Be indexed switching all will not to used on line and the usage experience of user generate harmful effect.
It should be noted that for embodiment of the method above-mentioned, for simple description, therefore it is all expressed as a series of Combination of actions, but those skilled in the art should understand that, the present invention is not limited by the described action sequence, because according to According to the present invention, certain steps can be performed in other orders or simultaneously.Next, those skilled in the art should also know that, Embodiment described in this description belongs to preferred embodiment, and involved action is not necessarily essential to the invention.
To make those skilled in the art more fully understand the embodiment of the present invention, below by way of a specific example to this hair The update method of Internet video index described in bright embodiment illustrates.
Fig. 2 be the embodiment of the present invention Internet video index update method an example in establish showing for inverted index It is intended to.
During video production, need to retrieve the data such as program, since data volume is huge, so using It is developed based on this search engines increased income of elasticsearch.The mistake that inverted index is established in search system Journey probably includes following several steps:
Step 1 prepares the document for needing to create index.
Step 2 segments document.
Word after step 3, participle builds dictionary, while establishing the mapping relations of word and document.
Step 4 counts some relevant informations, word frequency, document frequency etc..
Step 5, index are established complete in memory, and disk is written.
In these steps, step 2 and 3 is all than relatively time-consuming.Since data volume is huge, the time of index full dose structure Can be long, the full dose structure that most of search system is indexed all by the way of the switching of the old and new's index database, i.e., new Index full dose structure complete before, old index database externally provides always service.Existing index switching mode will be old due to needs Index service is closed, and then new index database is opened to the outside world, so index switching time is long.
With reference to figure 3, an exemplary schematic diagram of the update method of the Internet video index of the embodiment of the present invention is shown, It can specifically include following steps:
The present invention is to reach taking over seamlessly for the old and new's index database, it is proposed that a kind of flow of new index switching.Mainly adopt With a kind of technology and asynchronous thread technology of index alias.Detailed process is as follows:
Step 1 creates an interim index database in index server, and the naming rule of index database is that " library name-is random The alias of number-current time ", old index database is video, and the alias of new index database is:Alias:video- 1403506581424。
Step 2 adds index data in the interim index database newly created, carries out the structure of full dose data.
The time that step 3, full dose build data can be long, during full dose builds and indexes, if there is index number According to update, that index upgrade thread may require that adds index data in the index for providing service, also needs to simultaneously A data is added in interim index database, an attribute field is added to this data to indicate that the data is in full dose rope It is newly created during drawing.The title IndexName of newly-increased index data:video-1403506581424- 20140623145621430。
The temporary library structure that step 4, full dose index finishes.
Step 5 deletes the alias of index database being served.
Temporary library is changed to externally provide the alias (Alias of service by step 6, asynchronous thread:video- 1403506581424 → video), new index database starts external offer service.
Step 7 above all after the completion deletes old index database.
Updating the data in step 8, processing full dose structure Index process.
It is tested through practical application, during the index full dose of present invention structure, external search service is only in step 5 It can stop with when 6, and the time that search system Makes Alias is very short, as long as several milliseconds, therefore index using the present invention When switching flow can accomplish full dose structure index, new and old index database is taken over seamlessly.
Explanation based on above method embodiment, the present invention also provides the updating device of corresponding Internet video index is real Example is applied, to realize the content described in above method embodiment.
With reference to Fig. 4, it illustrates a kind of structure diagram of the updating device of Internet video index described in the embodiment of the present invention, Including:
Index database establishes module 201, for creating the second index database, institute in the index server including the first index database It states the first index database and has the first default alias that mark externally provides index service;
Index creation module 202, for creating inverted index in second index database;
Receiving module 203 is indexed, for during second index database creates inverted index, receiving newly-increased rope Argument evidence;
Add module 204 is indexed, for addition to be newly-increased respectively in first index database and second index database Index data, for newly-increased index data addition newly-increased mark of the addition in second index database;
Alias changes module 205, is used for after the completion of second index database creates inverted index structure, by described first The first default alias that index database has is deleted, and adds the first default alias for second index database;
Index creation module 206, the index for being increased newly in second index database according to the newly-increased identifier lookup Data, and corresponding inverted index is created according to newly-increased index data.
In the embodiment of the present invention, it is preferable that described device further includes:
The second default alias is added for second index database;
The alias changes module, is described specifically for the second default alias name modifications for having second index database First default alias.
In the embodiment of the present invention, it is preferable that the index creation module, specifically for being added in second index database Index data, and corresponding inverted index is created according to the index data added.
In the embodiment of the present invention, it is preferable that the inverted index include extracted from the index data it is at least one The mapping relations of keyword and the index data, the index creation module include:
Submodule is segmented, carries out word segmentation processing for the index data to addition, obtaining the index data includes Multiple keywords;
Mapping relations setting up submodule, the mapping for establishing multiple keywords and the index data that participle obtains are closed System;
In the embodiment of the present invention, it is preferable that the index creation module further includes:
Information Statistics submodule, the word frequency for counting the corresponding multiple keywords of the index data and the index number According to the frequency of appearance, and it is added in the inverted index of establishment;
Wherein, the inverted index is created by memory, and disk is written and is preserved.
In the embodiment of the present invention, it is preferable that described device further includes:
Removing module, for deleting first index database.
In search system, it is typically necessary the establishment of the carry out full dose index data of timing, it is time-consuming longer, to making on line It is influenced with causing, the present invention is mainly a wound of the switching flow of the old and new's index database during full dose builds and indexes Newly, it by the thinking of space for time, is called by the alias and asynchronous thread of index and completes alias change technology, realized Big data full dose index takes over seamlessly so that the switching time of full dose index completes in millisecond rank, carries out rope any time Draw switching all will not to used on line and the usage experience of user generate harmful effect.
For the updating device embodiment of above-mentioned Internet video index, since it is basically similar to the method embodiment, So description is fairly simple, the part explanation of related place embodiment of the method shown in Figure 1.
Each embodiment in this specification is described in a progressive manner, the highlights of each of the examples are with The difference of other embodiment, the same or similar parts between the embodiments can be referred to each other.
It would have readily occurred to a person skilled in the art that be:The arbitrary combination application of above-mentioned each embodiment is all feasible, therefore Arbitrary combination between above-mentioned each embodiment is all embodiment of the present invention, but this specification exists as space is limited, This is not just detailed one by one.
The present invention can be used in numerous general or special purpose computing system environments or configuration.Such as:Personal computer, service Device computer, handheld device or portable device, laptop device, multicomputer system, microprocessor-based system, top set Box, programmable consumer-elcetronics devices, network PC, minicomputer, mainframe computer including any of the above system or equipment Distributed computing environment etc..
The present invention can describe in the general context of computer-executable instructions executed by a computer, such as program Module.Usually, program module includes routines performing specific tasks or implementing specific abstract data types, program, object, group Part, data structure etc..The present invention can also be put into practice in a distributed computing environment, in these distributed computing environments, by Task is executed by the connected remote processing devices of communication network.In a distributed computing environment, program module can be with In the local and remote computer storage media including storage device.
In the present invention, " component ", " device ", " system " etc. refer to the related entities applied to computer, such as hardware, firmly Combination, software or software in execution of part and software etc..In detail, for example, component can with but be not limited to run on place Manage process, processor, object, executable component, execution thread, program and/or the computer of device.In addition, running on server On application program or shell script, server can be component.One or more components can be in the process and/or line of execution Cheng Zhong, and component can be localized and/or be distributed between two or multiple stage computers on one computer, and can be by Various computer-readable medium operations.Component can also be according to the signal with one or more data packets, for example, coming from one Pass through signal and other systems with another component interaction in local system, distributed system, and/or network in internet and hand over The signal of mutual data is communicated by locally and/or remotely process.
Finally, it is to be noted that, herein, relational terms such as first and second and the like be used merely to by One entity or operation are distinguished with another entity or operation, without necessarily requiring or implying these entities or operation Between there are any actual relationship or orders.Moreover, the terms "include", "comprise", include not only those elements, and And further include other elements that are not explicitly listed, or further include for this process, method, article or equipment institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence " including ... ", it is not excluded that wanted including described There is also other identical elements in the process, method, article or equipment of element.
Moreover, "and/or" above indicate both to have contained herein " and " relationship, also contain the relationship of "or", In:If option A and option b be " and " relationship, then it represents that can include option A and option b simultaneously in certain embodiment;If Option A and the relationship that option b is "or", then it represents that can include individually option A in certain embodiment, or include individually option b.
It should be understood by those skilled in the art that, the embodiment of the present invention can be provided as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the present invention Apply the form of example.Moreover, the present invention can be used in one or more wherein include computer usable program code computer The computer program production implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) The form of product.
The present invention be with reference to according to the method for the embodiment of the present invention, the flow of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that can be realized by computer program instructions every first-class in flowchart and/or the block diagram The combination of flow and/or box in journey and/or box and flowchart and/or the block diagram.These computer programs can be provided Instruct the processor of all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine so that the instruction executed by computer or the processor of other programmable data processing devices is generated for real The device for the function of being specified in present one flow of flow chart or one box of multiple flows and/or block diagram or multiple boxes.
These computer program instructions, which may also be stored in, can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that instruction generation stored in the computer readable memory includes referring to Enable the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one box of block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device so that count Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, in computer or The instruction executed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in a box or multiple boxes.
Although preferred embodiments of the present invention have been described, it is created once a person skilled in the art knows basic Property concept, then additional changes and modifications can be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as It selects embodiment and falls into all change and modification of the scope of the invention.
Above to a kind of more new clothes of the update method and Internet video index of Internet video index provided by the present invention It sets, is described in detail, principle and implementation of the present invention are described for specific case used herein, above The explanation of embodiment is merely used to help understand the method and its core concept of the present invention;Meanwhile for the general skill of this field Art personnel, according to the thought of the present invention, there will be changes in the specific implementation manner and application range, in conclusion this Description should not be construed as limiting the invention.

Claims (7)

1. a kind of update method of Internet video index, which is characterized in that including:
The second index database is created in the index server including the first index database, first index database has mark and externally carries For the first default alias of index service;
During second index database creates inverted index, newly-increased index data is received;
Wherein, described to include in second index database establishment inverted index:Index data is added in second index database, And corresponding inverted index is created according to the index data added;Second index database is used to carry out the structure of full dose data It builds;
Newly-increased index data is added respectively in first index database and second index database, for addition described the The newly-increased newly-increased mark of index data addition in two index databases;
After the completion of second index database creates inverted index structure, the first default alias that first index database is had It deletes, and the first default alias is added for second index database;
Delete first index database;
According to the index data that the newly-increased identifier lookup increases newly in second index database, and according to newly-increased index data Create corresponding inverted index.
2. according to the method described in claim 1, it is characterized in that, described in the index server including the first index database After newly-built second index database, the method further includes:
The second default alias is added for second index database;
It is described to include for second index database addition, the first default alias:
The second default alias name modifications that second index database is had are the described first default alias.
3. according to the method described in claim 1, it is characterized in that, the inverted index includes being extracted from the index data At least one keyword and the index data mapping relations, it is described according to the index data that is added create it is corresponding fall Row indexes:
Word segmentation processing is carried out to the index data of addition, obtains multiple keywords that the index data includes;
Establish the mapping relations of multiple keywords and the index data that participle obtains.
4. according to the method described in claim 3, it is characterized in that, the mistake for creating inverted index in second index database Journey further includes:
The frequency of the word frequency and index data appearance of the corresponding multiple keywords of the index data is counted, and is added to wound In the inverted index built;
Wherein, the inverted index is created by memory, and disk is written and is preserved.
5. a kind of updating device of Internet video index, which is characterized in that including:
Index database establishes module, in the index server including the first index database create the second index database, described first Index database has the first default alias that mark externally provides index service;
Index creation module, for creating inverted index in second index database;Wherein, described to be created in second index database Building inverted index includes:Index data is added in second index database, and is created and corresponded to according to the index data added Inverted index;Second index database is used to carry out the structure of full dose data;
Receiving module is indexed, for during second index database creates inverted index, receiving newly-increased index data;
Add module is indexed, for adding newly-increased index number respectively in first index database and second index database According to for newly-increased index data addition newly-increased mark of the addition in second index database;
Alias changes module, is used for after the completion of second index database creates inverted index structure, by first index database The first default alias having is deleted, and adds the first default alias for second index database;
Removing module, for deleting first index database;
Index creation module, the index data for being increased newly in second index database according to the newly-increased identifier lookup, and Corresponding inverted index is created according to newly-increased index data.
6. device according to claim 5, which is characterized in that described device further includes:
The second default alias is added for second index database;
The alias changes module, is described first specifically for the second default alias name modifications for having second index database Default alias.
7. device according to claim 5, which is characterized in that the inverted index includes being extracted from the index data At least one keyword and the index data mapping relations, the index creation module includes:
Submodule is segmented, for carrying out word segmentation processing to the index data of addition, obtain that the index data includes is multiple Keyword;
Mapping relations setting up submodule, the mapping relations for establishing multiple keywords and the index data that participle obtains.
CN201410854832.XA 2014-12-31 2014-12-31 A kind of update method and device of Internet video index Active CN104598550B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410854832.XA CN104598550B (en) 2014-12-31 2014-12-31 A kind of update method and device of Internet video index

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410854832.XA CN104598550B (en) 2014-12-31 2014-12-31 A kind of update method and device of Internet video index

Publications (2)

Publication Number Publication Date
CN104598550A CN104598550A (en) 2015-05-06
CN104598550B true CN104598550B (en) 2018-09-25

Family

ID=53124335

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410854832.XA Active CN104598550B (en) 2014-12-31 2014-12-31 A kind of update method and device of Internet video index

Country Status (1)

Country Link
CN (1) CN104598550B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106959951A (en) * 2016-01-08 2017-07-18 北京国双科技有限公司 The treating method and apparatus of database
CN107133350A (en) * 2017-05-25 2017-09-05 努比亚技术有限公司 Data-updating method, mobile terminal and storage medium based on search engine
CN110019200B (en) * 2017-09-30 2023-05-09 阿里巴巴集团控股有限公司 Index establishing and using method and device
CN109857752A (en) * 2019-01-25 2019-06-07 北京炎黄新星网络科技有限公司 A kind of index database update method and device
CN110515953A (en) * 2019-08-29 2019-11-29 百度在线网络技术(北京)有限公司 Querying method, device, equipment and the storage medium of data
CN112597191A (en) * 2020-12-29 2021-04-02 拉卡拉支付股份有限公司 Data processing method, data processing apparatus, electronic device, storage medium, and program product
CN115563375A (en) * 2022-09-29 2023-01-03 北京海泰方圆科技股份有限公司 Document index updating method, device, equipment and medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101072205A (en) * 2007-06-21 2007-11-14 腾讯科技(深圳)有限公司 Chat information searching method and system
CN101136016A (en) * 2006-09-01 2008-03-05 北大方正集团有限公司 Indexes on-line updating method of full text retrieval system
CN101246500A (en) * 2008-03-27 2008-08-20 腾讯科技(深圳)有限公司 Retrieval system and method for implementing data fast indexing
CN101520800A (en) * 2009-03-27 2009-09-02 华中科技大学 Cryptogram-based safe full-text indexing and retrieval system
CN102073726A (en) * 2011-01-11 2011-05-25 百度在线网络技术(北京)有限公司 Search engine system and structured data import method for search engine system
CN102103602A (en) * 2009-12-17 2011-06-22 腾讯科技(深圳)有限公司 System and method for increasing retrieval speed
CN102968478A (en) * 2012-11-19 2013-03-13 天津书生投资有限公司 Indexing and searching method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8909615B2 (en) * 2011-08-30 2014-12-09 Open Text S.A. System and method of managing capacity of search index partitions

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101136016A (en) * 2006-09-01 2008-03-05 北大方正集团有限公司 Indexes on-line updating method of full text retrieval system
CN101072205A (en) * 2007-06-21 2007-11-14 腾讯科技(深圳)有限公司 Chat information searching method and system
CN101246500A (en) * 2008-03-27 2008-08-20 腾讯科技(深圳)有限公司 Retrieval system and method for implementing data fast indexing
CN101520800A (en) * 2009-03-27 2009-09-02 华中科技大学 Cryptogram-based safe full-text indexing and retrieval system
CN102103602A (en) * 2009-12-17 2011-06-22 腾讯科技(深圳)有限公司 System and method for increasing retrieval speed
CN102073726A (en) * 2011-01-11 2011-05-25 百度在线网络技术(北京)有限公司 Search engine system and structured data import method for search engine system
CN102968478A (en) * 2012-11-19 2013-03-13 天津书生投资有限公司 Indexing and searching method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"Elasticsearch索引重建(Rebuild)";既然2015;《http://blog.csdn.net/changong28/article/details/38491185》;20140811;第1-2页 *

Also Published As

Publication number Publication date
CN104598550A (en) 2015-05-06

Similar Documents

Publication Publication Date Title
CN104598550B (en) A kind of update method and device of Internet video index
US11645183B1 (en) User interface for correlation of virtual machine information and storage information
CN105740303B (en) The method and device of improved object storage
CN106970958B (en) A kind of inquiry of stream file and storage method and device
KR102311032B1 (en) Database Synchronization
US10353874B2 (en) Method and apparatus for associating information
US9201700B2 (en) Provisioning computer resources on a network
JP5791149B2 (en) Computer-implemented method, computer program, and data processing system for database query optimization
CN109033109B (en) Data processing method and system
US10083031B2 (en) Cognitive feature analytics
US11288287B2 (en) Methods and apparatus to partition a database
CN109086434B (en) Knowledge aggregation method and system based on theme map
CN115335821B (en) Offloading statistics collection
CN110188100A (en) Data processing method, device and computer storage medium
CN109101575A (en) Calculation method and device
CN108319608A (en) The method, apparatus and system of access log storage inquiry
CN109344226A (en) A kind of index data update method and device
WO2015168988A1 (en) Data index creation method and device, and computer storage medium
Wang et al. High volumes of event stream indexing and efficient multi-keyword searching for cloud monitoring
KR101955376B1 (en) Processing method for a relational query in distributed stream processing engine based on shared-nothing architecture, recording medium and device for performing the method
CN107894942B (en) Method and device for monitoring data table access amount
US9286349B2 (en) Dynamic search system
US20150178075A1 (en) Enhancing understandability of code using code clones
US8214336B2 (en) Preservation of digital content
EP3559797A1 (en) Meta-join and meta-group-by indexes for big data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant