CN101833587A - Network video searching system - Google Patents

Network video searching system Download PDF

Info

Publication number
CN101833587A
CN101833587A CN 201010186145 CN201010186145A CN101833587A CN 101833587 A CN101833587 A CN 101833587A CN 201010186145 CN201010186145 CN 201010186145 CN 201010186145 A CN201010186145 A CN 201010186145A CN 101833587 A CN101833587 A CN 101833587A
Authority
CN
China
Prior art keywords
submodule
links
unit
video
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201010186145
Other languages
Chinese (zh)
Inventor
蒋兴浩
孙锬锋
傅光磊
李荣杰
冯冰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiaotong University
Original Assignee
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University filed Critical Shanghai Jiaotong University
Priority to CN 201010186145 priority Critical patent/CN101833587A/en
Publication of CN101833587A publication Critical patent/CN101833587A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a network video searching system in the technical field of network application, which comprises a data storage module, a service processing module and a user interface module, wherein the data storage module comprises a network video crawler module and database submodule; and the service processing module comprises a searching interface submodule, a database operation submodule, a network video hot word analysis submodule, a user interest model management submodule, a network video address detection submodule and a system configuration management submodule. The network video searching system provides multi-level network video searching service for a user, combines the interest of the user to actively and intelligently provide the network video for the user, provides the network video downloading function, and has the capacity of analyzing the network video hot words. Simultaneously, the system manager can reasonably and effectively manage the system through the multi-selectivity system operation configuration management submodule; and the time used by the network video searching is greatly reduced, and the accuracy is improved.

Description

Network video searching system
Technical field
What the present invention relates to is a kind of system of network application technical field, specifically is a kind of network video searching system.
Background technology
Along with the continuous development of Internet technology, at present Internet video has become in the transmission information media more widely.Emerge in an endless stream in main flow video operation website, domestic more well-known have Yoqoo (youku), potato net (tudou) or the like.External well-known youtube or the like that has.Also all there has been Video service separately some multiple-service portal websites, are used for playing media contents such as news.Search engine is except providing to the user the function of search, from being the instrument of a data message statistical study in essence.Early stage internet is based on Word message, and nowadays Internet video becomes main information carrier, and search engine need have corresponding statistical study way at this medium of video.
Through existing literature search is found, Beijing Jiaotong University's master thesis " Design of searching engine of video traffic Network Based and realization " in 2007, classification number is TP393.09, this article has mainly been discussed the search engine system of video traffic Network Based, its system architecture has mainly comprised 3 modules: information grasps module, the information index module, information searching module, wherein information grasps module and mainly comprises Web Spider extracting information on the video website, and analyze extraction, final information is deposited in the database.Information searching module mainly is to read the attribute information of video from database, handles through Chinese diction, adopts Lucene to generate index file.Information searching module comprises user interface and index, and user interface mainly is a key word of accepting user input, and Search Results is returned to the user, and index is mainly according to user's keyword, the search index file, and sort according to certain requirement.Though this paper has been discussed the way that realizes the Internet video search engine, but the function that its designed search system provides is more single, be merely able to the search network video, and it is long to search for the used time, multi-level search interface can not be provided, also not have the hot speech analytic function of Internet video, user interest management function and Internet video address detecting function.
Summary of the invention
The objective of the invention is to overcome above shortcomings in the prior art, a kind of network video searching system is provided.The present invention is by the behavior of phase-split network video data and user search Internet video, realized the Internet video search supervisory system of multifunctional intelligent, has user individual, Internet video can be downloaded, video search is multi-level, but the advantage of system's operation configuration flexibility and Internet video focus statistical.
The present invention is achieved by the following technical solutions:
The present invention includes: data memory module, Service Processing Module and Subscriber Interface Module SIM, wherein: data memory module link to each other with Service Processing Module transmitting data information and process information, Service Processing Module link to each other with Subscriber Interface Module SIM transmission searching request information and search result information.
Described data memory module obtains and the storage networking video data, comprise: Internet video reptile submodule and database submodule, wherein: the Internet video reptile submodule transmission network video data information that links to each other with the database submodule, the Internet video reptile submodule transport module operation configuration information that links to each other with Service Processing Module, the database submodule links to each other with Service Processing Module and transmits database manipulation solicited message and database manipulation return data.
Described Internet video reptile submodule comprises: page download unit, content of pages analytic unit and video information extraction unit, wherein: page download unit links to each other with the content of pages analytic unit to wait to climb and gets the video website page data, the content of pages analytic unit transmission associated video information in the page of back by analysis that links to each other with the video information extraction unit, the precise video information that the video information extraction unit links to each other with the database submodule and extracts in the transmits page.
Described Service Processing Module comprises: the search interface submodule, the database manipulation submodule, submodule analyzed in the hot speech of Internet video, user interest model management submodule, submodule and system configuration management submodule are surveyed in the Internet video address, wherein: the search interface submodule links to each other with Subscriber Interface Module SIM and transmits searching request information and search result information, the database manipulation submodule links to each other with the search interface submodule and transmits search condition information and search return message, the database manipulation submodule links to each other with the hot speech analysis of Internet video submodule and transmits database manipulation message and hot speech analysis result information, the database manipulation submodule links to each other with user interest model management submodule and transmits database manipulation message and user interest model lastest imformation, the database manipulation submodule links to each other with Internet video address detection submodule and transmits database manipulation message and Internet video address information, the system configuration management submodule transmission block that links to each other with user interest model management submodule moves configuration information, system configuration management submodule links to each other with the hot speech analysis of Internet video submodule and transmits the operation configuration information, system configuration management submodule links to each other with Internet video address detection submodule and transmits the operation configuration information, system configuration management submodule links to each other with the database manipulation submodule and transmits the operation configuration information, system configuration management submodule links to each other with Internet video reptile submodule and transmits the operation configuration information, and the database manipulation submodule links to each other with data memory module and transmits database stores information.
Described database manipulation submodule comprises: the database retrieval data cell, database adds data cell, database deleted data unit, database update data cell and database update view unit, wherein: the database retrieval data cell links to each other with data memory module and transmits database retrieval statement and return results data, database interpolation data cell links to each other with data memory module and transmits database interpolation data command, database deleted data unit links to each other with data memory module and transmits the order of database deleted data, the database update data cell links to each other with data memory module and transmits the database update data command, and the database update view unit links to each other with data memory module and transmits database video update command.
Described search interface submodule comprises: the user search condition is accepted the unit, the user search result returns the unit, search condition processing unit and search execution unit, wherein: the user search condition is accepted the unit original search condition information of transmission user that links to each other with the search condition processing unit, the search condition processing unit links to each other transmission through the search instruction information after the system handles with search execution unit, the search execution unit transmission database manipulation message that links to each other with the database manipulation submodule, the user search result returns the unit and links to each other with the database manipulation submodule and transmit the database manipulation return data.
The hot speech of described Internet video is analyzed submodule and is comprised: the video title extraction unit, video title participle unit, heading converges the class unit, title vocabulary statistic unit and focus vocabulary updating block, wherein: the video title extraction unit links to each other with video title participle unit and transmits analyzed video title set, video title participle unit and heading converge the transmission that links to each other of class unit and constitute the phrase of video title, heading converges the class unit lexical space finished of transmission cluster that links to each other with title vocabulary statistic unit, title vocabulary statistic unit links to each other with focus vocabulary updating block and transmits the higher lexical set of occurrence rate, focus vocabulary updating block links to each other with the database manipulation submodule and transmits the database manipulation order of adding data, and the video title extraction unit links to each other with the database manipulation submodule and transmits the data base manipulation statement that obtains video title.
Described user interest model management submodule comprises: user behavior is represented unit, user interest model updating block and user interest recommendation unit, wherein: user behavior represents that the unit transmission user that links to each other with the user interest model updating block searches plain behavior identification information, the data base manipulation statement information of user interest is upgraded in the transmission that links to each other with the database manipulation submodule of user interest model updating block, and the user interest recommendation unit transmission user that links to each other with the database manipulation submodule is recommended video information extraction data base manipulation statement information.
Described Internet video address is surveyed submodule and is comprised: packet acquiring unit, packet content analytic unit and file address extraction unit, wherein: the packet acquiring unit links to each other with the packet content analytic unit and transmits HTTP (the HyperText Transfer Protocol that grasps, HTML (Hypertext Markup Language)) packet, the packet content analytic unit links to each other with the file address extraction unit and transmits the HTTP packet that contains the file address, and the file address extraction unit links to each other with the database manipulation submodule and transmits the database update action statement of upgrading video information.
Described system configuration management submodule comprises: Internet video reptile dispensing unit, dispensing unit is surveyed in the Internet video address, user interest model administration configuration unit, Internet video focus lexical analysis dispensing unit and stale data delete cells, wherein: Internet video reptile dispensing unit links to each other with data memory module and transmits reptile operation configuration modification information, user interest model administration configuration unit and the user interest model management submodule transport module that links to each other moves configuration information, the hot speech of Internet video focus lexical analysis dispensing unit and Internet video is analyzed the submodule transport module that links to each other and is moved configuration information, the Internet video address is surveyed dispensing unit and Internet video address and is surveyed the submodule transport module operation configuration information that links to each other, and the stale data delete cells links to each other with the database manipulation submodule and transmits the data base manipulation statement information of deleting video information.
Compared with prior art, the invention has the beneficial effects as follows: the present invention can provide multi-level Internet video search service for the user, can initiatively provide Internet video for the user intelligently in conjunction with user's interest, the Internet video download function is provided, system has the ability of phase-split network video focus vocabulary, simultaneity factor keeper can go management system rationally and effectively by system's operation configuration management submodule of multi-selection, the Internet video used time of search reduces greatly, and accuracy rate is improved.
Description of drawings
Fig. 1 is the composition connection diagram of system of the present invention.
Embodiment
Below in conjunction with accompanying drawing system of the present invention is further described: present embodiment is being to implement under the prerequisite with the technical solution of the present invention, provided detailed embodiment and concrete operating process, but protection scope of the present invention is not limited to following embodiment.
Embodiment
As shown in Figure 1, present embodiment comprises: data memory module, Service Processing Module and Subscriber Interface Module SIM, wherein: data memory module link to each other with Service Processing Module transmitting data information and process information, Service Processing Module link to each other with Subscriber Interface Module SIM transmission searching request information and search result information.
Described data memory module obtains and the storage networking video data, comprise: Internet video reptile submodule and database submodule, wherein: the Internet video reptile submodule transmission network video data information that links to each other with the database submodule, the Internet video reptile submodule transport module operation configuration information that links to each other with Service Processing Module, the database submodule links to each other with Service Processing Module and transmits database manipulation solicited message and database manipulation return data.
Described Internet video reptile submodule comprises: page download unit, content of pages analytic unit and video information extraction unit, wherein: page download unit links to each other with the content of pages analytic unit to wait to climb and gets the video website page data, the content of pages analytic unit transmission associated video information in the page of back by analysis that links to each other with the video information extraction unit, the precise video information that the video information extraction unit links to each other with the database submodule and extracts in the transmits page.
The data that system offers each Internet video of user in the present embodiment comprise: the broadcast chained address of the clicking rate of the source web of the picture of (1) video, (2) screen, the title of (3) video, (4) video, the download address of (5) video and (6) video.Because its page layout of each video website is all far from each other, so the Internet video reptile need before the reptile modular design, need carry out the web page element structure analysis to the target video website at one of each video website design.Internet video reptile submodule will carry out climbing of video data according to some organizational informations of himself webpage of video website to be got, and these comprise the taxonomic structure and the focus ordering of video website self.The Internet video data of getting of climbing Internet video reptile submodule will store in the correlation table of data in server storehouse, and Xiang Guan view will obtain upgrading simultaneously.Owing to be to focus on to climb to get, climb the informational needs of getting and accurately put in place, each is different to consider present Internet video Website page structure, sets up a reptile at the Internet video website of each main flow in the enforcement, reptile pond of whole formation.
Build the database submodule with SQL SERVER 2005 in the present embodiment, at the video data table of different Internet video website difference design stores video datas, these tables have identical list structure: the principal mark knowledge of table, the broadcast address of Internet video, the title of Internet video, the chained address that Shows Picture of Internet video, the broadcast number of Internet video, the remote address of network video file and the time that the Internet video data are updated.Need set up the table of user interest storehouse model at user interest storehouse model, the structure of table is as follows: user ID (principal mark of table is known), at each video classification user's clicking rate, at the frequent keyword of search of each video website user's clicking rate and user.Set up Event Log Table at total system operation, be used for incident in the register system operational process, the structure of table is as follows: source and Time To Event take place in event id (principal mark of table is known), event content, incident.At different classes of Internet video, need to set up view at each classification, each video data table upgrades, and need refresh corresponding view again.The standard that view is set up is set up with each classification of each video website, such as the sport category Internet video at the Yoqoo station, can set up the view that name is youku_sport_view.
Described Service Processing Module comprises: the search interface submodule, the database manipulation submodule, submodule analyzed in the hot speech of Internet video, user interest model management submodule, submodule and system configuration management submodule are surveyed in the Internet video address, wherein: the search interface submodule links to each other with Subscriber Interface Module SIM and transmits searching request information and search result information, the database manipulation submodule links to each other with the search interface submodule and transmits search condition information and search return message, the database manipulation submodule links to each other with the hot speech analysis of Internet video submodule and transmits database manipulation message and hot speech analysis result information, the database manipulation submodule links to each other with user interest model management submodule and transmits database manipulation message and user interest model lastest imformation, the database manipulation submodule links to each other with Internet video address detection submodule and transmits database manipulation message and Internet video address information, the system configuration management submodule transmission block that links to each other with user interest model management submodule moves configuration information, system configuration management submodule links to each other with the hot speech analysis of Internet video submodule and transmits the operation configuration information, system configuration management submodule links to each other with Internet video address detection submodule and transmits the operation configuration information, system configuration management submodule links to each other with the database manipulation submodule and transmits the operation configuration information, system configuration management submodule links to each other with Internet video reptile submodule and transmits the operation configuration information, and the database manipulation submodule links to each other with data memory module and transmits database stores information.
Described database manipulation submodule comprises: the database retrieval data cell, database adds data cell, database deleted data unit, database update data cell and database update view unit, wherein: the database retrieval data cell links to each other with data memory module and transmits database retrieval statement and return results data, database interpolation data cell links to each other with data memory module and transmits database interpolation data command, database deleted data unit links to each other with data memory module and transmits the order of database deleted data, the database update data cell links to each other with data memory module and transmits the database update data command, and the database update view unit links to each other with data memory module and transmits database video update command.
Adopt ADO.NET to set up the database manipulation submodule in the present embodiment, the ADO.NET storehouse has the good operability for database.In the process of implementation database operation submodule, need to set up following functional interfaces: 1 in database correlation table add the functional interface of data; 2 from database the functional interface of deleted data in the correlation table; 3 revise the functional interface of correlation table data in the database; 4 from database the functional interface of retrieve relevant data.Need to consider the parameter of all functional interfaces in the enforcement,, need consider in which table and add data, add what data such as the function of adding data.
Described search interface submodule comprises: the user search condition is accepted the unit, the user search result returns the unit, search condition processing unit and search execution unit, wherein: the user search condition is accepted the unit original search condition information of transmission user that links to each other with the search condition processing unit, the search condition processing unit links to each other transmission through the search instruction information after the system handles with search execution unit, the search execution unit transmission database manipulation message that links to each other with the database manipulation submodule, the user search result returns the unit and links to each other with the database manipulation submodule and transmit the database manipulation return data.
Four kinds of search interface are arranged in the present embodiment: 1 from the site search Internet video, and at this moment search interface will be transferred to the database manipulation module to the specific website name, returns and the database manipulation module will go out the associated video data according to the website name search; 2 from the classification search Internet video, and at this moment search interface will be transferred to the database manipulation module to the particular category name, returns and the database manipulation module will go out the associated video data according to the class name search words; 3 from the keyword search Internet video, and at this moment search interface will be transferred to the database manipulation module to particular keywords, returns and the database manipulation module will go out the associated video data according to keyword retrieval; 4 high-level network video searchs, respectively with the website name, the classification name, the keyword name, broadcast number quantity and search are returned the quantity term combination and are carried out the search of Internet video, will obtain more accurate Internet video.
The hot speech of described Internet video is analyzed submodule and is comprised: the video title extraction unit, video title participle unit, heading converges the class unit, title vocabulary statistic unit and focus vocabulary updating block, wherein: the video title extraction unit links to each other with video title participle unit and transmits analyzed video title set, video title participle unit and heading converge the transmission that links to each other of class unit and constitute the phrase of video title, heading converges the class unit lexical space finished of transmission cluster that links to each other with title vocabulary statistic unit, title vocabulary statistic unit links to each other with focus vocabulary updating block and transmits the higher lexical set of occurrence rate, focus vocabulary updating block links to each other with the database manipulation submodule and transmits the database manipulation order of adding data, and the video title extraction unit links to each other with the database manipulation submodule and transmits the data base manipulation statement that obtains video title.
The vector of in the present embodiment title of each video being regarded as a string vocabulary is found out the maximum vocabulary of occurrence rate then.Focus vocabulary can change along with the variation of time, so this flow process wants certain interval of time running once, the content that makes hot speech ordering in time and dynamic change.By the statistical study for video focus vocabulary, monitoring can be done to the Internet video focus in the special time by system.The hot speech of Internet video is analyzed submodule and by the database manipulation submodule title of playing the forward Internet video of number arrangement in the Internet video database is retrieved out, then these titles are analyzed, idiographic flow is as follows: 1 retrieves the highest some videos (quantity is configurable) of clicking rate from database; 2 make participle to the title of these videos, make video title become the vector of speech; 3 carry out cluster to these video title term vectors; 4 in each class, counts the related term of the high frequency of occurrences respectively; The higher stop word of 5 deletion frequencies; 6 pairs of focus word lists are upgraded.Hot speech analysis is to move at interval with the some cycles, after the operation the hot speech of the Internet video in the update system is ranked each time.
Described user interest model management submodule will be set up interest model at the user according to user's search behavior, as user again during login system, system will recommend to make this user's interest Internet video to the user according to user's interest model storehouse, comprise: user behavior is represented the unit, user interest model updating block and user interest recommendation unit, wherein: user behavior represents that the unit transmission user that links to each other with the user interest model updating block searches plain behavior identification information, the data base manipulation statement information of user interest is upgraded in the transmission that links to each other with the database manipulation submodule of user interest model updating block, and the user interest recommendation unit transmission user that links to each other with the database manipulation submodule is recommended video information extraction data base manipulation statement information.
The course of work of described user interest model management submodule is: 1 user links to the platform page; If before 2 users make search history was arranged, then according to the token of client, platform will be according to the interest model recommendation network video of this user in the database; If 3 these users do not have search history, then be the newly-built interest model of this user, in the cookie of client, set token simultaneously; 4 users' video search each time all will be revised the interest model of this user in the server; 5 for the user interest model that does not have change for a long time, and these data are with deleted, so that the size of data in the control data.User interest model is to preserve user interest information with tree-shaped form in database, it is respectively video website and visual classification that two classes are set in the model, and below each classification subclassification is arranged, as Yoqoo is arranged in the video website, perhaps in the video classification sport category is arranged, preserve the weight of user search behavior correspondence below each subclass, can pick out the recommendation video according to these weights.
The remote address of submodule detection network video file is surveyed in described Internet video address, thereby provide the function of user's download Internet video, comprise: the packet acquiring unit, packet content analytic unit and file address extraction unit, wherein: the packet acquiring unit links to each other with the packet content analytic unit and transmits the HTTP packet that grasps, the packet content analytic unit links to each other with the file address extraction unit and transmits the HTTP packet that contains the file address, and the file address extraction unit links to each other with the database manipulation submodule and transmits the database update action statement of upgrading video information.
The course of work that submodule is surveyed in described Internet video address is: by obtaining the broadcast page address of Internet video, link to the broadcast page of this Internet video, using winpcap packet capturing program simultaneously obtains local to the HTTP of video server request package, analyze the request content in the packet header, can obtain the remote address of network video file.This module, is stored into the file address in the corresponding table by the database manipulation submodule again after disposing by the broadcast address of database manipulation submodule acquisition Internet video.
Described system configuration management submodule comprises: Internet video reptile dispensing unit, dispensing unit is surveyed in the Internet video address, user interest model administration configuration unit, Internet video focus lexical analysis dispensing unit and stale data delete cells, wherein: Internet video reptile dispensing unit links to each other with data memory module and transmits reptile operation configuration modification information, user interest model administration configuration unit and the user interest model management submodule transport module that links to each other moves configuration information, the hot speech of Internet video focus lexical analysis dispensing unit and Internet video is analyzed the submodule transport module that links to each other and is moved configuration information, the Internet video address is surveyed dispensing unit and Internet video address and is surveyed the submodule transport module operation configuration information that links to each other, and the stale data delete cells links to each other with the database manipulation submodule and transmits the data base manipulation statement information of deleting video information.
Write down each modules configured parameter with the xml file in the present embodiment.At multi-level search interface submodule, can dispose it search interface for which website and which classification is provided; Survey submodule at the Internet video address, can dispose its operation and still stop and earlier which Internet video is carried out the address and survey; Analyze submodule at the hot speech of Internet video, can dispose all period interval that its operation still stops and moving; At user interest model management submodule, can dispose it and whether enable and the deletion time; At Internet video reptile submodule, can dispose which reptile starts, which reptile stops and reptile climbs the zero-time of getting and climb and get at interval.System configuration management submodule will be write change in the file according to system manager's demand, and each submodule operates in the operating system backstage with the form of serving, in case configuration change, system sends to corresponding service by Service Management with message, and service is restarted operation according to new configuration parameter.
In the present embodiment when this system has active user's interest model, then system can be according to this interest model retrieve video data from the database submodule, these video datas are passed to Subscriber Interface Module SIM, come out by web displaying, which focus is arranged thereby the user can know the current network video; If system is this user's interest model not, then system will retrieve most popular Internet video data and come out to pass to Subscriber Interface Module SIM from the database submodule, comes out by web displaying.
When adopting the present embodiment system: 1 Yoqoo respectively to China's five big video website; 2 potato nets; 3 cruel six nets; 4 six rooms; 5 five six video nets, wherein: Yoqoo number of videos 5625, potato net number of videos 3403,1495, six room number of videos of cruel six net number of videos, 2355, five six net number of videos 3320 are primarily aimed at four class videos simultaneously and climb and get: 1 information class video; 2 sport category videos; 3 animation class videos; 4 amusement class videos, wherein: information class number of videos 3309, sport category number of videos 3160, animation class number of videos 2696, amusement class number of videos 3173, the number of videos that obtains under different search interface and the search parameter and the time of consumption are as shown in table 1.Because present embodiment carried out relevant index and view, so very little at the used time complexity of website and classification search video, qualified video can accurately be searched for and download to the time of consumption seldom and.
Table 1
Search interface Search parameter The number of videos that searches The time that consumes (unit: second)
Source web Excellent cruel ??400 ??0.09
Source web Potato ??400 ??0.06
Source web Six rooms ??400 ??0.07
Source web Cruel six ??400 ??0.10
Source web Five or six videos ??400 ??0.07
The video classification All ??500 ??0.13
The video classification Information ??500 ??0.17
The video classification Physical culture ??500 ??0.11
The video classification Amusement ??500 ??0.18
The video classification Cartoon ??500 ??0.15
Keyword search ?“NBA” ??153 ??0.08
Advanced Search Excellent cruel+physical culture+NBA+200 broadcasting time+200 recycle times ??22 ??0.09

Claims (8)

1. network video searching system, it is characterized in that, comprise: data memory module, Service Processing Module and Subscriber Interface Module SIM, wherein: data memory module link to each other with Service Processing Module transmitting data information and process information, Service Processing Module link to each other with Subscriber Interface Module SIM transmission searching request information and search result information;
Described data memory module obtains and the storage networking video data, comprise: Internet video reptile submodule and database submodule, wherein: the Internet video reptile submodule transmission network video data information that links to each other with the database submodule, Internet video reptile submodule links to each other with Service Processing Module and transmits the operation configuration information, and the database submodule links to each other with Service Processing Module and transmits database manipulation solicited message and database manipulation return data;
Described Service Processing Module comprises: the search interface submodule, the database manipulation submodule, submodule analyzed in the hot speech of Internet video, user interest model management submodule, submodule and system configuration management submodule are surveyed in the Internet video address, wherein: the search interface submodule links to each other with Subscriber Interface Module SIM and transmits searching request information and search result information, the database manipulation submodule links to each other with the search interface submodule and transmits search condition information and search return message, the database manipulation submodule links to each other with the hot speech analysis of Internet video submodule and transmits database manipulation message and hot speech analysis result information, the database manipulation submodule links to each other with user interest model management submodule and transmits database manipulation message and user interest model lastest imformation, the database manipulation submodule links to each other with Internet video address detection submodule and transmits database manipulation message and Internet video address information, the system configuration management submodule transmission block that links to each other with user interest model management submodule moves configuration information, system configuration management submodule links to each other with the hot speech analysis of Internet video submodule and transmits the operation configuration information, system configuration management submodule links to each other with Internet video address detection submodule and transmits the operation configuration information, system configuration management submodule links to each other with the database manipulation submodule and transmits the operation configuration information, system configuration management submodule links to each other with Internet video reptile submodule and transmits the operation configuration information, and the database manipulation submodule links to each other with data memory module and transmits database stores information.
2. network video searching system according to claim 1, it is characterized in that, described Internet video reptile submodule comprises: page download unit, content of pages analytic unit and video information extraction unit, wherein: page download unit links to each other with the content of pages analytic unit to wait to climb and gets the video website page data, the content of pages analytic unit transmission associated video information in the page of back by analysis that links to each other with the video information extraction unit, the precise video information that the video information extraction unit links to each other with the database submodule and extracts in the transmits page.
3. network video searching system according to claim 1, it is characterized in that, described database manipulation submodule comprises: the database retrieval data cell, database adds data cell, database deleted data unit, database update data cell and database update view unit, wherein: the database retrieval data cell links to each other with data memory module and transmits database retrieval statement and return results data, database interpolation data cell links to each other with data memory module and transmits database interpolation data command, database deleted data unit links to each other with data memory module and transmits the order of database deleted data, the database update data cell links to each other with data memory module and transmits the database update data command, and the database update view unit links to each other with data memory module and transmits the database update view command.
4. network video searching system according to claim 1, it is characterized in that, described search interface submodule comprises: the user search condition is accepted the unit, the user search result returns the unit, search condition processing unit and search execution unit, wherein: the user search condition is accepted the unit original search condition information of transmission user that links to each other with the search condition processing unit, the search condition processing unit links to each other transmission through the search instruction information after the system handles with search execution unit, the search execution unit transmission database manipulation message that links to each other with the database manipulation submodule, the user search result returns the unit and links to each other with the database manipulation submodule and transmit the database manipulation return data.
5. network video searching system according to claim 1, it is characterized in that, the hot speech of described Internet video is analyzed submodule and is comprised: the video title extraction unit, video title participle unit, heading converges the class unit, title vocabulary statistic unit and focus vocabulary updating block, wherein: the video title extraction unit links to each other with video title participle unit and transmits analyzed video title set, video title participle unit and heading converge the transmission that links to each other of class unit and constitute the phrase of video title, heading converges the class unit lexical space finished of transmission cluster that links to each other with title vocabulary statistic unit, title vocabulary statistic unit links to each other with focus vocabulary updating block and transmits the higher lexical set of occurrence rate, focus vocabulary updating block links to each other with the database manipulation submodule and transmits the database manipulation order of adding data, and the video title extraction unit links to each other with the database manipulation submodule and transmits the data base manipulation statement that obtains video title.
6. network video searching system according to claim 1, it is characterized in that, described user interest model management submodule comprises: user behavior is represented the unit, user interest model updating block and user interest recommendation unit, wherein: user behavior represents that the unit transmission user that links to each other with the user interest model updating block searches plain behavior identification information, the data base manipulation statement information of user interest is upgraded in the transmission that links to each other with the database manipulation submodule of user interest model updating block, and the user interest recommendation unit transmission user that links to each other with the database manipulation submodule is recommended video information extraction data base manipulation statement information.
7. network video searching system according to claim 1, it is characterized in that, described Internet video address is surveyed submodule and is comprised: the packet acquiring unit, packet content analytic unit and file address extraction unit, wherein: the packet acquiring unit links to each other with the packet content analytic unit and transmits the HTTP packet that grasps, the packet content analytic unit links to each other with the file address extraction unit and transmits the HTTP packet that contains the file address, and the file address extraction unit links to each other with the database manipulation submodule and transmits the database update action statement information of upgrading video information.
8. network video searching system according to claim 1, it is characterized in that, described system configuration management submodule comprises: Internet video reptile dispensing unit, dispensing unit is surveyed in the Internet video address, user interest model administration configuration unit, Internet video focus lexical analysis dispensing unit and stale data delete cells, wherein: Internet video reptile dispensing unit links to each other with data memory module and transmits reptile operation configuration modification information, user interest model administration configuration unit and the user interest model management submodule transport module that links to each other moves configuration information, the hot speech of Internet video focus lexical analysis dispensing unit and Internet video is analyzed the submodule transport module that links to each other and is moved configuration information, the Internet video address is surveyed dispensing unit and Internet video address and is surveyed the submodule transport module operation configuration information that links to each other, and the stale data delete cells links to each other with the database manipulation submodule and transmits the data base manipulation statement information of deleting video information.
CN 201010186145 2010-05-28 2010-05-28 Network video searching system Pending CN101833587A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010186145 CN101833587A (en) 2010-05-28 2010-05-28 Network video searching system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010186145 CN101833587A (en) 2010-05-28 2010-05-28 Network video searching system

Publications (1)

Publication Number Publication Date
CN101833587A true CN101833587A (en) 2010-09-15

Family

ID=42717656

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010186145 Pending CN101833587A (en) 2010-05-28 2010-05-28 Network video searching system

Country Status (1)

Country Link
CN (1) CN101833587A (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102625158A (en) * 2011-08-10 2012-08-01 苏州闻道网络科技有限公司 Video management system
CN102630049A (en) * 2011-12-31 2012-08-08 上海聚力传媒技术有限公司 Method for determining interest degree of user about playing video and equipment thereof
CN102760058A (en) * 2012-04-05 2012-10-31 中国人民解放军国防科学技术大学 Massive software project sharing method oriented to large-scale collaborative development
CN103020212A (en) * 2012-12-07 2013-04-03 合一网络技术(北京)有限公司 Method and device for finding hot videos based on user query logs in real time
CN103179441A (en) * 2011-12-21 2013-06-26 腾讯科技(深圳)有限公司 Method and server for playing contents
CN103186539A (en) * 2011-12-27 2013-07-03 阿里巴巴集团控股有限公司 Method and system for confirming user groups, inquiring information and recommending
CN103501470A (en) * 2013-10-17 2014-01-08 珠海迈科电子科技有限公司 Network data screening method and device
CN103605773A (en) * 2013-11-27 2014-02-26 乐视网信息技术(北京)股份有限公司 Multimedia file searching method and device
CN103699661A (en) * 2013-12-26 2014-04-02 乐视网信息技术(北京)股份有限公司 Method and system for acquiring data of video resources
CN104980770A (en) * 2014-04-09 2015-10-14 杭州迪普科技有限公司 Method and device for downloading video data contents
CN105025369A (en) * 2015-06-30 2015-11-04 北京奇艺世纪科技有限公司 Method and device for determining recommended resources in channel combination
CN105893559A (en) * 2016-03-31 2016-08-24 北京奇艺世纪科技有限公司 Data pushing method and device
CN106453348A (en) * 2016-10-31 2017-02-22 南京邮电大学 Login authentication method based on user interest in social network
CN108399223A (en) * 2018-02-12 2018-08-14 北京奇艺世纪科技有限公司 A kind of data capture method, device and electronic equipment
CN109951739A (en) * 2019-03-27 2019-06-28 北京市博汇科技股份有限公司 Video traffic processing method, device and electronic equipment
CN113297450A (en) * 2021-05-24 2021-08-24 华北科技学院(中国煤矿安全技术培训中心) Crawler method, system, medium and electronic device based on fuzzy comprehensive evaluation method

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101021852A (en) * 2006-10-10 2007-08-22 鲍东山 Video search dispatching system based on content

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101021852A (en) * 2006-10-10 2007-08-22 鲍东山 Video search dispatching system based on content

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
《中国优秀硕士学位论文全文数据库》 20080531 任严 基于网络视频业务的搜索引擎的设计与实现 第15页,33-43页 1-8 , 2 *
《中国科技信息》 20070630 任严等 基于网络视频的搜索引擎的设计与实现 第120-121页 1-8 , 第11期 2 *
《信息技术》 20060731 刘春祥等 基于MVC模式的网络视频检索系统设计与实现 第7-10,第37页 1-8 , 第7期 2 *
《计算机工程与应用》 20050331 费洪晓等 基于词频统计的中文分词的研究 第67-68,100页 1-8 , 第7期 2 *

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102625158A (en) * 2011-08-10 2012-08-01 苏州闻道网络科技有限公司 Video management system
CN103179441A (en) * 2011-12-21 2013-06-26 腾讯科技(深圳)有限公司 Method and server for playing contents
US9400831B2 (en) 2011-12-27 2016-07-26 Alibaba Group Holding Limited Providing information recommendations based on determined user groups
CN103186539A (en) * 2011-12-27 2013-07-03 阿里巴巴集团控股有限公司 Method and system for confirming user groups, inquiring information and recommending
CN103186539B (en) * 2011-12-27 2016-07-27 阿里巴巴集团控股有限公司 A kind of method and system determining user group, information inquiry and recommendation
CN102630049A (en) * 2011-12-31 2012-08-08 上海聚力传媒技术有限公司 Method for determining interest degree of user about playing video and equipment thereof
CN102630049B (en) * 2011-12-31 2014-12-10 上海聚力传媒技术有限公司 Method for determining interest degree of user about playing video and equipment thereof
CN102760058A (en) * 2012-04-05 2012-10-31 中国人民解放军国防科学技术大学 Massive software project sharing method oriented to large-scale collaborative development
CN102760058B (en) * 2012-04-05 2015-03-11 中国人民解放军国防科学技术大学 Massive software project sharing method oriented to large-scale collaborative development
CN103020212A (en) * 2012-12-07 2013-04-03 合一网络技术(北京)有限公司 Method and device for finding hot videos based on user query logs in real time
CN106909638A (en) * 2012-12-07 2017-06-30 合网络技术(北京)有限公司 A kind of method and apparatus for finding hot video in real time based on user's inquiry log
CN103020212B (en) * 2012-12-07 2017-05-10 合一网络技术(北京)有限公司 Method and device for finding hot videos based on user query logs in real time
CN103501470A (en) * 2013-10-17 2014-01-08 珠海迈科电子科技有限公司 Network data screening method and device
CN103605773A (en) * 2013-11-27 2014-02-26 乐视网信息技术(北京)股份有限公司 Multimedia file searching method and device
CN103699661A (en) * 2013-12-26 2014-04-02 乐视网信息技术(北京)股份有限公司 Method and system for acquiring data of video resources
CN104980770A (en) * 2014-04-09 2015-10-14 杭州迪普科技有限公司 Method and device for downloading video data contents
CN105025369A (en) * 2015-06-30 2015-11-04 北京奇艺世纪科技有限公司 Method and device for determining recommended resources in channel combination
CN105025369B (en) * 2015-06-30 2018-07-17 北京奇艺世纪科技有限公司 Recommend the method and device of resource in a kind of determining combiner channel
CN105893559A (en) * 2016-03-31 2016-08-24 北京奇艺世纪科技有限公司 Data pushing method and device
CN106453348A (en) * 2016-10-31 2017-02-22 南京邮电大学 Login authentication method based on user interest in social network
CN106453348B (en) * 2016-10-31 2019-11-15 南京邮电大学 Based on the login authentication method of user interest in social networks
CN108399223A (en) * 2018-02-12 2018-08-14 北京奇艺世纪科技有限公司 A kind of data capture method, device and electronic equipment
CN109951739A (en) * 2019-03-27 2019-06-28 北京市博汇科技股份有限公司 Video traffic processing method, device and electronic equipment
CN109951739B (en) * 2019-03-27 2021-06-08 北京市博汇科技股份有限公司 Video service processing method and device and electronic equipment
CN113297450A (en) * 2021-05-24 2021-08-24 华北科技学院(中国煤矿安全技术培训中心) Crawler method, system, medium and electronic device based on fuzzy comprehensive evaluation method

Similar Documents

Publication Publication Date Title
CN101833587A (en) Network video searching system
CN105022827B (en) A kind of Web news dynamic aggregation method of domain-oriented theme
JP2021108183A (en) Method, apparatus, device and storage medium for intention recommendation
CN110597981B (en) Network news summary system for automatically generating summary by adopting multiple strategies
US8626768B2 (en) Automated discovery aggregation and organization of subject area discussions
US9262532B2 (en) Ranking entity facets using user-click feedback
US8065619B2 (en) Customized today module
CN102708174B (en) Method and device for displaying rich media information in browser
US20120011129A1 (en) Faceted exploration of media collections
CN106096056A (en) A kind of based on distributed public sentiment data real-time collecting method and system
CN106339394B (en) Information processing method and device
US20110307462A1 (en) Systems and Methods for Online Search Recirculation and Query Categorization
CN103294815A (en) Search engine device with various presentation modes based on classification of key words and searching method
CN104090923A (en) Method and device for displaying rich media information in browser
CN115757689A (en) Information query system, method and equipment
CN116226494B (en) Crawler system and method for information search
CN105989176A (en) Data processing method and device
Benedusi et al. An associative engines based approach supporting collaborative analytics in the internet of cultural things
WO2015000083A1 (en) System and method for ranking online content
Wang et al. Adaptive identification of hashtags for real-time event data collection
CN114065054A (en) Method and device for pushing information
Alzua-Sorzabal et al. Using MWD: A business intelligence system for tourism destination web
CN111970327A (en) News spreading method and system based on big data processing
CN102890715A (en) Device and method for automatically organizing specific domain information
Walchhofer et al. Semantic online tourism market monitoring

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20100915