CN101517574B - Illegal contents auto-searching system and method using access/search application on internet - Google Patents

Illegal contents auto-searching system and method using access/search application on internet Download PDF

Info

Publication number
CN101517574B
CN101517574B CN2007800356457A CN200780035645A CN101517574B CN 101517574 B CN101517574 B CN 101517574B CN 2007800356457 A CN2007800356457 A CN 2007800356457A CN 200780035645 A CN200780035645 A CN 200780035645A CN 101517574 B CN101517574 B CN 101517574B
Authority
CN
China
Prior art keywords
search
illegal contents
access
script
download
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2007800356457A
Other languages
Chinese (zh)
Other versions
CN101517574A (en
Inventor
郑彗源
李骏硕
徐泳浩
俞元英
徐庸硕
李相光
李善和
金元谦
吴元根
李诚晥
李承宰
尹英锡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Electronics and Telecommunications Research Institute ETRI
Original Assignee
Electronics and Telecommunications Research Institute ETRI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electronics and Telecommunications Research Institute ETRI filed Critical Electronics and Telecommunications Research Institute ETRI
Publication of CN101517574A publication Critical patent/CN101517574A/en
Application granted granted Critical
Publication of CN101517574B publication Critical patent/CN101517574B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions

Abstract

The invention provides an illegal contents auto-searching system and method using access/search application on the Internet. The system includes: a keyword input unit for receiving a keyword related to contents from a search client and managing the keyword according to a keyword group; a script file editing unit for extracting a window class identification (ID) which can control the access/search application from configuration information of the access/search application and editing a script file for searching/downloading illegal contents based on the extracted window class ID and access information; a script file automating unit for controlling the access/search application to automatically search/download illegal contents related to the keyword group on the Internet according to the edited illegal contents for searching/downloading script file; and an illegal contents information storing unit for storing searched and downloaded illegal contents information using the access/search application.

Description

Use access/search application to search for the system and method for illegal contents automatically on the internet
Technical field
The present invention relates to use on the internet access/search application to search for the system and method for illegal contents automatically.More particularly, the present invention relates to use on the internet the automatic search system of illegal contents and the method thereof of access/search application, described system is based on such as the configuration information of the access/search application in the internet sites of equity service (P2P), network hard disc and webpage, the script file that editor is used for the search/download illegal contents, and control this access/search application based on this script file, thereby in the P2P/ network hard disc search illegal contents automatically.
Background technology
In current digital copyright management (DRM) environment, though content is sent to the user legally, the user can come unlawfully duplicating image, Voice ﹠ Video content by instrument of catching or activities of hacker, does not make the deterioration of voice or picture simultaneously again.When the user issued the content of bootlegging on the internet by equity (P2P) service or network hard disc service without approval, other users just can use illegal contents, and need not to buy described content.Issue forbidden digit content has seriously influenced the digital content industry.Thereby, need be used to prevent the technology of bootlegging.
Though need to follow the tracks of also punishment unlawful activities to prevent the bootlegging of logarithm word content, in conventional art, the content of very difficult search/download bootlegging is also followed the tracks of the publisher of illegal contents.(such as e-donkey (electric donkey) and online collecting web page content bitTorrent), that comprise simple HTML(Hypertext Markup Language), but the function of this method is limited by having used common protocol for a kind of classic method.And existence can not obtain the problem of a part of information.
Because it is public service (the premium common service) environment of the employed additional payment of general user that personal communication agreement and interface, P2P or network hard disc are provided, therefore there is the problem that is difficult to automatic search content in P2P or network hard disc.
The technology that prevents and follow the tracks of the illegal distribution conventional digital contents comprises a kind of technology that is used to follow the tracks of with searching image, it is based on Webpage search device, content-based feature point extraction device and content-based search engine, the unique point that is published on the image on the internet and metadata are searched for and classified, find the copyright infringement on the internet thus.
Yet this conventional art only limits to Webpage search, and contents of object only limits to image.And, in different public P2P/ network hard discs, search for the content of bootlegging according to the service provider and do not carry out automatically, but manually carry out.
Have another kind of conventional art, it uses Client Agent to prevent and follow the tracks of bootlegging and unauthorized issue.
Described technology can be set up a system, this system is by being used for monitoring the illegal work monitor server of illegal work and the illegal work trace routine of issuing from this illegal work monitor server on internet or P2P shared network, follow the tracks of the illegal work on internet or the P2P shared network, effectively control and monitor the illegal work of issue on the internet thus.And this illegal work surveillance can be kept effectively by following the tracks of expense (such as mileage points (mileagepoint) or proper reward) to the client payment illegal work that monitors illegal work.
Yet this conventional art can only be searched in simple web page/public P2P.Because each application has different interfaces and different host-host protocols, therefore there is such problem: will manually carry out being used as sealing P2P (closed P2P) that public service provides or the search in the network hard disc and the tracking of distributor information.
Summary of the invention
Technical matters
Therefore, an object of the present invention is to provide a kind of system and method thereof of using access/search application to search for illegal contents automatically on the internet.This system is according to the configuration information such as the access/search application in the internet sites of equity (P2P), network hard disc and webpage, editor is used for the script file of search/download illegal contents, and control this access/search application based on this script file, come to search for automatically the illegal contents in the P2P/ network hard disc thus.
Other purpose of the present invention and advantage will be understood by following description, and will become clearer by the embodiments of the invention of hereinafter setting forth.To be clear that also objects and advantages of the present invention can easily realize by device and the combination thereof that defines in the claim.
Technical scheme
According to an aspect of the present invention, provide a kind of automatic search system of illegal contents of using access/search application on the internet.Described system comprises: the key word input block, and it receives the key word relevant with content from the search client, and manages described key word according to keyword group; The script file edit cell, it extracts window class sign ID from the configuration information of access/search application, and being used for the script file of search/download illegal contents based on the window class ID that is extracted and visit information editor, wherein said window class ID can control this access/search application; The script file automation equipment, it controls described access/search application, with according to being used for the illegal contents of being edited of search/download script file, the search/download illegal contents relevant automatically on the internet with keyword group; With the illegal contents information storage unit, its storage use described access/search application search with the illegal contents information of downloading.
According to another aspect of the present invention, a kind of illegal contents automatic search method that uses access/search application on the internet is provided, and this method comprises following steps: a) configuration information from described access/search application extracts the window class sign ID that is used to control this access/search application; B) pass through window class ID and the described access/search application extracted, edit the script file that is used for the search/download illegal contents based on the visit information that is used to the access internet website; C) receive the key word relevant from the search client, and manage described key word according to each group with described content; And d) control described access/search application, with according to the script file of being edited that is used for the described illegal contents of search/download, the search/download illegal contents relevant automatically on the internet with described keyword group.
An object of the present invention is to search for automatically in the sealing P2P/ network hard disc on the internet illegal contents.
By based on configuration information edit script file such as the access/search application on P2P, network hard disc and webpage, the to be searched internet sites, and control this access/search application search/download illegal contents relevant with keyword group according to the script file of being edited, automatically search for illegal contents information.
When following the tracks of illegal reproducting content such as the such searching request client requests of publisher server, the individual who has copyright and content and service provider, the present invention carry out on the internet such as sealing P2P/ network hard disc access program such content service program, carry out search automatically according to keyword group, and download and comprise the bootlegging content-related information of image, Voice ﹠ Video.And, the present invention includes the distributor information tracking module that is used to collect distributor information.
Beneficial effect
In traditional technology, the simple search, content search and the publisher in charge P2P or network hard disc that manually carry out in simple web page or public P2P determine work.
On the contrary, the script that the present invention is based on the access/search application that can control sealing is searched for illegal contents, and extracts the distributor information of the content that searches, thereby can automatically search for illegal contents and distributor information.
And, when access/search application is the network hard disc of type of webpage, for example, when content was stored in the bulletin board of webpage, the present invention called access/search application as script, by the web page analysis piece source is analyzed, and by analysis result acquisition link information, thus, when positional information changes, the present invention need not to proofread and correct script, can easily search for.
The present invention comes in window message internal extraction distributor information by hooking the window message that is transmitted in access/search application inside.Thereby in the time can not extracting distributor information, the present invention can force to extract distributor information, and the control access/search application.
By whether the also definite content of search content is illegal automatically in online service, the present invention can reduce labour and the budget that prevents that bootlegging is required, and can encourage preventing the content bootlegging.
Description of drawings
Above and other objects of the present invention and feature will become clear from the description of preferred embodiments that provides below in conjunction with accompanying drawing, wherein:
Fig. 1 shows the automatic search system of the illegal contents that uses access/search application according to an embodiment of the invention, on the internet;
Fig. 2 for explanation according to an embodiment of the invention, the block diagram of the script file editor piece of Fig. 1;
Fig. 3 for explanation according to an embodiment of the invention, the block diagram of the script file robotization piece of Fig. 1; And
Fig. 4 for explanation according to the embodiment of the invention based on access/search application search for automatically on the internet in the method for illegal contents, according to the process flow diagram of the method for each group's search key.
Embodiment
Other purpose of the present invention and advantage will become clear to the description of embodiment with reference to the accompanying drawings from following.Therefore, the technician of the field of the invention can easily specifically implement technological concept of the present invention and scope.In addition, if can blur main points of the present invention, just no longer provide such detailed description here to the detailed description of correlation technique.Hereinafter with preferred embodiments of the present invention will be described in detail with reference to the annexed drawings.
Fig. 1 shows according to an embodiment of the invention, uses on the internet access/search application to search for the system of illegal contents automatically.
Use the automatic search system 10 of illegal contents of access/search application to comprise information management piece 12, key word input block 13, script file editor piece 14, script file robotization piece 15 and content stores piece 16 on the internet.
Access/search application 11 is carried out visit, search and download function for the user, so as in public equity (P2P) website, general webpage and sealing P2P/ network hard disc website storage and content shared.
The configuration information that information management piece 12 management access/search is used, and management is used for the visit information by access/search application 11 access internet websites.
The configuration information that access/search application information administrative unit 121 management access/search is used.Just, the configuration information of access/search application 11 can be imported, corrects and be deleted to access/search application information administrative unit 121, writes down and manage the access/search service and the details of a certain kind.
122 management of visit information administrative unit are used for the visit information by access/search application 11 access internet websites.Described visit information comprises program version, the installation site of connector ID, password, access/search application and the executable file that uses in client-side program.
Key word input block 13 is from receiving and the corresponding key word of content such as the such content search request client of the individual with copyright and content and service provider, and comes the key word of administrative institute's input according to each group.Key word input block 13 is added or the deletion key word to the keyword group that is used for search content.
Fig. 2 is the block diagram of the script file editor piece of key diagram 1.
Script file editor piece 14 comprises window class ID extraction unit 21, window command input block 22, script command input block 23 and script edit cell 24.
Script file editor piece 14 extracts the window class ID that is used to control this access/search application from the configuration information of access/search application, and edits the script file that is used for the search/download illegal contents based on window class ID that is extracted and the visit information managed in information management piece 12.
Window class ID extraction unit 21 extracts the window class ID that is used to control access/search application 11 from the configuration information of the access/search application of management information management piece 12, so that window message directly is sent to window class, and will order the corresponding window class of window class ID that directly is assigned to and is extracted.
The 22 receive window orders of window command input block, this window command are used for will ordering based on the window class ID that extracts from window class ID extraction unit 21 and directly are sent to window class.
Script command input block 23 receives the script command that automatically performs the window input.
Script edit cell 24 is based on the window class ID that extracts from window class ID extraction unit 21, from the window command of window command input block 22 transmission, from script command and visit information that script command input block 23 transmits, and editor is used for the script file of search/download illegal contents.
That script file is divided into is grand, window command and program command.Grand is the script that is used to automatically perform such as the such window input of keyboard or click.Window command is the script that is used for order directly is sent to window class.Program command is to be used for the script file that execute file duplicates, external program is operated and forced external program to stop.
Fig. 3 for explanation according to an embodiment of the invention, the block diagram of the script file robotization piece of Fig. 1.
Script file robotization piece 15 comprises script file analytic unit 31, script file operating unit 32, automatic key word input block 33, web page analysis unit 34, file downloading control unit 35, distributor information extraction unit 36 and additional (add-on) unit 37.
Script file robotization piece 15 control access/search application are according to the script file that is used for the illegal contents that search/download edits, on the internet the search/download illegal contents relevant with keyword group automatically.
Script file analytic unit 31 is analyzed at script file editor piece script file 14 ineditings, that be used for the search/download illegal contents, and converts this script file to be used for the search/download illegal contents script operation file.Just, the form stored script information file that script file analytic unit 31 is analyzed with ASCII (ASCII), and convert the script file of being analyzed to script operation file, and this script operation file is stored in the storer, wherein, described script operation file is the form of the bytecode that will operate in script file operating unit 32.
Script file operating unit 32 operates in script operation file conversion, that be used for the search/download illegal contents in the script file analytic unit 31, and control access/search application 11 is on the internet according to the key word relevant illegal contents of the automatic search/download of each group with input in automatic key word input block 33.
Script file operating unit 32 reads in script operation file that stored, the bytecode form in the storer in the script file analytic unit 31, and carries out and the corresponding order of this script operation file.The order of bytecode form is as shown in table 1 below.
Table 1
Order The content of order
START The starting point of program
HALT Program termination
GOTO Move to certain position
LOOP Loop statement
LABEL Location tags from FALSEBRANCH code and GOTO code
FALSEBRANCH When the answer for condition is a fictitious time, move to corresponding position
LABEL The mark position that in FALSEBRANCH code and GOTO code, is used to move
MOUSELCLICK The left side of clicking the mouse
MOUSELDCLICK Double-click the left side of mouse
KEYPRESS Press specific key
SENLETTERS Press a plurality of keys successively
LVSELECTITEM In List View, select specific item
BTNCLICK Click and the corresponding certain window button of class ID
SETTEXT Text is inserted in the edit box
EXECUTEEXE The operation obj ect file
DELETEFILE The deletion obj ect file
MOVEFILE Mobile obj ect file
Automatically key word input block 33 is carried out from key word input block 13 and is loaded the key word that manages according to each group, and automatically imports key word.Just, key word input block 33 is obtained the keyword group that manages according to each group in key word input block 13 automatically, and execution loads the operation that is included in each key word in the keyword group according to each group on the key word window.Thereby, can automatically search for all key words by script file operating unit 32 operated access/search application 11.
When access/search application 11 was the network hard disc of type of webpage, for example, when on the bulletin board at webpage when circular and memory contents or linked contents, 34 analyzing web page sources, web page analysis unit obtained the link information that also search comprises illegal contents.
32 of script file operating units just can be controlled different webpages by script.Yet, when access/search application 11 is type of webpage, there is a difficulty, that is, script file operating unit 32 should be proofreaied and correct the script file (such as the various variations of positional information or layout) that is used for search/download based on user rs environment.Therefore, script file robotization piece 15 is carried out search by web page analysis unit 34, and comes download content is partly carried out search by calling the script file that is used for search/download.
Owing to exist whenever positional information changes the difficulty that all will proofread and correct script, therefore web page analysis unit 34 called script easily to carry out search.
A part of content (described content may be confirmed to be the illegal contents that is searched by access/search application 11) is downloaded in file downloading control unit 35, and described partial content is identified as illegal contents.Only download partial content and can improve search and downloading efficiency.When can to identify the partial content that is searched by access/search application 11 under the control of script file operating unit 32 be illegal, file downloading control unit 35 stopped to download and duplicating the illegal contents of being downloaded.
In conjunction with content recognition technology based on unique point, only utilize partial content, just can be fast and the metamessage of the scope of examination exactly.
By file downloading control unit 35, flow can be reduced 1/10th of overall network flow.Thereby, can be with 1/10th of speed before the search speed raising.
Distributor information extraction unit 36 is from the distributor information of internet sites extraction illegal contents, and wherein, the illegal contents that is searched by access/search application 11 is published on the described internet sites.The information that distributor information extraction unit 36 extracts about content distributed actual distributor so that give the content publisher with responsibility, and obtains the conclusive fact about the issue illegal contents.The information about the illegal contents publisher that can collect in distributor information extraction unit 36 comprises that screen is caught in service name, User IP, user ID, collection date, content corresponding and download and such as the content metadata of title, authors' name, photograph album name and download URL.
When producing the distributor information therefrom to extract illegal contents or can not be according to the script operation file that is used for the search/download illegal contents during to its information of controlling, perhaps when needs were accurately controlled access/search application 11, extra cell 37 can extract distributor information or control program by hooking the window message that sends from the inside of access/search application 11.Extra cell 37 is programmed by adding, is carried out the hard coded of assembly language corresponding to each access/search application 11, and carries out and force to follow the tracks of.
Illegal contents information storage block 16 is stored in the illegal contents information of collecting in the script file robotization piece 15.The illegal contents information relevant with illegal contents comprises the distributor information of illegal contents, illegal contents acquisition of information and illegal contents.Illegal contents information also comprises link information (under the situation of the network hard disc of type of webpage, this link information comprises illegal contents) and such as the illegal contents metadata of filename, file size and form.
Fig. 4 for explanation according to the embodiment of the invention based on access/search application search for automatically on the internet in the method for illegal contents, according to the process flow diagram of the method for each group's search key.
Script file robotization piece 15 comes operational access/search to use 11 by being used for the script operation file of search/download illegal contents.And, in operated access/search application 11, create the key word window.Automatically key word input block 33 is carried out the function that loads and insert automatically key word on the key word window of access/search application 11.
Automatically the key word insertion process comprises the steps: to search for illegal contents according to each keyword group, and the distributor information of the illegal contents that searches is stored in the database (DB).Each group is repeated this search procedure, and after having searched for last group, this search procedure finishes.
Automatically key word input block 33 is carried out at step S402 and is loaded first keyword group, and will be inserted into from the key word of first key word in the key word window of access/search application 11 at step S404.Access/search application 11 is searched for and the relevant content of being inserted of first key word at step S406, and one by one downloads the illegal contents that searches at step S408.At step S410, distributor information extraction unit 36 is collected distributor information, and described distributor information can be extracted from the internet sites that the access/search service is provided.
As mentioned above, the download of file downloading control unit 35 control content to be downloading a part of content, and, by content recognition technology, only utilize described a part of content just can discern content based on unique point.When having downloaded a part of file,, duplicate institute's downloaded files at step S412.Subsequently,, stop file and download at step S414, and the file of deleted file folder.Next file is repeated described process.At step S416, the information of collected illegal contents is recorded in the illegal contents storage block 16, to avoid downloading identical file.
At step S418, check whether institute's downloaded files is last search file.When institute's downloaded files was not last search file, logic flow turned back to step S408, and the process above repeating.When institute's downloaded files is last search file, check at step S420 whether key word is last key word.When key word is not last key word, insert next key word at step S422, and logic flow turns back to step S406.When key word is last key word, check at step S424 whether keyword group is last keyword group.When keyword group is not last keyword group, load next keyword group at step S426, and logic flow turns back to step S404.When keyword group is last keyword group, handle the file that searches at step S428.
As top described in detail, technology of the present invention may be implemented as program and is stored in the computer readable recording medium storing program for performing such as CD-ROM, RAM, ROM, floppy disk, hard disk and magneto-optic disk.Because those skilled in the art can easily realize described process, therefore do not provide further description herein.
The application comprises the relevant theme of 2006-0069970 korean patent application that is submitted to Korea S Department of Intellectual Property with on July 25th, 2006, is incorporated herein with the form of the reference full content with this korean patent application.
Although described the present invention, it will be apparent to those skilled in the art that and to carry out various changes and modification and do not deviate from the defined scope of the present invention of claim with reference to certain preferred embodiment.

Claims (13)

1. automatic search system of illegal contents of using access/search application on the internet comprises:
The key word input media, it receives the key word relevant with content from the search client, and manages described key word according to keyword group;
The script file editing device, it extracts the window class sign ID that can control this access/search application from the configuration information of access/search application, and based on the window class ID that is extracted be used for being used for by the visit information editor of this access/search application access internet website the script file of search/download illegal contents;
The script file automation equipment, it controls described access/search application, with according to the script file of being edited that is used for the search/download illegal contents, the search illegal contents relevant with keyword group automatically on the internet with download; With
The illegal contents information that illegal contents information memory storage, its storage use described access/search application search and download.
2. the system as claimed in claim 1 also comprises:
Apparatus for management of information, it manages the configuration information of described access/search application, and manages described visit information.
3. the system as claimed in claim 1, wherein, described script file editing device comprises:
Window class ID extraction unit, it extracts the window class ID that can control this access/search application from the configuration information of described access/search application;
The window command input block, its receive window order directly is sent to window class will order based on the window class ID that is extracted;
The script command input block, it receives script command, and this script command is used to automatically perform the window input; With
The script edit cell, the script command and the described visit information of its window command according to the window class ID that is extracted, input, input are edited the script file that is used for the search/download illegal contents.
4. system as claimed in claim 3, wherein, described script file automation equipment comprises:
The script file analytic unit, it analyzes the script file of being edited that is used for the search/download illegal contents, and converts this script file to be used for the search/download illegal contents script operation file;
Automatically key word input block, its load and automatically input from key word described key word input media, that manage according to each group; With
The script file operating unit, the script operation file that is used for the search/download illegal contents after its operation conversion, and control described access/search application, it is searched on the internet automatically according to each group and download the illegal contents relevant with the key word of being imported.
5. system as claimed in claim 4, wherein, described script file automation equipment also comprises:
The distributor information extraction unit, the internet sites of its illegal contents from issue script file operating unit is extracted the distributor information of illegal contents;
File downloading control unit, its unique point based on the illegal contents that searches is downloaded a part of content, and is illegal contents with described content recognition;
The web page analysis unit, when described access/search application is the network hard disc of type of webpage, the source of its analyzing web page, and obtain the link information that comprises illegal contents; With
Extra cell, when not extracting the distributor information of illegal contents according to the script operation file that is used for the search/download illegal contents, it hooks the window message that transmits from the inside of access/search application, and extracts distributor information.
6. system as claimed in claim 5, wherein, network hard disc, the sealing P2P that described access/search application is visited public reciprocity P2P, type of webpage use and the close network hard disk at least one, and search content.
7. system as claimed in claim 5, wherein, described distributor information comprises that the search/download of publisher IP, publisher ID, the employed access/search service name of publisher, the metadata of illegal contents, the collection date of distributor information, the illegal contents that searches and illegal contents catches screen.
8. illegal contents automatic search method that uses access/search application on the internet comprises following steps:
A) configuration information from described access/search application extracts the window class sign ID that is used to control this access/search application;
B) pass through the window class ID that extracted and this access/search application, based on the visit information that is used for the access internet website, editor is used for the script file of search/download illegal contents;
C) receive the key word relevant from the search client with described content, and according to the described key word of each management and group; And
D) control described access/search application, with according to the script file of being edited that is used for the search/download illegal contents, the search illegal contents relevant with described keyword group automatically on the internet with download.
9. method as claimed in claim 8, wherein said step b) comprises:
B1) receive window order, this window command are used for will ordering according to the window class ID that is extracted and directly are sent to window class;
B2) receive script command, this script command is used to automatically perform the window input; And
B3) based on the window class ID that is extracted, the window command of being imported, the script command of being imported and described visit information, editor is used for the script file of search/download illegal contents.
10. method as claimed in claim 8, wherein, step d) comprises:
D1) analyze script file that edited, that be used for the search/download illegal contents in the script file editor, and convert this script file to be used for the search/download illegal contents script operation file;
D2) load and import automatically the key word that in step c), manages according to each group; And
D3) script operation file after the operation conversion, that be used for the search/download illegal contents, and control access/search application is to search for automatically on the internet according to each group and to download the illegal contents relevant with the key word of being imported.
11. method as claimed in claim 10, wherein, step d) also comprises step:
D4) from issuing steps d3) the internet sites of illegal contents extract the distributor information of illegal contents;
D5) download a part of content based on the unique point of the illegal contents that searches, and be illegal contents described content recognition;
D6) when access/search application is the network hard disc of type of webpage, the source of analyzing web page, and obtain the link information that comprises illegal contents; And
D7) when not extracting the distributor information of illegal contents, hook the window message that transmits from the inside of described access/search application, and extract distributor information according to the script operation file that is used for the search/download illegal contents.
12. method as claimed in claim 11, wherein, described access/search application is visited the network hard disc of public equity (P2P), type of webpage, at least one among sealing P2P and the close network hard disk, and search content.
13. method as claimed in claim 11, wherein said distributor information comprise that the search/download of publisher IP, publisher ID, the metadata by the employed access/search service name of publisher, illegal contents, the collection date of distributor information, the illegal contents that searches and illegal contents catches screen.
CN2007800356457A 2006-07-25 2007-01-29 Illegal contents auto-searching system and method using access/search application on internet Expired - Fee Related CN101517574B (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR1020060069970 2006-07-25
KR10-2006-0069970 2006-07-25
KR1020060069970A KR100805819B1 (en) 2006-07-25 2006-07-25 System and Method for Auto-Searching of Illegal Contents in the P2P/Webhard Service
PCT/KR2007/000495 WO2008013351A1 (en) 2006-07-25 2007-01-29 Illegal contents auto-searching system and method using access/search application on internet

Publications (2)

Publication Number Publication Date
CN101517574A CN101517574A (en) 2009-08-26
CN101517574B true CN101517574B (en) 2011-10-26

Family

ID=38981646

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2007800356457A Expired - Fee Related CN101517574B (en) 2006-07-25 2007-01-29 Illegal contents auto-searching system and method using access/search application on internet

Country Status (3)

Country Link
KR (1) KR100805819B1 (en)
CN (1) CN101517574B (en)
WO (1) WO2008013351A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101016119B1 (en) * 2008-10-08 2011-02-17 주식회사 케이티 Method for providing individual service information of internet site and terminal therefor
CN101729853B (en) * 2009-11-13 2011-05-18 深圳创维-Rgb电子有限公司 System, method, device and installation for filtering programs
CN101968807A (en) * 2010-10-15 2011-02-09 北京思在信息技术有限责任公司 Content retrieval method and device
KR101653686B1 (en) 2015-12-17 2016-09-09 주식회사 비디 Service flow providing method, service flow providing server performing the same and storage medium storing the same
KR20170101624A (en) * 2016-02-29 2017-09-06 (주)엠더블유스토리 System for monitoring digital contents and method for processing thereof
KR102331338B1 (en) 2020-07-07 2021-11-25 주식회사 에이아이스페라 Apparatus, method and program for providing information related to distribution of illegal contents on peer-to-peer network

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1461562A (en) * 2001-02-20 2003-12-10 皇家菲利浦电子有限公司 Broadcast and processing of meta-information associated with content material
CN1655500A (en) * 2004-02-11 2005-08-17 微软公司 Desynchronized fingerprinting method and system for digital multimedia data

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20000036731A (en) * 2000-03-27 2000-07-05 이영만 Computer-controlled content playback device made impossible to duplicate
US6748375B1 (en) * 2000-09-07 2004-06-08 Microsoft Corporation System and method for content retrieval
KR20030013814A (en) * 2001-08-09 2003-02-15 권오석 A system and method for searching a contents included non-text type data
KR20030015742A (en) * 2001-08-17 2003-02-25 주식회사 비즈모델라인 System for tracking down illegal copies and distribution of digital contents
KR100747147B1 (en) * 2005-10-05 2007-08-07 문종섭 A Peer to Peer system which provides benefit to all of content provider, operator of the network and distributor and provides securities in the network

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1461562A (en) * 2001-02-20 2003-12-10 皇家菲利浦电子有限公司 Broadcast and processing of meta-information associated with content material
CN1655500A (en) * 2004-02-11 2005-08-17 微软公司 Desynchronized fingerprinting method and system for digital multimedia data

Also Published As

Publication number Publication date
WO2008013351A1 (en) 2008-01-31
CN101517574A (en) 2009-08-26
KR20080010028A (en) 2008-01-30
KR100805819B1 (en) 2008-02-21

Similar Documents

Publication Publication Date Title
US11886402B2 (en) Systems, methods, and media for dynamically generating informational content
CN102737109B (en) The method and apparatus of the label of generating media content
JP4681720B2 (en) Electronic document management method and management system
US8290938B2 (en) Document management techniques to account for user-specific patterns in document metadata
Li et al. Here's what I did: Sharing and reusing web activity with ActionShot
CN101853300B (en) Method and system for identifying and evaluating video downloading service website
CN104766014A (en) Method and system used for detecting malicious website
CN101517574B (en) Illegal contents auto-searching system and method using access/search application on internet
US20150178476A1 (en) System and method of monitoring font usage
US20090100154A1 (en) Automatically instrumenting a set of web documents
US8972374B2 (en) Content acquisition system and method of implementation
CN101443751A (en) Method and apparatus for an application crawler
AU2009238294A1 (en) Data transformation based on a technical design document
US9069771B2 (en) Music recognition method and system based on socialized music server
CN103678487A (en) Method and device for generating web page snapshot
KR20170101624A (en) System for monitoring digital contents and method for processing thereof
US20060184573A1 (en) Information processing apparatus, information pocessing method, and computer program
US9356845B1 (en) System and method for audience segment profiling and targeting
KR101888866B1 (en) Method and apparatus for distributing contents using copyright protection
CN111859076B (en) Data crawling method, device, computer equipment and computer readable storage medium
US20120023133A1 (en) Document searching system and method
JP5224839B2 (en) Document management system, document management apparatus, document management method, and program
TWI680666B (en) Method and system for identifying users on internet
JP2020126465A (en) Detector, detecting method, and detecting program
KR102399489B1 (en) Server, system, method for artist id integrated management

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20111026

Termination date: 20140129