CN115442118A - Information acquisition system and method for web service in anonymous communication system - Google Patents

Information acquisition system and method for web service in anonymous communication system Download PDF

Info

Publication number
CN115442118A
CN115442118A CN202211060121.6A CN202211060121A CN115442118A CN 115442118 A CN115442118 A CN 115442118A CN 202211060121 A CN202211060121 A CN 202211060121A CN 115442118 A CN115442118 A CN 115442118A
Authority
CN
China
Prior art keywords
communication system
request
anonymous communication
mode
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211060121.6A
Other languages
Chinese (zh)
Inventor
李海龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yang Qiongyun
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202211060121.6A priority Critical patent/CN115442118A/en
Publication of CN115442118A publication Critical patent/CN115442118A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/04Network architectures or network communication protocols for network security for providing a confidential data exchange among entities communicating through data packet networks
    • H04L63/0407Network architectures or network communication protocols for network security for providing a confidential data exchange among entities communicating through data packet networks wherein the identity of one or more communicating identities is hidden
    • H04L63/0421Anonymous communication, i.e. the party's identifiers are hidden from the other party or parties, e.g. using an anonymizer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/04Network architectures or network communication protocols for network security for providing a confidential data exchange among entities communicating through data packet networks
    • H04L63/0428Network architectures or network communication protocols for network security for providing a confidential data exchange among entities communicating through data packet networks wherein the data content is protected, e.g. by encrypting or encapsulating the payload
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/08Protocols for interworking; Protocol conversion

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Hardware Design (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer And Data Communications (AREA)

Abstract

An information acquisition system aiming at web service in an anonymous communication system comprises an acquisition module and an agent module; the acquisition module acquires contents provided by the target web service in a browser control mode; the proxy module accesses an anonymous communication system used by a target web service in a port mapping or protocol conversion mode; a method of collecting information using an information collection system for web services within an anonymous communication system, comprising the steps of: s1, receiving an external task by using an acquisition module, analyzing the task into a specific request and sending the specific request to an agent module; and S2, receiving the request of the acquisition module by using the proxy module, and acquiring the requested resource through the anonymous communication system bridge.

Description

Information acquisition system and method for web service in anonymous communication system
Technical Field
The invention belongs to the technical field of information acquisition systems, and particularly relates to an information acquisition system and method for web services in an anonymous communication system.
Background
With the development of Internet technology, various decentralization and anonymous communication means are continuously appearing and upgrading, and an anonymous communication system is an overlay network which is built on the Internet and comprehensively utilizes a plurality of technologies such as data forwarding, content encryption, flow confusion and the like to hide communication entity relationships and contents. Anonymous communication is a privacy protection technology that hides communication contents and relationships by taking measures such as data forwarding, content encryption, traffic confusion, and the like. In order to improve the anonymity of communication, the data forwarding links are generally formed by multi-hop encryption proxy service nodes, and all the nodes form an anonymous communication system (or anonymous communication network). The anonymous communication system is essentially an overlay network for providing anonymous communication service, can provide Internet anonymous access function for ordinary users to hide network communication sources and targets, and provides a hidden service mechanism for service providers to realize anonymous network service deployment. As a core function of an anonymous communication system, a hidden service mechanism can ensure that an anonymous service cannot track and locate, usually by using a multi-hop reverse proxy or by hiding the real address of a service provider through resource sharing storage.
Because the anonymous communication system has the characteristics of difficult node discovery, difficult service positioning, difficult user monitoring, difficult communication relationship confirmation and the like, the anonymous communication system is utilized to hide the true identity and bring about the phenomenon of anonymous abuse, namely malicious or even network criminal activities. Like Tor, other dark network systems such as I2P, freenet, zeroNet and the like are difficult to automatically collect information resources in large scale due to the network topology characteristics.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention aims to provide an information acquisition system and method for web service in an anonymous communication system, solves the technical problem that information resources in the anonymous communication system are difficult to acquire, and has the characteristics of high stability and strong information resource acquisition real-time performance.
In order to achieve the purpose, the invention adopts the technical scheme that:
the information acquisition system for the web service in the anonymous communication system comprises an acquisition module, an agent module and the anonymous communication system; the acquisition module acquires contents provided by the target web service in a browser control mode; the agent module is accessed to the used anonymous communication system in a port mapping or protocol conversion mode; the anonymous communication system is generally an overlay network built on the Internet and comprehensively utilizing multiple technologies of data forwarding, content encryption, traffic confusion and point-to-point communication to hide communication entity relations and contents, is used for providing an Internet anonymous access function for ordinary users to hide network communication sources and targets of the ordinary users, and provides a hidden service mechanism for a service provider to realize anonymous network service deployment.
The acquisition module comprises:
1) A browser for accessing a web service;
2) And the browser controller controls the browser in a mode of simulating input or interacting with the browser API to realize the extraction of the screenshot or the text information.
3) And the task receiver receives and calls the browser controller to process the external task in a message transmission mode through the message queue, the API interface and the database.
3. The information collection system for web services within an anonymous communication system as set forth in claim 1, wherein said proxy module comprises:
1) The anonymous communication system bridge bridges the common network and the anonymous communication system, so that the anonymous communication system can be accessed by the bridge;
2) The request receiver is used for receiving the request of the user in a socks agency mode, a VPN mode and a port forwarding mode, and forwarding the request through the bridge, so that the user can access the special network converter without installing or using a special network access device, and the request of the receiver is processed.
A method of collecting information using an information collection system for web services within an anonymous communication system, comprising the steps of:
s1, receiving an external task by using an acquisition module, analyzing the task into a specific request and sending the specific request to an agent module;
and S2, receiving the request of the acquisition module by using the proxy module, and acquiring the requested resource through the anonymous communication system bridge.
The step S1 further comprises the following steps:
s1.1, receiving an information acquisition task of an external system or a user in a message transmission mode through a message queue, an API (application program interface) and a database;
step S1.2, analyzing the task, calling a browser controller, controlling the browser to analyze the task into a specific request, and then initiating the request to an agent module;
s1.3, the waiting agent module returns a response of the anonymous communication system to the request;
and S2.4, storing the acquired information in a screen capture mode or a mode of storing HTML (hypertext markup language) as an image after rendering according to the setting.
The step S2 further comprises the following steps:
s2.1, a request receiving module receives a request of an acquisition module in a socks proxy mode, an http proxy mode, a VPN mode and the like;
step S2.2, the request receiving module forwards the request to a corresponding anonymous communication system bridge according to system setting by judging the characteristics of the access point, the access mode, the request content and the like of the received request;
and step S2.3, the anonymous communication system bridge returns the response of the anonymous communication system.
Compared with the prior art, the invention has the beneficial effects that:
1) The system can be directly accessed to the anonymous communication system through the agent module; 2) The data acquisition of a common browser is controlled in a mode of simulating input or interacting with a browser API, so that the problems of resource acquisition and evidence collection of an anonymous communication system are solved.
Drawings
FIG. 1 is a schematic block diagram of the system of the present invention.
FIG. 2 is a flow chart of the method of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the following examples and the accompanying drawings.
Referring to fig. 1, the information acquisition system for web services in an anonymous communication system includes an acquisition module, an agent module, and an anonymous communication system; the acquisition module acquires contents provided by the target web service in a browser control mode; the agent module is accessed to the used anonymous communication system in a port mapping or protocol conversion mode; the anonymous communication system is generally an overlay network built on the Internet and comprehensively utilizing multiple technologies of data forwarding, content encryption, traffic confusion and point-to-point communication to hide communication entity relations and contents, is used for providing an Internet anonymous access function for ordinary users to hide network communication sources and targets of the ordinary users, and provides a hidden service mechanism for a service provider to realize anonymous network service deployment.
The acquisition module comprises:
1) A browser for accessing a web service;
2) And the browser controller controls the browser in a mode of simulating input or interacting with the browser API to realize the extraction of the screenshot or the text information.
3) And the task receiver receives and calls the browser controller to process the external task in a message transmission mode through the message queue, the API interface and the database.
3. The information collection system for web services within an anonymous communication system as set forth in claim 1, wherein said proxy module comprises:
1) The anonymous communication system bridge bridges the common network and the anonymous communication system, so that the anonymous communication system can be accessed by the bridge;
2) The request receiver is used for receiving the request of the user in a locks agency, VPN and port forwarding mode, and forwarding the request through the bridge, so that the user can access the special network converter without installing or using a special network access device, and the request of the receiver is processed.
A method of collecting information using an information collection system for web services within an anonymous communication system, comprising the steps of:
s1, receiving an external task by using an acquisition module, analyzing the task into a specific request and sending the specific request to an agent module;
and S2, receiving the request of the acquisition module by using the proxy module, and acquiring the requested resource through the anonymous communication system bridge.
The step S1 further comprises the following steps:
s1.1, receiving an information acquisition task of an external system or a user in a message transmission mode through a message queue, an API (application program interface) and a database;
step S1.2, analyzing the task, calling a browser controller, controlling the browser to analyze the task into a specific request, and then initiating the request to an agent module;
s1.3, the waiting agent module returns a response of the anonymous communication system to the request;
and S2.4, storing the acquired information in a screen capture mode or a mode of storing HTML (hypertext markup language) as an image after rendering according to the setting.
The step S2 further comprises the following steps:
s2.1, a request receiving module receives a request of an acquisition module in a socks proxy mode, an http proxy mode and a VPN mode;
step S2.2, the request receiving module forwards the request to a corresponding anonymous communication system bridge according to system setting by judging the characteristics of the access point, the access mode, the request content and the like of the received request;
and step S2.3, the anonymous communication system bridge returns the response of the anonymous communication system.
Examples
S1, receiving an external task by using an acquisition module, analyzing the task into a specific request and sending the specific request to an agent module;
and S2, receiving the request of the acquisition module by using the proxy module, and acquiring the requested resource through the anonymous communication system bridge.
The step S1 further comprises the following steps:
step S1.1, acquiring a resource grabbing task A by monitoring a message queue A;
s1.2, obtaining specific task content mainly comprising resource URL, request head, request mode and request load by analyzing the task A, for example, URl obtained by analyzing is
Figure DEST_PATH_IMAGE001
When the request head is null, the request mode is GET and the request load is null, calling a browser controller to control the browser to set the request URl to be null
Figure DEST_PATH_IMAGE002
Setting a request head to be null, setting a request mode to be GET, setting a request load to be null, and then initiating the request;
s1.3, the waiting agent module returns a response of the anonymous communication system to the request;
and S2.4, waiting for the browser to render the response, wherein a screenshot mode is adopted in the embodiment, and the rendering result is persistently stored.
The step S2 further comprises the following steps:
step S2.1, in the example, a socks proxy mode is adopted, the request of the acquisition module is received, and the URl received in the step S1.2 is
Figure DEST_PATH_IMAGE003
The request head is empty, the request mode is GET, and the request load is empty;
s2.2, judging through a system preset rule, forwarding the request to an anonymous communication system T by the anonymous communication system bridge, and returning a response result by the anonymous communication system T;
and step S2.3, the anonymous communication system bridge returns the response result.

Claims (6)

1. The information acquisition system for the web service in the anonymous communication system is characterized by comprising an acquisition module and an agent module; the acquisition module acquires contents provided by the target web service in a browser control mode; the agent module is accessed to the used anonymous communication system in a port mapping or protocol conversion mode; the anonymous communication system is generally an overlay network built on the Internet and comprehensively utilizing multiple technologies of data forwarding, content encryption, traffic confusion and point-to-point communication to hide communication entity relations and contents, is used for providing an Internet anonymous access function for ordinary users to hide network communication sources and targets of the ordinary users, and provides a hidden service mechanism for a service provider to realize anonymous network service deployment.
2. The information gathering system for web services within an anonymous communication system as set forth in claim 1, wherein the gathering module comprises:
1) A browser for accessing a web service;
2) The browser controller controls the browser in a mode of simulating input or interacting with a browser API (application program interface) to realize extraction of screenshot or text information;
3) And the task receiver receives and calls the browser controller to process the external task in a message transmission mode through the message queue, the API interface and the database.
3. The information collection system for web services within an anonymous communication system as set forth in claim 1, wherein said proxy module comprises:
1) The anonymous communication system bridge bridges the common network and the anonymous communication system, so that the anonymous communication system can be accessed by the bridge;
2) The request receiver is used for receiving the request of the user in a locks agency, VPN and port forwarding mode, and forwarding the request through the bridge, so that the user can access the special network converter without installing or using a special network access device, and the request of the receiver is processed.
4. A method for collecting information using an information collection system for web services in an anonymous communication system, comprising the steps of:
s1, receiving an external task by using an acquisition module, analyzing the task into a specific request and sending the specific request to an agent module;
and S2, receiving the request of the acquisition module by using the proxy module, and acquiring the requested resource through the anonymous communication system bridge.
5. The method for collecting information using an information collecting system for web services in an anonymous communication system as set forth in claim 4, wherein said step S1 further comprises the steps of:
step S1.1, receiving an information acquisition task of an external system or a user in a message queue, API interface and database transmission mode;
s1.2, analyzing the task, calling a browser controller, controlling the browser to analyze the task into a specific request, and initiating the request to an agent module;
s1.3, the waiting agent module returns a response of the anonymous communication system to the request;
and S2.4, storing the acquired information in a screen capture mode or a mode of storing HTML (hypertext markup language) as an image after rendering according to the setting.
6. The method for collecting information using an information collecting system for web services in an anonymous communication system as set forth in claim 4, wherein said step S2 further comprises the steps of:
s2.1, a request receiving module receives a request of an acquisition module in a socks proxy mode, an http proxy mode, a VPN mode and the like;
step S2.2, the request receiving module forwards the request to a corresponding anonymous communication system bridge according to system setting by judging the characteristics of the access point, the access mode, the request content and the like of the received request;
and step S2.3, the anonymous communication system bridge returns the response of the anonymous communication system.
CN202211060121.6A 2022-09-01 2022-09-01 Information acquisition system and method for web service in anonymous communication system Pending CN115442118A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211060121.6A CN115442118A (en) 2022-09-01 2022-09-01 Information acquisition system and method for web service in anonymous communication system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211060121.6A CN115442118A (en) 2022-09-01 2022-09-01 Information acquisition system and method for web service in anonymous communication system

Publications (1)

Publication Number Publication Date
CN115442118A true CN115442118A (en) 2022-12-06

Family

ID=84245248

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211060121.6A Pending CN115442118A (en) 2022-09-01 2022-09-01 Information acquisition system and method for web service in anonymous communication system

Country Status (1)

Country Link
CN (1) CN115442118A (en)

Similar Documents

Publication Publication Date Title
US20130246504A1 (en) Method for subscribing to notification, apparatus and system
US8914461B2 (en) Method and device for editing web contents by URL conversion
CN103716326A (en) Resource access method and URG
CN108418847B (en) Network traffic caching system, method and device
CN102055813A (en) Access controlling method for network application and device thereof
CN104426985B (en) Show the method, apparatus and system of webpage
CN100512158C (en) Network measuring system structure and realizing method thereof
CN102932473B (en) The method of cross operator access to netwoks Web content
CN105635073A (en) Access control method and device and network access equipment
EP2529533A1 (en) Method, apparatus and system for intercepted triggering of execution of internet services
CN107040429A (en) A kind of method of testing and system of port forwarding performance
JP2018537777A (en) Internet of things information system
CN104040538A (en) Internet application interaction method, device and system
CN107645543B (en) Method and system applied to cache server HTTP non-80 cache port service
CN112261111A (en) Method and system for realizing cross-domain access of browser in application program
CN106411819A (en) Method and apparatus for recognizing proxy Internet protocol address
US20170111430A1 (en) Methods for website version control using bucket cookies
CN115442118A (en) Information acquisition system and method for web service in anonymous communication system
CN104065766B (en) One kind bypass caching domain name analytic method
CN112367296B (en) Service control method and device
CN109981738B (en) Cloud server suitable for application of narrow-band Internet of things
CN113660177A (en) Flow control method, device and system and readable storage medium
JP2004094411A (en) Roaming system of thin-client having transparent working environment in wide area network and method therefor
CN113765972A (en) Data request response method, device, system, server and storage medium
Hou et al. A dynamic correlation method of fragmented web resources

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20230712

Address after: 838000 Guanghui Community, Gaochang District, Turpan City, Xinjiang

Applicant after: Yang Qiongyun

Address before: 710000 North Shenyang New Century South District, Yanta District, Xi'an City, Shaanxi Province

Applicant before: Li Hailong