CN112182338A - Monitoring method and device for hosting platform - Google Patents

Monitoring method and device for hosting platform Download PDF

Info

Publication number
CN112182338A
CN112182338A CN202011205947.8A CN202011205947A CN112182338A CN 112182338 A CN112182338 A CN 112182338A CN 202011205947 A CN202011205947 A CN 202011205947A CN 112182338 A CN112182338 A CN 112182338A
Authority
CN
China
Prior art keywords
monitoring
item
module
project
search
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011205947.8A
Other languages
Chinese (zh)
Inventor
郭广鑫
董佳涵
王小虎
任天宇
王超
李群
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
State Grid Beijing Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
State Grid Beijing Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, State Grid Beijing Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN202011205947.8A priority Critical patent/CN112182338A/en
Publication of CN112182338A publication Critical patent/CN112182338A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/64Protecting data integrity, e.g. using checksums, certificates or signatures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/70Software maintenance or management
    • G06F8/71Version control; Configuration management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Hardware Design (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioethics (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a method and a device for monitoring a pipe supporting platform. Wherein, the method comprises the following steps: acquiring a retrieval keyword; crawling a project set corresponding to the retrieval keyword from the hosting platform; monitoring the selected items in the item set. The method and the device solve the technical problem that sensitive information in the hosting platform is easy to leak in the related technology.

Description

Monitoring method and device for hosting platform
Technical Field
The invention relates to the field of a hosting platform, in particular to a method and a device for monitoring the hosting platform.
Background
Most companies currently use GitHub for code hosting and versioning. Since the GitHub is a managed platform facing to open source and private software projects, if the configuration file code of the project contains sensitive server and database information, the information is easy to be utilized by hackers and is finally attacked to cause information leakage.
In view of the above problems, no effective solution has been proposed.
Disclosure of Invention
The embodiment of the invention provides a method and a device for monitoring a hosting platform, which are used for at least solving the technical problem that sensitive information in the hosting platform is easy to leak in the related technology.
According to an aspect of an embodiment of the present invention, there is provided a method for monitoring a pipe pallet, including: acquiring a retrieval keyword; crawling a project set corresponding to the retrieval keyword from the hosting platform; monitoring the selected items in the item set.
Optionally, the crawling of the collection of items corresponding to the retrieval keyword from the hosting platform includes: obtaining a search module selected from a hosting platform, wherein the search module comprises one of: the system comprises a warehouse module, a code module, a comment module, a question module, a market module, a main body module, a wiki module and a user module; a collection of items is crawled from a search module.
Optionally, in a case that the selected search module is a code module, the entered login information of the hosting platform is acquired.
Optionally, monitoring the selected items in the item set includes: under the condition that a monitoring instruction is detected in the item set, determining a selected item corresponding to the monitoring instruction; downloading project data of the selected project from the hosting platform, and caching the project data to the local; the selected project is monitored based on the project data.
Optionally, monitoring the selected item based on the item data includes: acquiring file search information, wherein the file search information comprises: file name and file suffix; acquiring file data corresponding to file searching information from the project data; it is determined whether there is a leak in the project data based on the file data.
Optionally, before monitoring the selected item based on the item data, the method further comprises: acquiring an updating task interval and a project interval one by one; under the condition that a synchronization instruction is received, synchronizing the selected items based on the updating task interval and the item-by-item interval; and monitoring the selected project based on the synchronized project data.
Optionally, after crawling the collection of items corresponding to the retrieval keyword from the hosting platform, the method further includes: under the condition that a viewing instruction is detected in the item set, determining a first item corresponding to the viewing instruction; outputting the item data of the first item.
Optionally, after crawling the collection of items corresponding to the retrieval keyword from the hosting platform, the method further includes: determining a second item corresponding to the export instruction when the export instruction is detected in the item set; and exporting the project data of the second project to a preset type of file.
According to another aspect of the embodiments of the present invention, there is also provided a method for monitoring a pipe supporting platform, including: displaying the received search keyword; and displaying a monitoring result, wherein the monitoring result is a result generated by monitoring the selected item in the item set, and the item set is a set of items corresponding to the retrieval keyword and crawled from the hosting platform.
According to another aspect of the embodiments of the present invention, there is also provided a monitoring device for a pipe supporting platform, including: the acquisition module is used for acquiring search keywords; the crawling module is used for crawling a project set corresponding to the retrieval keyword from the hosting platform; and the monitoring module is used for monitoring the selected items in the item set.
According to another aspect of the embodiments of the present invention, there is also provided a monitoring device for a pipe supporting platform, including: the first display module is used for displaying the received search keywords; and the second display module is used for displaying a monitoring result, wherein the monitoring result is a result generated by monitoring the selected item in the item set, and the item set is a set of items corresponding to the retrieval keyword and crawled from the hosting platform.
According to another aspect of the embodiments of the present invention, there is also provided a computer-readable storage medium, where the computer-readable storage medium includes a stored program, and when the program runs, the apparatus where the computer-readable storage medium is located is controlled to execute the above monitoring method for a managed platform.
According to another aspect of the embodiments of the present invention, a processor is further provided, where the processor is configured to execute a program, where the program executes the monitoring method for the managed platform described above.
In the embodiment of the invention, after the retrieval keywords are obtained, the item set corresponding to the retrieval keywords is crawled from the hosting platform, and the selected items in the item set are monitored, so that the purpose of monitoring by the hosting platform is realized. It is easy to notice that the local full-text detailed retrieval function can be performed by searching the appointed keyword information and synchronizing the concerned items to the local in time, so that the effects of timely, effectively and comprehensively monitoring the items in the hosting platform and preventing the sensitive information from being leaked are achieved, and the technical problem that the sensitive information in the hosting platform is easy to leak in the related technology is solved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention without limiting the invention. In the drawings:
FIG. 1 is a flow chart of a method of monitoring a hosted platform according to an embodiment of the invention;
FIG. 2a is a schematic diagram of an alternative related term search interface, according to an embodiment of the present invention;
FIG. 2b is a diagram illustrating a search result displayed in an alternative related-word search interface according to an embodiment of the present invention;
FIG. 2c is a schematic illustration of an alternative menu of items according to an embodiment of the present invention;
FIG. 2d is a schematic diagram of an alternative download queue interface, according to an embodiment of the present invention;
FIG. 2e is a schematic diagram of an alternative local monitoring interface according to an embodiment of the present invention;
FIG. 2f is a diagram illustrating a search for item content in an alternative local monitoring interface, according to an embodiment of the present invention;
FIG. 2g is a schematic illustration of an alternative quick-view window according to an embodiment of the present invention;
FIG. 3 is a flow diagram of another method of monitoring a hosted platform according to an embodiment of the invention;
FIG. 4 is a schematic view of a monitoring device for a pipelayer platform according to an embodiment of the present invention;
fig. 5 is a schematic diagram of another monitoring device for hosting a platform according to an embodiment of the invention.
Detailed Description
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the invention described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
First, the following technical names or technical terms in the embodiments of the present invention are explained as follows:
web crawlers: not only are important components of search engines, but also are indispensable tools for big data analysis. The basic workflow of the web crawler is as follows: firstly, selecting a part of carefully selected seed URL; adding a seed URL (Uniform Resource Locator) into a task queue; taking out URLs to be captured from the URL queue to be captured, analyzing a Domain Name System (DNS), obtaining an IP address of a host, downloading webpages corresponding to the URLs, storing the webpages into a downloaded webpage library, and putting the URLs into the captured URL queue; analyzing the URLs in the captured URL queue, analyzing other URLs in the captured URL queue, and putting the URLs into a URL queue to be captured so as to enter the next cycle; analyzing the downloaded web page and analyzing the required data; and if the data is persistent, storing the data in the database.
The crawling strategy of the crawler, in the crawler system, a URL queue to be crawled is an important part. It is also an important question in what order the URLs in the URL queue to be fetched are arranged, since this involves fetching that page first, and then fetching that page. The method for determining the arrangement order of the URLs is called a capture strategy. The program combines factors such as a crawler principle, a crawler strategy, a GitHub website structure and the like, and achieves the effect of crawling the GitHub search results.
The Git protocol: which may be a content addressable file system, on top of which a version control system user interface is provided, is a very powerful and easy to use tool. Completely subverts the loop of the traditional version control and makes new design for the realization mode of each link. Git is more like a small file system, but it also provides many powerful tools on this basis, not just a simple VCS.
Full text retrieval: the document retrieval can be realized by utilizing Windows API (CLR interface), the document retrieval is carried out by specifying the name, suffix and the like of the searched document, and the required document information is finally screened out by comparing the name and the suffix.
Example 1
In accordance with an embodiment of the present invention, there is provided a method of hosting platform monitoring, it being noted that the steps illustrated in the flowchart of the figure may be performed in a computer system such as a set of computer-executable instructions and that, while a logical order is illustrated in the flowchart, in some cases the steps illustrated or described may be performed in an order different than here.
Fig. 1 is a flowchart of a method for monitoring a pipelayer platform according to an embodiment of the present invention, as shown in fig. 1, the method includes the following steps:
step S102, obtaining a search keyword.
The search keyword in the above step may be a keyword input by the user.
In an alternative embodiment, for the purpose of monitoring the hosted platform, a monitoring tool may be provided to provide an interactive interface to the user, and as shown in fig. 2a, the user may enter the keyword "sgcc" to be searched in the input box of the search keyword of the interface, and start the search process by clicking the "start search" button.
And step S104, crawling the item set corresponding to the search keyword from the hosting platform.
The hosting platform in the above steps may be a GitHub platform, but is not limited thereto, and may be any platform capable of implementing code hosting and version control.
In an alternative embodiment, the retrieval process may make an online query to GitHub to crawl the corresponding project information, such as project owner, project name, development language, acquisition star count and project description, through web crawler principles. As shown in fig. 2b, after the user enters the keyword "sgcc," the corresponding set of items can be crawled from the hosting platform and the searched set of items can be displayed on the interactive interface.
It should be noted that if the Network condition is bad, a Virtual Private Network (VPN) or a proxy access method can be used to provide better use effect. In addition, the GitHub can be effectively shielded by a built-in anti-shielding module.
And step S106, monitoring the selected items in the item set.
In an optional embodiment, a user can retrieve a plurality of items which accord with retrieval keywords through a retrieval mode to obtain an item set, further select the items to be monitored, perform text retrieval on the selected items, and judge whether the files have leakage or not through searching key sensitive characters in the retrieved files, connecting IP addresses of a database and the like, so as to achieve the purpose of GibHub monitoring.
According to the embodiment of the invention, after the retrieval keywords are acquired, the item set corresponding to the retrieval keywords is crawled from the hosting platform, and the selected items in the item set are monitored, so that the purpose of monitoring by the hosting platform is realized. It is easy to notice that the local full-text detailed retrieval function can be performed by searching the appointed keyword information and synchronizing the concerned items to the local in time, so that the effects of timely, effectively and comprehensively monitoring the items in the hosting platform and preventing the sensitive information from being leaked are achieved, and the technical problem that the sensitive information in the hosting platform is easy to leak in the related technology is solved.
Optionally, in the foregoing embodiment of the present invention, crawling the item set corresponding to the search keyword from the hosting platform includes: obtaining a search module selected from a hosting platform, wherein the search module comprises one of: the system comprises a warehouse module, a code module, a comment module, a question module, a market module, a main body module, a wiki module and a user module; a collection of items is crawled from a search module.
When keyword retrieval is performed by using the GitHub, eight modules, such as warehouses, codes, Commits, Issues, marketes, Topics, Wikis, Users and the like, are displayed, and the search contents of the four modules, namely warehouses, codes, submissions and questions, are the most abundant. Therefore, in the embodiment of the invention, the four modules of warehouse, code, submission and question can be searched, the searched content is listed, and the searched content can be derived to facilitate future management.
In an alternative embodiment, as shown in FIG. 2a, the user may select an appropriate search module as the selected search module in a drop-down menu of "retrieve items" displayed on the interactive interface, e.g., the user selects the search module "warehouse-report". So that items that match the key search terms can be searched from the module.
Optionally, in the foregoing embodiment of the present invention, in a case that the selected search module is a code module, the entered login information of the hosted platform is obtained.
In an alternative embodiment, when the user selects a code module to search, the user may enter a GitHub account in the interface shown in FIG. 2a, such as entering the user name "e 3315110@ nwyt" and the password "Gh 7ujm8 ik".
Optionally, in the above embodiment of the present invention, monitoring the selected item in the item set includes: under the condition that a monitoring instruction is detected in the item set, determining a selected item corresponding to the monitoring instruction; downloading project data of the selected project from the hosting platform, and caching the project data to the local; the selected project is monitored based on the project data.
The monitoring instruction in the above steps can be generated by the user clicking a right key on the item to be monitored, and selecting a download item and monitoring a button.
In an alternative embodiment, after retrieving the item set that matches the retrieval keyword, the user may view each item in the item set in the interactive interface as shown in fig. 2b, and when the user needs to monitor one of the items, for example, the user wants to monitor the item with the item name "SGCC-2013 — Edge", the user may right-click on the item, pop up the menu as shown in fig. 2c, and click on the "download item and monitor" button in the menu for monitoring the item. After the user selects the downloaded item and monitors, the user may jump to the download queue module, as shown in fig. 2d, may invoke the Gif module, download the item content of the item selected by the user, and download to the gitubroot directory to form a local item cache, which facilitates the query and synchronization of the local monitoring module.
Optionally, in the foregoing embodiment of the present invention, monitoring the selected item based on the item data includes: acquiring file search information, wherein the file search information comprises: file name and file suffix; acquiring file data corresponding to file searching information from the project data; it is determined whether there is a leak in the project data based on the file data.
In an alternative embodiment, when monitoring the item selected by the user is needed, the "local monitoring" interface shown in fig. 2e may be entered, the node tree of the item may be displayed on the left side, and the left node tree is double-clicked, the search box is opened, and the interface shown in fig. 2f is displayed. The user may search for a specific file under the item in the interface, the user may enter a file name in the search content box and select a file suffix in the file suffix selection box, or enter a file suffix name in another suffix input box, and after clicking the "start search" button, information of the searched file, such as file name, rank, matching content, creation time, modification time, path, and the like, may be displayed in the interface.
It should be noted that, the office file and the database type file, including word document, doc, docx suffix file, excel document, xlsxx, xls suffix file, access database, mdb suffix file, connection database configuration file, xml suffix file, the key sensitive characters in the file are searched, and the connection database IP address is connected, and whether the sensitive file is divulged or not is judged according to the above contents.
Optionally, in the above embodiment of the present invention, before monitoring the selected item based on the item data, the method further includes: acquiring an updating task interval and a project interval one by one; under the condition that a synchronization instruction is received, synchronizing the selected items based on the updating task interval and the item-by-item interval; and monitoring the selected project based on the synchronized project data.
The synchronization command in the above step may be to keep the latest directory synchronization display effect by clicking a "refresh node list" button near the node tree on the left side of the interface as shown in fig. 2 e.
In an alternative embodiment, as shown in fig. 2e, the user may set the update task interval and the project-by-project interval on the right side of the interface, for example, set the update task interval to 10 minutes and the project-by-project interval to 10 seconds, and then click the "start polling synchronization" button to start synchronizing the timing of the selected project cached locally with the GitHub, so as to implement synchronous update, and after the synchronization is set, the purpose of project monitoring may be implemented by double-clicking the left node tree.
Optionally, in the above embodiment of the present invention, after crawling the collection of items corresponding to the retrieval keyword from the hosting platform, the method further includes: under the condition that a viewing instruction is detected in the item set, determining a first item corresponding to the viewing instruction; outputting the item data of the first item.
The viewing instruction in the above steps may be generated by the user by clicking or double clicking the first item.
In an alternative embodiment, after the item set is searched, as shown in fig. 2g, the user may click the first item pop-up window to quickly view the item information of the first item, or may double click the first item to open the first item in the browser for a detailed query.
Optionally, in the above embodiment of the present invention, after crawling the collection of items corresponding to the retrieval keyword from the hosting platform, the method further includes: determining a second item corresponding to the export instruction when the export instruction is detected in the item set; and exporting the project data of the second project to a preset type of file.
The export instruction in the above step may be generated by the user clicking the second item and clicking a "export this page to Excel" button in a pop-up menu. The preset type in the above step may be an Excel type, but is not limited thereto, and may be determined according to actual needs.
In an alternative embodiment, when the user needs to export a certain item, the user can right click on the item, pop up the menu shown in fig. 2c, and click on the "export this page to Excel" button in the menu, so that the item data of the item can be exported to the Excel file.
Through the scheme, the invention provides a Github monitoring self-checking auxiliary tool, which adopts an interactive interface design, sub-modules are arranged in a TabPage-by-TabPage mode, each Tab page has different listed functions, and a program mainly comprises three modules of related word retrieval, download queue and local monitoring. And appointing retrieval keyword information through different module UI interfaces, clicking to start crawling Github data, selecting a corresponding monitoring item by right clicking to cache to the local, and realizing the functions of local synchronous updating and local searching.
The program comprises a plurality of modules of input and output, program configuration, flow execution, search result analysis processing, downloading of designated items, local item monitoring and specific file searching. All modules are linked in a code coupling mode, and stable execution of the program is ensured.
By using the tool, specified keyword information can be searched on Github, the concerned items can be synchronized to the local in time, and the local full-text detailed retrieval function can be performed. Sensitive contents in the Github contents can be effectively and comprehensively monitored in time through the software, so that the sensitive information is prevented from being leaked, and the adverse effect on the enterprise image is avoided.
Example 2
There is also provided, in accordance with an embodiment of the present invention, a method for hosting platform monitoring, it being noted that the steps illustrated in the flowchart of the figure may be performed in a computer system such as a set of computer-executable instructions and that, although a logical order is illustrated in the flowchart, in some cases the steps illustrated or described may be performed in an order different than here.
Fig. 3 is a flowchart of another monitoring method for a hosting platform according to an embodiment of the present invention, as shown in fig. 3, the method includes the following steps:
step S302, displaying the received search keywords.
The search keyword in the above step may be a keyword input by the user.
And step S304, displaying a monitoring result, wherein the monitoring result is a result generated by monitoring the selected item in the item set, and the item set is a set of items corresponding to the retrieval keyword and crawled from the hosting platform.
The hosting platform in the above steps may be a GitHub platform, but is not limited thereto, and may be any platform capable of implementing code hosting and version control.
Optionally, in the above embodiment of the present invention, before displaying the received search keyword, the method further includes: displaying a search module selected from the hosting platform, wherein the search module comprises one of: the system comprises a warehouse module, a code module, a comment module, a question module, a market module, a main body module, a wiki module and a user module; the display crawls a collection of items from a search module.
Optionally, in the above embodiment of the present invention, in a case that the selected search module is a code module, the method further includes: and displaying the input login information of the managed platform.
Optionally, in the above embodiment of the present invention, before displaying the monitoring result, the method further includes: under the condition that a monitoring instruction is detected in the item set, displaying a selected item corresponding to the monitoring instruction; displaying project data of the selected project downloaded from the hosting platform in a downloading interface, wherein the project data are cached locally; and displaying the monitoring result.
Optionally, in the above embodiment of the present invention, before displaying the monitoring result, the method further includes: displaying file search information, wherein the file search information includes: file name and file suffix; and displaying file data corresponding to file search information acquired from the project data, wherein the monitoring result is determined based on the file data and is used for representing whether the project data has leakage or not.
Optionally, in the above embodiment of the present invention, before displaying the monitoring result, the method further includes: displaying an update task interval and a project-by-project interval; and displaying a synchronization result in the case of receiving the synchronization instruction, wherein the synchronization result is obtained by synchronizing the selected items on the basis of the updating task interval and the item-by-item interval.
Optionally, in the above embodiment of the present invention, before displaying the monitoring result, the method further includes: and displaying the item data of the first item corresponding to the viewing instruction when the viewing instruction is detected in the item set.
Optionally, in the above embodiment of the present invention, before displaying the monitoring result, the method further includes: and displaying an export result of a second item corresponding to the export instruction when the export instruction is detected in the item set, wherein the item data of the second item is exported to a preset type of file.
Example 3
According to an embodiment of the present invention, a monitoring apparatus for a hosting platform is provided, where the apparatus may perform the monitoring method for a hosting platform in embodiment 1, and a specific implementation and a preferred application scenario are the same as those in embodiment 1, and are not described herein again.
Fig. 4 is a schematic view of a monitoring device for a pipe pallet according to an embodiment of the present invention, as shown in fig. 4, the device including:
and an obtaining module 42, configured to obtain the search keyword.
And a crawling module 44, configured to crawl the item set corresponding to the search keyword from the hosting platform.
And a monitoring module 46 for monitoring the selected items in the item set.
Optionally, in the above embodiment of the present invention, the crawling module includes: an obtaining unit, configured to obtain a search module selected from the hosting platform, where the search module includes one of: the system comprises a warehouse module, a code module, a comment module, a question module, a market module, a main body module, a wiki module and a user module; and the crawling unit is used for crawling the item set from the searching module.
Optionally, in the above embodiment of the present invention, the obtaining module is further configured to obtain, when the selected search module is a code module, the input login information of the hosting platform.
Optionally, in the foregoing embodiment of the present invention, the monitoring module includes: the determining unit is used for determining a selected item corresponding to the monitoring instruction under the condition that the monitoring instruction is detected in the item set; the download unit is used for downloading the project data of the selected project from the hosting platform and caching the project data to the local; and the monitoring unit is used for monitoring the selected item based on the item data.
Optionally, in the foregoing embodiment of the present invention, the monitoring unit includes: a first obtaining subunit, configured to obtain file search information, where the file search information includes: file name and file suffix; the second acquisition subunit is used for acquiring file data corresponding to the file search information from the project data; a determining subunit operable to determine whether there is a leak in the item data based on the file data.
Optionally, in the above embodiment of the present invention, the apparatus further includes: the acquisition module is also used for acquiring an update task interval and a project interval one by one; the synchronization module is used for synchronizing the selected items based on the updating task interval and the item-by-item interval under the condition of receiving the synchronization instruction; the monitoring module is also used for monitoring the selected project based on the synchronized project data.
Optionally, in the above embodiment of the present invention, the apparatus further includes: the first determining module is used for determining a first item corresponding to the viewing instruction under the condition that the viewing instruction is detected in the item set; and the output module is used for outputting the project data of the first project.
Optionally, in the above embodiment of the present invention, the apparatus further includes: the second determining module is used for determining a second item corresponding to the export instruction under the condition that the export instruction is detected in the item set; and the export module is used for exporting the project data of the second project to the file of the preset type.
Example 4
According to an embodiment of the present invention, there is also provided a monitoring apparatus for a hosting platform, where the apparatus may perform the monitoring method for a hosting platform in embodiment 2, and a specific implementation and a preferred application scenario are the same as those in embodiment 1, and are not described herein again.
Fig. 5 is a schematic diagram of another monitoring apparatus for hosting a platform according to an embodiment of the present invention, as shown in fig. 5, the apparatus includes:
and a first display module 52, configured to display the received search keyword.
And a second display module 54, configured to display a monitoring result, where the monitoring result is a result generated by monitoring a selected item in an item set, and the item set is a set of items corresponding to the search keyword and crawled from the hosting platform.
Optionally, in the above embodiment of the present invention, the apparatus further includes: a third display module for displaying the search module selected from the hosting platform, wherein the search module comprises one of the following: the system comprises a warehouse module, a code module, a comment module, a question module, a market module, a main body module, a wiki module and a user module; and the fourth display module is used for displaying the item set crawled from the search module.
Optionally, in the above embodiment of the present invention, the apparatus further includes: and the fifth display module is used for displaying the input login information of the hosting platform under the condition that the selected search module is the code module.
Optionally, in the above embodiment of the present invention, the apparatus further includes: the sixth display module is used for displaying the selected item corresponding to the monitoring instruction under the condition that the monitoring instruction is detected in the item set; the seventh display module is used for displaying the item data of the selected item downloaded from the hosting platform in the downloading interface, wherein the item data is cached locally; the second display module is also used for displaying the monitoring result.
Optionally, in the above embodiment of the present invention, the apparatus further includes: an eighth display module, configured to display file search information, where the file search information includes: file name and file suffix; and the ninth display module is used for displaying the file data corresponding to the file search information acquired from the project data, wherein the monitoring result is determined based on the file data and is used for representing whether the project data has leakage or not.
Optionally, in the above embodiment of the present invention, the apparatus further includes: a tenth display module for displaying the update task interval and the item-by-item interval; and the eleventh display module is used for displaying a synchronization result under the condition that the synchronization instruction is received, wherein the synchronization result is obtained by synchronizing the selected items on the basis of the updating task interval and the item-by-item interval.
Optionally, in the above embodiment of the present invention, the apparatus further includes: and the twelfth display module is used for displaying the item data of the first item corresponding to the viewing instruction under the condition that the viewing instruction is detected in the item set.
Optionally, in the above embodiment of the present invention, the apparatus further includes: and the thirteenth display module is used for displaying an export result of a second item corresponding to the export instruction when the export instruction is detected in the item set, wherein the item data of the second item is exported to a preset type of file.
Example 5
According to an embodiment of the present invention, there is also provided a computer-readable storage medium, where the computer-readable storage medium includes a stored program, and when the program runs, the apparatus where the computer-readable storage medium is located is controlled to execute the monitoring method for the managed platform in embodiments 1 and 2.
Example 6
According to an embodiment of the present invention, there is further provided a processor, where the processor is configured to execute a program, where the program executes the monitoring method for the managed platform in embodiments 1 and 2.
The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments.
In the above embodiments of the present invention, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
In the embodiments provided in the present application, it should be understood that the disclosed technology can be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the division of the units may be a logical division, and in actual implementation, there may be another division, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, units or modules, and may be in an electrical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic or optical disk, and other various media capable of storing program codes.
The foregoing is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, various modifications and decorations can be made without departing from the principle of the present invention, and these modifications and decorations should also be regarded as the protection scope of the present invention.

Claims (13)

1. A method for monitoring a pipe supporting platform is characterized by comprising the following steps:
acquiring a retrieval keyword;
crawling an item set corresponding to the retrieval keyword from a hosting platform;
and monitoring the selected items in the item set.
2. The method of claim 1, wherein crawling the set of items corresponding to the search keyword from a hosting platform comprises:
obtaining a search module selected from the hosting platform, wherein the search module comprises one of: the system comprises a warehouse module, a code module, a comment module, a question module, a market module, a main body module, a wiki module and a user module;
the set of items is crawled from the search module.
3. The method of claim 2, wherein if the selected search module is the code module, obtaining the entered login information of the hosting platform.
4. The method of claim 1, wherein monitoring selected items in the collection of items comprises:
under the condition that a monitoring instruction is detected in the item set, determining the selected item corresponding to the monitoring instruction;
downloading project data of the selected project from the hosting platform, and caching the project data to the local;
and monitoring the selected project based on the project data.
5. The method of claim 4, wherein monitoring the selected item based on the item data comprises:
acquiring file search information, wherein the file search information comprises: file name and file suffix;
acquiring file data corresponding to the file searching information from the project data;
determining whether there is a leak in the project data based on the file data.
6. The method of claim 4, wherein prior to monitoring the selected item based on the item data, the method further comprises:
acquiring an updating task interval and a project interval one by one;
under the condition that a synchronization instruction is received, synchronizing the selected project based on the updating task interval and the project-by-project interval;
and monitoring the selected project based on the synchronized project data.
7. The method of claim 1, wherein after crawling the set of items corresponding to the search keyword from a hosting platform, the method further comprises:
under the condition that a viewing instruction is detected in the item set, determining a first item corresponding to the viewing instruction;
outputting the item data of the first item.
8. The method of claim 1, wherein after crawling the set of items corresponding to the search keyword from a hosting platform, the method further comprises:
determining a second item corresponding to a lead-out instruction when the lead-out instruction is detected in the item set;
and exporting the project data of the second project to a preset type of file.
9. A method for monitoring a pipe supporting platform is characterized by comprising the following steps:
displaying the received search keyword;
displaying a monitoring result, wherein the monitoring result is a result generated by monitoring a selected item in a set of items, and the set of items is a set of items corresponding to the retrieval keyword and crawled from a hosting platform.
10. A monitoring device for a pipelaying platform, comprising:
the acquisition module is used for acquiring search keywords;
the crawling module is used for crawling the item set corresponding to the retrieval keyword from the hosting platform;
and the monitoring module is used for monitoring the selected items in the item set.
11. A monitoring device for a pipelaying platform, comprising:
the first display module is used for displaying the received search keywords;
and the second display module is used for displaying a monitoring result, wherein the monitoring result is a result generated by monitoring a selected item in an item set, and the item set is a set of items corresponding to the retrieval keyword and crawled from the hosting platform.
12. A computer-readable storage medium, comprising a stored program, wherein when the program runs, the program controls a device on which the computer-readable storage medium is located to execute the monitoring method of the hosting platform according to any one of claims 1 to 9.
13. A processor, configured to execute a program, wherein the program runtime performs the monitoring method of the managed platform according to any one of claims 1 to 9.
CN202011205947.8A 2020-11-02 2020-11-02 Monitoring method and device for hosting platform Pending CN112182338A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011205947.8A CN112182338A (en) 2020-11-02 2020-11-02 Monitoring method and device for hosting platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011205947.8A CN112182338A (en) 2020-11-02 2020-11-02 Monitoring method and device for hosting platform

Publications (1)

Publication Number Publication Date
CN112182338A true CN112182338A (en) 2021-01-05

Family

ID=73917737

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011205947.8A Pending CN112182338A (en) 2020-11-02 2020-11-02 Monitoring method and device for hosting platform

Country Status (1)

Country Link
CN (1) CN112182338A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115640324A (en) * 2022-12-23 2023-01-24 深圳开源互联网安全技术有限公司 Information query method, device, terminal equipment and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107291902A (en) * 2017-06-23 2017-10-24 中国人民解放军国防科学技术大学 Automatic marking method is checked in a kind of popular contribution based on hybrid classification technology
CN107977218A (en) * 2017-11-23 2018-05-01 武汉斗鱼网络科技有限公司 Code update method, device, electronic equipment and readable storage medium storing program for executing
CN108459874A (en) * 2018-03-05 2018-08-28 中国人民解放军国防科技大学 Code automatic summarization method integrating deep learning and natural language processing
CN108694324A (en) * 2017-04-06 2018-10-23 腾讯科技(深圳)有限公司 A kind of information leakage monitoring method and device
CN109361713A (en) * 2018-12-17 2019-02-19 深信服科技股份有限公司 Internet risk monitoring and control method, apparatus, equipment and storage medium
CN109710299A (en) * 2018-12-14 2019-05-03 平安普惠企业管理有限公司 A kind of open source class libraries monitoring method, device, equipment and computer storage medium
CN110990836A (en) * 2019-12-18 2020-04-10 南京富士通南大软件技术有限公司 Code leakage detection system and method based on natural language processing technology
CN111753149A (en) * 2020-06-28 2020-10-09 深圳前海微众银行股份有限公司 Sensitive information detection method, device, equipment and storage medium
CN115658067A (en) * 2022-10-21 2023-01-31 深圳开源互联网安全技术有限公司 Leakage code retrieval method and device and computer readable storage medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108694324A (en) * 2017-04-06 2018-10-23 腾讯科技(深圳)有限公司 A kind of information leakage monitoring method and device
CN107291902A (en) * 2017-06-23 2017-10-24 中国人民解放军国防科学技术大学 Automatic marking method is checked in a kind of popular contribution based on hybrid classification technology
CN107977218A (en) * 2017-11-23 2018-05-01 武汉斗鱼网络科技有限公司 Code update method, device, electronic equipment and readable storage medium storing program for executing
CN108459874A (en) * 2018-03-05 2018-08-28 中国人民解放军国防科技大学 Code automatic summarization method integrating deep learning and natural language processing
CN109710299A (en) * 2018-12-14 2019-05-03 平安普惠企业管理有限公司 A kind of open source class libraries monitoring method, device, equipment and computer storage medium
CN109361713A (en) * 2018-12-17 2019-02-19 深信服科技股份有限公司 Internet risk monitoring and control method, apparatus, equipment and storage medium
CN110990836A (en) * 2019-12-18 2020-04-10 南京富士通南大软件技术有限公司 Code leakage detection system and method based on natural language processing technology
CN111753149A (en) * 2020-06-28 2020-10-09 深圳前海微众银行股份有限公司 Sensitive information detection method, device, equipment and storage medium
CN115658067A (en) * 2022-10-21 2023-01-31 深圳开源互联网安全技术有限公司 Leakage code retrieval method and device and computer readable storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115640324A (en) * 2022-12-23 2023-01-24 深圳开源互联网安全技术有限公司 Information query method, device, terminal equipment and storage medium

Similar Documents

Publication Publication Date Title
US7672938B2 (en) Creating search enabled web pages
US7062475B1 (en) Personalized multi-service computer environment
US10713324B2 (en) Search results for native applications
JP7330891B2 (en) System and method for direct in-browser markup of elements in Internet content
CN103744853B (en) The method and device of Research of Search Engine Website Snapshot System information is provided
CN106528657A (en) Control method and device for browser skipping to application program
US20110225482A1 (en) Managing and generating citations in scholarly work
US10943063B1 (en) Apparatus and method to automate website user interface navigation
US20050262052A1 (en) Web research tool
CN105745644A (en) Establishment of state representation of a web page represented in a web browser
US8219934B2 (en) Method and code module for facilitating navigation between webpages
CN105283843A (en) Embeddable media content search widget
CN112612943A (en) Asynchronous processing framework-based data crawling method with automatic testing function
JP6517437B2 (en) Automated information acquisition
CN103617225B (en) A kind of associating web pages searching method and system
CN112182338A (en) Monitoring method and device for hosting platform
US20130132820A1 (en) Web browsing tool delivering relevant content
US20160299985A1 (en) Method for accessing last search
US8131752B2 (en) Breaking documents
US20130311860A1 (en) Identifying Referred Documents Based on a Search Result
Neeli et al. Automated data mining from web servers using perl script
Salama “Down With Regression!”–Generating Test Suites for the Web
CN112784186A (en) Link address opening method and device, storage medium and electronic equipment
EP1415243B1 (en) Dynamic content based assisted information browsing
Horch et al. Automated Elicitation of Functional User Requirements for Supporting Cloud Service Search

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination