CN115146712A - Internet of things asset identification method, device, equipment and storage medium - Google Patents

Internet of things asset identification method, device, equipment and storage medium Download PDF

Info

Publication number
CN115146712A
CN115146712A CN202210680492.8A CN202210680492A CN115146712A CN 115146712 A CN115146712 A CN 115146712A CN 202210680492 A CN202210680492 A CN 202210680492A CN 115146712 A CN115146712 A CN 115146712A
Authority
CN
China
Prior art keywords
asset
internet
things
classification
keywords
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210680492.8A
Other languages
Chinese (zh)
Other versions
CN115146712B (en
Inventor
章小敏
李勇
万志宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Topsec Technology Co Ltd
Beijing Topsec Network Security Technology Co Ltd
Beijing Topsec Software Co Ltd
Original Assignee
Beijing Topsec Technology Co Ltd
Beijing Topsec Network Security Technology Co Ltd
Beijing Topsec Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Topsec Technology Co Ltd, Beijing Topsec Network Security Technology Co Ltd, Beijing Topsec Software Co Ltd filed Critical Beijing Topsec Technology Co Ltd
Priority to CN202210680492.8A priority Critical patent/CN115146712B/en
Publication of CN115146712A publication Critical patent/CN115146712A/en
Application granted granted Critical
Publication of CN115146712B publication Critical patent/CN115146712B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16YINFORMATION AND COMMUNICATION TECHNOLOGY SPECIALLY ADAPTED FOR THE INTERNET OF THINGS [IoT]
    • G16Y20/00Information sensed or collected by the things
    • G16Y20/30Information sensed or collected by the things relating to resources, e.g. consumed power
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The disclosure relates to an asset identification method, device, equipment and storage medium of the Internet of things, wherein the method comprises the following steps: acquiring asset information of equipment of the Internet of things, extracting keywords of the asset information, and performing vectorization processing on the asset information to generate vectorized data corresponding to the equipment of the Internet of things; matching the keywords with an asset classification library, if the keywords do not exist in the asset classification library, performing dimensionality reduction on the vectorized data, performing multi-classification on the vectorized data after the dimensionality reduction, and determining a classification result of the Internet of things equipment. According to the technical scheme, labor cost and time cost generated by the asset identification and classification of the Internet of things can be reduced, and the accuracy and intelligence of the asset identification and classification of the Internet of things are improved.

Description

Internet of things asset identification method, device, equipment and storage medium
Technical Field
The disclosure relates to the technical field of internet of things, and in particular relates to an internet of things asset identification method, device, equipment and storage medium.
Background
With the popularization of the internet of everything and the development of artificial intelligence technology, more and more internet of things equipment and services are exposed in the internet, the safety problem of the internet of things is concerned, and the asset identification of the internet of things is a necessary and key part in the safety of the internet of things.
The traditional asset identification method is manually dominant, labor cost and time cost are high, and the clustering algorithm is applied to the internet of things asset identification method generally combined with artificial intelligence, so that the accuracy of the internet of things asset identification method is required to be further improved.
Disclosure of Invention
In order to solve the technical problems or at least partially solve the technical problems, the present disclosure provides an internet of things asset identification method, apparatus, device and storage medium.
In a first aspect, an embodiment of the present disclosure provides an asset identification method for an internet of things, including:
acquiring asset information of the equipment of the Internet of things;
extracting keywords of the asset information, and performing vectorization processing on the asset information to generate vectorized data corresponding to the Internet of things equipment;
matching the keywords with an asset classification library, and if the keywords do not exist in the asset classification library, performing dimension reduction processing on the vectorization data;
and performing multi-classification on the vectorized data subjected to the dimensionality reduction processing, and determining a classification result of the Internet of things equipment.
Optionally, the extracting the keywords of the asset information, and performing vectorization processing on the asset information to generate vectorized data corresponding to the internet of things device includes:
extracting effective information of the asset information to generate text data;
performing word segmentation on the text data, and determining keywords of the text data through a TF-IDF algorithm; and
and vectorizing the text data to generate the vectorized data.
Optionally, after matching the keyword with the asset classification library, the method further includes:
and if the keywords exist in the asset classification library, performing tagging and unification processing on the asset information, and storing the asset information into the asset classification library.
Optionally, the multi-classifying the vectorized data after the dimensionality reduction processing, and determining a classification result of the internet of things device, includes:
inputting the vectorized data subjected to the dimensionality reduction into a Boosting algorithm model for processing, and determining a category corresponding to the Internet of things equipment;
and if the category is the asset category contained in the asset classification library, performing tagging and unification processing on the asset information, and storing the asset information into the asset classification library.
Optionally, after determining the category corresponding to the internet of things device, the method further includes:
determining that the category is not target data of the asset category contained in the asset classification library in the vectorization data after the dimension reduction processing;
clustering the target data through a clustering algorithm to generate a clustering result;
and calibrating the clustering result to store the clustering result in the asset classification library.
Optionally, the method further comprises: and updating the asset classification library according to the classification result of the Internet of things equipment.
Optionally, the obtaining asset information of the internet of things device includes:
sending a network protocol communication request to detect the equipment of the Internet of things;
and responding to the returned response information, and determining the asset information of the equipment of the Internet of things.
In a second aspect, an embodiment of the present disclosure provides an internet of things asset identification device, including:
the acquisition module is used for acquiring asset information of the Internet of things equipment;
the processing module is used for extracting keywords of the asset information, vectorizing the asset information and generating vectorized data corresponding to the Internet of things equipment;
the matching module is used for matching the keywords with an asset classification library, and if the keywords do not exist in the asset classification library, performing dimension reduction processing on the vectorized data;
and the classification module is used for carrying out multi-classification on the vectorization data subjected to the dimensionality reduction processing and determining a classification result of the Internet of things equipment.
In a third aspect, an embodiment of the present disclosure provides an electronic device, including: a processor; a memory for storing the processor-executable instructions; the processor is configured to read the executable instructions from the memory and execute the instructions to implement the method for identifying assets in the internet of things according to the first aspect.
In a fourth aspect, the disclosed embodiments provide a computer-readable storage medium, where the storage medium stores a computer program, and the computer program, when executed by a processor, implements the method for identifying an asset in the internet of things according to the first aspect.
Compared with the prior art, the technical scheme provided by the embodiment of the disclosure has the following advantages: the method comprises the steps of extracting keywords of asset information by obtaining the asset information of the equipment of the Internet of things, and carrying out vectorization processing on the asset information to generate vectorization data corresponding to the equipment of the Internet of things; matching the keywords with an asset classification library, if the keywords do not exist in the asset classification library, performing dimensionality reduction on the vectorized data, performing multi-classification on the vectorized data after the dimensionality reduction, and determining a classification result of the Internet of things equipment. Therefore, the Internet of things asset identification classification is realized based on a natural language processing technology, a dimensionality reduction algorithm and an integrated learning multi-classification algorithm, the labor cost and the time cost generated by the Internet of things asset identification classification are reduced, and the accuracy and the intelligence of the Internet of things asset identification classification are improved.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and together with the description, serve to explain the principles of the disclosure.
In order to more clearly illustrate the embodiments or technical solutions in the prior art of the present disclosure, the drawings used in the description of the embodiments or prior art will be briefly described below, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without inventive exercise.
Fig. 1 is a schematic flowchart of an asset identification method of the internet of things according to an embodiment of the present disclosure;
fig. 2 is a schematic diagram of another method for identifying assets of the internet of things according to the embodiment of the disclosure;
fig. 3 is a schematic structural diagram of an asset identification device of the internet of things according to an embodiment of the present disclosure;
fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure.
Detailed Description
In order that the above objects, features and advantages of the present disclosure may be more clearly understood, aspects of the present disclosure will be further described below. It should be noted that the embodiments and features of the embodiments of the present disclosure may be combined with each other without conflict.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present disclosure, but the present disclosure may be practiced otherwise than as described herein; it is to be understood that the embodiments disclosed in the specification are only a few embodiments of the present disclosure, and not all embodiments.
Fig. 1 is a schematic flow diagram of an internet of things asset identification method according to an embodiment of the present disclosure, where the method according to the embodiment of the present disclosure may be executed by an internet of things asset identification device, and the device may be implemented by software and/or hardware and may be integrated on any electronic device with computing capability, such as a user terminal, e.g., a smart phone, a tablet computer, and the like.
As shown in fig. 1, an asset identification method of an internet of things provided by an embodiment of the present disclosure may include:
step 101, asset information of the internet of things equipment is obtained.
The method can be applied to asset identification and classification scenes of the Internet of things equipment. The internet of things equipment comprises but is not limited to smart home equipment, smart cars, sensor equipment, medical equipment and the like.
In this embodiment, a network protocol communication request may be sent to detect the internet of things device, and then, in response to the returned response information, asset information of the internet of things device is determined. Specifically, an active identification module is arranged, the active identification module detects assets on the internet in a mode of actively sending a network protocol communication request, and collects asset information of the internet of things equipment through response information returned by the request, the technology applied by the active identification module includes but is not limited to Nmap, and the acquired asset information includes multi-dimensional data.
And 102, extracting keywords of asset information, performing vectorization processing on the asset information, and generating vectorized data corresponding to the Internet of things equipment.
In this embodiment, after the asset information of the internet of things device is acquired, vectorization processing is performed on the collected asset information, including extracting effective information such as attributes and tag contents in a message, and then performing word segmentation, keyword extraction and text data vectorization processing on the effective information.
As an example, the extracting keywords of the asset information, and performing vectorization processing on the asset information to generate vectorized data corresponding to the internet of things device includes: extracting effective information from the asset information to generate text data; performing word segmentation on the text data, and determining keywords of the text data through a TF-IDF (term frequency-inverse text frequency index) algorithm; and vectorizing the text data to generate the vectorized data.
In this example, effective information related to asset identification is extracted by a natural language processing technique to generate corresponding text data, and then keywords of the text data are extracted to perform matching according to the keywords. For example, the keyword may be a business name, a device model number, an asset type, and the like.
In the field of natural language processing, text is represented as a vector capable of expressing text semantics through vectorization processing of text data, the text data vectorization processing mode includes but is not limited to a statistical-based method, a neural network-based method, and the like, and for example, vectorization data of text data may be generated by using a vectorization algorithm word2 vec.
And 103, matching the keywords with the asset classification library, and if the keywords do not exist in the asset classification library, performing dimension reduction on the quantitative data.
In this embodiment, the asset classification library includes a plurality of keywords, and the keywords of the text data are matched with the keywords in the asset classification library, and in an embodiment of the present disclosure, after the keywords are matched with the asset classification library, the method further includes: and if the keywords exist in the asset classification library, performing tagging and unification processing on the asset information, and storing the asset information into the asset classification library. The method comprises the steps of performing tagging and unification processing on text data to achieve inventory processing of Internet of things equipment, as an example, determining keywords of the text data to comprise an XX enterprise for the first Internet of things equipment, matching the XX enterprise with an asset classification library, performing inventory processing on the first Internet of things equipment if the XX enterprise also exists in the asset classification library, performing tagging and unification processing on the text data, and updating the asset classification library according to the processed text data.
In this embodiment, in the case that the keyword does not exist in the asset classification library, dimension reduction processing is performed on the vector quantization data. In an internet of things equipment asset identification scene, as the internet of things asset information dimension increases, a dimension disaster is caused, and the dimension disaster refers to a phenomenon that, in a problem related to vector calculation, the calculation amount is exponentially multiplied with the increase of the dimension, and in the case of the dimension disaster, a clustering algorithm increases a large amount of calculation and brings a problem of data accuracy.
And 104, performing multi-classification on the vectorization data subjected to the dimensionality reduction processing, and determining a classification result of the Internet of things equipment.
In this embodiment, the vectorized data after the dimensionality reduction is used as an input of the classification model, and the classification output by the classification model includes the asset type in the current asset classification library, so as to determine the classification result of the internet of things device, optionally, the internet of things device is subjected to warehousing processing according to the classification result of the internet of things device, and the asset classification library is updated.
As an example, the multi-classifying the vectorized data after the dimension reduction processing, and determining the classification result of the internet of things device includes: inputting the vectorized data subjected to the dimensionality reduction processing into a Boosting algorithm model for processing, and determining the category corresponding to the Internet of things equipment; and if the category is the asset category contained in the asset classification library, performing tagging and unification treatment on the asset information, and storing the asset information into the asset classification library.
In this example, the classification model is trained according to sample data, the output of the classification model is the asset type, the input is vectorization data, and optionally, the sample data is constructed according to the vectorization data of the internet of things devices stored in the asset classification library and the asset type to train the classification model.
In this example, the categories output by the model include a first asset category included in the asset classification library and a second asset category not included in the asset classification library, and optionally, after determining the category corresponding to the internet of things device, if the category corresponding to the internet of things device is the first asset category, storing the Internet of things equipment, and if the category corresponding to the Internet of things equipment is the second asset category, further calibrating the Internet of things equipment, namely determining target data of which the category is not the asset category contained in the asset classification library in vectorized data after dimension reduction processing; clustering the target data through a clustering algorithm to generate a clustering result; and calibrating the clustering result to store the clustering result in the asset classification library. The clustering algorithm includes, but is not limited to, a KMeans algorithm, and clusters the partial data to obtain asset type information of different types, and then, based on the clustered asset type information, the new asset type or the existing asset type is determined by calibration, and based on the calibration result, the asset classification library is updated.
For example, referring to fig. 2, an active recognition module, a vectorization module, a matching module, a dimension reduction module, an integrated classification module, a clustering module, and a manual intervention module are provided. The method comprises the steps of actively discovering a model through an Internet of things gateway, actively detecting Internet of things asset equipment in interconnection through Nmap, and carrying out preliminary data cleaning on detected response data. Vectorizing any Internet of things equipment, extracting keywords from the data by using a TF-IDF algorithm, matching the extracted keywords in a current asset classification library, and storing the current Internet of things equipment and updating the asset classification library if the extracted keywords are matched; if the data is not matched, performing data dimension reduction processing on the vectorized data, processing the vectorized data after dimension reduction by using an integrated learning Boosting algorithm, comparing the obtained classification result with the categories in the asset classification library, further clustering the data which are not successfully compared, and displaying the processed result on a gateway interface including the classified and unclassified data. Further, on the gateway interface, classified assets are verified based on further information provided by the customer, unclassified assets are recalibrated, and a library process and asset class library updates, including but not limited to, updates of keywords and asset types in the library, are performed. The above steps are repeatedly executed, and the accuracy and the automation of the asset classification of the Internet of things are improved.
According to the technical scheme of the embodiment of the disclosure, asset information of the equipment of the Internet of things is obtained, keywords of the asset information are extracted, vectorization processing is carried out on the asset information, and vectorization data corresponding to the equipment of the Internet of things are generated; matching the keywords with an asset classification library, if the keywords do not exist in the asset classification library, performing dimensionality reduction on the vectorized data, performing multi-classification on the vectorized data after the dimensionality reduction, and determining a classification result of the Internet of things equipment. Therefore, asset identification and classification are realized based on a natural language processing technology, a dimensionality reduction algorithm and an integrated learning multi-classification algorithm, labor cost and time cost generated by asset identification and classification of the Internet of things are reduced, and accuracy and intelligence of asset identification and classification of the Internet of things are improved.
Fig. 3 is a schematic structural diagram of an internet of things asset identification device provided in an embodiment of the present disclosure, and as shown in fig. 3, the internet of things asset identification device includes: an acquisition module 31, a processing module 32, a matching module 33, and a classification module 34.
The acquiring module 31 is configured to acquire asset information of the internet of things device;
the processing module 32 is configured to extract keywords of the asset information, perform vectorization processing on the asset information, and generate vectorized data corresponding to the internet of things device;
a matching module 33, configured to match the keyword with an asset classification library, and if the keyword does not exist in the asset classification library, perform dimension reduction processing on the vectorized data;
the classification module 34 is configured to perform multi-classification on the vectorized data after the dimension reduction processing, and determine a classification result of the internet of things device.
In an embodiment of the present disclosure, the processing module 32 is specifically configured to: extracting effective information of the asset information to generate text data; performing word segmentation on the text data, and determining keywords of the text data through a TF-IDF algorithm; and vectorizing the text data to generate the vectorized data.
In one embodiment of the present disclosure, the apparatus further comprises: and the first library storage module is used for performing labeling and unification processing on the asset information and storing the asset information into the asset classification library if the keywords exist in the asset classification library.
In one embodiment of the present disclosure, classification module 34 is specifically configured to: inputting the vectorized data subjected to the dimensionality reduction processing into a Boosting algorithm model for processing, and determining the category corresponding to the Internet of things equipment; and if the category is the asset category contained in the asset classification library, performing tagging and unification treatment on the asset information, and storing the asset information into the asset classification library.
In an embodiment of the present disclosure, the classification module 34 is specifically configured to: determining that the category is not target data of the asset category contained in the asset classification library in the vectorization data after the dimension reduction processing; clustering the target data through a clustering algorithm to generate a clustering result; and calibrating the clustering result to store the clustering result in the asset classification library.
In one embodiment of the present disclosure, the apparatus further comprises: and the updating module is used for updating the asset classification library according to the classification result of the Internet of things equipment.
In an embodiment of the present disclosure, the obtaining module 31 is specifically configured to: sending a network protocol communication request to detect the equipment of the Internet of things; and responding to the returned response information, and determining the asset information of the equipment of the Internet of things.
The asset identification device of the internet of things provided by the embodiment of the disclosure can execute any asset identification method of the internet of things provided by the embodiment of the disclosure, and has corresponding functional modules and beneficial effects of the execution method. Reference may be made to the description of any method embodiment of the disclosure that may not be described in detail in the embodiments of the apparatus of the disclosure.
Fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure. As shown in fig. 4, the electronic device 600 includes one or more processors 601 and memory 602.
The processor 601 may be a Central Processing Unit (CPU) or other form of processing unit having data processing capabilities and/or instruction execution capabilities, and may control other components in the electronic device 600 to perform desired functions.
Memory 602 may include one or more computer program products that may include various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory. Volatile memory can include, for example, random Access Memory (RAM), cache memory (cache), and/or the like. The non-volatile memory may include, for example, read Only Memory (ROM), a hard disk, flash memory, and the like. One or more computer program instructions may be stored on a computer-readable storage medium and executed by processor 601 to implement the methods of the embodiments of the present disclosure above and/or other desired functionality. Various contents such as an input signal, a signal component, a noise component, etc. may also be stored in the computer-readable storage medium.
In one example, the electronic device 600 may further include: an input device 603 and an output device 604, which are interconnected by a bus system and/or other form of connection mechanism (not shown). The input device 603 may also include, for example, a keyboard, a mouse, and the like. The output device 604 may output various information including the determined distance information, direction information, and the like to the outside. The output devices 604 may include, for example, a display, speakers, a printer, and a communication network and its connected remote output devices, among others.
Of course, for simplicity, only some of the components of the electronic device 600 relevant to the present disclosure are shown in fig. 4, and components such as buses, input/output interfaces, and the like are omitted. In addition, electronic device 600 may include any other suitable components depending on the particular application.
In addition to the methods and apparatus described above, embodiments of the present disclosure may also be a computer program product comprising computer program instructions that, when executed by a processor, cause the processor to perform any of the methods provided by embodiments of the present disclosure.
The computer program product may write program code for performing operations for embodiments of the present disclosure in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server.
Furthermore, embodiments of the present disclosure may also be a computer readable storage medium having stored thereon computer program instructions that, when executed by a processor, cause the processor to perform any of the methods provided by embodiments of the present disclosure.
A computer-readable storage medium may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may include, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
It is noted that, in this document, relational terms such as "first" and "second," and the like, are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrases "comprising a," "8230," "8230," or "comprising" does not exclude the presence of additional like elements in a process, method, article, or apparatus that comprises the element.
The foregoing are merely exemplary embodiments of the present disclosure, which enable those skilled in the art to understand or practice the present disclosure. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the disclosure. Thus, the present disclosure is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. An asset identification method of the Internet of things is characterized by comprising the following steps:
acquiring asset information of the equipment of the Internet of things;
extracting keywords of the asset information, and performing vectorization processing on the asset information to generate vectorized data corresponding to the Internet of things equipment;
matching the keywords with an asset classification library, and if the keywords do not exist in the asset classification library, performing dimension reduction processing on the vectorization data;
and performing multi-classification on the vectorized data subjected to the dimensionality reduction processing, and determining a classification result of the Internet of things equipment.
2. The method of claim 1, wherein the extracting keywords of the asset information and vectorizing the asset information to generate vectorized data corresponding to the internet of things device comprises:
extracting effective information from the asset information to generate text data;
performing word segmentation on the text data, and determining keywords of the text data through a TF-IDF algorithm; and
and vectorizing the text data to generate the vectorized data.
3. The method of claim 1, after matching the keyword to an asset classification library, further comprising:
and if the keywords exist in the asset classification library, performing tagging and unification processing on the asset information, and storing the asset information into the asset classification library.
4. The method of claim 1, wherein the multi-classifying the vectorized data after the dimension reduction processing to determine the classification result of the internet of things device comprises:
inputting the vectorized data subjected to the dimensionality reduction processing into a Boosting algorithm model for processing, and determining the category corresponding to the Internet of things equipment;
and if the category is the asset category contained in the asset classification library, performing tagging and unification treatment on the asset information, and storing the asset information into the asset classification library.
5. The method of claim 4, after determining the class to which the Internet of things device corresponds, further comprising:
determining that the category is not target data of an asset category contained in the asset classification library in the vectorization data after the dimension reduction processing;
clustering the target data through a clustering algorithm to generate a clustering result;
and calibrating the clustering result to store the clustering result in the asset classification library.
6. The method of any one of claims 1-5, further comprising:
and updating the asset classification library according to the classification result of the Internet of things equipment.
7. The method of claim 1, wherein the obtaining asset information of the internet of things device comprises:
sending a network protocol communication request to detect the equipment of the Internet of things;
and responding to the returned response information, and determining the asset information of the equipment of the Internet of things.
8. An internet of things asset identification device, comprising:
the acquisition module is used for acquiring asset information of the Internet of things equipment;
the processing module is used for extracting keywords of the asset information, vectorizing the asset information and generating vectorized data corresponding to the Internet of things equipment;
the matching module is used for matching the keywords with an asset classification library, and if the keywords do not exist in the asset classification library, performing dimension reduction processing on the vectorized data;
and the classification module is used for performing multi-classification on the vectorized data subjected to the dimensionality reduction processing and determining a classification result of the Internet of things equipment.
9. An electronic device, comprising:
a processor;
a memory for storing the processor-executable instructions;
the processor is configured to read the executable instructions from the memory and execute the instructions to implement the method for identifying assets in the internet of things as claimed in any one of the claims 1 to 7.
10. A computer-readable storage medium, wherein the storage medium stores a computer program, and the computer program when executed by a processor implements the method for identifying assets in the internet of things as claimed in any one of claims 1 to 7.
CN202210680492.8A 2022-06-15 2022-06-15 Internet of things asset identification method, device, equipment and storage medium Active CN115146712B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210680492.8A CN115146712B (en) 2022-06-15 2022-06-15 Internet of things asset identification method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210680492.8A CN115146712B (en) 2022-06-15 2022-06-15 Internet of things asset identification method, device, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN115146712A true CN115146712A (en) 2022-10-04
CN115146712B CN115146712B (en) 2023-04-28

Family

ID=83407854

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210680492.8A Active CN115146712B (en) 2022-06-15 2022-06-15 Internet of things asset identification method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115146712B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116541578A (en) * 2023-05-16 2023-08-04 三峡科技有限责任公司 Asset digital multidimensional management method and system
CN117194947A (en) * 2023-08-16 2023-12-08 惠州市庆展科技有限公司 Smart home equipment characteristic determining method and system
CN117493777A (en) * 2023-12-29 2024-02-02 成都秦川物联网科技股份有限公司 Ultrasonic flowmeter data cleaning method, system and device based on Internet of things

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104376406A (en) * 2014-11-05 2015-02-25 上海计算机软件技术开发中心 Enterprise innovation resource management and analysis system and method based on big data
CN109033284A (en) * 2018-07-12 2018-12-18 国网福建省电力有限公司 The power information operational system database construction method of knowledge based map
CN109933783A (en) * 2019-01-31 2019-06-25 华融融通(北京)科技有限公司 A kind of essence of a contract method of non-performing asset operation field
CN110021439A (en) * 2019-03-07 2019-07-16 平安科技(深圳)有限公司 Medical data classification method, device and computer equipment based on machine learning
CN110245693A (en) * 2019-05-30 2019-09-17 北京理工大学 In conjunction with the key message infrastructure assets recognition methods of mixing random forest
CN110781955A (en) * 2019-10-24 2020-02-11 中国银联股份有限公司 Method and device for classifying label-free objects and detecting nested codes and computer-readable storage medium
CN110825941A (en) * 2019-10-17 2020-02-21 北京天融信网络安全技术有限公司 Content management system identification method, device and storage medium
US20200169574A1 (en) * 2018-11-27 2020-05-28 General Electric Company Framework to develop cyber-physical system behavior-based monitoring
US20200210907A1 (en) * 2018-12-26 2020-07-02 Accenture Global Solutions Limited Utilizing econometric and machine learning models to identify analytics data for an entity
CN111897962A (en) * 2020-07-27 2020-11-06 绿盟科技集团股份有限公司 Internet of things asset marking method and device
CN113095078A (en) * 2021-05-08 2021-07-09 中国工商银行股份有限公司 Associated asset determination method and device and electronic equipment
CN114491049A (en) * 2022-02-17 2022-05-13 金陵科技学院 Office system asset allocation method based on information management

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104376406A (en) * 2014-11-05 2015-02-25 上海计算机软件技术开发中心 Enterprise innovation resource management and analysis system and method based on big data
CN109033284A (en) * 2018-07-12 2018-12-18 国网福建省电力有限公司 The power information operational system database construction method of knowledge based map
US20200169574A1 (en) * 2018-11-27 2020-05-28 General Electric Company Framework to develop cyber-physical system behavior-based monitoring
US20200210907A1 (en) * 2018-12-26 2020-07-02 Accenture Global Solutions Limited Utilizing econometric and machine learning models to identify analytics data for an entity
CN109933783A (en) * 2019-01-31 2019-06-25 华融融通(北京)科技有限公司 A kind of essence of a contract method of non-performing asset operation field
CN110021439A (en) * 2019-03-07 2019-07-16 平安科技(深圳)有限公司 Medical data classification method, device and computer equipment based on machine learning
CN110245693A (en) * 2019-05-30 2019-09-17 北京理工大学 In conjunction with the key message infrastructure assets recognition methods of mixing random forest
CN110825941A (en) * 2019-10-17 2020-02-21 北京天融信网络安全技术有限公司 Content management system identification method, device and storage medium
CN110781955A (en) * 2019-10-24 2020-02-11 中国银联股份有限公司 Method and device for classifying label-free objects and detecting nested codes and computer-readable storage medium
CN111897962A (en) * 2020-07-27 2020-11-06 绿盟科技集团股份有限公司 Internet of things asset marking method and device
CN113095078A (en) * 2021-05-08 2021-07-09 中国工商银行股份有限公司 Associated asset determination method and device and electronic equipment
CN114491049A (en) * 2022-02-17 2022-05-13 金陵科技学院 Office system asset allocation method based on information management

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王家驹等: "基于多维尺度分析和改进K-means的台户关系辨识方法", 《电气自动化》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116541578A (en) * 2023-05-16 2023-08-04 三峡科技有限责任公司 Asset digital multidimensional management method and system
CN117194947A (en) * 2023-08-16 2023-12-08 惠州市庆展科技有限公司 Smart home equipment characteristic determining method and system
CN117493777A (en) * 2023-12-29 2024-02-02 成都秦川物联网科技股份有限公司 Ultrasonic flowmeter data cleaning method, system and device based on Internet of things
CN117493777B (en) * 2023-12-29 2024-03-15 成都秦川物联网科技股份有限公司 Ultrasonic flowmeter data cleaning method, system and device based on Internet of things

Also Published As

Publication number Publication date
CN115146712B (en) 2023-04-28

Similar Documents

Publication Publication Date Title
CN108153901B (en) Knowledge graph-based information pushing method and device
CN107705066B (en) Information input method and electronic equipment during commodity warehousing
CN109522557B (en) Training method and device of text relation extraction model and readable storage medium
CN115146712B (en) Internet of things asset identification method, device, equipment and storage medium
US11170179B2 (en) Systems and methods for natural language processing of structured documents
CN109471944B (en) Training method and device of text classification model and readable storage medium
CN110069709B (en) Intention recognition method, device, computer readable medium and electronic equipment
US20240029086A1 (en) Discovery of new business openings using web content analysis
CN113255370B (en) Industry type recommendation method, device, equipment and medium based on semantic similarity
CN109947903B (en) Idiom query method and device
JP2020091865A (en) Metadata-based api attribute extraction
CN111125566B (en) Information acquisition method and device, electronic equipment and storage medium
CN114416998A (en) Text label identification method and device, electronic equipment and storage medium
CN111949785A (en) Query statement management method and device, readable storage medium and electronic device
CN116739626A (en) Commodity data mining processing method and device, electronic equipment and readable medium
CN108959289B (en) Website category acquisition method and device
CN110489740B (en) Semantic analysis method and related product
US20230376692A1 (en) Technical document issues scanner
CN112069304A (en) Question answering method, device, server and storage medium for insurance business
CN112100364A (en) Text semantic understanding method and model training method, device, equipment and medium
CN112015773B (en) Knowledge base retrieval method and device, electronic equipment and storage medium
CN114385918A (en) Message pushing method and device, computer equipment and storage medium
CN113901817A (en) Document classification method and device, computer equipment and storage medium
CN110516717B (en) Method and apparatus for generating image recognition model
CN113837836A (en) Model recommendation method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant