CN117077067A - Information system automatic deployment planning method based on intelligent matching - Google Patents

Information system automatic deployment planning method based on intelligent matching Download PDF

Info

Publication number
CN117077067A
CN117077067A CN202311346131.0A CN202311346131A CN117077067A CN 117077067 A CN117077067 A CN 117077067A CN 202311346131 A CN202311346131 A CN 202311346131A CN 117077067 A CN117077067 A CN 117077067A
Authority
CN
China
Prior art keywords
dimensional data
dimension
data
isolated
ith
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202311346131.0A
Other languages
Chinese (zh)
Other versions
CN117077067B (en
Inventor
王建宏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Yakang Wanwei Information Technology Co ltd
Original Assignee
Beijing Yakang Wanwei Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Yakang Wanwei Information Technology Co ltd filed Critical Beijing Yakang Wanwei Information Technology Co ltd
Priority to CN202311346131.0A priority Critical patent/CN117077067B/en
Publication of CN117077067A publication Critical patent/CN117077067A/en
Application granted granted Critical
Publication of CN117077067B publication Critical patent/CN117077067B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/2433Single-class perspective, e.g. one-against-all classification; Novelty detection; Outlier detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/24323Tree-organised classifiers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/20Ensemble learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Software Systems (AREA)
  • Medical Informatics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Image Analysis (AREA)

Abstract

The invention relates to the technical field of data management, in particular to an information system automatic deployment planning method based on intelligent matching, which comprises the following steps: constructing a plurality of isolated trees of all high-dimensional data by utilizing an isolated forest anomaly detection algorithm; calculating the isolated characteristic of each high-dimensional data in each dimension according to the isolated tree; obtaining a dimension sequence of each high-dimensional data according to the isolated features; calculating the suitability degree of partial dimension in the dimension sequence of each high-dimensional data as the characteristic dimension of each high-dimensional data; obtaining all feature dimensions of each high-dimensional data according to the suitability degree; obtaining all concerned data in each dimension according to all characteristic dimensions of each high-dimensional data; storing all the concerned data in each dimension in each storage group to obtain a threshold value of each storage group; and matching the high-dimensional data to be searched according to the threshold value of each storage group. The invention accelerates the speed of accurately matching the multi-dimensional data to the optimal result and improves the matching efficiency.

Description

Information system automatic deployment planning method based on intelligent matching
Technical Field
The invention relates to the technical field of data management, in particular to an information system automatic deployment planning method based on intelligent matching.
Background
The information system based on intelligent matching utilizes an artificial intelligence technology, and provides more personalized, accurate and efficient information acquisition experience by understanding user requirements and intelligently matching the user requirements with information resources.
The requirement of matching optimal results in multidimensional data is often met when intelligent matching is carried out, each dimension is usually matched in each data in the prior art, and the problem of the method is that multiple times of matching are needed, and the matching efficiency is quite low.
Therefore, how to quickly and accurately match to the optimal result in the multidimensional data is a technical problem to be solved.
Disclosure of Invention
In order to solve the problems, the invention provides an intelligent matching-based automatic information system deployment planning method, which comprises the following steps:
acquiring all high-dimensional data required to be stored in an information system based on intelligent matching;
constructing a plurality of isolated trees of all high-dimensional data by utilizing an isolated forest anomaly detection algorithm; calculating the isolated characteristic of each high-dimensional data in each dimension according to the isolated tree; obtaining a dimension sequence of each high-dimensional data according to the isolated features; calculating the suitability degree of partial dimension in the dimension sequence of each high-dimensional data as the characteristic dimension of each high-dimensional data;
obtaining all feature dimensions of each high-dimensional data according to the suitability degree; obtaining all concerned data in each dimension according to all characteristic dimensions of each high-dimensional data; storing all the concerned data in each dimension in each storage group to obtain a threshold value of each storage group;
and matching the high-dimensional data to be searched according to the threshold value of each storage group.
Further, the calculating the isolated feature of each high-dimensional data in each dimension comprises the following specific steps:
according to the strong feature isolation tree and the weak feature isolation tree of each high-dimensional data in each dimension, calculating the isolated feature of each high-dimensional data in each dimension, wherein a specific calculation formula is as follows:
in the method, in the process of the invention,representing isolated features of the ith high dimensional data in the jth dimension,/for example>Representing the number of strong feature orphan trees of the ith data in the jth dimension, N representing the total number of all orphan trees, then +.>For the number of weak feature isolation trees of the ith data in the jth dimension, +.>Representing the number of high-dimensional data points of the ith data in the child node of the jth dimension in the weak feature isolation tree of the ith data in the jth dimension, and M represents the total number of all the high-dimensional data.
Further, the method for acquiring the strong feature isolated tree and the weak feature isolated tree of each high-dimensional data in each dimension is specifically as follows:
in any one of the isolated trees, if the ith high-dimensional data is isolated data under the child node of the jth dimension, taking the isolated tree as a strong characteristic isolated tree of the ith high-dimensional data under the jth dimension; otherwise, the tree is used as a weak feature isolation tree of the ith high-dimensional data in the jth dimension.
Further, the step of obtaining the dimension sequence of each high-dimensional data according to the isolated features comprises the following specific steps:
and for any one high-dimensional data, arranging all dimensions according to the sequence from large to small of the isolated features of the high-dimensional data in each dimension to obtain a dimension sequence of the high-dimensional data.
Further, the calculating the suitability degree of the partial dimension in the dimension sequence of each high-dimensional data as the characteristic dimension of each high-dimensional data includes the following specific steps:
in the method, in the process of the invention,representing the appropriateness of the first r dimensions in the sequence of dimensions of the ith high-dimensional data as characteristic dimensions of the ith high-dimensional data,/for>Isolated indicators respectively representing the 1 st dimension and the r th dimension of the ith high-dimensional data in the dimension sequence of the ith high-dimensional data, T represents the total number of all dimensions,/>An exponential function based on a natural constant is represented.
Further, the obtaining all feature dimensions of each high-dimensional data comprises the following specific steps:
for the ith high-dimensional data, calculating the former r in the dimension sequence of the ith high-dimensional data) And taking the dimension as the suitability degree of the characteristic dimension of each high-dimensional data, taking r corresponding to the maximum value as the number of the characteristic dimension of the ith high-dimensional data, taking the first r dimensions in the dimension sequence of the ith high-dimensional data as the characteristic dimension of the ith high-dimensional data, and acquiring all the characteristic dimensions of each high-dimensional data.
Further, the obtaining all the data of interest in each dimension includes the following specific steps:
and if the jth dimension is the characteristic dimension of the ith high-dimensional data, taking the ith high-dimensional data as the concerned data in the jth dimension, and acquiring all concerned data in each dimension.
Further, the storing all the data of interest in each dimension in each storage group to obtain the threshold value of each storage group includes the following specific steps:
all the data of interest in each dimension are stored together as one storage group, and the range consisting of the minimum value and the maximum value of the data value of all the data of interest in each dimension is used as the threshold value of each storage group.
Further, the matching of the high-dimensional data to be searched according to the threshold value of each storage group comprises the following specific steps:
matching the high-dimensional data to be searched with each high-dimensional data in the preferential matching group of the high-dimensional data to be searched, and if the matched high-dimensional data does not exist, sequentially matching the high-dimensional data to be searched with each high-dimensional data in each group to be searched of the high-dimensional data to be searched until the high-dimensional data matched with the high-dimensional data to be searched is obtained.
Further, the method for acquiring the priority matching group of the high-dimensional data to be searched and the packet to be searched specifically includes the following steps:
obtaining high-dimensional data to be searched, judging whether the value of the high-dimensional data to be searched in each dimension is in the threshold value of the storage grouping corresponding to each dimension, and if the value of the high-dimensional data to be searched in each dimension is in the threshold value of the storage grouping corresponding to the dimension, taking the storage grouping as the group to be searched of the high-dimensional data to be searched, obtaining the intersection of all groups to be searched of the high-dimensional data to be searched, and taking the intersection as the priority matching group of the high-dimensional data to be searched.
The technical scheme of the invention has the beneficial effects that: according to the method, the isolated characteristics of each high-dimensional data in each dimension are calculated according to the isolated tree, all characteristic dimensions of each high-dimensional data are obtained according to the suitability degree of a part of dimensions in a dimension sequence of the high-dimensional data as characteristic dimensions of each high-dimensional data, all concerned data in each dimension are obtained according to all characteristic dimensions of each high-dimensional data, all concerned data in each dimension are stored in each storage group, the threshold value of each storage group is obtained, when the high-dimensional data to be searched are matched, the high-dimensional data are matched with each high-dimensional data in a preferential matching group of the high-dimensional data to be searched, the speed of accurately matching the high-dimensional data to an optimal result is increased, and the matching efficiency is improved.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a method for automatically deploying and planning an information system based on intelligent matching.
Detailed Description
In order to further describe the technical means and effects adopted by the invention to achieve the preset aim, the following is a detailed description of specific implementation, structure, characteristics and effects thereof of an intelligent matching-based information system automatic deployment planning method according to the invention with reference to the accompanying drawings and preferred embodiments. In the following description, different "one embodiment" or "another embodiment" means that the embodiments are not necessarily the same. Furthermore, the particular features, structures, or characteristics of one or more embodiments may be combined in any suitable manner.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
The following specifically describes a specific scheme of the intelligent matching-based information system automatic deployment planning method.
Referring to fig. 1, a flowchart of a method for automatically deploying a data transmission module of a planning method for an information system based on intelligent matching according to an embodiment of the present invention is shown, where the method includes:
s001, obtain all high-dimensional data that needs to be stored in the intelligent matching-based information system.
In particular, all high-dimensional data that needs to be stored in an intelligent matching based information system is acquired, wherein the high-dimensional data includes several dimensions.
S002, constructing a plurality of isolated trees of all high-dimensional data by utilizing an isolated forest anomaly detection algorithm; calculating the isolated characteristic of each high-dimensional data in each dimension according to the isolated tree; obtaining a dimension sequence of each high-dimensional data according to the isolated features; calculating the suitability degree of partial dimension in the dimension sequence of each high-dimensional data as the characteristic dimension of each high-dimensional data; obtaining all feature dimensions of each high-dimensional data according to the suitability degree; obtaining all concerned data in each dimension according to all characteristic dimensions of each high-dimensional data; all data of interest in each dimension is stored in each storage group, and the threshold value of each storage group is obtained.
It should be noted that, in order to quickly and accurately match to the optimal result in the multidimensional data, considering that each high-dimensional data has its dimension different from other high-dimensional data, it is necessary to store the high-dimensional data in groups according to different dimensions, so that all the high-dimensional data stored in each group is distributed in a certain range on the dimension corresponding to each group, and in this embodiment, it is considered that each high-dimensional data is stored below the dimension in which it is located, and when matching is performed, matching is performed in the storage group below the dimension corresponding to the high-dimensional data to be searched.
1. And constructing a plurality of isolated trees of all high-dimensional data by using an isolated forest anomaly detection algorithm.
Specifically, constructing a plurality of isolated trees of all high-dimensional data by using an isolated forest anomaly detection algorithm, dividing all the high-dimensional data into two sub-nodes by each isolated tree through segmentation values of different dimensions in sequence until the depth of the isolated tree reaches the maximum depth or the number of samples in poles is smaller than or equal to the minimum number of split samples, repeating the steps until a specified number of isolated trees are constructed, wherein the maximum depth, the minimum number of split samples and the specified number are parameters in the isolated forest anomaly detection algorithm, and setting the parameters in the isolated forest anomaly detection algorithm as default values.
It should be noted that, the isolated forest anomaly detection algorithm is an unsupervised learning algorithm based on anomaly detection, and identifies outliers in the data set by constructing outliers in random binary search tree isolated data, and the isolated forest anomaly detection algorithm is in the prior art and will not be described here again.
2. And calculating the isolated characteristic of each high-dimensional data in each dimension according to the isolated tree.
It should be noted that, the isolated forest anomaly detection algorithm sequentially divides all the high-dimensional data by the division values of different dimensions, so as to construct an isolated tree, and similar high-dimensional data are divided together in each dimension, so that the constructed isolated tree can represent the specificity of each high-dimensional data in each dimension, and if the less the high-dimensional data are divided together in a certain dimension, the more specific the high-dimensional data are in the dimension, and correspondingly, the more the high-dimensional data can be represented in the dimension.
Specifically, in any one of the isolated trees, if the ith high-dimensional data is isolated data under the child node of the jth dimension, the isolated tree is used as a strong characteristic isolated tree of the ith high-dimensional data under the jth dimension; otherwise, the tree is used as a weak feature isolation tree of the ith high-dimensional data in the jth dimension.
Further, according to the strong feature isolation tree and the weak feature isolation tree of each high-dimensional data in each dimension, calculating the isolated feature of each high-dimensional data in each dimension, wherein the specific calculation formula is as follows:
in the method, in the process of the invention,representing isolated features of the ith high dimensional data in the jth dimension,/for example>Representing the number of strong feature orphan trees of the ith data in the jth dimension, N representing the total number of all orphan trees, then +.>For the number of weak feature isolation trees of the ith data in the jth dimension, +.>Representing the number of high-dimensional data points of the ith data in the child node of the jth dimension in the weak feature isolation tree of the ith data in the jth dimension, and M represents the total number of all the high-dimensional data.
It should be noted that, if the ith high-dimensional data is isolated data under the child node of the jth dimension, the ith high-dimensional data is far away from other high-dimensional data in the jth dimension, so that the more such isolated trees are, the jth dimension can represent the ith high-dimensional data, and the larger the isolated feature of the ith high-dimensional data in the jth dimension is; if the ith high-dimensional data is not isolated data under the child node of the jth dimension, at this time, the smaller the number of the high-dimensional data points of the ith data in the child node of the jth dimension is, the more the ith high-dimensional data is far away from other high-dimensional data in the jth dimension, the jth dimension can represent the ith high-dimensional data, and the greater the isolated feature of the ith high-dimensional data in the jth dimension is.
3. Obtaining a dimension sequence of each high-dimensional data according to the isolated features; calculating the suitability degree of partial dimension in the dimension sequence of each high-dimensional data as the characteristic dimension of each high-dimensional data; all feature dimensions of each high-dimensional data are obtained according to the fitness level.
It should be noted that, considering that each high-dimensional data may be specific in multiple dimensions, it is necessary to obtain the feature dimension of each high-dimensional data according to the distribution of isolated features of each high-dimensional data in all dimensions.
Specifically, for any one high-dimensional data, all dimensions are arranged according to the order of the isolated features of the high-dimensional data in each dimension from large to small, and a dimension sequence of the high-dimensional data is obtained.
Further, the first r dimensions in the dimension sequence of each high-dimensional data are calculated to be suitable degrees of the characteristic dimension of each high-dimensional data, and a specific calculation formula is as follows:
in the method, in the process of the invention,representing the appropriateness of the first r dimensions in the sequence of dimensions of the ith high-dimensional data as characteristic dimensions of the ith high-dimensional data,/for>Isolated indicators respectively representing the 1 st dimension and the r th dimension of the ith high-dimensional data in the dimension sequence of the ith high-dimensional data, T represents the total number of all dimensions,/>An exponential function based on a natural constant is represented.
It should be noted that the number of the substrates,the smaller the isolated index distribution representing the first r dimensions in the dimension sequence of the ith high-dimensional data, the more consistent the first r dimensions can all represent the characteristics of the ith high-dimensional data, the more suitable the first r dimensions are as the characteristic dimensions of the ith high-dimensional data, and the greater the suitability degree of the first r dimensions in the dimension sequence of the ith high-dimensional data as the characteristic dimensions of the ith high-dimensional data is>The larger the dimension sequence of the ith high-dimensional data, the greater the suitability of the first r dimensions as feature dimensions of the ith high-dimensional data.
Further, for the ith high-dimensional data, calculating the former r in the dimension sequence of the ith high-dimensional data) The dimension is taken as the suitability degree of the characteristic dimension of each high-dimensional data, r corresponding to the maximum value is taken as the number of characteristic dimensions of the ith high-dimensional data, and the first r dimensions in the dimension sequence of the ith high-dimensional data are taken as the number of characteristic dimensions of the ith high-dimensional dataAnd acquiring all feature dimensions of each piece of high-dimensional data.
4. And according to all feature dimensions of each high-dimensional data, obtaining all concerned data in each dimension, storing all concerned data in each dimension in each storage group, and obtaining a threshold value of each storage group.
Specifically, if the jth dimension is a feature dimension of the ith high-dimensional data, taking the ith high-dimensional data as the data of interest in the jth dimension, and acquiring all the data of interest in each dimension.
Further, all the data of interest in each dimension are stored together as one storage group, and a range of the minimum value and the maximum value of the data value of all the data of interest in each dimension is used as a threshold value of each storage group.
S003, matching the high-dimensional data to be searched according to the threshold value of each storage group.
Specifically, for the high-dimensional data to be searched, judging whether the value of the high-dimensional data to be searched in each dimension is in the threshold value of the storage group corresponding to each dimension, if the value of the high-dimensional data to be searched in each dimension is in the threshold value of the storage group corresponding to the dimension, taking the storage group as the group to be searched of the high-dimensional data to be searched, and obtaining the intersection of all groups to be searched of the high-dimensional data to be searched as the priority matching group of the high-dimensional data to be searched.
Further, matching the high-dimensional data to be searched with each high-dimensional data in the priority matching group of the high-dimensional data to be searched, and if the matched high-dimensional data does not exist, sequentially matching the high-dimensional data to be searched with each high-dimensional data in each group to be searched of the high-dimensional data to be searched until the high-dimensional data matched with the high-dimensional data to be searched is obtained.
It should be noted that, if the values of the high-dimensional data to be searched in the plurality of dimensions are within the threshold value of the storage group in the dimension, the high-dimensional data to be searched in the plurality of dimensions has specificity, and the high-dimensional data in the intersection of the storage groups corresponding to the dimensions has specificity in the plurality of dimensions, the probability that the high-dimensional data to be searched can be matched with the high-dimensional data in the intersection of the storage groups corresponding to the dimensions is larger, so that the high-dimensional data to be searched is matched with each high-dimensional data in the priority matching group of the high-dimensional data to be searched first, the speed of accurately matching the high-dimensional data to the optimal result can be increased, and the matching efficiency is improved.
The above description is only of the preferred embodiments of the present invention and is not intended to limit the invention, but any modifications, equivalent substitutions, improvements, etc. within the principles of the present invention should be included in the scope of the present invention.

Claims (10)

1. An intelligent matching-based automatic information system deployment planning method is characterized by comprising the following steps:
acquiring all high-dimensional data required to be stored in an information system based on intelligent matching;
constructing a plurality of isolated trees of all high-dimensional data by utilizing an isolated forest anomaly detection algorithm; calculating the isolated characteristic of each high-dimensional data in each dimension according to the isolated tree; obtaining a dimension sequence of each high-dimensional data according to the isolated features; calculating the suitability degree of partial dimension in the dimension sequence of each high-dimensional data as the characteristic dimension of each high-dimensional data;
obtaining all feature dimensions of each high-dimensional data according to the suitability degree; obtaining all concerned data in each dimension according to all characteristic dimensions of each high-dimensional data; storing all the concerned data in each dimension in each storage group to obtain a threshold value of each storage group;
and matching the high-dimensional data to be searched according to the threshold value of each storage group.
2. The method for automatically deploying and planning an information system based on intelligent matching according to claim 1, wherein the calculating the isolated feature of each high-dimensional data in each dimension comprises the following specific steps:
according to the strong feature isolation tree and the weak feature isolation tree of each high-dimensional data in each dimension, calculating the isolated feature of each high-dimensional data in each dimension, wherein a specific calculation formula is as follows:
in the method, in the process of the invention,representing isolated features of the ith high dimensional data in the jth dimension,/for example>Representing the number of strong feature orphan trees of the ith data in the jth dimension, N representing the total number of all orphan trees, then +.>For the number of weak feature isolation trees of the ith data in the jth dimension, +.>Representing the number of high-dimensional data points of the ith data in the child node of the jth dimension in the weak feature isolation tree of the ith data in the jth dimension, and M represents the total number of all the high-dimensional data.
3. The method for automatically deploying and planning an information system based on intelligent matching according to claim 2, wherein the method for acquiring the strong feature isolated tree and the weak feature isolated tree of each high-dimensional data in each dimension is specifically as follows:
in any one of the isolated trees, if the ith high-dimensional data is isolated data under the child node of the jth dimension, taking the isolated tree as a strong characteristic isolated tree of the ith high-dimensional data under the jth dimension; otherwise, the tree is used as a weak feature isolation tree of the ith high-dimensional data in the jth dimension.
4. The method for automatically deploying and planning an information system based on intelligent matching according to claim 1, wherein the step of obtaining the dimension sequence of each high-dimensional data according to the isolated features comprises the following specific steps:
and for any one high-dimensional data, arranging all dimensions according to the sequence from large to small of the isolated features of the high-dimensional data in each dimension to obtain a dimension sequence of the high-dimensional data.
5. The method for automatically deploying and planning an information system based on intelligent matching according to claim 1, wherein the calculating the suitability of a part of dimensions in the sequence of dimensions of each high-dimensional data as the characteristic dimensions of each high-dimensional data comprises the following specific steps:
in the method, in the process of the invention,representing the appropriateness of the first r dimensions in the sequence of dimensions of the ith high-dimensional data as characteristic dimensions of the ith high-dimensional data,/for>Isolated indicators respectively representing the 1 st dimension and the r th dimension of the ith high-dimensional data in the dimension sequence of the ith high-dimensional data, T represents the total number of all dimensions,/>An exponential function based on a natural constant is represented.
6. The method for automatically deploying and planning an information system based on intelligent matching according to claim 1, wherein the steps of obtaining all feature dimensions of each high-dimensional data comprise the following specific steps:
for the ith high-dimensional data, calculating the former r in the dimension sequence of the ith high-dimensional data) And taking the dimension as the suitability degree of the characteristic dimension of each high-dimensional data, taking r corresponding to the maximum value as the number of the characteristic dimension of the ith high-dimensional data, taking the first r dimensions in the dimension sequence of the ith high-dimensional data as the characteristic dimension of the ith high-dimensional data, and acquiring all the characteristic dimensions of each high-dimensional data.
7. The method for automatically deploying and planning an information system based on intelligent matching according to claim 1, wherein the steps of obtaining all the data of interest in each dimension comprise the following steps:
and if the jth dimension is the characteristic dimension of the ith high-dimensional data, taking the ith high-dimensional data as the concerned data in the jth dimension, and acquiring all concerned data in each dimension.
8. The method for automatically deploying and planning an information system based on intelligent matching according to claim 1, wherein the steps of storing all the data of interest in each dimension in each storage group to obtain the threshold value of each storage group comprise the following specific steps:
all the data of interest in each dimension are stored together as one storage group, and the range consisting of the minimum value and the maximum value of the data value of all the data of interest in each dimension is used as the threshold value of each storage group.
9. The method for automatically deploying and planning an information system based on intelligent matching according to claim 1, wherein the matching of the high-dimensional data to be searched according to the threshold value of each storage group comprises the following specific steps:
matching the high-dimensional data to be searched with each high-dimensional data in the preferential matching group of the high-dimensional data to be searched, and if the matched high-dimensional data does not exist, sequentially matching the high-dimensional data to be searched with each high-dimensional data in each group to be searched of the high-dimensional data to be searched until the high-dimensional data matched with the high-dimensional data to be searched is obtained.
10. The method for automatically deploying and planning an information system based on intelligent matching according to claim 9, wherein the method for acquiring the priority matching group and the group of the high-dimensional data to be searched specifically comprises the following steps:
obtaining high-dimensional data to be searched, judging whether the value of the high-dimensional data to be searched in each dimension is in the threshold value of the storage grouping corresponding to each dimension, and if the value of the high-dimensional data to be searched in each dimension is in the threshold value of the storage grouping corresponding to the dimension, taking the storage grouping as the group to be searched of the high-dimensional data to be searched, obtaining the intersection of all groups to be searched of the high-dimensional data to be searched, and taking the intersection as the priority matching group of the high-dimensional data to be searched.
CN202311346131.0A 2023-10-18 2023-10-18 Information system automatic deployment planning method based on intelligent matching Active CN117077067B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311346131.0A CN117077067B (en) 2023-10-18 2023-10-18 Information system automatic deployment planning method based on intelligent matching

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311346131.0A CN117077067B (en) 2023-10-18 2023-10-18 Information system automatic deployment planning method based on intelligent matching

Publications (2)

Publication Number Publication Date
CN117077067A true CN117077067A (en) 2023-11-17
CN117077067B CN117077067B (en) 2023-12-22

Family

ID=88706556

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311346131.0A Active CN117077067B (en) 2023-10-18 2023-10-18 Information system automatic deployment planning method based on intelligent matching

Country Status (1)

Country Link
CN (1) CN117077067B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110991552A (en) * 2019-12-12 2020-04-10 支付宝(杭州)信息技术有限公司 Isolated forest model construction and prediction method and device based on federal learning
CN111275547A (en) * 2020-03-19 2020-06-12 重庆富民银行股份有限公司 Wind control system and method based on isolated forest
CN112434208A (en) * 2020-12-03 2021-03-02 百果园技术(新加坡)有限公司 Training of isolated forest and identification method and related device of web crawler of isolated forest
CN113420073A (en) * 2021-08-23 2021-09-21 平安科技(深圳)有限公司 Abnormal sample detection method based on improved isolated forest and related equipment
CN114398685A (en) * 2021-10-21 2022-04-26 海南火链科技有限公司 Government affair data processing method and device, computer equipment and storage medium
CN114781688A (en) * 2022-03-21 2022-07-22 广东电网有限责任公司广州供电局 Method, device, equipment and storage medium for identifying abnormal data of business expansion project
WO2023169098A1 (en) * 2022-03-10 2023-09-14 东南大学 Isolation forest-based method for diagnosing open-circuit fault of modular multilevel converter

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110991552A (en) * 2019-12-12 2020-04-10 支付宝(杭州)信息技术有限公司 Isolated forest model construction and prediction method and device based on federal learning
WO2021114821A1 (en) * 2019-12-12 2021-06-17 支付宝(杭州)信息技术有限公司 Isolation forest model construction and prediction method and device based on federated learning
CN111275547A (en) * 2020-03-19 2020-06-12 重庆富民银行股份有限公司 Wind control system and method based on isolated forest
CN112434208A (en) * 2020-12-03 2021-03-02 百果园技术(新加坡)有限公司 Training of isolated forest and identification method and related device of web crawler of isolated forest
CN113420073A (en) * 2021-08-23 2021-09-21 平安科技(深圳)有限公司 Abnormal sample detection method based on improved isolated forest and related equipment
CN114398685A (en) * 2021-10-21 2022-04-26 海南火链科技有限公司 Government affair data processing method and device, computer equipment and storage medium
WO2023169098A1 (en) * 2022-03-10 2023-09-14 东南大学 Isolation forest-based method for diagnosing open-circuit fault of modular multilevel converter
CN114781688A (en) * 2022-03-21 2022-07-22 广东电网有限责任公司广州供电局 Method, device, equipment and storage medium for identifying abnormal data of business expansion project

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
闻明;窦晨;: "孤立森林算法在智能监控中的应用", 科技创新导报, no. 14 *

Also Published As

Publication number Publication date
CN117077067B (en) 2023-12-22

Similar Documents

Publication Publication Date Title
CN110059181B (en) Short text label method, system and device for large-scale classification system
CN110019647B (en) Keyword searching method and device and search engine
US6792163B2 (en) Method and apparatus for searching, browsing and summarizing moving image data using fidelity of tree-structured moving image hierarchy
CN107291785A (en) A kind of data search method and device
CN113868366B (en) Streaming data-oriented online cross-modal retrieval method and system
CN108304409B (en) Carry-based data frequency estimation method of Sketch data structure
CN104142946A (en) Method and system for aggregating and searching service objects of same type
CN110516118A (en) A kind of character string matching method, equipment and computer storage medium
CN104092618A (en) Peer-to-peer network traffic feature selection method based on cuckoo search algorithm
CN103077216B (en) The method of subgraph match device and subgraph match
CN115293919A (en) Graph neural network prediction method and system oriented to social network distribution generalization
CN104933143A (en) Method and device for acquiring recommended object
CN112598062A (en) Image identification method and device
CN111475511A (en) Data storage method, data access method, data storage device, data access device and data access equipment based on tree structure
CN111475158A (en) Sub-domain dividing method and device, electronic equipment and computer readable storage medium
CN117077067B (en) Information system automatic deployment planning method based on intelligent matching
CN114239237A (en) Power distribution network simulation scene generation system and method supporting digital twinning
CN108038118B (en) KD tree construction method with position priority and minimum external rectangle maintenance
CN112052649B (en) Text generation method, device, electronic equipment and storage medium
CN114036345A (en) Method and device for processing track data and storage medium
CN113076448A (en) Community discovery method based on extremely large cliques and strongly connected components
CN111428035A (en) Entity clustering method and device
CN111950615A (en) Network fault feature selection method based on tree species optimization algorithm
CN111460088A (en) Similar text retrieval method, device and system
US8843495B2 (en) High-efficiency selection of runtime rules for programmable search

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant