CN106980695B - Component retrieval method in power field and storage method thereof - Google Patents

Component retrieval method in power field and storage method thereof Download PDF

Info

Publication number
CN106980695B
CN106980695B CN201710217551.7A CN201710217551A CN106980695B CN 106980695 B CN106980695 B CN 106980695B CN 201710217551 A CN201710217551 A CN 201710217551A CN 106980695 B CN106980695 B CN 106980695B
Authority
CN
China
Prior art keywords
component
information
data structure
database
retrieval
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710217551.7A
Other languages
Chinese (zh)
Other versions
CN106980695A (en
Inventor
谢光彬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yibin Power Supply Co Of Sichuan Electric Power Corp
Original Assignee
Yibin Power Supply Co Of Sichuan Electric Power Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yibin Power Supply Co Of Sichuan Electric Power Corp filed Critical Yibin Power Supply Co Of Sichuan Electric Power Corp
Priority to CN201710217551.7A priority Critical patent/CN106980695B/en
Publication of CN106980695A publication Critical patent/CN106980695A/en
Application granted granted Critical
Publication of CN106980695B publication Critical patent/CN106980695B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a power neighborhood component retrieval method and a storage method thereof, wherein the method comprises the following steps: the data structure stored by the component is oriented to the data structure stored by the component in the power field according to the integration requirement of the application system of the power enterprise; establishing a component library facing the electric power field by utilizing an Oracle database, and directly storing component identification information and component characteristics in the database; storing the document information and the source code of the component in a file mode, and only storing the path information of the corresponding file in a database; determining component category information in component information identification codes according to a classification method for components in the power field, and filtering a component library according to the information; determining a similar component set according to the characteristic combination of the components; and the retrieval evaluation is carried out, so that the retrieval efficiency is improved, and the technical effect of ideal retrieval results is achieved.

Description

Component retrieval method in power field and storage method thereof
Technical Field
The invention relates to the field of power systems, in particular to a power neighborhood component retrieval method and a storage method thereof.
Background
Software reuse is an important field of software engineering and is considered as a main way to solve software crisis, improve software productivity and software quality, and enhance software openness and adaptability to external disturbances. Component-based software reuse is the focus of current software reuse research and is considered to be one of the key factors in achieving successful reuse. Component-based software reuse mainly involves 3 processes, i.e. the development of reusable components, the management of component libraries, which are links to the other 2 processes, and the building of component-based application systems, where there must be enough components to support the development. However, when a large number of reusable components are present, it becomes very difficult to find and select the appropriate component, and therefore an efficient search method must be used to enable the user to correctly locate the desired component.
Component retrieval has always been a key factor that restricts component reuse. If the retrieval efficiency is too low or the retrieved components are not suitable, software developers are often left with reusable components. At present, the retrieval method of the component mainly comprises the retrieval based on keywords, component facets and a knowledge base, and the methods have advantages and have certain limitations: retrieval based on component keywords: because there is an interaction phenomenon between keywords and there is a synonymy phenomenon of synonymy and synonymy between keyword terms, the effect of searching according to keywords is not ideal. Component facet-based retrieval: it is difficult to accurately describe the facets of all field members, and as the number of facets increases, it results in a reduction in search efficiency. Knowledge base based component retrieval: the retrieval method depends on the establishment rule and the perfection degree of a knowledge base, and is only stopped in theory at present.
The retrieval efficiency of the component is closely related to the storage mode of the component library, and if the structure of the component library has problems, no matter how efficient the matched algorithm is, good retrieval performance is difficult to obtain. Meanwhile, due to inherent differences among the domains, it is difficult to find a storage mode suitable for all the domain components.
Disclosure of Invention
The invention provides a power neighborhood component retrieval method and a storage method thereof, solves the technical problems of limitation, unsatisfactory retrieval result and low retrieval efficiency of the conventional power neighborhood component retrieval method, and realizes the technical effects of improved retrieval efficiency and ideal retrieval result.
In order to solve the technical problem, the application provides a power neighborhood component retrieval method, which comprises the following steps:
(1) the data structure stored by the component is oriented to the data structure stored by the component in the power field according to the integration requirement of the power enterprise application system;
(2) establishing a component library facing the electric power field by utilizing an Oracle database, and directly storing component identification information and component characteristics in the database;
(3) storing the document information and the source code of the component in a file mode, and only storing the path information of the corresponding file in a database;
(4) according to the classification method for the components in the power field, the component category information in the component information identification codes is determined, and the component library is filtered according to the information, so that the retrieval range can be quickly reduced;
(5) determining a similar component set according to the characteristic combination of the components;
(6) and (5) carrying out search flat evaluation.
In order to better implement the method of the present invention, further, in step (1), the data structure stored by the component includes component identification information, component characteristics, build document information, and build source code.
In order to better implement the method of the present invention, further, in step (1), the process of constructing the stored data structure includes the following steps:
(1.1) dividing the component identification information into five layers by using a functional decomposition classification method;
(1.2) the internal and external features of the component are described separately.
In order to better implement the method of the present invention, further, the five levels of the components in the step (1.1) are a system level, a subsystem level, a sequence number, and a version number from top to bottom.
In order to better implement the method of the present invention, further, in the step (1.2), the internal features of the component include an interface, a development platform, a development language; the constructed external features comprise types, keywords, abstracts, developers, warehousing time and multiplexing times.
In order to better implement the method of the present invention, further, the step (6) performs retrieval evaluation through the front samples and the back samples.
A power neighborhood component storage method, comprising the steps of:
(1) the data structure stored by the component is oriented to the data structure stored by the component in the power field according to the integration requirement of the power enterprise application system;
(2) establishing a component library facing the electric power field by utilizing an Oracle database, and directly storing component identification information and component characteristics in the database;
(3) the document information and the source code of the component are stored in a file mode, and only the path information of the corresponding file is stored in the database.
In order to better implement the method of the present invention, further, in step (1), the data structure stored by the component includes component identification information, component characteristics, build document information, and build source code.
In order to better implement the method of the present invention, further, in step (1), the process of constructing the stored data structure includes the following steps:
(1.1) dividing the component identification information into five layers by using a functional decomposition classification method;
(1.2) the internal and external features of the component are described separately.
In order to better implement the method of the present invention, further, the five levels of the components in the step (1.1) are a system level, a subsystem level, a sequence number, and a version number from top to bottom.
In order to better implement the method of the present invention, further, in the step (1.2), the internal features of the component include an interface, a development platform, a development language; the constructed external features comprise types, keywords, abstracts, developers, warehousing time and multiplexing times.
The development language is C language.
Compared with the prior art, the invention has the following advantages and beneficial effects:
according to the invention, a data structure for storing the components facing the electric power field is designed according to the requirements of the electric power enterprise application system integration, the classification problem of the components facing the electric power field is solved, the classification method is convenient for the retrieval of users, the expansion of the component library facing the field is simply realized in the electric power enterprise application system integration, the efficiency of the integrated system is improved, and the requirement of the electric power system informatization development is met.
Detailed Description
The invention provides a power neighborhood component retrieval method and a storage method thereof, solves the technical problems of limitation, unsatisfactory retrieval result and low retrieval efficiency of the conventional power neighborhood component retrieval method, and realizes the technical effects of improved retrieval efficiency and ideal retrieval result.
In order that the above objects, features and advantages of the present invention can be more clearly understood, the following detailed description of the present invention proceeds with further details. It should be noted that the embodiments and features of the embodiments of the present application may be combined with each other without conflicting with each other.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those specifically described and thus the scope of the present invention is not limited by the specific embodiments disclosed below.
The application provides a power neighborhood component retrieval method, which comprises the following steps:
(1) the data structure stored by the component is oriented to the data structure stored by the component in the power field according to the integration requirement of the power enterprise application system;
(2) establishing a component library facing the electric power field by utilizing an Oracle database, and directly storing component identification information and component characteristics in the database;
(3) storing the document information and the source code of the component in a file mode, and only storing the path information of the corresponding file in a database;
(4) according to the classification method for the components in the power field, the component category information in the component information identification codes is determined, and the component library is filtered according to the information, so that the retrieval range can be quickly reduced;
(5) determining a similar component set according to the characteristic combination of the components;
(6) and (5) carrying out search flat evaluation.
The data structure of the component storage expresses the storage format of the component in the component library system, and the definition of a proper component data structure is the premise for realizing effective component storage and retrieval. But at present, people often ignore the data structure stored by the component, which is an important reason that the existing component retrieval efficiency is not ideal. According to the requirement of power enterprise application system integration, the embodiment provides a data structure for component storage in the power field, and the problem of classification of components in the power field is solved. The data structure includes part identification information, part characteristics, part documentation information, and part source code, as shown in table 1,
TABLE 1 data Structure for Power Domain component storage
Figure GDA0001284137960000041
The identification information of the member is a code for identifying the member only in the member library, and reflects the classification condition of the member. The classification of components for each field is very complex, while the classification of components for a specific field (herein, taking the electric field as an example) has a certain rule, and the system is assembled by components corresponding to each related function. Therefore, it is suitable to adopt a classification method based on functional decomposition for the components of the power enterprise application system, and in this way, the component identification information is coded into 5 layers, namely a system layer, a subsystem layer, a sequence number and a version number from top to bottom. The system layer is used for indicating an integrated application system of which power company belongs, for example, ha indicates a wye-an power company; the subsystem layer is used for representing a relatively universal subsystem in the integrated application system, and comprises safety production management, electricity utilization management, scheduling management and the like, wherein dd represents scheduling management; the subsystem layer is used for representing specific items related to each subsystem, such as load prediction (fhyc) and fault analysis (gzfx) in scheduling management; component sequence numbers are used to denote the nth component of the same subsystem; component version numbers are used to denote different versions of components of the same sequence number. The classification method facilitates the retrieval of users and also simplifies the expansion of the component library facing the field.
The component characteristics describe information related to component management and intrinsic information of the component itself, and are divided into external characteristics and internal characteristics. The external features of the component comprise component type, keywords for describing the component, abstract (function for describing the component), developer, warehousing time, multiplexing times and the like, and are used for determining similar component sets in retrieval; the internal features are used to determine the most appropriate member in a set of similar members, including interfaces, development platforms, development languages, etc.
The method comprises the steps of establishing a component library facing the electric power field by utilizing an Oracle database, directly storing component identification information and component characteristics in the database, storing document information and source codes of components in a file mode, and only storing path information of corresponding files in the database. Therefore, the establishment and maintenance of the component library are facilitated, and the maintenance of the component document information and the source code only needs to operate the corresponding file.
According to a component storage mode, an Oracle database is utilized to establish a component library facing to the power field, and the designed verified components are added into the component library. For example, in the process of integrating the dispatching management subsystem, in order to search a Component for completing harmonic analysis in the Huaian power company dispatching management subsystem fault analysis subsystem, according to a two-step search method, firstly listing a search formula 'haddzfx', and carrying out fuzzy query in a Component _ distribution table for storing Component description information in a Component library, wherein select from Component _ distribution _ coherent _ Code 'ddgzfx' Component _ Code is a Component identification information field). By this step, the search range is greatly reduced. Then according to the characteristics of the components, the query author is Li Ming, and the query author is warehoused after 11 months and 1 day 2003, and the components of harmonic analysis are included in the abstract. And after the retrieval result is obtained, multiplexing can be performed.
For a component library with few components, the required components can be obtained by querying through a two-step retrieval method. However, with the increasing of component libraries, only similar component sets can be obtained through a two-step search method, which requires evaluation of search results. Evaluation does not simply utilize an input sample and a corresponding output sample of a component and compare it with the output of the desired component to determine if it is the desired component. Although the required components can be found, the method takes long time and is low in efficiency, and the evaluation process is quite complicated. In order to perform efficient search evaluation, samples of the test member are divided into positive samples and negative samples, and stored in test cases. The front surface sample refers to an input sample of the component when the component is executed, and then an output sample of the component is obtained, and such input-output pair is called as the front surface sample; otherwise, the reverse example is used. The similar component set is evaluated by utilizing the positive and negative samples of the components, and the components do not need to be actually executed in the evaluation process, so that the efficiency of retrieval evaluation is greatly improved.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention in any way, and all simple modifications and equivalent variations of the above embodiments according to the technical spirit of the present invention are included in the scope of the present invention.
In another aspect, the present application further provides a power neighborhood component storage method, including the following steps:
(1) the data structure stored by the component is oriented to the data structure stored by the component in the power field according to the integration requirement of the power enterprise application system;
(2) establishing a component library facing the electric power field by utilizing an Oracle database, and directly storing component identification information and component characteristics in the database;
(3) the document information and the source code of the component are stored in a file mode, and only the path information of the corresponding file is stored in the database.
The data structure of the component storage expresses the storage format of the component in the component library system, and the definition of a proper component data structure is the premise for realizing effective component storage and retrieval. But at present, people often ignore the data structure stored by the component, which is an important reason that the existing component retrieval efficiency is not ideal. According to the requirement of power enterprise application system integration, the embodiment provides a data structure for component storage in the power field, and the problem of classification of components in the power field is solved. The data structure includes part identification information, part characteristics, part documentation information, and part source code, as shown in table 1,
the identification information of the member is a code for identifying the member only in the member library, and reflects the classification condition of the member. The classification of components for each field is very complex, while the classification of components for a specific field (herein, taking the electric field as an example) has a certain rule, and the system is assembled by components corresponding to each related function. Therefore, it is suitable to adopt a classification method based on functional decomposition for the components of the power enterprise application system, and in this way, the component identification information is coded into 5 layers, namely a system layer, a subsystem layer, a sequence number and a version number from top to bottom. The system layer is used for indicating an integrated application system of which power company belongs, for example, ha indicates a wye-an power company; the subsystem layer is used for representing a relatively universal subsystem in the integrated application system, and comprises safety production management, electricity utilization management, scheduling management and the like, wherein dd represents scheduling management; the subsystem layer is used for representing specific items related to each subsystem, such as load prediction (fhyc) and fault analysis (gzfx) in scheduling management; component sequence numbers are used to denote the nth component of the same subsystem; component version numbers are used to denote different versions of components of the same sequence number. The classification method facilitates the retrieval of users and also simplifies the expansion of the component library facing the field.
The component characteristics describe information related to component management and intrinsic information of the component itself, and are divided into external characteristics and internal characteristics. The external features of the component comprise component type, keywords for describing the component, abstract (function for describing the component), developer, warehousing time, multiplexing times and the like, and are used for determining similar component sets in retrieval; the internal features are used to determine the most appropriate member in a set of similar members, including interfaces, development platforms, development languages, etc.
The method comprises the steps of establishing a component library facing the electric power field by utilizing an Oracle database, directly storing component identification information and component characteristics in the database, storing document information and source codes of components in a file mode, and only storing path information of corresponding files in the database. Therefore, the establishment and maintenance of the component library are facilitated, and the maintenance of the component document information and the source code only needs to operate the corresponding file.
Wherein the development language is C language.
The technical scheme in the embodiment of the application at least has the following technical effects or advantages:
according to the invention, a data structure for storing the components facing the electric power field is designed according to the requirements of the electric power enterprise application system integration, the classification problem of the components facing the electric power field is solved, the classification method is convenient for the retrieval of users, the expansion of the component library facing the field is simply realized in the electric power enterprise application system integration, the efficiency of the integrated system is improved, and the requirement of the electric power system informatization development is met.
While preferred embodiments of the present invention have been described, additional variations and modifications in those embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the invention.
It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims (4)

1. A power domain component retrieval method is characterized by comprising the following steps:
(1) the data structure stored by the component is oriented to the data structure stored by the component in the power field according to the integration requirement of the application system of the power enterprise;
(2) establishing a component library facing the electric power field by utilizing an Oracle database, and directly storing component identification information and component characteristics in the database;
(3) storing the document information and the source code of the component in a file mode, and only storing the path information of the corresponding file in a database;
(4) determining component category information in component information identification codes according to a classification method for components in the power field, and filtering a component library according to the information;
(5) determining a similar component set according to the characteristic combination of the components;
(6) carrying out retrieval evaluation;
in the step (1), the data structure stored by the component comprises: component identification information, component characteristics, construction document information and construction source codes;
in the step (1), the process of constructing the stored data structure comprises the following steps:
(1.1) dividing the component identification information into five layers by using a functional decomposition classification method;
(1.2) describing the internal and external features of the member, respectively;
the five layers of the component in the step (1.1) are a system layer, a subsystem layer, a sequence number and a version number from top to bottom in sequence.
2. The method for retrieving a power domain component of claim 1, wherein: in the step (1.2), the internal features of the component comprise an interface, a development platform and a development language; the constructed external features comprise types, keywords, abstracts, developers, warehousing time and multiplexing times.
3. The method for retrieving a power domain component of claim 2, wherein: and (6) searching and evaluating through the front sample and the back sample.
4. A power domain component storage method is characterized by comprising the following steps:
(1) the data structure stored by the component is oriented to the data structure stored by the component in the power field according to the integration requirement of the power enterprise application system;
(2) establishing a component library facing the electric power field by utilizing an Oracle database, and directly storing component identification information and component characteristics in the database;
(3) storing the document information and the source code of the component in a file mode, and only storing the path information of the corresponding file in a database;
in the step (1), the data structure stored by the component comprises component identification information, component characteristics, construction document information and construction source codes;
in the step (1), the process of constructing the stored data structure comprises the following steps:
(1.1) dividing the component identification information into five layers by using a functional decomposition classification method;
(1.2) describing the internal and external features of the member, respectively;
the five layers of the component in the step (1.1) are a system layer, a subsystem layer, a sequence number and a version number from top to bottom in sequence; in the step (1.2), the internal features of the component comprise an interface, a development platform and a development language; the constructed external features comprise types, keywords, abstracts, developers, warehousing time and multiplexing times; the development language is C language.
CN201710217551.7A 2017-04-05 2017-04-05 Component retrieval method in power field and storage method thereof Active CN106980695B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710217551.7A CN106980695B (en) 2017-04-05 2017-04-05 Component retrieval method in power field and storage method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710217551.7A CN106980695B (en) 2017-04-05 2017-04-05 Component retrieval method in power field and storage method thereof

Publications (2)

Publication Number Publication Date
CN106980695A CN106980695A (en) 2017-07-25
CN106980695B true CN106980695B (en) 2020-06-09

Family

ID=59345562

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710217551.7A Active CN106980695B (en) 2017-04-05 2017-04-05 Component retrieval method in power field and storage method thereof

Country Status (1)

Country Link
CN (1) CN106980695B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1877523A (en) * 2006-06-23 2006-12-13 清华大学 Method for unified management of component library supporting heterogeneous component
CN101996257A (en) * 2010-11-30 2011-03-30 中国人民解放军信息工程大学 Method for searching reconfigurable route exchange platform component
CN104318739A (en) * 2014-10-11 2015-01-28 国网辽宁省电力有限公司鞍山供电公司 Safety and quality on-site supervision master station for power and wireless data terminal

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9679253B2 (en) * 2014-11-06 2017-06-13 Copperleaf Technologies Inc. Methods for maintaining infrastructure equipment and related apparatus

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1877523A (en) * 2006-06-23 2006-12-13 清华大学 Method for unified management of component library supporting heterogeneous component
CN101996257A (en) * 2010-11-30 2011-03-30 中国人民解放军信息工程大学 Method for searching reconfigurable route exchange platform component
CN104318739A (en) * 2014-10-11 2015-01-28 国网辽宁省电力有限公司鞍山供电公司 Safety and quality on-site supervision master station for power and wireless data terminal

Also Published As

Publication number Publication date
CN106980695A (en) 2017-07-25

Similar Documents

Publication Publication Date Title
Nath et al. Incremental association rule mining: a survey
US20140108313A1 (en) Ranking in cascading learning system
CN103336790A (en) Hadoop-based fast neighborhood rough set attribute reduction method
US20150154194A1 (en) Non-exclusionary search within in-memory databases
Silva et al. MapReduce-based similarity join for metric spaces
CN111046059B (en) Low-efficiency SQL statement analysis method and system based on distributed database cluster
CN103336791A (en) Hadoop-based fast rough set attribute reduction method
US11620453B2 (en) System and method for artificial intelligence driven document analysis, including searching, indexing, comparing or associating datasets based on learned representations
US20130325757A1 (en) Cascading learning system as semantic search
US9031886B2 (en) Pluggable modules in a cascading learning system
CN111061713A (en) Block chain data fusion method, device, equipment and storage medium
CN116303628B (en) Alarm data query method, system and equipment based on elastic search
US20230142351A1 (en) Methods and systems for searching and retrieving information
CN106980695B (en) Component retrieval method in power field and storage method thereof
US8805820B1 (en) Systems and methods for facilitating searches involving multiple indexes
CN117033534A (en) Geographic information processing method, device, computer equipment and storage medium
US20220414157A1 (en) Apparatus and method for maintaining a machine learning model repository
CN107436919B (en) Cloud manufacturing standard service modeling method based on ontology and BOSS
CN115495545A (en) Power grid operation panoramic model retrieval method, electronic device and storage medium
CN114860898A (en) Software development knowledge base construction and application method
WO2014051455A1 (en) Method and system for storing graph data
CN112905747A (en) Professional system archive question-answering robot system based on semantic analysis technology
Al Qital Digital Management on Green Economy: A Review Using R Indexed by Dimensions. ai
Zhang et al. The research and design of SQL processing in a data-mining system based on MapReduce
Jin et al. Curator: Efficient Indexing for Multi-Tenant Vector Databases

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant