CN106570094A - Fixed value term matching method and matching system - Google Patents

Fixed value term matching method and matching system Download PDF

Info

Publication number
CN106570094A
CN106570094A CN201610928276.5A CN201610928276A CN106570094A CN 106570094 A CN106570094 A CN 106570094A CN 201610928276 A CN201610928276 A CN 201610928276A CN 106570094 A CN106570094 A CN 106570094A
Authority
CN
China
Prior art keywords
definite value
value item
characteristic vector
item
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610928276.5A
Other languages
Chinese (zh)
Other versions
CN106570094B (en
Inventor
林志超
罗步升
王英民
郑兆典
孙迪飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huizhou Power Supply Bureau of Guangdong Power Grid Co Ltd
Original Assignee
Huizhou Power Supply Bureau of Guangdong Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huizhou Power Supply Bureau of Guangdong Power Grid Co Ltd filed Critical Huizhou Power Supply Bureau of Guangdong Power Grid Co Ltd
Priority to CN201610928276.5A priority Critical patent/CN106570094B/en
Publication of CN106570094A publication Critical patent/CN106570094A/en
Application granted granted Critical
Publication of CN106570094B publication Critical patent/CN106570094B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24553Query execution of query operations

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a fixed value term matching method, which comprises the following steps: generating an eigenvector corresponding to each to-be-matched fixed value term; using the cosine theorem to calculate the cosine value of the included angle between the eigenvectors; determining whether the cosine value of the included angle is within a preset threshold range, and if so, determining that matching is successful. The invention also provides a fixed value term matching system. Via the above way, the correct rate of fixed value term matching can be improved, so that a large amount of manual operation can be avoided, and thus the work efficiency can be improved.

Description

Definite value item matching process and matching system
Technical field
The present invention relates to technical field of electric power, more particularly to a kind of definite value item matching process and matching system.
Background technology
Guarantor's letter system and Production MIS, definite value due to lacking relevant industries specification, in current power industry Not consistent is described to the title of same secondary device, definite value item in the system such as adjust, and the functional requirement such as definite value automatic checking Definite value item information between system be able to must be exchanged.Therefore, the matching for how being automatically performed definite value item practices function Key, is also a difficult point.
At present, the main employing technological means for solving the above problems has:Keyword match, editing distance, manual association etc. Method.
However, because identical keyword can occur in the definite value project that can not be matched, using keyword match meeting The match is successful to make original unmatched definite value project;And editing distance is to count to be transformed into another character string from a character string Required edit step, according to edit step number describing the similarity degree of two character strings, due to there are different words Symbol string editing causes the presence of a large amount of mispairing situations apart from identical, digital Chinese is synonymous but editing distance not parity problem;In addition, Completely manual association then brings huge workload, and completing manual association needs to expend substantial amounts of manpower and time.
The content of the invention
The present invention provides a kind of definite value item matching process and matching system to solve above-mentioned technical problem, it is possible to increase definite value The accuracy of item matching, it is to avoid a large amount of artificial operations further improve operating efficiency.
To solve above-mentioned technical problem, the present invention provides a kind of definite value item matching process, comprises the steps:Generation is treated With each self-corresponding characteristic vector of definite value item;Included angle cosine value between the characteristic vector is calculated using the cosine law;Judge Whether the included angle cosine value is in the range of pre-set limit, if in the range of pre-set limit, being judged as that the match is successful.
Further, among the step of generation definite value item to be matched each self-corresponding characteristic vector, including:To treat The definite value item of matching is analyzed according to preassigned pattern and then generates the corresponding characteristic vector.
Further, described the definite value item to be matched is analyzed according to preassigned pattern and then is generated corresponding The characteristic vector the step of among, including following sub-step:According to the title of definite value item described in term table scan, matching bag The term that contains simultaneously is converted to the first predefined value;The term is deleted from the title, according to delete position by the name Title be divided into it is multiple first segmentation, each it is described first segmentation in extract comprising ordinal number ordinal number phrase and be converted to second make a reservation for Justice value;The ordinal number phrase is deleted from first segmentation comprising ordinal number, according to delete position multiple the are again split into Two-section, digital value is extracted using as determiner in each described second segmentation;According to first predefined value, described Two predefined values and the determiner construct the characteristic vector.
Further, before the step of generation definite value item to be matched each self-corresponding characteristic vector, also include:Receive The project name of collection definite value item, builds accordingly the nomenclature.
Further, among the step of the structure nomenclature, including:Compile to the list item of nomenclature each described Number, and give weighted value.
To solve above-mentioned technical problem, the present invention also provides a kind of definite value item matching system, including:
Further, signal generating unit, for generating each self-corresponding characteristic vector of definite value item to be matched;Computing unit, the meter Calculate unit to be connected with the growing element, for calculating the included angle cosine value between the characteristic vector using the cosine law;With And judging unit, the judging unit is connected with the computing unit, for judging whether the included angle cosine value is limiting in advance In the range of value, if in the range of pre-set limit, being judged as that the match is successful.
Further, the signal generating unit is used to be analyzed the definite value item to be matched further according to preassigned pattern Generate the corresponding characteristic vector.
Further, specifically for the title according to definite value item described in term table scan, matching is included the signal generating unit Term and be converted to the first predefined value;The term is deleted from the title, according to delete position by the title Be divided into it is multiple first segmentation, each it is described first segmentation in extract comprising ordinal number ordinal number phrase and be converted to second predefine Value;The ordinal number phrase is deleted from first segmentation comprising ordinal number, according to delete position multiple second are again split into Segmentation, digital value is extracted using as determiner in each described second segmentation;According to first predefined value, described second Predefined value and the determiner construct the characteristic vector.
Further, the definite value item matching system also includes the structural unit being connected with the signal generating unit, the structure Unit is made for collecting the project name of definite value item, the nomenclature is built accordingly.
Further, the structural unit is additionally operable to the list item numbering of nomenclature each described, and gives weighted value.
The definite value item matching process and matching system of the present invention, has the advantages that:Using characteristic vector mode, bag The characteristic informations such as technical term, ordinal number and numeral are contained, it is to avoid it is original unmatched fixed to make caused by simple judgement keyword The value project problem that the match is successful;To ordinal number and the processing mode of numerical characteristic in characteristic vector, program function is set to keep stable, When avoiding the character string of different expression using program process agreement, it is necessary to which modification program could meet asking for various requirement Topic;Matching process is automatically performed, and saves time, manpower, improves operating efficiency.
Description of the drawings
Fig. 1 is the flow chart of definite value item matching process embodiment of the present invention.
Fig. 2 is the flow chart of characteristic vector generation method in the matching process of definite value item shown in Fig. 1.
The functional structure chart of Fig. 3 definite value item matching system embodiments of the present invention.
Specific embodiment
Below in conjunction with the accompanying drawings the present invention is described in detail with embodiment.
As shown in figure 1, the definite value item matching process of embodiment of the present invention, it comprises the steps:
Step S11, generates each self-corresponding characteristic vector of definite value item to be matched.I.e. each definite value item correspondence can generate a feature Generally speaking vector, definite value item matching refers to the matching between two definite value items.
Step S12, using the cosine law included angle cosine value between characteristic vector is calculated.Calculate two using the cosine law Included angle cosine value between each self-corresponding characteristic vector of individual definite value item.
Whether step S13, judge included angle cosine value in the range of pre-set limit.The pre-set limit defines error permission Scope, only when the included angle cosine value is in the error range of considered critical, can just be judged as that the match is successful.
Specifically, in step s 13, if more than the angle between the characteristic vector corresponding to two definite value items to be matched String value is judged as that the match is successful in the range of pre-set limit, then;No person, then be judged as that it fails to match.
For example, its concrete matching process substantially can be following manner:
By equipment one(Such as protect letter system and Production MIS)Definite value item 1, the definite value item 2 ... that middle nomenclature is included is calmly Value item n successively in the way of traveling through respectively with equipment two(Such as fixed value adjusting system)Definite value item 1 ' that middle nomenclature is included, Definite value item 2 ' ... definite value item n ' is matched.Specifically, by the characteristic vector 1 generated by definite value item 1 successively with by definite value item 1 ' the characteristic vector 1 ' for generating is matched, if matching is unsuccessful, is being carried out with the characteristic vector 2 ' generated by definite value item 2 ' Matching ... is until the match is successful or all matching is completed;Then above-mentioned steps are repeated to definite value item 2.Certainly, equipment one It is middle to need the definite value item of matching to be set on demand in advance, it is not necessarily required to whole matchings.
Among the step of generation definite value item to be matched each self-corresponding characteristic vector, i.e., in step s 11, including:Will Definite value item to be matched is analyzed according to preassigned pattern and then generates corresponding characteristic vector.Specifically, the step S11 bag Include following sub-step:
Step S111, term extraction.
Specifically, according to the title of term table scan definite value item, match the term that includes and be converted to the first predefined value.
Step S112, ordinal number is extracted.
Specifically, term is deleted from title, title is divided into into multiple first segmentations according to delete position, each the The ordinal number phrase comprising ordinal number is extracted in one segmentation and the second predefined value is converted to.
Step S113, digital extraction.
Specifically, ordinal number phrase is deleted from the comprising ordinal number first segmentation, is again split into according to delete position multiple Second segmentation, digital value is extracted using as determiner in each second segmentation.
And step S114, it is vectorial according to characteristic value construction feature is extracted.
Specifically, according to the first predefined value, the second predefined value and determiner structural feature vector.
In one embodiment, specifically before the step of generating definite value item to be matched each self-corresponding characteristic vector, i.e., Before step S11, also including step S10:The project name of definite value item is collected, nomenclature is built accordingly.
Wherein, the step of nomenclature is built among S10, including:Number to the list item of each nomenclature, and give weight Value.The parameter of characteristic vector can be optimized by way of numbering and giving weighted value, that what is improved definite value item the match is successful is accurate Property.
In addition, the present invention also provides a kind of definite value item matching system, including:Signal generating unit 11, computing unit 12 and sentence Disconnected unit 13.Specifically:
Signal generating unit 11, for generating each self-corresponding characteristic vector of definite value item to be matched.
Computing unit 12, computing unit 12 is connected with growing element, for being calculated between characteristic vector using the cosine law Included angle cosine value.
And judging unit 13, whether judging unit 13 is connected with computing unit 12, for judging included angle cosine value pre- In the range of value of limiting, if in the range of pre-set limit, being judged as that the match is successful.
The signal generating unit 11 is used to that definite value item to be matched to be analyzed according to preassigned pattern and then be generated corresponding Characteristic vector.
Further, signal generating unit 11 matches the term for including specifically for the title according to term table scan definite value item And be converted to the first predefined value;Term is deleted from title, title is divided into into multiple first segmentations according to delete position; The ordinal number phrase comprising ordinal number is extracted in each first segmentation and be converted to the second predefined value;By ordinal number phrase from comprising ordinal number First segmentation in delete, according to delete position be again split into it is multiple second segmentation;Digital value is extracted in each second segmentation Using as determiner;According to the first predefined value, the second predefined value and determiner structural feature vector.
Definite value item matching system also includes the structural unit 10 being connected with signal generating unit 11, and structural unit 10 is used to collect fixed The project name of value item, builds accordingly nomenclature.Wherein, the structural unit 10 is additionally operable to be numbered to the list item of each nomenclature, And give weighted value.
The definite value item matching process and matching system of the present invention, has the advantages that:
1. characteristic vector mode is adopted, the characteristic informations such as technical term, ordinal number and numeral are contained, it is to avoid be simple to judge crucial The original unmatched definite value project problem that the match is successful is made caused by word.
2. to ordinal number and the processing mode of numerical characteristic in characteristic vector, make program function keep stable, it is to avoid to adopt Program process agree to and during the character string of different expression, it is necessary to modification program could meet the problem of various requirement.
3. matching process is automatically performed, and saves time, manpower, improves operating efficiency.
Embodiments of the present invention are these are only, not thereby the scope of the claims of the limit value present invention, it is every using the present invention Equivalent structure or equivalent flow conversion that specification and accompanying drawing content are made, or directly or indirectly it is used in other related technologies Field, is included within the scope of the present invention.

Claims (10)

1. a kind of definite value item matching process, it is characterised in that comprise the steps:
Generate each self-corresponding characteristic vector of definite value item to be matched;
Included angle cosine value between the characteristic vector is calculated using the cosine law;
The included angle cosine value is judged whether in the range of pre-set limit, if in the range of pre-set limit, being judged as matching Success.
2. definite value item matching process according to claim 1, it is characterised in that generate definite value item to be matched each described Among the step of corresponding characteristic vector, including:The definite value item to be matched is analyzed according to preassigned pattern and then raw Into the corresponding characteristic vector.
3. definite value item matching process according to claim 2, it is characterised in that described by the definite value item to be matched Among the step of being analyzed according to preassigned pattern and then generate the corresponding characteristic vector, including following sub-step:
According to the title of definite value item described in term table scan, match the term that includes and be converted to the first predefined value;
The term is deleted from the title, the title is divided into into multiple first segmentations according to delete position, at each The ordinal number phrase comprising ordinal number is extracted in first segmentation and be converted to the second predefined value;
The ordinal number phrase is deleted from first segmentation comprising ordinal number, according to delete position multiple second are again split into Segmentation, digital value is extracted using as determiner in each described second segmentation;
The characteristic vector is constructed according to first predefined value, second predefined value and the determiner.
4. definite value item matching process according to claim 3, it is characterised in that generate definite value item to be matched each described Before the step of corresponding characteristic vector, also include:
The project name of definite value item is collected, the nomenclature is built accordingly.
5. definite value item matching process according to claim 4, it is characterised in that the step of the structure nomenclature Among, including:
To the list item numbering of nomenclature each described, and give weighted value.
6. a kind of definite value item matching system, it is characterised in that include:
Signal generating unit, for generating each self-corresponding characteristic vector of definite value item to be matched;
Computing unit, the computing unit is connected with the growing element, for calculating the characteristic vector using the cosine law Between included angle cosine value;
And judging unit, the judging unit is connected with the computing unit, for judging that whether the included angle cosine value exists In the range of pre-set limit, if in the range of pre-set limit, being judged as that the match is successful.
7. definite value item matching system according to claim 6, it is characterised in that:
The signal generating unit is used to that the definite value item to be matched to be analyzed according to preassigned pattern and then be generated corresponding The characteristic vector.
8. definite value item matching system according to claim 7, it is characterised in that:
The signal generating unit matches the term that includes and is converted to specifically for the title according to definite value item described in term table scan First predefined value;The term is deleted from the title, the title is divided into into multiple first points according to delete position Section, extracts the ordinal number phrase comprising ordinal number and is converted to the second predefined value in each described first segmentation;By the ordinal number Phrase is deleted from first segmentation comprising ordinal number, multiple second segmentations is again split into according to delete position, in each institute State and extract in the second segmentation digital value using as determiner;According to first predefined value, second predefined value and The determiner constructs the characteristic vector.
9. definite value item matching system according to claim 8, it is characterised in that:
The definite value item matching system also includes the structural unit being connected with the signal generating unit, and the structural unit is used to collect The project name of definite value item, builds accordingly the nomenclature.
10. definite value item matching system according to claim 9, it is characterised in that:
The structural unit is additionally operable to the list item numbering of nomenclature each described, and gives weighted value.
CN201610928276.5A 2016-10-31 2016-10-31 Definite value item matching process and matching system Active CN106570094B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610928276.5A CN106570094B (en) 2016-10-31 2016-10-31 Definite value item matching process and matching system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610928276.5A CN106570094B (en) 2016-10-31 2016-10-31 Definite value item matching process and matching system

Publications (2)

Publication Number Publication Date
CN106570094A true CN106570094A (en) 2017-04-19
CN106570094B CN106570094B (en) 2019-06-28

Family

ID=58534386

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610928276.5A Active CN106570094B (en) 2016-10-31 2016-10-31 Definite value item matching process and matching system

Country Status (1)

Country Link
CN (1) CN106570094B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101937440A (en) * 2009-06-30 2011-01-05 华为技术有限公司 Feature selection method and device
CN102213759A (en) * 2011-04-08 2011-10-12 东南大学 Characteristic matching method of underground water target based on power spectrum
CN102332262A (en) * 2011-09-23 2012-01-25 哈尔滨工业大学深圳研究生院 Method for intelligently identifying songs based on audio features
CN103605644A (en) * 2013-12-02 2014-02-26 哈尔滨工业大学 Pivot language translation method and device based on similarity matching
CN105335899A (en) * 2015-11-11 2016-02-17 国网山东省电力公司德州供电公司 Intelligent power line naming system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101937440A (en) * 2009-06-30 2011-01-05 华为技术有限公司 Feature selection method and device
CN102213759A (en) * 2011-04-08 2011-10-12 东南大学 Characteristic matching method of underground water target based on power spectrum
CN102332262A (en) * 2011-09-23 2012-01-25 哈尔滨工业大学深圳研究生院 Method for intelligently identifying songs based on audio features
CN103605644A (en) * 2013-12-02 2014-02-26 哈尔滨工业大学 Pivot language translation method and device based on similarity matching
CN105335899A (en) * 2015-11-11 2016-02-17 国网山东省电力公司德州供电公司 Intelligent power line naming system

Also Published As

Publication number Publication date
CN106570094B (en) 2019-06-28

Similar Documents

Publication Publication Date Title
CN107729322B (en) Word segmentation method and device and sentence vector generation model establishment method and device
CN112632292A (en) Method, device and equipment for extracting service keywords and storage medium
EP3153978A1 (en) Address search method and device
CN103970733B (en) A kind of Chinese new word identification method based on graph structure
CN110866093A (en) Machine question-answering method and device
CN104462105B (en) Chinese word cutting method, device and server
CN103336766A (en) Short text garbage identification and modeling method and device
WO2016045567A1 (en) Webpage data analysis method and device
CN110414005B (en) Intention recognition method, electronic device and storage medium
EP3232336A1 (en) Method and device for recognizing stop word
CN106649250A (en) Method and device for identifying emotional new words
CN116226350A (en) Document query method, device, equipment and storage medium
EP4193261A1 (en) Test script generation from test specifications using natural language processing
CN112148852A (en) Intelligent customer service method and device, storage medium and computer equipment
CN106603538A (en) Invasion detection method and system
CN110147449A (en) File classification method and device
CN111723192A (en) Code recommendation method and device
CN103309851B (en) The rubbish recognition methods of short text and system
CN108629124B (en) Method for automatically generating simulation parameter data based on active graph path
CN110909138B (en) User intention identification method and system
CN104408036A (en) Correlated topic recognition method and device
CN106570094A (en) Fixed value term matching method and matching system
CN112529629A (en) Malicious user comment brushing behavior identification method and system
CN111325562A (en) Grain safety tracing system and method
CN109241124B (en) Method and system for quickly retrieving similar character strings

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant