CN113627168B - Method, device, medium and equipment for checking component packaging conflict - Google Patents

Method, device, medium and equipment for checking component packaging conflict Download PDF

Info

Publication number
CN113627168B
CN113627168B CN202110945684.2A CN202110945684A CN113627168B CN 113627168 B CN113627168 B CN 113627168B CN 202110945684 A CN202110945684 A CN 202110945684A CN 113627168 B CN113627168 B CN 113627168B
Authority
CN
China
Prior art keywords
data
keywords
packaging
specifications
package
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110945684.2A
Other languages
Chinese (zh)
Other versions
CN113627168A (en
Inventor
谢国清
林颖朝
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Yuncai Network Technology Co ltd
Original Assignee
Shenzhen Yuncai Network Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Yuncai Network Technology Co ltd filed Critical Shenzhen Yuncai Network Technology Co ltd
Priority to CN202110945684.2A priority Critical patent/CN113627168B/en
Publication of CN113627168A publication Critical patent/CN113627168A/en
Application granted granted Critical
Publication of CN113627168B publication Critical patent/CN113627168B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/08Logistics, e.g. warehousing, loading or distribution; Inventory or stock management
    • G06Q10/087Inventory or stock management, e.g. order filling, procurement or balancing against orders
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The application discloses a method, a device, a medium and equipment for checking component packaging conflict, which comprise the steps of performing text processing on a bill of materials to obtain a plurality of keywords; identifying package specifications of the plurality of keywords; detecting the relevant state of at least two packaging specifications when the at least two packaging specifications exist in the keywords; and determining that the bill of materials is abnormal when the relevant states of the at least two packaging specifications are conflict states. The method and the device can solve the problem that whether the bill of materials has data abnormality or not can not be detected when the bill of materials is read.

Description

Method, device, medium and equipment for checking component packaging conflict
Technical Field
The present invention relates to the field of data processing technologies, and in particular, to a method, an apparatus, a medium, and a device for inspecting component packaging conflicts.
Background
Existing purchasing systems typically obtain purchasing demand information from a bill of materials provided by a user. However, the problem of parameter collocation errors may exist in the bill of materials provided by the user, and the purchasing system cannot detect whether the bill of materials has data abnormality when the bill of materials is read, so that if the bill of materials has data abnormality, the data generated by the final purchasing system will be affected.
Disclosure of Invention
The present application has been made in order to solve the above technical problems. The embodiment of the application provides a method, a device, a medium and equipment for checking component packaging conflict, which can solve the problem that whether data abnormality exists in a bill of materials can not be detected when the bill of materials is read.
According to one aspect of the present application, there is provided a method for inspecting component package conflict, including: text processing is carried out on the bill of materials to obtain a plurality of keywords; identifying package specifications of the plurality of keywords; detecting the relevant state of at least two packaging specifications when the at least two packaging specifications exist in the keywords; and determining that the bill of materials is abnormal when the relevant states of the at least two packaging specifications are conflict states.
In an embodiment, when at least two package specifications exist in the plurality of keywords, detecting the relevant status of the at least two package specifications includes: comparing the at least two packaging specifications in pairs to obtain a comparison result; and determining the relevant states of the at least two packaging specifications according to the comparison result.
In an embodiment, the comparing the at least two package specifications in pairs, and obtaining the comparison result includes: substituting the at least two packaging specifications into the function at a time to obtain two function values; wherein the function comprises a hash function; comparing the two function values with the data packaging dictionary to obtain two preset packaging specifications corresponding to the two function values; when the two preset package specifications are equal, determining that the two package specifications corresponding to the two function values have the same meaning.
In an embodiment, the determining the relevant status of the at least two package specifications according to the comparison result includes: and when the comparison result shows that the two packaging specifications corresponding to the two function values have different meanings, determining that the relevant states of the two packaging specifications corresponding to the two function values are conflict states.
In an embodiment, the identifying the package specification of the plurality of keywords includes: and comparing the keywords with the data encapsulation dictionary, and determining encapsulation specifications of the keywords.
In an embodiment, the comparing the plurality of keywords to the data package dictionary, determining the package specification of the plurality of keywords comprises: and when the keyword is the same as the data in the data encapsulation dictionary, determining the encapsulation specification of the keyword.
In an embodiment, the text processing the bill of materials, obtaining a plurality of keywords includes: acquiring a bill of materials file; analyzing the bill of materials to obtain target BOM data of the bill of materials; acquiring column header data and element data of each single product in the target BOM data according to the target BOM data; wherein the bill of materials contains the material information of a plurality of single products; cleaning and converting the element data according to the column header data to generate analysis electronic element data; and packaging the BOM analysis data according to the column header data and the analysis element data; and obtaining a plurality of keywords according to the BOM analysis data.
In an embodiment, the text processing the bill of materials, obtaining a plurality of keywords includes: extracting a material text from the bill of materials; performing text preprocessing on the material text to obtain a preprocessed material text; and carrying out vocabulary word segmentation on the preprocessed material text to obtain the keywords.
In an embodiment, the text preprocessing the material text, and obtaining the preprocessed material text includes: performing code conversion on the material text to obtain a coded material text; converting full-angle characters in the coded material text into half-angle characters to obtain a character conversion material text; deleting the interference characters in the character conversion material text to obtain a preprocessed material text; wherein the interfering character comprises an emoji, tab, or invisible character.
In an embodiment, the vocabulary word segmentation is performed on the preprocessed material text, and obtaining the plurality of keywords includes: splitting the preprocessed material text into the keywords according to the word segmentation dictionary; the word segmentation dictionary comprises a preset word segmentation dictionary.
In an embodiment, the splitting the preprocessed material text into the plurality of keywords according to the word segmentation dictionary includes: splitting the preprocessed material text into the keywords according to the word segmentation dictionary; the preprocessed material text comprises a plurality of unstructured data, and the keywords comprise a plurality of structured data.
According to another aspect of the present application, there is provided an inspection apparatus for component package conflict, including: the processing module is used for carrying out text processing on the bill of materials to obtain a plurality of keywords; the identification module is used for identifying the packaging specifications of the keywords; the detection module is used for detecting the relevant states of at least two packaging specifications when the at least two packaging specifications exist in the keywords; and the determining module is used for determining that the bill of materials is abnormal when the relevant states of the at least two packaging specifications are conflict states.
According to another aspect of the present application, there is provided a computer readable storage medium storing a computer program for executing a method for inspecting a component package conflict as described in any one of the above embodiments.
According to another aspect of the present application, there is provided an electronic device including: a processor; a memory for storing the processor-executable instructions; the processor is configured to execute the method for checking component packaging conflict according to any one of the embodiments.
According to the method, the device, the medium and the equipment for checking the component packaging conflict, firstly, text processing is carried out on a bill of materials, and characters which can interfere with subsequent analysis are eliminated. And extracting a plurality of keywords from the processed text, wherein the attribute of the text determines that the text cannot be directly substituted into a function to be processed, so that the text needs to be processed again to obtain a plurality of keywords. And then, identifying the packaging specification of a plurality of keywords, and packaging the data so as to prevent illegal access of the data among different modules. When there are at least two package specifications among the plurality of keywords, it is also necessary to perform a synonymous check on the relevant status of the at least two package specifications, because the meaning expressed by two different package specifications may be the same. Finally, judging whether the bill of materials is abnormal or not according to one or more relevant states, and if the relevant states are conflict states, reflecting that the bill of materials is abnormal, so that the bill of materials can be checked based on packaging conflict.
Drawings
The foregoing and other objects, features and advantages of the present application will become more apparent from the following more particular description of embodiments of the present application, as illustrated in the accompanying drawings. The accompanying drawings are included to provide a further understanding of embodiments of the application and are incorporated in and constitute a part of this specification, illustrate the application and not constitute a limitation to the application. In the drawings, like reference numerals generally refer to like parts or steps.
Fig. 1 is a flowchart of a method for inspecting component package conflict according to an exemplary embodiment of the present application.
Fig. 2 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application.
Fig. 3 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application.
Fig. 4 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application.
Fig. 5 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application.
Fig. 6 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application.
Fig. 7 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application.
Fig. 8 is a schematic structural diagram of an inspection apparatus for component package conflict according to an exemplary embodiment of the present application.
Fig. 9 is a schematic structural view of an inspection apparatus for component package conflict provided in another exemplary embodiment of the present application.
Fig. 10 is a block diagram of an electronic device according to an exemplary embodiment of the present application.
Detailed Description
Hereinafter, example embodiments according to the present application will be described in detail with reference to the accompanying drawings. It should be apparent that the described embodiments are only some of the embodiments of the present application and not all of the embodiments of the present application, and it should be understood that the present application is not limited by the example embodiments described herein.
Exemplary method
Fig. 1 is a flowchart of a method for inspecting component package conflict according to an exemplary embodiment of the present application, where, as shown in fig. 1, the method for inspecting component package conflict includes:
step 100: and carrying out text processing on the bill of materials to obtain a plurality of keywords.
The bill of materials can be the BOM file after analysis and recognition, and the text of the materials can be extracted from the BOM file after analysis and recognition. Text extracted directly from the BOM file may have interfering characters or many invisible characters that may interfere with subsequent analysis. And the text extracted directly from the BOM file is unstructured data, so that mathematical calculation cannot be directly performed on the unstructured data. Therefore, text processing is required to be performed on the bill of materials, a plurality of keywords are obtained from the bill of materials, and vocabulary word segmentation can be performed on the text by adopting a word segmentation mode based on dictionary matching, so that the keywords are obtained from the text. The bill of materials may be a bill of procurement components.
The bill of materials contains a plurality of lines of data, when the bill of materials is subjected to text processing, the text processing can be performed on one line, each line is divided into a plurality of keywords, the conflict checking is performed on the keywords in each line, and after the conflict checking of one line of data is processed, the text processing and the conflict checking of the next line are performed.
Step 200: the package specifications of the plurality of keywords are identified.
After a plurality of keywords are obtained, a keyword list can be formed, and package specifications corresponding to the keywords are searched according to the keyword list. The keywords may correspond to one package specification or may correspond to a plurality of package specifications, where the package specifications of the keywords may be the same or different. The package specification of the plurality of keywords is identified, non-packaged data may be screened from the plurality of keywords, and the non-packaged data may be filtered out. The packaging refers to element packaging of the electronic component, namely the packaging refers to the distribution diagram of the appearance and pins of the component, namely the electronic component packaging refers to the appearance and the positions of welding spots indicated when the actual electronic component is welded to the circuit board, and comprises the appearance size, the occupied space position, the spacing among pins and the like of the actual electronic component.
Step 300: when at least two package specifications exist in the plurality of keywords, detecting the relevant state of the at least two package specifications.
When only one package specification exists in the plurality of keywords, the package specification does not generate package conflict. When there are at least two package sizes in the plurality of keywords, a synonym check is required for every two package sizes, because some package sizes may represent the same meaning although the original data is not the same.
Step 400: and when the related states of at least two packaging specifications are conflict states, determining that the bill of materials is abnormal.
When the related states of two package specifications are in a conflict state or the related states of more than two package specifications are in a conflict state, the abnormality in the bill of materials, such as the abnormality with parameter collocation error, can be reflected.
According to the method for checking the component packaging conflict, firstly, text processing is carried out on the bill of materials, and characters which can interfere with subsequent analysis are eliminated. And extracting a plurality of keywords from the processed text, wherein the attribute of the text determines that the text cannot be directly substituted into a function to be processed, so that the text needs to be processed again to obtain a plurality of keywords. And then, identifying the packaging specification of a plurality of keywords, and packaging the data so as to prevent illegal access of the data among different modules. When there are at least two package specifications among the plurality of keywords, it is also necessary to perform a synonymous check on the relevant status of the at least two package specifications, because the meaning expressed by two different package specifications may be the same. Finally, judging whether the bill of materials is abnormal or not according to one or more relevant states, and if the relevant states are conflict states, reflecting that the bill of materials is abnormal, so that the bill of materials can be checked based on packaging conflict.
The specific embodiment of step 200 described above may be modified to: acquiring a bill of materials; analyzing the bill of materials to obtain target BOM data of the bill of materials; acquiring column header data in target BOM data and element data of each single product according to the target BOM data; wherein the bill of materials contains the material information of a plurality of single products; cleaning and converting the element data according to the column header data to generate analysis element data; packaging the BOM analysis data according to the column header data and the analysis element data; and according to the BOM analysis data, obtaining a plurality of keywords.
The bill of materials, i.e. BOM (Bill of Material) file, which describes the structure of the product in a data format, is a computer-recognizable data file, and is a tie for electronic manufacturing supply chain enterprises to contact and communicate with major businesses. The bill of materials is the original BOM file uploaded by the customer that the system discerned, its data content, format etc. are different, the system only obtains the bill of materials after, can carry on the subsequent analysis recognition process, facilitate unifying the format and correcting of mistakes of various bill of materials.
The target BOM data refers to data used for representing single products in the bill of materials, and the target BOM data is BOM data obtained after format unification and error correction are carried out. After the BOM data are subjected to format unification and error correction, the system can more accurately identify the material information of the single product, so that more accurate quotation is performed on the required single product, and the probability of error quotation is reduced. The column header data refers to the beginning data of each column of BOM data, and the single product may be an electronic component. After the BOM analysis data is obtained, keyword extraction can be performed according to the BOM analysis data, and an identification basis is provided for subsequent inspection of package conflict.
Because bill of materials formats of different companies are different, the manual input is difficult to avoid error, so that data are cleaned in advance and unified conversion of the formats is carried out, and the system can more accurately identify the bill of materials and further accurately quote. The whole analysis and recognition process is performed by the system, manpower is not needed, the labor cost is saved, secondary error conditions caused by fatigue or insufficient experience of staff are avoided again, and the recognition efficiency is higher.
Fig. 2 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application, as shown in fig. 2, the step 300 may include:
step 310: and comparing at least two packaging specifications in pairs to obtain a comparison result.
And comparing the plurality of packaging specifications in pairs, and comparing the next pair of packaging specifications after the comparison of the first pair of packaging specifications. The comparison of the two package specifications can be achieved through a hash table. The hash table is a method for processing conflicts according to a set hash function, a group of keywords is mapped to a limited continuous address set (section), and images of the keywords in the address set are used as storage positions recorded in the table. The time complexity of the hash algorithm is O (1), and the consumed calculation time of O (1) is the same no matter the linked list is large or small, so that the comparison efficiency can be ensured.
Step 320: and determining the relevant states of at least two packaging specifications according to the comparison result.
According to the comparison result of any two package specifications in the at least two package specifications, the relevant state of any two package specifications can be determined. The relevant states of the two package specifications may be non-conflicting or conflicting states.
Fig. 3 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application, as shown in fig. 3, the step 310 may include:
step 311: substituting at least two packaging specifications into the function at a time to obtain two function values; wherein the function comprises a hash function.
The hash function is a hash function, and the package specification is input into the hash function to obtain a hash value corresponding to the keyword. For example, a keyword k1 and a keyword k2 are included, and the hash values f (k 1) and f (k 2) are obtained by substituting the keyword k1 and the keyword k2 into a hash function.
Step 312: comparing the two function values with the data packaging dictionary to obtain two preset packaging specifications corresponding to the two function values;
and inquiring preset packaging specifications corresponding to the two hash values in the data packaging dictionary. The data packaging dictionary is preset with a preset packaging specification and a hash value corresponding to the preset packaging specification in advance.
Step 313: when the two preset package specifications are equal, the two package specifications corresponding to the two function values are determined to have the same meaning.
When two different hash values are queried in the data packaging dictionary to correspond to the same preset packaging specification, the keyword k1 and the keyword k2 are synonyms. For example, after the keyword k1 and the keyword k2 are substituted into the hash function, hash values f (k 1) and f (k 2) are obtained, preset package specifications corresponding to f (k 1) and f (k 2) are queried in the data package dictionary, and if the two preset package specifications are the same, the keyword k1 and the keyword k2 are called synonyms for the hash function.
The keywords k1, k2 correspond to the package specifications 0603 and 1608, respectively, and hash values corresponding to 0603 are preset in the data package dictionary to be 7c4a8d09ca3762af61e59520943dc26494f8941b and b08fff87998d20605f134b632cb8fa3398008116. And respectively substituting 0603 and 1608 into a hash function for calculation to obtain a hash value of 0603 as 7c4a8d09ca3762af61e 595959499443boss 8941b, a hash value of 1608 as b08fff87998d20605f134b632cb8fa3398008116, inquiring preset packaging specifications corresponding to the hash values of 0603 and 1608 in a data packaging dictionary, and looking up the preset hash values of 0603 and 1608 according to the preset hash values of the data packaging dictionary, wherein the preset packaging specifications corresponding to the hash values of 0603 and 1608 are 0603, so that the 0603 and 1608 represent the same meaning though different expression forms, and the keyword k1 and the keyword k2 are synonymous.
In one embodiment, as shown in fig. 3, the step 320 may include:
step 321: when the comparison result shows that the two packaging specifications corresponding to the two function values have different meanings, determining that the relevant states of the two packaging specifications corresponding to the two function values are conflict states.
When the two package specifications have different meanings, it may be determined that the relevant states of the two package specifications corresponding to the two function values are conflicting states. The same component cannot correspond to two package sizes, and therefore, when two package specifications having different meanings are present, it can be judged that a conflict occurs. For example, after k1 and k2 are substituted into the hash function, hash values f (k 1) and f (k 2) are obtained, preset package specifications corresponding to f (k 1) and f (k 2) are queried in the data package dictionary, if two preset package specifications corresponding to f (k 1) and f (k 2) are different, the keyword k1 and the keyword k2 have different meanings, the two package specifications are in a conflict state, and the conflict state also indicates that the bill of materials is abnormal.
Fig. 4 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application, as shown in fig. 4, the step 200 may include:
step 210: and comparing the keywords with the data package dictionary to determine package specifications of the keywords.
The key-value database can be used for comparing the key-value database, wherein the key represents the key, the value represents the value, the key and the value can be used as key value pairs, each key can correspond to one value, and one key can also be supported to correspond to a plurality of values. The data package dictionary is a set of data of key-value key value pairs, and has the characteristics of variability, disorder and non-repetition of keys. The data encapsulation dictionary may directly call an existing dictionary.
In an embodiment, the implementation of step 210 may be further adjusted to: and determining the package specification of the keywords when the keywords are the same as the data in the data package dictionary.
By inputting the keywords and then obtaining the corresponding values returned according to the keywords, it can be judged whether the keywords correspond to the data in the data package dictionary. If a keyword is input but no corresponding data exists in the data package dictionary, an abnormal prompt is returned when the fact that the keyword does not exist is detected. When the keywords exist in the data package dictionary, the package specification of the keywords can be judged, and the package specification of the keywords can be one or a plurality of keywords.
Fig. 5 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application, as shown in fig. 5, the step 100 may include:
step 110: and extracting the material text from the material list.
And extracting a material text from the identified and cleaned BOM file (bill of materials), wherein the material text can comprise information such as names, models, specifications, quantity and the like of the materials.
Step 120: and carrying out text preprocessing on the material text to obtain the preprocessed material text.
And (3) cleaning the text of the material text, deleting the characters which interfere with the analysis result, and obtaining the preprocessed material text with the interfering characters or invisible characters removed, so as to reduce the interference items of the text.
Step 130: and carrying out vocabulary word segmentation on the preprocessed material text to obtain a plurality of keywords.
The material text is subjected to text preprocessing, so that clean material text can be obtained, keywords play a key role in the clean material text, and even the keywords can determine the text orientation, therefore, the whole text can be summarized by extracting a plurality of keywords in the text. The text is unstructured data and cannot be directly substituted into the function to be calculated, so that the text of the material needs to be converted into keywords, and the keywords are structured data, so that mathematical calculation is facilitated.
Fig. 6 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application, as shown in fig. 6, the step 120 may include:
step 121: and performing code conversion on the material text to obtain the code material text.
For example, gb2312 is converted to UTF8. And the material file is coded and converted, so that two systems using different coding schemes can communicate conveniently, and the possibility of interference in the analysis process is reduced.
Step 122: and converting full-angle characters in the coded material text into half-angle characters to obtain the character-converted material text.
One character in the full-angle character occupies two standard character positions, one character in the half-angle character occupies one standard character position, and the half-angle forms of English letters, number keys and symbol keys are treated as basic codes in the system generally. Thus, converting full-angle characters to half-angle characters facilitates recognition and processing by the system.
Step 123: and deleting the interference characters in the text of the material to obtain the preprocessed text of the material.
Wherein the interfering character comprises an emoji, tab, or invisible character.
After the character conversion is carried out on the material text, some useless symbols can exist, so that the meaning of the text cannot be reflected, and the material text can be disturbed. Therefore, the interference characters in the material text are deleted, the material text is further cleaned, and finally the material text after pretreatment is obtained.
Fig. 7 is a flowchart of a method for inspecting component package conflict according to another exemplary embodiment of the present application, as shown in fig. 7, the step 130 may include:
step 131: and splitting the preprocessed material text into a plurality of keywords according to the word segmentation dictionary.
The word segmentation dictionary comprises a preset word segmentation dictionary.
And adopting a word segmentation mode based on dictionary matching to segment the material text. The effect of word segmentation mode based on dictionary matching is in direct proportion to the richness of the word segmentation dictionary, so that the existing word segmentation dictionary can be called, and a large number of professional vocabularies of related industries are added into the word segmentation dictionary according to the content and the attribute of the material text, so that the keywords split from the material text are more comprehensive and complete.
In an embodiment, the implementation of step 131 may be further adjusted to: splitting the preprocessed material text into a plurality of keywords according to the word segmentation dictionary; the preprocessed material text comprises a plurality of unstructured data, and the keywords comprise a plurality of structured data.
The text is unstructured data and cannot be directly substituted into the function to be calculated, so that the material text needs to be converted into keywords, and the keywords are structured data, so that mathematical calculation is convenient to carry out on the keywords. For example, a hash address (hash value) corresponding to a keyword can be obtained by substituting the keyword of the structured data into the hash function.
The bill of materials contains a plurality of lines of data, and each line of data can be processed independently when the bill of materials is processed in a text mode. Each line of data comprises a plurality of texts, the text processing is firstly carried out on the first line of data from the object list, keywords are divided, the package specifications of the keywords in the first line are identified, if at least two package specifications exist on the keywords in the first line, the relevant states of the at least two package specifications are detected, whether the two package specifications conflict is determined, whether the first line of data is abnormal is determined, keyword division and conflict checking are carried out on the next line of data after the first line of data is checked, and whether the next line of data is abnormal is determined.
Exemplary apparatus
Fig. 8 is a schematic structural diagram of an inspection apparatus for component package conflict according to an exemplary embodiment of the present application, and as shown in fig. 8, the inspection apparatus 8 for component package conflict includes: a processing module 81, configured to perform text processing on the bill of materials to obtain a plurality of keywords; an identification module 82 for identifying package specifications of a plurality of keywords; a detection module 83 for detecting a correlation state of at least two package specifications when at least two package specifications exist in the plurality of keywords; and a determining module 84, configured to determine that there is an abnormality in the bill of materials when the relevant states of the at least two package specifications are conflicting states.
The device 8 for checking component packaging conflict firstly carries out text processing on a bill of materials through the processing module 81, and excludes characters which can interfere with subsequent analysis. And extracting a plurality of keywords from the processed text, wherein the attribute of the text determines that the text cannot be directly substituted into a function to be processed, so that the text needs to be processed again to obtain a plurality of keywords. Then, the recognition module 82 recognizes the package specification of the keywords, and the package of the data can prevent illegal access of the data between different modules. When there are at least two package specifications among the plurality of keywords, it is also necessary to perform a synonymous check on the relevant status of the at least two package specifications by the detection module 83, because the meanings expressed by the two different package specifications may be the same. Finally, according to one or more relevant states, whether the bill of materials is abnormal or not is judged, if the relevant states are conflict states, the abnormality in the bill of materials can be determined through the determining module 84, and therefore the bill of materials can be checked based on packaging conflict.
Fig. 9 is a schematic structural diagram of an inspection apparatus for component package conflict according to another exemplary embodiment of the present application, as shown in fig. 9, the detection module 83 may include: the comparison unit 831 is used for performing pairwise comparison on at least two packaging specifications to obtain a comparison result; the determining state unit 832 is configured to determine the relevant states of at least two package specifications according to the comparison result.
In an embodiment, as shown in fig. 9, the comparing unit 831 may include: a calculation subunit 8311, configured to substitute at least two package specifications into the function at a single time, so as to obtain two function values; wherein the function comprises a hash function; the comparison subunit 8312 is used for comparing the two function values with the data encapsulation dictionary to obtain a preset encapsulation specification corresponding to the two function values; the meaning determination subunit 8313 is configured to determine that, when two preset package specifications are equal, two package specifications corresponding to the two function values have the same meaning.
In one embodiment, as shown in fig. 9, the determining state unit 832 may include: the conflict determination subunit 8321 is configured to determine that the relevant states of the two package specifications corresponding to the two function values are conflict states when the comparison result indicates that the two package specifications corresponding to the two function values have different meanings.
In one embodiment, as shown in fig. 9, the identification module 82 may include: and a determining unit 821 for comparing the plurality of keywords with the data package dictionary and determining package specifications of the plurality of keywords.
The above-described determination packaging unit 821 may be further configured to: and determining the package specification of the keywords when the keywords are the same as the data in the data package dictionary.
In one embodiment, as shown in fig. 9, the processing module 81 may include: an extracting unit 811 for extracting a material text from the bill of materials; a preprocessing unit 812, configured to perform text preprocessing on the material text, so as to obtain a preprocessed material text; the word segmentation unit 813 is configured to perform word segmentation on the preprocessed material text, so as to obtain a plurality of keywords.
In one embodiment, as shown in fig. 9, the preprocessing unit 812 may include: the coding subunit 8121 is configured to perform coding conversion on the material text, so as to obtain a coded material text; a rotor subunit 8122, configured to convert full-angle characters in the encoded material text into half-angle characters, thereby obtaining a rotor material text; and the deleting subunit 8123 is configured to delete the interfering characters in the text of the material to obtain a preprocessed text of the material.
In an embodiment, as shown in fig. 9, the word segmentation unit 813 may include: the word segmentation subunit 8131 is configured to split the preprocessed material text into a plurality of keywords according to the word segmentation dictionary.
In an embodiment, word segmentation subunit 8131 may be further configured to: splitting the preprocessed material text into a plurality of keywords according to the word segmentation dictionary; the preprocessed material text comprises a plurality of unstructured data, and the keywords comprise a plurality of structured data.
Exemplary electronic device
Next, an electronic device according to an embodiment of the present application is described with reference to fig. 10. The electronic device may be either or both of the first device and the second device, or a stand-alone device independent thereof, which may communicate with the first device and the second device to receive the acquired input signals therefrom.
Fig. 10 illustrates a block diagram of an electronic device according to an embodiment of the present application.
As shown in fig. 10, the electronic device 10 includes one or more processors 11 and a memory 12.
The processor 11 may be a Central Processing Unit (CPU) or other form of processing unit having data processing and/or instruction execution capabilities, and may control other components in the electronic device 10 to perform desired functions.
Memory 12 may include one or more computer program products that may include various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory. The volatile memory may include, for example, random Access Memory (RAM) and/or cache memory (cache), and the like. The non-volatile memory may include, for example, read Only Memory (ROM), hard disk, flash memory, and the like. One or more computer program instructions may be stored on the computer readable storage medium that may be executed by the processor 11 to implement the method of checking for component package conflicts and/or other desired functions of the various embodiments of the present application described above. Various contents such as an input signal, a signal component, a noise component, and the like may also be stored in the computer-readable storage medium.
In one example, the electronic device 10 may further include: an input device 13 and an output device 14, which are interconnected by a bus system and/or other forms of connection mechanisms (not shown).
When the electronic device is a stand-alone device, the input means 13 may be a communication network connector for receiving the acquired input signals from the first device and the second device.
In addition, the input device 13 may also include, for example, a keyboard, a mouse, and the like.
The output device 14 may output various information to the outside, including the determined distance information, direction information, and the like. The output means 14 may include, for example, a display, speakers, a printer, and a communication network and remote output devices connected thereto, etc.
Of course, only some of the components of the electronic device 10 that are relevant to the present application are shown in fig. 10 for simplicity, components such as buses, input/output interfaces, etc. are omitted. In addition, the electronic device 10 may include any other suitable components depending on the particular application.
The computer program product may write program code for performing the operations of embodiments of the present application in any combination of one or more programming languages, including an object oriented programming language such as Java, C++ or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device, partly on a remote computing device, or entirely on the remote computing device or server.
The computer readable storage medium may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. The readable storage medium may include, for example, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium would include the following: an electrical connection having one or more wires, a portable disk, a hard disk, random Access Memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
The foregoing description has been presented for purposes of illustration and description. Furthermore, this description is not intended to limit the embodiments of the application to the form disclosed herein. Although a number of example aspects and embodiments have been discussed above, a person of ordinary skill in the art will recognize certain variations, modifications, alterations, additions, and subcombinations thereof.

Claims (10)

1. The method for checking the component packaging conflict is characterized by comprising the following steps of:
text processing is carried out on the bill of materials to obtain a plurality of keywords; the text processing is performed on the bill of materials, and obtaining a plurality of keywords comprises: acquiring a bill of materials file; analyzing the bill of materials to obtain target BOM data of the bill of materials; acquiring column header data and element data of each single product in the target BOM data according to the target BOM data; wherein the bill of materials contains the material information of a plurality of single products; cleaning and converting the element data according to the column header data to generate analysis electronic element data; packaging the BOM analysis data according to the column header data and the analysis electronic element data; obtaining a plurality of keywords according to the BOM analysis data; the keywords are structured data; and
identifying package specifications of the plurality of keywords; and
detecting the relevant state of at least two packaging specifications when the at least two packaging specifications exist in the keywords;
wherein, when at least two package specifications exist in the plurality of keywords, detecting the relevant state of the at least two package specifications includes:
comparing the at least two packaging specifications in pairs to obtain a comparison result;
the step of comparing the at least two package specifications in pairs, wherein the step of obtaining a comparison result comprises the following steps: substituting the at least two packaging specifications into the function at a time to obtain two function values; the function comprises a hash function, wherein the hash function is a hash function, and the package specification is input into the hash function to obtain a hash value corresponding to the keyword; comparing the two function values with the data packaging dictionary to obtain two preset packaging specifications corresponding to the two function values; the data packaging dictionary is preset with a preset packaging specification and a hash value corresponding to the preset packaging specification in advance; when the two preset packaging specifications are equal, determining that the two packaging specifications corresponding to the two function values have the same meaning;
determining the relevant states of the at least two packaging specifications according to the comparison result;
wherein, the determining the relevant state of the at least two package specifications according to the comparison result includes: when the comparison result shows that the two packaging specifications corresponding to the two function values have different meanings, determining that the relevant states of the two packaging specifications corresponding to the two function values are conflict states; and
and when the related states of the at least two packaging specifications are conflict states, determining that the bill of materials is abnormal.
2. The method of claim 1, wherein the identifying the package specification of the plurality of keywords comprises:
and comparing the keywords with the data encapsulation dictionary, and determining encapsulation specifications of the keywords.
3. The method of claim 2, wherein the comparing the plurality of keywords to a data package dictionary, determining package specifications for the plurality of keywords comprises:
and when the keyword is the same as the data in the data encapsulation dictionary, determining the encapsulation specification of the keyword.
4. The method of claim 1, wherein the text processing the bill of materials to obtain a plurality of keywords comprises:
extracting a material text from the bill of materials;
performing text preprocessing on the material text to obtain a preprocessed material text; and
and carrying out vocabulary word segmentation on the preprocessed material text to obtain the keywords.
5. The method of claim 4, wherein text pre-processing the material text to obtain pre-processed material text comprises:
performing code conversion on the material text to obtain a coded material text;
converting full-angle characters in the coded material text into half-angle characters to obtain a character conversion material text; and
deleting the interference characters in the character conversion material text to obtain a preprocessed material text; wherein the interfering character comprises an emoji, tab, or invisible character.
6. The method of claim 4, wherein lexically segmenting the pre-processed material text to obtain the plurality of keywords comprises:
splitting the preprocessed material text into the keywords according to a word segmentation dictionary; the word segmentation dictionary comprises a preset word segmentation dictionary.
7. The method of claim 6, wherein splitting the pre-processed material text into the plurality of keywords according to a word segmentation dictionary comprises:
splitting the preprocessed material text into the keywords according to the word segmentation dictionary; wherein,,
the preprocessed material text includes a plurality of unstructured data, and the plurality of keywords includes a plurality of structured data.
8. An inspection apparatus for component package conflict, comprising:
the processing module is used for carrying out text processing on the bill of materials to obtain a plurality of keywords;
the processing module comprises a module for acquiring a bill of materials file; analyzing the bill of materials to obtain target BOM data of the bill of materials; acquiring column header data and element data of each single product in the target BOM data according to the target BOM data; wherein the bill of materials contains the material information of a plurality of single products; cleaning and converting the element data according to the column header data to generate analysis electronic element data; packaging the BOM analysis data according to the column header data and the analysis electronic element data; obtaining a plurality of keywords according to the BOM analysis data; the keywords are structured data;
the identification module is used for identifying the packaging specifications of the keywords;
the detection module is used for detecting the relevant states of at least two packaging specifications when the at least two packaging specifications exist in the keywords;
the detection module comprises: the comparison unit is used for carrying out pairwise comparison on at least two packaging specifications to obtain a comparison result; the contrast unit includes: a calculating subunit, configured to substitute at least two package specifications into the function at a single time, so as to obtain two function values; the function comprises a hash function, wherein the hash function is the hash function, and the package specification is input into the hash function to obtain a hash value corresponding to the keyword; the comparison subunit is used for comparing the two function values with the data packaging dictionary to obtain preset packaging specifications corresponding to the two function values; the data packaging dictionary is preset with a preset packaging specification and a hash value corresponding to the preset packaging specification in advance; the meaning determining subunit is used for determining that the two packaging specifications corresponding to the two function values have the same meaning when the two preset packaging specifications are equal; a determining state unit, configured to determine relevant states of at least two package specifications according to the comparison result; determining the state unit includes: a conflict determination subunit, configured to determine, when the comparison result is that the two package specifications corresponding to the two function values have different meanings, that the relevant states of the two package specifications corresponding to the two function values are conflict states;
and
And the determining module is used for determining that the bill of materials is abnormal when the related states of the at least two packaging specifications are conflict states.
9. A computer-readable storage medium storing a computer program for executing a method of checking for component package conflicts as claimed in any one of the preceding claims 1-7.
10. An electronic device, the electronic device comprising:
a processor;
a memory for storing the processor-executable instructions;
the processor is configured to execute a method for checking for component package conflicts as set forth in any one of claims 1-7.
CN202110945684.2A 2021-08-16 2021-08-16 Method, device, medium and equipment for checking component packaging conflict Active CN113627168B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110945684.2A CN113627168B (en) 2021-08-16 2021-08-16 Method, device, medium and equipment for checking component packaging conflict

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110945684.2A CN113627168B (en) 2021-08-16 2021-08-16 Method, device, medium and equipment for checking component packaging conflict

Publications (2)

Publication Number Publication Date
CN113627168A CN113627168A (en) 2021-11-09
CN113627168B true CN113627168B (en) 2023-06-30

Family

ID=78386211

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110945684.2A Active CN113627168B (en) 2021-08-16 2021-08-16 Method, device, medium and equipment for checking component packaging conflict

Country Status (1)

Country Link
CN (1) CN113627168B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113988068B (en) * 2021-12-29 2022-04-15 深圳前海硬之城信息技术有限公司 Word segmentation method, device, equipment and storage medium of BOM text
CN114386368B (en) * 2022-03-23 2022-07-19 百芯智能制造科技(深圳)有限公司 Processing method and processing device for packaging specification item data
CN116452124B (en) * 2023-06-15 2023-08-18 商飞软件有限公司 Real-time state list construction system suitable for civil aircraft time control

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1549670A (en) * 2003-05-21 2004-11-24 华为技术有限公司 Method and apparatus for examining uniformity
CN102447673A (en) * 2010-09-30 2012-05-09 突触计算机系统(上海)有限公司 Method and equipment for depacking multimedia file carrying a package format
CN107633353A (en) * 2017-09-08 2018-01-26 南京国电南自电网自动化有限公司 Material BOM tables based on EXCEL Table V BA close rule inspection method
CN110502674A (en) * 2019-08-27 2019-11-26 云汉芯城(上海)互联网科技股份有限公司 It is a kind of based on the classification method of calibration of BOM type selecting, device
CN111543070A (en) * 2018-02-09 2020-08-14 英特尔公司 Trusted IOT device configuration and loading
CN111858698A (en) * 2020-06-02 2020-10-30 深圳市鼎盛光电有限公司 Data checking method, data checking device and terminal equipment
CN112447693A (en) * 2019-09-03 2021-03-05 联发科技股份有限公司 Semiconductor device with a plurality of semiconductor chips
CN112667480A (en) * 2020-12-30 2021-04-16 平安证券股份有限公司 Dynamic monitoring method and device for business data, electronic equipment and storage medium
CN112801621A (en) * 2021-01-30 2021-05-14 深圳捷创电子科技有限公司 BOM bill of material sorting and approving method, system and computer readable storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4385387B1 (en) * 2009-07-02 2009-12-16 修平 西山 Database system with attributed key-value store

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1549670A (en) * 2003-05-21 2004-11-24 华为技术有限公司 Method and apparatus for examining uniformity
CN102447673A (en) * 2010-09-30 2012-05-09 突触计算机系统(上海)有限公司 Method and equipment for depacking multimedia file carrying a package format
CN107633353A (en) * 2017-09-08 2018-01-26 南京国电南自电网自动化有限公司 Material BOM tables based on EXCEL Table V BA close rule inspection method
CN111543070A (en) * 2018-02-09 2020-08-14 英特尔公司 Trusted IOT device configuration and loading
CN110502674A (en) * 2019-08-27 2019-11-26 云汉芯城(上海)互联网科技股份有限公司 It is a kind of based on the classification method of calibration of BOM type selecting, device
CN112447693A (en) * 2019-09-03 2021-03-05 联发科技股份有限公司 Semiconductor device with a plurality of semiconductor chips
CN111858698A (en) * 2020-06-02 2020-10-30 深圳市鼎盛光电有限公司 Data checking method, data checking device and terminal equipment
CN112667480A (en) * 2020-12-30 2021-04-16 平安证券股份有限公司 Dynamic monitoring method and device for business data, electronic equipment and storage medium
CN112801621A (en) * 2021-01-30 2021-05-14 深圳捷创电子科技有限公司 BOM bill of material sorting and approving method, system and computer readable storage medium

Also Published As

Publication number Publication date
CN113627168A (en) 2021-11-09

Similar Documents

Publication Publication Date Title
CN113627168B (en) Method, device, medium and equipment for checking component packaging conflict
US9690788B2 (en) File type recognition analysis method and system
CN112015430A (en) JavaScript code translation method and device, computer equipment and storage medium
CA3048356A1 (en) Unstructured data parsing for structured information
US8874426B2 (en) Method for translating computer generated log files
CN113986864A (en) Log data processing method and device, electronic equipment and storage medium
CN111133396B (en) Production facility monitoring device, production facility monitoring method, and recording medium
CN113657088A (en) Interface document analysis method and device, electronic equipment and storage medium
CN109189372B (en) Development script generation method of insurance product and terminal equipment
CN114385694A (en) Data processing method and device, computer equipment and storage medium
CN113626561A (en) Component model identification method, device, medium and equipment
US11928421B2 (en) Technical document error detection
CN110555212A (en) Document verification method and device based on natural language processing and electronic equipment
CN115470034A (en) Log analysis method, device and storage medium
CN113627892B (en) BOM data identification method and electronic equipment thereof
CN115309706A (en) Method, device, equipment, medium and product for updating multi-language configuration file
CN115294593A (en) Image information extraction method and device, computer equipment and storage medium
US11281901B2 (en) Document extraction system and method
CN114154480A (en) Information extraction method, device, equipment and storage medium
CN113807807A (en) Component parameter identification method and device, electronic equipment and readable medium
CN110083807B (en) Contract modification influence automatic prediction method, device, medium and electronic equipment
CN113936130A (en) Document information intelligent acquisition and error correction method, system and equipment based on OCR technology
CN112966671A (en) Contract detection method and device, electronic equipment and storage medium
CN111475403A (en) Dynamic generation method of test script and related device
CN114743012B (en) Text recognition method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant