CN113627168A - Method, device, medium and equipment for checking component packaging conflict - Google Patents

Method, device, medium and equipment for checking component packaging conflict Download PDF

Info

Publication number
CN113627168A
CN113627168A CN202110945684.2A CN202110945684A CN113627168A CN 113627168 A CN113627168 A CN 113627168A CN 202110945684 A CN202110945684 A CN 202110945684A CN 113627168 A CN113627168 A CN 113627168A
Authority
CN
China
Prior art keywords
packaging
keywords
data
specifications
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110945684.2A
Other languages
Chinese (zh)
Other versions
CN113627168B (en
Inventor
谢国清
林颖朝
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Yuncai Network Technology Co ltd
Original Assignee
Shenzhen Yuncai Network Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Yuncai Network Technology Co ltd filed Critical Shenzhen Yuncai Network Technology Co ltd
Priority to CN202110945684.2A priority Critical patent/CN113627168B/en
Publication of CN113627168A publication Critical patent/CN113627168A/en
Application granted granted Critical
Publication of CN113627168B publication Critical patent/CN113627168B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/08Logistics, e.g. warehousing, loading or distribution; Inventory or stock management
    • G06Q10/087Inventory or stock management, e.g. order filling, procurement or balancing against orders
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The application discloses a method, a device, a medium and equipment for checking component packaging conflict, which comprises the steps of performing text processing on a bill of materials to obtain a plurality of keywords; identifying packaging specifications of the plurality of keywords; when at least two packaging specifications exist in the plurality of keywords, detecting the relevant states of the at least two packaging specifications; and when the relevant states of the at least two packaging specifications are conflict states, determining that the bill of materials is abnormal. The problem that whether data are abnormal or not in the bill of materials cannot be detected when the bill of materials is read can be solved.

Description

Method, device, medium and equipment for checking component packaging conflict
Technical Field
The application relates to the technical field of data processing, in particular to a method, a device, a medium and equipment for checking component packaging conflict.
Background
Existing procurement systems typically obtain procurement requirements information from a user-provided bill of materials. However, the problem that parameter collocation is wrong may exist in the bill of materials provided by the user, and the purchasing system cannot detect whether data abnormality exists in the bill of materials when reading the bill of materials, so that if the data abnormality occurs in the bill of materials, the data generated by the purchasing system is influenced finally.
Disclosure of Invention
The present application is proposed to solve the above-mentioned technical problems. Embodiments of the present application provide a method, an apparatus, a medium, and a device for checking a component packaging conflict, which can solve a problem that whether data abnormality exists in a bill of materials cannot be detected when the bill of materials is read.
According to an aspect of the present application, there is provided a method for checking a component packaging conflict, including: performing text processing on the bill of materials to obtain a plurality of keywords; identifying packaging specifications of the plurality of keywords; when at least two packaging specifications exist in the plurality of keywords, detecting the relevant states of the at least two packaging specifications; and when the relevant states of the at least two packaging specifications are conflict states, determining that the bill of materials is abnormal.
In an embodiment, when there are at least two package specifications in the plurality of keywords, the detecting the relevant states of the at least two package specifications includes: comparing the at least two packaging specifications pairwise to obtain a comparison result; and determining the relevant states of the at least two packaging specifications according to the comparison result.
In an embodiment, the pairwise comparing the at least two package specifications to obtain a comparison result includes: substituting the at least two packaging specifications into a function once to obtain two function values; wherein the function comprises a hash function; comparing the two function values with the data packaging dictionary to obtain two preset packaging specifications corresponding to the two function values; and when the two preset packaging specifications are equal, determining that the two packaging specifications corresponding to the two function values have the same meaning.
In an embodiment, the determining the relevant states of the at least two package specifications according to the comparison result includes: and when the comparison result indicates that the two packaging specifications corresponding to the two function values have different meanings, determining that the relevant states of the two packaging specifications corresponding to the two function values are conflict states.
In one embodiment, the identifying the package specifications of the keywords comprises: and comparing the plurality of keywords with the data packaging dictionary, and determining the packaging specifications of the plurality of keywords.
In an embodiment, the comparing the plurality of keywords with the data package dictionary and determining the package specifications of the plurality of keywords comprises: and when the keyword is the same as the data in the data packaging dictionary, determining the packaging specification of the keyword.
In an embodiment, the text processing the bill of materials, and obtaining a plurality of keywords includes: acquiring a bill of material file; analyzing the bill of materials to obtain target BOM data of the bill of materials; acquiring column header data and element data of each single product in the target BOM data according to the target BOM data; the bill of materials comprises material information of a plurality of single products; according to the column header data, cleaning and converting the element data to generate analytic electronic element data; packaging the column header data and the analysis element data into BOM analysis data; and obtaining a plurality of keywords according to the BOM analysis data.
In an embodiment, the text processing the bill of materials, and obtaining a plurality of keywords includes: extracting a material text from the bill of materials; performing text preprocessing on the material text to obtain a preprocessed material text; and performing vocabulary word segmentation on the preprocessed material text to obtain the plurality of keywords.
In an embodiment, the text preprocessing on the material text, and obtaining the preprocessed material text includes: performing code conversion on the material text to obtain a coded material text; converting full-angle characters in the coding material text into half-angle characters to obtain a converter material text; deleting the interference characters in the transfer character material text to obtain a preprocessed material text; wherein the distracting characters include emoticons, tabs, or invisible characters.
In an embodiment, the performing vocabulary segmentation on the preprocessed material text to obtain the plurality of keywords includes: splitting the preprocessed material text into the plurality of keywords according to the word segmentation dictionary; wherein the segmentation dictionary comprises a preset segmentation dictionary.
In an embodiment, the splitting the preprocessed material text into the plurality of keywords according to the word segmentation dictionary includes: splitting the preprocessed material text into the plurality of keywords according to the word segmentation dictionary; the preprocessed material text comprises a plurality of unstructured data, and the plurality of keywords comprise a plurality of structured data.
According to another aspect of the present application, there is provided a device for inspecting a component packaging conflict, including: the processing module is used for performing text processing on the bill of materials to obtain a plurality of keywords; the identification module is used for identifying the packaging specifications of the keywords; the detection module is used for detecting the relevant states of at least two packaging specifications when at least two packaging specifications exist in the plurality of keywords; and the determining module is used for determining that the bill of material is abnormal when the relevant states of the at least two packaging specifications are conflict states.
According to another aspect of the present application, a computer-readable storage medium is provided, where the storage medium stores a computer program for executing a method for checking a collision of component packages according to any of the above embodiments.
According to another aspect of the present application, there is provided an electronic apparatus including: a processor; a memory for storing the processor-executable instructions; the processor is configured to execute the method for checking a component packaging conflict according to any one of the embodiments.
According to the method, the device, the medium and the equipment for checking the component packaging conflict, firstly, the text processing is carried out on the bill of materials, and characters which can interfere with the follow-up analysis are eliminated. And extracting a plurality of keywords from the processed text, wherein the text cannot be directly substituted into the function for processing due to the attribute of the text, so that the text needs to be processed again to obtain a plurality of keywords. And then identifying the packaging specifications of a plurality of keywords, and packaging the data to prevent illegal access of the data among different modules. When at least two package specifications exist in the plurality of keywords, synonymy checking of the related states of the at least two package specifications is also required, because the meanings of the two different package specifications may be the same. And finally, judging whether the bill of materials is abnormal or not according to one or more relevant states, and if the relevant states are conflict states, reflecting that the bill of materials is abnormal, so that the bill of materials can be checked based on packaging conflicts.
Drawings
The above and other objects, features and advantages of the present application will become more apparent by describing in more detail embodiments of the present application with reference to the attached drawings. The accompanying drawings are included to provide a further understanding of the embodiments of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the principles of the application. In the drawings, like reference numbers generally represent like parts or steps.
Fig. 1 is a schematic flow chart of a method for checking a collision of component packages according to an exemplary embodiment of the present application.
Fig. 2 is a schematic flow chart of a method for checking a collision of a component package according to another exemplary embodiment of the present application.
Fig. 3 is a schematic flow chart of a method for checking a collision of a component package according to another exemplary embodiment of the present application.
Fig. 4 is a schematic flowchart of a method for checking a collision of component packages according to another exemplary embodiment of the present application.
Fig. 5 is a schematic flowchart of a method for checking a collision of component packages according to another exemplary embodiment of the present application.
Fig. 6 is a schematic flowchart of a method for checking a collision of component packages according to another exemplary embodiment of the present application.
Fig. 7 is a schematic flowchart of a method for checking a collision of component packages according to another exemplary embodiment of the present application.
Fig. 8 is a schematic structural diagram of an apparatus for inspecting a collision of component packages according to an exemplary embodiment of the present application.
Fig. 9 is a schematic structural diagram of an apparatus for inspecting a collision of component packages according to another exemplary embodiment of the present application.
Fig. 10 is a block diagram of an electronic device provided in an exemplary embodiment of the present application.
Detailed Description
Hereinafter, example embodiments according to the present application will be described in detail with reference to the accompanying drawings. It should be understood that the described embodiments are only some embodiments of the present application and not all embodiments of the present application, and that the present application is not limited by the example embodiments described herein.
Exemplary method
Fig. 1 is a schematic flow chart of a method for checking a component packaging conflict according to an exemplary embodiment of the present application, where as shown in fig. 1, the method for checking a component packaging conflict includes:
step 100: and performing text processing on the bill of materials to obtain a plurality of keywords.
The bill of material may be a BOM file identified by parsing, and the material text may be extracted from the BOM file identified by parsing. Text extracted directly from a BOM file may have interfering characters or many invisible characters that interfere with subsequent analysis. And the texts directly extracted from the BOM files are all unstructured data, and the unstructured data cannot be directly subjected to mathematical computation. Therefore, the bill of materials needs to be subjected to text processing first, and a plurality of keywords are obtained from the bill of materials, and the words of the text can be segmented by adopting a word segmentation mode based on dictionary matching, so that the keywords can be obtained from the words. The bill of materials may be a list of purchased components.
The bill of materials contains multiple rows of data, when the bill of materials is subjected to text processing, the text processing can be carried out on one line, each line is divided into multiple keywords, the multiple keywords in each line are subjected to conflict check, and after the conflict check of one line of data is processed, the text processing and the conflict check of the next line are carried out.
Step 200: package specifications for a plurality of keywords are identified.
After a plurality of keywords are obtained, a keyword list can be formed, and the packaging specification corresponding to the keywords is searched according to the keyword list. The plurality of keywords may correspond to one package specification or a plurality of package specifications, and the package specifications of the plurality of keywords may be the same or different. The packaging specifications of the keywords are identified, non-packaged data can be screened out from the keywords, and the non-packaged data is filtered out. The packaging refers to component packaging of the electronic component, and the packaging refers to distribution of the shape and pins of the component, that is, the electronic component packaging refers to the appearance and the positions of welding points indicated when the actual electronic component is welded to the circuit board, and includes the external dimension, the occupied space position, the spaces among pins and the like of the actual electronic component.
Step 300: when at least two packaging specifications exist in the plurality of keywords, detecting the relevant states of the at least two packaging specifications.
When only one packaging specification exists in the keywords, the packaging conflict cannot be generated by one packaging specification. When there are at least two package specifications in the plurality of keywords, it is necessary to perform a synonymous check for every two package specifications because some package specifications may represent the same meaning although the original data is different.
Step 400: and when the relevant states of at least two packaging specifications are conflict states, determining that the bill of materials is abnormal.
When the relevant states of two package specifications are conflict states, or the relevant states of more than two package specifications are conflict states, it can reflect that there is an exception in the bill of material, for example, there is an exception of parameter collocation error.
According to the method for checking the component packaging conflict, firstly, the bill of materials is subjected to text processing, and characters which can interfere with subsequent analysis are eliminated. And extracting a plurality of keywords from the processed text, wherein the text cannot be directly substituted into the function for processing due to the attribute of the text, so that the text needs to be processed again to obtain a plurality of keywords. And then identifying the packaging specifications of a plurality of keywords, and packaging the data to prevent illegal access of the data among different modules. When at least two package specifications exist in the plurality of keywords, synonymy checking of the related states of the at least two package specifications is also required, because the meanings of the two different package specifications may be the same. And finally, judging whether the bill of materials is abnormal or not according to one or more relevant states, and if the relevant states are conflict states, reflecting that the bill of materials is abnormal, so that the bill of materials can be checked based on packaging conflicts.
The specific implementation of the above step 200 can be adjusted as follows: acquiring a bill of materials; analyzing the bill of materials to obtain target BOM data of the bill of materials; acquiring column header data and element data of each single product in the target BOM data according to the target BOM data; the bill of materials comprises material information of a plurality of single products; according to the column header data, cleaning and converting the element data to generate analytic element data; packaging the column header data and the analytic element data into BOM analytic data; and analyzing the data according to the BOM to obtain a plurality of keywords.
The bill of material (bom of material) file, i.e. the file describing the structure of the product in data format, is a data file that can be identified by the computer, and is also a link for the electronic manufacturing supply chain enterprise to contact and communicate the main business. The bill of material is the original BOM file uploaded by the client identified by the system, the data content, format and the like are different, and the system can perform the subsequent analysis and identification process only after obtaining the bill of material, so that the unification of formats and the correction of mistakes and omissions of various bills of material are facilitated.
The target BOM data refers to data used for representing a single item in a bill of materials, and the target BOM data is BOM data obtained after format unification and error correction are carried out. After the BOM data is subjected to format unification and error correction, the system can more accurately identify the material information of the single product, and further more accurately quote the required single product, so that the probability of wrong quotation is reduced. The column head data refers to the head data of each column of BOM data, and a single product can be an electronic component. After the BOM analysis data is obtained, keyword extraction can be performed according to the BOM analysis data, and an identification basis is provided for follow-up inspection and packaging conflicts.
Because the bill of material formats of different companies are different, manual input is difficult to avoid mistakes and omissions, and therefore data is cleaned in advance and the formats are converted in a unified mode, so that the system can recognize material information more accurately, and price quotation is accurate. The whole analysis and identification process is carried out by the system, manpower is not needed, the labor cost is saved, meanwhile, secondary error conditions caused by fatigue of workers or insufficient experience and the like are avoided again, and the identification efficiency is higher.
Fig. 2 is a schematic flow chart of a method for checking a component packaging conflict according to another exemplary embodiment of the present application, and as shown in fig. 2, the step 300 may include:
step 310: and comparing at least two packaging specifications pairwise to obtain a comparison result.
And comparing the plurality of packaging specifications pairwise, and comparing the next pair of packaging specifications after comparing one pair of packaging specifications. The comparison of the two package specifications may be achieved by a hash table. The hash table is a table in which a set of keywords are mapped onto a limited continuous address set (segment) according to a set hash function and a method for handling collisions, and images of the keywords in the address set are recorded as storage locations in the table. The time complexity of the hash algorithm is O (1), and the calculation time consumed by O (1) is the same no matter the linked list is large or small, so that the comparison efficiency can be ensured.
Step 320: and determining the relevant states of at least two packaging specifications according to the comparison result.
According to the comparison result of any two of the at least two packaging specifications, the relevant state of any two packaging specifications can be determined. The relevant states of the two package specifications may be non-conflicting or conflicting states.
Fig. 3 is a schematic flowchart of a method for checking a component packaging conflict according to another exemplary embodiment of the present application, and as shown in fig. 3, the step 310 may include:
step 311: substituting at least two packaging specifications into the function once to obtain two function values; wherein the function comprises a hash function.
The hash function is a hash function, and the package specification is input into the hash function to obtain a hash value corresponding to the keyword. For example, regarding the keyword k1 and the keyword k2, the hash values f (k1) and f (k2) are obtained by substituting the keyword k1 and the keyword k2 into the hash function.
Step 312: comparing the two function values with the data packaging dictionary to obtain two preset packaging specifications corresponding to the two function values;
and inquiring the preset packaging specifications corresponding to the two hash values in the data packaging dictionary. The data packaging dictionary is preset with a preset packaging specification and a hash value corresponding to the preset packaging specification in advance.
Step 313: and when the two preset packaging specifications are equal, determining that the two packaging specifications corresponding to the two function values have the same meaning.
When two different hash values corresponding to the same preset packaging specification are searched in the data packaging dictionary, the keyword k1 and the keyword k2 are synonyms. For example, after the keyword k1 and the keyword k2 are substituted into a hash function, hash values f (k1) and f (k2) are obtained, preset package specifications corresponding to f (k1) and f (k2) are inquired in a data package dictionary, and if the two preset package specifications are the same, the keyword k1 and the keyword k2 are called synonyms for the hash function.
Keywords k1 and k2 correspond to package specifications 0603 and 1608, respectively, and hash values corresponding to 0603 are preset in a data package dictionary to be 7c4a8d09ca3762af61e59520943dc26494f8941b and b08fff87998d20605f134b632cb8fa 3398008116. 0603 and 1608 are respectively substituted into the hash function to be calculated, the hash value of 0603 is 7c4a8d09ca3762af61e59520943dc26494f8941b, the hash value of 1608 is b08fff87998d20605f134b632cb8fa 33988116, the preset package specifications corresponding to the hash values of 0603 and 1608 are inquired in the data package dictionary, and the preset package specifications corresponding to the hash values of 0603 and 1608 are 0603 according to the preset hash value of the data package dictionary, so that 0603 and 1608 represent the same meaning although the expression forms are different, and the keyword k1 and the keyword k2 are synonyms.
In one embodiment, as shown in fig. 3, the step 320 may include:
step 321: and when the comparison result shows that the two packaging specifications corresponding to the two function values have different meanings, determining the relevant states of the two packaging specifications corresponding to the two function values as conflict states.
When the two package specifications have different meanings, the relevant states of the two package specifications corresponding to the two function values can be determined to be conflict states. The same component cannot correspond to two package sizes, and therefore, when two package specifications with different meanings appear, a conflict can be judged to appear. For example, after k1 and k2 are substituted into a hash function, hash values f (k1) and f (k2) are obtained, preset package specifications corresponding to f (k1) and f (k2) are inquired in a data package dictionary, if two preset package specifications corresponding to f (k1) and f (k2) are different, a keyword k1 and a keyword k2 have different meanings, the two package specifications are in a conflict state, and the conflict state also indicates that the bill of materials is abnormal.
Fig. 4 is a schematic flow chart of a method for checking a component packaging conflict according to another exemplary embodiment of the present application, and as shown in fig. 4, the step 200 may include:
step 210: and comparing the plurality of keywords with the data packaging dictionary, and determining the packaging specifications of the plurality of keywords.
Comparing the plurality of keywords can be performed by using a key-value database, wherein a key represents a keyword, a value represents a value, the key and the value can be used as a key-value pair, each key can correspond to a value, and one key can also be supported to correspond to a plurality of values. The data packaging dictionary is a collection of data of key-value key value pairs, and has the characteristics of variable, unordered and non-repeated key. The data package dictionary may call an existing dictionary directly.
In an embodiment, the specific implementation of step 210 may be further adjusted as follows: and determining the packaging specification of the key words when the key words are the same as the data in the data packaging dictionary.
By inputting the keyword and then obtaining the corresponding value returned according to the keyword, whether the keyword corresponds to the data in the data packaging dictionary can be judged. If the keyword is input but the data in the data packaging dictionary does not have corresponding data, the keyword is detected to be absent, and an abnormal prompt is returned. When the keyword exists in the data packaging dictionary, the packaging specification of the keyword can be judged, and the packaging specification of the keyword can be one or more.
Fig. 5 is a schematic flow chart of a method for checking a collision of a component package according to another exemplary embodiment of the present application, and as shown in fig. 5, the step 100 may include:
step 110: and extracting the material text from the bill of material.
Extracting a material text from the identified and cleaned BOM file (bill of material), wherein the material text can comprise information of name, model, specification, quantity and the like of the material.
Step 120: and performing text preprocessing on the material text to obtain a preprocessed material text.
And (3) text cleaning is carried out on the material text, characters which can cause interference on the analysis result are deleted, and the preprocessed material text with the interference characters or invisible characters removed is obtained, so that the interference items of the text are reduced.
Step 130: and performing vocabulary word segmentation on the preprocessed material text to obtain a plurality of keywords.
The material text is preprocessed to obtain a clean material text, and some key words play a key role in the clean material text, even the key words can determine the text orientation, so that the whole text can be summarized by extracting a plurality of key words from the text. And the texts are all some unstructured data and cannot be directly substituted into the function for calculation, so that the material texts need to be converted into key words, and the key words are structured data, so that mathematical calculation is convenient for the key words.
Fig. 6 is a schematic flow chart of a method for checking a component packaging conflict according to another exemplary embodiment of the present application, and as shown in fig. 6, the step 120 may include:
step 121: and performing code conversion on the material text to obtain a coded material text.
For example, gb2312 is converted to UTF 8. The material file is subjected to code conversion, two systems using different coding schemes can communicate conveniently, and the possibility of interference possibly generated in the analysis process is reduced.
Step 122: and converting full-angle characters in the coded material text into half-angle characters to obtain a converter material text.
One character in the full-angle character occupies two standard character positions, one character in the half-angle character occupies one standard character position, and usually, in the system, the half-angle shapes of English letters, number keys and symbol keys are processed as basic codes. Thus, converting full-angle characters to half-angle characters facilitates recognition and processing by the system.
Step 123: and deleting the interference characters in the text of the material of the transfer character to obtain the preprocessed text of the material.
Wherein the distracting characters include emoticons, tabs, or invisible characters.
After the material text is subjected to character conversion, some useless symbols may exist, so that the meaning of the text cannot be reflected, and the material text is interfered. Therefore, the interference characters in the text of the material of the transfer character are deleted, the text of the material is further cleaned, and the preprocessed text of the material is finally obtained.
Fig. 7 is a schematic flowchart of a method for checking a collision of a component package according to another exemplary embodiment of the present application, and as shown in fig. 7, the step 130 may include:
step 131: and splitting the preprocessed material text into a plurality of keywords according to the word segmentation dictionary.
The word segmentation dictionary comprises a preset word segmentation dictionary.
And performing word segmentation on the material text by adopting a word segmentation mode based on dictionary matching. The effect of the word segmentation mode based on dictionary matching is in direct proportion to the richness of the word segmentation dictionary, so that the existing word segmentation dictionary can be called, a large number of professional words of related industries are added into the word segmentation dictionary artificially according to the content and the attribute of the material text, and the keywords split from the material text are more comprehensive and complete.
In an embodiment, the specific implementation of the step 131 may be further adjusted as follows: splitting the preprocessed material text into a plurality of keywords according to the word segmentation dictionary; the preprocessed material text comprises a plurality of unstructured data, and the plurality of keywords comprise a plurality of structured data.
The texts are all unstructured data and cannot be directly substituted into the functions for calculation, so that the material texts need to be converted into key words, and the key words are structured data, so that mathematical calculation is convenient to perform on the key words. For example, substituting a keyword of the structured data into a hash function may obtain a hash address (hash value) corresponding to the keyword.
The bill of materials contains multiple rows of data, and when the bill of materials is subjected to text processing, each row of data can be processed independently. Each line of data comprises a plurality of texts, starting from a first line of data in a bill of materials, firstly performing text processing on the first line of data, dividing a keyword, identifying packaging specifications of the plurality of keywords in the first line, if the plurality of keywords in the first line have at least two packaging specifications, detecting the relevant states of the at least two packaging specifications, determining whether the two packaging specifications conflict or not, thereby determining whether the first line of data is abnormal or not, and continuing performing keyword division and conflict check on the next line of data after the first line of data is checked, and determining whether the next line of data is abnormal or not.
Exemplary devices
Fig. 8 is a schematic structural diagram of an apparatus for inspecting a collision of a component package according to an exemplary embodiment of the present application, where, as shown in fig. 8, the apparatus 8 for inspecting a collision of a component package includes: the processing module 81 is used for performing text processing on the bill of materials to obtain a plurality of keywords; an identification module 82 for identifying packaging specifications of a plurality of keywords; a detection module 83, configured to detect a correlation state of at least two package specifications when at least two package specifications exist in the plurality of keywords; and a determining module 84, configured to determine that there is an exception to the bill of material when the relevant status of the at least two package specifications is a conflict status.
The inspection device 8 for the component packaging conflict provided by the application firstly performs text processing on the bill of materials through the processing module 81, and eliminates characters which can interfere with subsequent analysis. And extracting a plurality of keywords from the processed text, wherein the text cannot be directly substituted into the function for processing due to the attribute of the text, so that the text needs to be processed again to obtain a plurality of keywords. The packaging specification of the keywords is then identified by the identification module 82, and the packaging of the data can prevent illegal access of the data between different modules. When there are at least two package specifications in the plurality of keywords, the detection module 83 is further required to perform a synonymous check on the relevant states of the at least two package specifications, because the meanings of the two different package specifications may be the same. Finally, whether the bill of materials is abnormal or not is judged according to one or more relevant states, if the relevant states are conflict states, the determining module 84 can determine that the bill of materials is abnormal, and therefore the bill of materials can be checked based on the packaging conflict.
Fig. 9 is a schematic structural diagram of an apparatus for inspecting a collision of a component package according to another exemplary embodiment of the present application, and as shown in fig. 9, the detection module 83 may include: the comparison unit 831 is configured to compare at least two package specifications with each other in pairs to obtain a comparison result; a determine status unit 832 for determining the relevant status of at least two package specifications according to the comparison result.
In an embodiment, as shown in fig. 9, the comparing unit 831 may include: a calculation subunit 8311, configured to substitute at least two package specifications into the function at a time to obtain two function values; wherein the function comprises a hash function; a comparison subunit 8312, configured to compare the two function values with the data encapsulation dictionary to obtain preset encapsulation specifications corresponding to the two function values; a meaning determining subunit 8313 is configured to determine that the two package specifications corresponding to the two function values have the same meaning when the two preset package specifications are equal to each other.
In an embodiment, as shown in fig. 9, the status determining unit 832 may include: a conflict determination subunit 8321 is configured to determine that the relevant state of the two package specifications corresponding to the two function values is a conflict state when the comparison result indicates that the two package specifications corresponding to the two function values have different meanings.
In one embodiment, as shown in fig. 9, the identification module 82 may include: the determining and packaging unit 821 is used for comparing the plurality of keywords with the data packaging dictionary and determining the packaging specifications of the plurality of keywords.
The determination encapsulating unit 821 may be further configured to: and determining the packaging specification of the key words when the key words are the same as the data in the data packaging dictionary.
In an embodiment, as shown in fig. 9, the processing module 81 may include: the extracting unit 811 is used for extracting the material texts from the bill of materials; a preprocessing unit 812, configured to perform text preprocessing on the material text to obtain a preprocessed material text; the word segmentation unit 813 is configured to perform word segmentation on the preprocessed material text to obtain a plurality of keywords.
In an embodiment, as shown in fig. 9, the preprocessing unit 812 may include: the coding subunit 8121 is configured to perform coding conversion on the material text to obtain a coded material text; a rotor subunit 8122, configured to convert full-angle characters in the encoded material text into half-angle characters, and obtain a rotor material text; and the character deleting subunit 8123 is configured to delete the interfering characters in the material text of the character to obtain the preprocessed material text.
In an embodiment, as shown in fig. 9, the word segmentation unit 813 may include: and the word segmentation subunit 8131 is configured to split the preprocessed material text into a plurality of keywords according to the word segmentation dictionary.
In an embodiment, the participle subunit 8131 may be further configured to: splitting the preprocessed material text into a plurality of keywords according to the word segmentation dictionary; the preprocessed material text comprises a plurality of unstructured data, and the plurality of keywords comprise a plurality of structured data.
Exemplary electronic device
Next, an electronic apparatus according to an embodiment of the present application is described with reference to fig. 10. The electronic device may be either or both of the first device and the second device, or a stand-alone device separate from them, which stand-alone device may communicate with the first device and the second device to receive the acquired input signals therefrom.
FIG. 10 illustrates a block diagram of an electronic device in accordance with an embodiment of the present application.
As shown in fig. 10, the electronic device 10 includes one or more processors 11 and memory 12.
The processor 11 may be a Central Processing Unit (CPU) or other form of processing unit having data processing capabilities and/or instruction execution capabilities, and may control other components in the electronic device 10 to perform desired functions.
Memory 12 may include one or more computer program products that may include various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory. The volatile memory may include, for example, Random Access Memory (RAM), cache memory (cache), and/or the like. The non-volatile memory may include, for example, Read Only Memory (ROM), hard disk, flash memory, etc. One or more computer program instructions may be stored on the computer-readable storage medium and executed by the processor 11 to implement the method for checking the collision of the component package of the various embodiments of the present application described above and/or other desired functions. Various contents such as an input signal, a signal component, a noise component, etc. may also be stored in the computer-readable storage medium.
In one example, the electronic device 10 may further include: an input device 13 and an output device 14, which are interconnected by a bus system and/or other form of connection mechanism (not shown).
When the electronic device is a stand-alone device, the input means 13 may be a communication network connector for receiving the acquired input signals from the first device and the second device.
The input device 13 may also include, for example, a keyboard, a mouse, and the like.
The output device 14 may output various information including the determined distance information, direction information, and the like to the outside. The output devices 14 may include, for example, a display, speakers, a printer, and a communication network and its connected remote output devices, among others.
Of course, for simplicity, only some of the components of the electronic device 10 relevant to the present application are shown in fig. 10, and components such as buses, input/output interfaces, and the like are omitted. In addition, the electronic device 10 may include any other suitable components depending on the particular application.
The computer program product may be written with program code for performing the operations of embodiments of the present application in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server.
The computer-readable storage medium may take any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may include, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
The foregoing description has been presented for purposes of illustration and description. Furthermore, the description is not intended to limit embodiments of the application to the form disclosed herein. While a number of example aspects and embodiments have been discussed above, those of skill in the art will recognize certain variations, modifications, alterations, additions and sub-combinations thereof.

Claims (14)

1. A method for checking a component packaging conflict is characterized by comprising the following steps:
performing text processing on the bill of materials to obtain a plurality of keywords;
identifying packaging specifications of the plurality of keywords;
when at least two packaging specifications exist in the plurality of keywords, detecting the relevant states of the at least two packaging specifications; and
and when the relevant states of the at least two packaging specifications are conflict states, determining that the bill of materials is abnormal.
2. The method according to claim 1, wherein when at least two package specifications exist in the plurality of keywords, the detecting the relevant status of the at least two package specifications comprises:
comparing the at least two packaging specifications pairwise to obtain a comparison result;
and determining the relevant states of the at least two packaging specifications according to the comparison result.
3. The method of claim 2, wherein pairwise comparing the at least two package specifications comprises:
substituting the at least two packaging specifications into a function once to obtain two function values; wherein the function comprises a hash function;
comparing the two function values with the data packaging dictionary to obtain two preset packaging specifications corresponding to the two function values;
and when the two preset packaging specifications are equal, determining that the two packaging specifications corresponding to the two function values have the same meaning.
4. The method of claim 2, wherein said determining the relative status of the at least two package sizes according to the comparison comprises:
and when the comparison result indicates that the two packaging specifications corresponding to the two function values have different meanings, determining that the relevant states of the two packaging specifications corresponding to the two function values are conflict states.
5. The method of claim 1, wherein identifying the package specifications for the plurality of keywords comprises:
and comparing the plurality of keywords with the data packaging dictionary, and determining the packaging specifications of the plurality of keywords.
6. The method of claim 5, wherein comparing the plurality of keywords to a data package dictionary and determining a package specification for the plurality of keywords comprises:
and when the keyword is the same as the data in the data packaging dictionary, determining the packaging specification of the keyword.
7. The method of claim 1, wherein the texting the bill of materials to obtain the plurality of keywords comprises:
acquiring a bill of material file;
analyzing the bill of materials to obtain target BOM data of the bill of materials;
acquiring column header data and element data of each single product in the target BOM data according to the target BOM data; the bill of materials comprises material information of a plurality of single products;
according to the column header data, cleaning and converting the element data to generate analytic electronic element data; and
packaging the column header data and the analysis element data into BOM analysis data;
and obtaining a plurality of keywords according to the BOM analysis data.
8. The method of claim 1, wherein the texting the bill of materials to obtain the plurality of keywords comprises:
extracting a material text from the bill of materials;
performing text preprocessing on the material text to obtain a preprocessed material text; and
and performing vocabulary word segmentation on the preprocessed material text to obtain the plurality of keywords.
9. The method of claim 8, wherein the text preprocessing the material text to obtain a preprocessed material text comprises:
performing code conversion on the material text to obtain a coded material text;
converting full-angle characters in the coding material text into half-angle characters to obtain a converter material text; and
deleting the interference characters in the transfer material text to obtain a preprocessed material text; wherein the distracting characters include emoticons, tabs, or invisible characters.
10. The method of claim 8, wherein the lexical segmentation of the preprocessed material text to obtain the plurality of keywords comprises:
splitting the preprocessed material text into the plurality of keywords according to the word segmentation dictionary; wherein the segmentation dictionary comprises a preset segmentation dictionary.
11. The method of claim 10, wherein the splitting the preprocessed material text into the plurality of keywords according to the segmentation dictionary comprises:
splitting the preprocessed material text into the plurality of keywords according to the word segmentation dictionary; the preprocessed material text comprises a plurality of unstructured data, and the plurality of keywords comprise a plurality of structured data.
12. An inspection apparatus for a component packaging conflict, comprising:
the processing module is used for performing text processing on the bill of materials to obtain a plurality of keywords;
the identification module is used for identifying the packaging specifications of the keywords;
the detection module is used for detecting the relevant states of at least two packaging specifications when at least two packaging specifications exist in the plurality of keywords; and
and the determining module is used for determining that the bill of material is abnormal when the relevant states of the at least two packaging specifications are conflict states.
13. A computer-readable storage medium storing a computer program for executing a method of checking a component packaging conflict according to any one of claims 1 to 11.
14. An electronic device, the electronic device comprising:
a processor;
a memory for storing the processor-executable instructions;
the processor is configured to execute a method for checking a component packaging conflict according to any one of claims 1 to 11.
CN202110945684.2A 2021-08-16 2021-08-16 Method, device, medium and equipment for checking component packaging conflict Active CN113627168B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110945684.2A CN113627168B (en) 2021-08-16 2021-08-16 Method, device, medium and equipment for checking component packaging conflict

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110945684.2A CN113627168B (en) 2021-08-16 2021-08-16 Method, device, medium and equipment for checking component packaging conflict

Publications (2)

Publication Number Publication Date
CN113627168A true CN113627168A (en) 2021-11-09
CN113627168B CN113627168B (en) 2023-06-30

Family

ID=78386211

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110945684.2A Active CN113627168B (en) 2021-08-16 2021-08-16 Method, device, medium and equipment for checking component packaging conflict

Country Status (1)

Country Link
CN (1) CN113627168B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113988068A (en) * 2021-12-29 2022-01-28 深圳前海硬之城信息技术有限公司 Word segmentation method, device, equipment and storage medium of BOM text
CN114386368A (en) * 2022-03-23 2022-04-22 百芯智能制造科技(深圳)有限公司 Processing method and processing device for packaging specification item data
CN116452124A (en) * 2023-06-15 2023-07-18 商飞软件有限公司 Real-time state list construction system suitable for civil aircraft time control

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1549670A (en) * 2003-05-21 2004-11-24 华为技术有限公司 Method and apparatus for examining uniformity
US20110004638A1 (en) * 2009-07-02 2011-01-06 Shuhei Nishiyama Attributed key-value-store database system
CN102447673A (en) * 2010-09-30 2012-05-09 突触计算机系统(上海)有限公司 Method and equipment for depacking multimedia file carrying a package format
CN107633353A (en) * 2017-09-08 2018-01-26 南京国电南自电网自动化有限公司 Material BOM tables based on EXCEL Table V BA close rule inspection method
CN110502674A (en) * 2019-08-27 2019-11-26 云汉芯城(上海)互联网科技股份有限公司 It is a kind of based on the classification method of calibration of BOM type selecting, device
CN111543070A (en) * 2018-02-09 2020-08-14 英特尔公司 Trusted IOT device configuration and loading
CN111858698A (en) * 2020-06-02 2020-10-30 深圳市鼎盛光电有限公司 Data checking method, data checking device and terminal equipment
CN112447693A (en) * 2019-09-03 2021-03-05 联发科技股份有限公司 Semiconductor device with a plurality of semiconductor chips
CN112667480A (en) * 2020-12-30 2021-04-16 平安证券股份有限公司 Dynamic monitoring method and device for business data, electronic equipment and storage medium
CN112801621A (en) * 2021-01-30 2021-05-14 深圳捷创电子科技有限公司 BOM bill of material sorting and approving method, system and computer readable storage medium

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1549670A (en) * 2003-05-21 2004-11-24 华为技术有限公司 Method and apparatus for examining uniformity
US20110004638A1 (en) * 2009-07-02 2011-01-06 Shuhei Nishiyama Attributed key-value-store database system
CN102447673A (en) * 2010-09-30 2012-05-09 突触计算机系统(上海)有限公司 Method and equipment for depacking multimedia file carrying a package format
CN107633353A (en) * 2017-09-08 2018-01-26 南京国电南自电网自动化有限公司 Material BOM tables based on EXCEL Table V BA close rule inspection method
CN111543070A (en) * 2018-02-09 2020-08-14 英特尔公司 Trusted IOT device configuration and loading
CN110502674A (en) * 2019-08-27 2019-11-26 云汉芯城(上海)互联网科技股份有限公司 It is a kind of based on the classification method of calibration of BOM type selecting, device
CN112447693A (en) * 2019-09-03 2021-03-05 联发科技股份有限公司 Semiconductor device with a plurality of semiconductor chips
CN111858698A (en) * 2020-06-02 2020-10-30 深圳市鼎盛光电有限公司 Data checking method, data checking device and terminal equipment
CN112667480A (en) * 2020-12-30 2021-04-16 平安证券股份有限公司 Dynamic monitoring method and device for business data, electronic equipment and storage medium
CN112801621A (en) * 2021-01-30 2021-05-14 深圳捷创电子科技有限公司 BOM bill of material sorting and approving method, system and computer readable storage medium

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113988068A (en) * 2021-12-29 2022-01-28 深圳前海硬之城信息技术有限公司 Word segmentation method, device, equipment and storage medium of BOM text
CN114386368A (en) * 2022-03-23 2022-04-22 百芯智能制造科技(深圳)有限公司 Processing method and processing device for packaging specification item data
CN116452124A (en) * 2023-06-15 2023-07-18 商飞软件有限公司 Real-time state list construction system suitable for civil aircraft time control
CN116452124B (en) * 2023-06-15 2023-08-18 商飞软件有限公司 Real-time state list construction system suitable for civil aircraft time control

Also Published As

Publication number Publication date
CN113627168B (en) 2023-06-30

Similar Documents

Publication Publication Date Title
CN112015430B (en) JavaScript code translation method, device, computer equipment and storage medium
CN113627168B (en) Method, device, medium and equipment for checking component packaging conflict
US9690788B2 (en) File type recognition analysis method and system
US11416531B2 (en) Systems and methods for parsing log files using classification and a plurality of neural networks
CN111176996A (en) Test case generation method and device, computer equipment and storage medium
CN108664471B (en) Character recognition error correction method, device, equipment and computer readable storage medium
CN111133396B (en) Production facility monitoring device, production facility monitoring method, and recording medium
CN111881183B (en) Enterprise name matching method and device, storage medium and electronic equipment
CN109189372B (en) Development script generation method of insurance product and terminal equipment
CN116127105B (en) Data collection method and device for big data platform
US20100332216A1 (en) Method for translating computer generated log files
CN113657088A (en) Interface document analysis method and device, electronic equipment and storage medium
CN111625567A (en) Data model matching method, device, computer system and readable storage medium
CN113627173A (en) Manufacturer name identification method and device, electronic equipment and readable medium
CN117874118A (en) Feature data conversion method, device, electronic equipment and readable storage medium
CN117827923A (en) Query demand processing method and device, computer equipment and storage medium
CN117633639A (en) Method and device for automatically identifying difficulty level of examination questions
CN112579629A (en) Method for helping purchasers of electronic component enterprises to accurately find products
CN112579781A (en) Text classification method and device, electronic equipment and medium
CN116340172A (en) Data collection method and device based on test scene and test case detection method
CN114743012A (en) Text recognition method and device
CN113627892B (en) BOM data identification method and electronic equipment thereof
CN110807082A (en) Quality spot check item determination method, system, electronic device and readable storage medium
CN113807807A (en) Component parameter identification method and device, electronic equipment and readable medium
US20210318949A1 (en) Method for checking file data, computer device and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant