WO2018220688A1 - Générateur de dictionnaire, procédé de génération de dictionnaire, et programme - Google Patents
Générateur de dictionnaire, procédé de génération de dictionnaire, et programme Download PDFInfo
- Publication number
- WO2018220688A1 WO2018220688A1 PCT/JP2017/019947 JP2017019947W WO2018220688A1 WO 2018220688 A1 WO2018220688 A1 WO 2018220688A1 JP 2017019947 W JP2017019947 W JP 2017019947W WO 2018220688 A1 WO2018220688 A1 WO 2018220688A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- group
- identification information
- unit
- dictionary
- name
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/237—Lexical tools
Definitions
- the present invention relates to a dictionary generation device, a dictionary generation method, and a program.
- Patent Document 1 discloses a document processing apparatus that extracts a morpheme group having a dependency relationship from a document, classifies the extracted morpheme group according to its viewpoint, and classifies the document according to a classification result. It is disclosed.
- the dictionary generation device includes an extraction unit that extracts identification information and a noun from a document file, a combination of the identification information extracted by the extraction unit, and the extracted noun.
- a relationship evaluation unit that evaluates the correlation between the identification information and the noun based on the occurrence frequency, and a group generation unit that generates a group of identification information or a group of nouns based on the evaluation result by the relationship evaluation unit.
- the document file is a work-related document file
- the extraction unit extracts work object identification information and a work object name from the work-related document file
- the relationship evaluation unit includes: Based on the co-occurrence frequency of the identification information of the work object and the name of the work object, the correlation between the identification information of the work object and the name of the work object is evaluated, and the group generation unit A group of identification information of an object or a group of names of work objects is generated.
- the document file is a document file for a request or report for maintenance work or repair work
- the extraction unit extracts the replacement part identification information and the name of the replacement part from the document file.
- the relationship evaluation unit evaluates the correlation between the identification information of the replacement part and the name of the replacement part based on the co-occurrence frequency of the identification information of the replacement part and the name of the replacement part
- the group generation unit Generates a group of identification information of replacement parts or a group of names of replacement parts.
- a representative determination unit that determines a representative of identification information or a representative word of a part name
- a dictionary output unit that outputs a dictionary in which identification information included in the group, a representative word of the group determined by the representative word determination unit or a representative of the identification information, and a noun included in each group are associated with each other; .
- the apparatus further includes a group updating unit that evaluates the correlation among the subgroups within the group generated by the group generation unit and updates the group based on the increase / decrease in the evaluation result.
- a group updating unit that evaluates the correlation among the subgroups within the group generated by the group generation unit and updates the group based on the increase / decrease in the evaluation result.
- the apparatus further includes an analysis unit that generates statistical data regarding work using the dictionary output by the dictionary output unit.
- the dictionary generation method includes an extraction step for extracting identification information to be lexicized and a noun from a document file, the identification information extracted by the extraction step, an extracted noun, A relationship evaluation step for evaluating the correlation between the identification information and the noun based on the co-occurrence frequency, and a group generation step for generating a group of identification information or a group of nouns based on the evaluation result of the relationship evaluation step And have.
- the program according to the present invention also includes an extraction step for extracting identification information and nouns to be lexicized from a document file, a combination of the identification information extracted by the extraction step and the extracted nouns.
- a relationship evaluation step for evaluating the correlation between the identification information and the noun based on the occurrence frequency
- a group generation step for generating a group of identification information or a group of nouns based on the evaluation result of the relationship evaluation step. Let the computer run.
- FIG. 3 is a diagram illustrating a hardware configuration of a dictionary generation device 3.
- FIG. 3 is a diagram illustrating a functional configuration of a dictionary generation device 3.
- FIG. 3 is a diagram illustrating a more detailed functional configuration of a search unit 334.
- FIG. It is a figure which illustrates the information registered into a dictionary. It is a flowchart explaining the dictionary production
- FIG. It is a flowchart explaining a dictionary addition process (S20). It is a flowchart explaining a dictionary arrangement
- FIG. 1 is a diagram illustrating a hardware configuration of the dictionary generation device 3.
- the dictionary generation device 3 includes a CPU 300, a memory 302, an HDD 304, a network interface 306 (network IF 306), a display device 308, and an input device 310, and these configurations are connected via a bus 312. Connected to each other. That is, the dictionary generation device 3 is a computer device.
- the CPU 300 is, for example, a central processing unit.
- the memory 302 is, for example, a volatile memory and functions as a main storage device.
- the HDD 304 is, for example, a hard disk drive device, and stores a computer program (for example, the dictionary generation program 32 in FIG.
- the network IF 306 is an interface for performing wired or wireless communication.
- the display device 308 is, for example, a liquid crystal display.
- the input device 310 is, for example, a keyboard and a mouse.
- FIG. 2 is a diagram illustrating a functional configuration of the dictionary generation device 3.
- a dictionary generation program 32 is installed in the dictionary generation apparatus 3 of this example, and a work information storage unit 360 is configured.
- the dictionary generation program 32 is stored in, for example, a recording medium such as a CD-ROM, and is installed in the dictionary generation apparatus 3 via this recording medium.
- the dictionary generation program 32 includes an extraction unit 320, a relationship evaluation unit 322, a group generation unit 324, a representative determination unit 326, an important word determination unit 328, a group update unit 330, a dictionary output unit 332, a search unit 334, and an analysis unit 342.
- Part or all of the dictionary generation program 32 may be realized by hardware such as an ASIC, or may be realized by partially borrowing an OS (Operating System) function.
- OS Operating System
- the extraction unit 320 extracts identification information to be dictionaryd and nouns from the document file.
- the lexicalization target may be anything as long as identification information is given, but is a work target that is a work target, for example.
- a part that is a target of maintenance work of the device will be described as a specific example.
- the extraction unit 320 of this example extracts part identification information (hereinafter referred to as part ID) and a noun that is likely to be a part name from a maintenance work or repair work request document or report document file.
- the relationship evaluation unit 322 evaluates the correlation between the identification information and the noun based on the co-occurrence frequency of the identification information extracted by the extraction unit 320 and the extracted noun.
- the relationship evaluation unit 322 of this example counts the frequency (co-occurrence frequency) at which nouns that are likely to be component names and component IDs appear within a predetermined unit, and generates a co-occurrence frequency matrix.
- the default unit is a range of a document in which it can be determined that the component ID and the component name appear together, and includes, for example, a document file unit, an input field unit, a paragraph unit, and a sentence unit.
- the relationship evaluation unit 322 of this example calculates a correlation coefficient between the component name and the component ID based on the generated co-occurrence frequency matrix.
- the group generation unit 324 generates a group of identification information or a group of nouns based on the evaluation result by the relationship evaluation unit 322. For example, the group generation unit 324 clusters the component names based on the co-occurrence frequency of the component name and the component ID counted by the relationship evaluation unit 322. The group generation unit 324 of this example calculates the distance between the component names using the co-occurrence frequency of the component ID as a vector for each component name, and groups the component names having the calculated distances close to each other. Note that other clustering methods, distance measures, and the like may be used.
- the representative determining unit 326 determines a representative word that is a representative of identification information or a representative of a noun from identification information or a group of nouns generated by the group generation unit 324. For example, the representative determining unit 326 determines the representative of the identification information or the representative word based on the appearance frequency of the identification information or noun in the group. The representative determining unit 326 of this example uses, as a representative word, the most frequently used term (component name) in the group from the group of component names generated by the group generating unit 324.
- the important word determination unit 328 determines an important word representing a target name corresponding to this identification information from nouns (part names) associated with the same identification information. For example, the important word determination unit 328 selects an important word from a group of nouns associated with the same identification information based on their appearance frequencies. The important word determination unit 328 of this example sets the same part ID and the term (part name) having the highest co-occurrence frequency as the important words.
- the group update unit 330 evaluates the correlation among the subgroups in the group generated by the group generation unit 324, and updates the group based on the increase / decrease in the evaluation result.
- the group updating unit 330 in this example searches for subgroups in which the correlation coefficient is greater than or equal to the reference value among the subgroups in the group, and if a subgroup that is greater than or equal to the reference value is found, If a subgroup exceeding the reference value is not found, the subgroup is deleted from the dictionary.
- the dictionary output unit 332 Based on the group generated by the group generation unit 324 or the group updated by the group update unit 330, the dictionary output unit 332 includes the identification information included in each group and the representative determined by the representative determination unit 326.
- the dictionary that associates the nouns included in each group with each other is output to the display device 308 or the work information storage unit 360.
- the dictionary output unit 332 of the present example includes a dictionary (FIG. 4A) that associates a component ID belonging to each group with a representative word of the group, and a component name belonging to each group.
- the dictionary (FIG. 4B) associated with the representative word of the group is output to the work information storage unit 360.
- the search unit 334 searches the identification information or name using the dictionary output by the dictionary output unit 332. More specifically, as illustrated in FIG. 3, the search unit 334 includes a synonym search unit 336, an inventory search unit 338, and a component ID search unit 340.
- the synonym search unit 336 refers to the dictionary and extracts part names belonging to the same group for the input part names. As a result, name identification becomes possible, and document files can be analyzed.
- the inventory search unit 338 extracts the part name of the replacement part from the document file of the maintenance work request form, refers to the dictionary, and represents the representative word associated with the extracted part name. Based on the above, the part IDs belonging to the same group are extracted, and the parts inventory is confirmed.
- the component ID search unit 340 refers to the dictionary and extracts component IDs belonging to the same group based on the representative words associated with the input component name.
- the analysis unit 342 controls the synonym search unit 336 for a plurality of input work document files (request document file or report file) to collate the names in the document file with the representative words.
- Statistical data relating to work is generated based on the appearance frequency and the like.
- the analysis unit 342 inputs a document file of a report including the contents of maintenance work (part names of replacement parts, etc.) and the work man-hours required for the work, and the input document A graph illustrated in FIG. 11 is created by associating the part names of the file with representative words according to a dictionary and counting the work man-hours for each representative word.
- FIG. 5 is a flowchart for explaining dictionary generation processing (S10) by the dictionary generation device 3.
- S10 dictionary generation processing
- FIG. 5 a flowchart for explaining dictionary generation processing (S10) by the dictionary generation device 3.
- S ⁇ b> 100 The transition of the data file in this flowchart is illustrated in FIG.
- the extraction unit 320 of the dictionary generation device 3 extracts a part ID and a part name from the input document file.
- the component ID is information that can uniquely identify the component, such as a model number of the component or a component inventory management number.
- step 105 the dictionary generation program 32 repeats the process of S100 until the extraction process is completed from all the document files, and when the extraction process is completed, the process proceeds to the process of S110.
- step 110 the relationship evaluation unit 322, for the component ID and the component name extracted by the extraction unit 320, the appearance frequency of the component name (FIG. 15A) and the appearance frequency of the component ID (FIG. 15B). )) Is counted, and a part name and part ID co-occurrence frequency matrix (FIG. 15C) is generated from the number of times the part ID and part name appear simultaneously (co-occurrence frequency).
- weighting may be performed in counting up the number of appearances.
- the relationship evaluation unit 322 performs weighting so that when a plurality of sets of component IDs and component names appear simultaneously, the count number is smaller than when only one set of component IDs and component names appears.
- the important word determination unit 328 determines, from among the part names corresponding to the part ID, the part name having the maximum co-occurrence frequency as the important word of this part ID.
- the keyword determining unit 328 may calculate a correlation coefficient matrix from the co-occurrence frequency matrix of the component ID and the component name, and may determine the component name having the maximum correlation coefficient as the keyword.
- step 120 (S120) the group generation unit 324 groups the component names based on the similarity by clustering based on the co-occurrence frequency matrix of the component names and the component IDs generated by the relationship evaluation unit 322.
- the clustering of this example is based on the co-occurrence frequency of the part name and the part ID, but is not limited to this, and various clustering methods and distance measures can be adopted.
- step 125 the representative determining unit 325 determines, for each group generated by the group generating unit 324, the component name having the highest appearance frequency from the component names belonging to the group as the representative word of this group. To do.
- step 130 the dictionary output unit 332 collates the representative words determined by the representative word determination unit 325 for each group generated by the group generation unit 324.
- step 135 the dictionary output unit 332 stores the part name, part ID, representative word, and important word of each group after name identification as a dictionary in the work information storage unit 360.
- FIG. 6 is a flowchart illustrating the dictionary addition process (S20).
- S20 dictionary addition process
- FIG. 6 a case where an entry is added to the dictionary generated in the dictionary generation process (S10) will be described as a specific example.
- the transition of the data file in this flowchart is illustrated in FIG.
- the group update unit 330 reads the component name and component ID from the work information storage unit 360.
- the component name and component ID to be read may include the component name and component ID newly extracted from the document file in addition to the component name and component ID extracted when the dictionary is first created.
- step 205 the group updating unit 330 reads the dictionary from the work information storage unit 360, and performs name identification of the read component names according to the read dictionary.
- step 210 the group update unit 330 instructs the relationship evaluation unit 322 to generate a co-occurrence frequency matrix.
- the relationship evaluation unit 322 counts the number of times the component ID and the component name appear simultaneously (co-occurrence frequency) for the read component ID and component name, and the co-occurrence frequency matrix of the component name and the component ID. (FIG. 16A) is generated.
- step 215 the group update unit 330 instructs the relationship evaluation unit 322 to generate a correlation coefficient matrix.
- the relationship evaluation unit 322 calculates a correlation coefficient between the component ID and the component name based on the counted co-occurrence frequencies, and generates a correlation coefficient matrix (FIG. 16B).
- step 220 the group update unit 330 instructs the group generation unit 324 to perform regrouping.
- the group generation unit 324 groups the component names again based on the component name and component ID co-occurrence frequency matrix generated by the relationship evaluation unit 322.
- step 225 for each group generated by the group generation unit 324, the representative determination unit 325 re-assigns the part name having the highest appearance frequency from the part names belonging to the group to the representative word of this group. decide.
- step 230 the group updating unit 330 acquires the top N component IDs and correlation coefficients having the highest correlation coefficient for each representative word of each group, and sets this value as the baseline.
- step 235 the group updating unit 330 synthesizes a frequency matrix of part names for each group, and searches for a combination whose correlation coefficient is higher than the set baseline.
- step 240 the dictionary generation program 32 proceeds to the process of S245 when a combination whose correlation coefficient is higher than the baseline is found, and proceeds to the process of S250 when it is not found.
- step 245 the group updating unit 330 additionally registers a combination having a higher correlation coefficient than the baseline in the dictionary.
- the dictionary generation program 32 repeats the processes of S205 to S245 a predetermined number of times, and after repeating the predetermined number of times, ends the dictionary addition process (S20).
- FIG. 7 is a flowchart for explaining the dictionary organizing process (S30).
- the dictionary generated in the dictionary generation process (S10) is organized will be described as a specific example.
- the transition of the data file in this flowchart is illustrated in FIG.
- the group update unit 330 reads the component name and component ID from the work information storage unit 360.
- the component name and component ID to be read may include the component name and component ID newly extracted from the document file in addition to the component name and component ID extracted when the dictionary is first created.
- step 305 the group updating unit 330 instructs the relationship evaluation unit 322 to generate a co-occurrence frequency matrix.
- the relationship evaluation unit 322 counts the number of times the component ID and the component name appear simultaneously (co-occurrence frequency) for the read component ID and component name, and the co-occurrence frequency matrix of the component name and the component ID. Is generated.
- step 310 the group update unit 330 instructs the relationship evaluation unit 322 to generate a correlation coefficient matrix.
- the relationship evaluation unit 322 calculates a correlation coefficient between the component ID and the component name based on the counted co-occurrence frequency, and generates a correlation coefficient matrix.
- step 315 the group update unit 330 acquires the top N component IDs and correlation coefficients having the highest correlation coefficient for each representative word of each group, and sets this value as the baseline.
- step 320 the group updating unit 330 synthesizes a frequency matrix of part names for each group, and searches for a combination whose correlation coefficient is higher than the set baseline.
- step 325 the dictionary generation program 32 proceeds to the process of S330 when a combination whose correlation coefficient is higher than the baseline is found, and proceeds to the process of S335 when it is not found.
- step 330 the group updating unit 330 updates the dictionary with a combination having a higher correlation coefficient than the baseline.
- step 335 the group updating unit 330 deletes the combination from the dictionary when no combination having a correlation coefficient higher than the baseline is found.
- FIG. 8 is a diagram for more specifically explaining a part name extraction method by the extraction unit 320.
- the extraction unit 320 performs morphological analysis, dependency analysis, case analysis, and the like, and obtains an analysis result. Subsequently, the extraction unit 320 obtains part name candidates by matching with a dictionary, and further narrows down to part names having a dependency relationship with “exchange” based on the segment dependency and the morpheme dependency. . At this time, a case including a meaning indicating negative such as “not exchanged” or a case including a meaning indicating a future schedule such as “scheduled replacement” is excluded.
- the extraction unit 320 performs morphological analysis on the input document file, matches it with the dictionary, labels the part names in the morpheme string as illustrated in FIG. 8A, and outputs them as learning data.
- the extraction unit 320 performs sequence labeling using the learning data. Subsequently, as illustrated in FIG. 8B, the extraction unit 320 gives a document file that is not used for learning to the component name extraction model, predicts the morpheme label, and selects a component name that does not exist in the dictionary. Extract as candidates and register in dictionary. Prior to registration, a person may visually confirm, or filtering may be performed by setting a threshold value for the appearance frequency or the like.
- the dictionary generation device 3 extracts the part name and part ID of the replacement part that is the object of maintenance work from the maintenance work request form or report document file, and shares the part name and part ID. Based on the frequency of occurrence, a dictionary of part names can be created efficiently. Furthermore, the dictionary generation device 3 performs name identification on the document file of the request form or report using the created dictionary, and analyzes the document file. Thereby, statistical evaluation regarding the maintenance work becomes possible.
- the component names are grouped and the representative word and the important word are determined for each group.
- the component ID may be grouped to determine the representative of the component ID and the important ID.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
L'invention concerne un générateur de dictionnaire servant à la génération efficiente d'un dictionnaire des noms compris dans un fichier de document. Le générateur de dictionnaire comprend: une unité d'extraction servant à extraire, de l'intérieur d'un fichier de document, un nom et des informations d'identification à mettre sous la forme d'un dictionnaire; une unité d'évaluation de relations servant à évaluer la corrélation entre les informations d'identification et le nom sur la base de la fréquence d'occurrence conjointe des informations d'identification extraites par l'unité d'extraction et du nom extrait; et une unité de génération de groupes servant à générer un groupe d'informations d'identification ou un groupe de noms sur la base du résultat de l'évaluation par l'unité d'évaluation.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2017/019947 WO2018220688A1 (fr) | 2017-05-29 | 2017-05-29 | Générateur de dictionnaire, procédé de génération de dictionnaire, et programme |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2017/019947 WO2018220688A1 (fr) | 2017-05-29 | 2017-05-29 | Générateur de dictionnaire, procédé de génération de dictionnaire, et programme |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018220688A1 true WO2018220688A1 (fr) | 2018-12-06 |
Family
ID=64455251
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2017/019947 WO2018220688A1 (fr) | 2017-05-29 | 2017-05-29 | Générateur de dictionnaire, procédé de génération de dictionnaire, et programme |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2018220688A1 (fr) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110110969A (zh) * | 2019-04-10 | 2019-08-09 | 中国科学院国家空间科学中心 | 一种空间环境预报产品基本质量自动检验评估方法及系统 |
JP2021157257A (ja) * | 2020-03-25 | 2021-10-07 | 株式会社富士通エフサス | 生成装置、入力補助装置、生成方法、入力補助方法、生成プログラムおよび入力補助プログラム |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006195756A (ja) * | 2005-01-13 | 2006-07-27 | Just Syst Corp | 情報検索装置、情報検索サイトの改善方法提示装置 |
JP2007140861A (ja) * | 2005-11-17 | 2007-06-07 | Konica Minolta Medical & Graphic Inc | 情報処理システム、情報処理方法、およびプログラム |
WO2014002776A1 (fr) * | 2012-06-25 | 2014-01-03 | 日本電気株式会社 | Système d'extraction de synonymes, procédé et support d'enregistrement |
-
2017
- 2017-05-29 WO PCT/JP2017/019947 patent/WO2018220688A1/fr active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006195756A (ja) * | 2005-01-13 | 2006-07-27 | Just Syst Corp | 情報検索装置、情報検索サイトの改善方法提示装置 |
JP2007140861A (ja) * | 2005-11-17 | 2007-06-07 | Konica Minolta Medical & Graphic Inc | 情報処理システム、情報処理方法、およびプログラム |
WO2014002776A1 (fr) * | 2012-06-25 | 2014-01-03 | 日本電気株式会社 | Système d'extraction de synonymes, procédé et support d'enregistrement |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110110969A (zh) * | 2019-04-10 | 2019-08-09 | 中国科学院国家空间科学中心 | 一种空间环境预报产品基本质量自动检验评估方法及系统 |
JP2021157257A (ja) * | 2020-03-25 | 2021-10-07 | 株式会社富士通エフサス | 生成装置、入力補助装置、生成方法、入力補助方法、生成プログラムおよび入力補助プログラム |
JP7478003B2 (ja) | 2020-03-25 | 2024-05-02 | 株式会社富士通エフサス | 生成装置、生成方法および生成プログラム |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7282940B2 (ja) | 電子記録の文脈検索のためのシステム及び方法 | |
US10489439B2 (en) | System and method for entity extraction from semi-structured text documents | |
US10558754B2 (en) | Method and system for automating training of named entity recognition in natural language processing | |
US9424524B2 (en) | Extracting facts from unstructured text | |
KR20160124742A (ko) | 비정형 텍스트내의 특징들의 중의성을 해소하는 방법 | |
JP6767042B2 (ja) | シナリオパッセージ分類器、シナリオ分類器、及びそのためのコンピュータプログラム | |
GB2555207A (en) | System and method for identifying passages in electronic documents | |
Deselaers et al. | Automatic medical image annotation in ImageCLEF 2007: Overview, results, and discussion | |
Limsettho et al. | Automatic unsupervised bug report categorization | |
KR20200071877A (ko) | 자율 증강형 반복 학습을 이용한 정보 추출 방법 및 시스템 | |
JP5834795B2 (ja) | 情報処理装置及びプログラム | |
CN102789473A (zh) | 标识符检索方法和设备 | |
CN113971398A (zh) | 一种面向网络安全领域实体快速识别的词典构造方法 | |
JP6522446B2 (ja) | ラベル付与装置、方法およびプログラム | |
WO2018220688A1 (fr) | Générateur de dictionnaire, procédé de génération de dictionnaire, et programme | |
US11580499B2 (en) | Method, system and computer-readable medium for information retrieval | |
KR20170044408A (ko) | 프로젝트의 추천 시스템 및 방법 | |
EP4258107A1 (fr) | Procédé et système de découverte automatisée d'actifs d'intelligence artificielle et d'apprentissage machine dans une entreprise | |
AU2019290658B2 (en) | Systems and methods for identifying and linking events in structured proceedings | |
RU2546555C1 (ru) | Способ автоматической классификации формализованных документов в системе электронного документооборота | |
Taghva et al. | Acronym expansion via hidden Markov models | |
Afzal et al. | Towards semantic annotation of bioinformatics services: building a controlled vocabulary | |
Álvarez-Rodríguez et al. | Towards a stepwise method for unifying and reconciling corporate names in public contracts metadata: The CORFU technique | |
KR20210023453A (ko) | 리뷰 광고 매칭 장치 및 방법 | |
Ďuračík et al. | Using concepts of text based plagiarism detection in source code plagiarism analysis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17912201 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17912201 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: JP |