FR3059797B1 - Automated method of establishing thesaurus of named entities that can include a plurality of hierarchical levels, and the use of such thesaurus - Google Patents

Automated method of establishing thesaurus of named entities that can include a plurality of hierarchical levels, and the use of such thesaurus Download PDF

Info

Publication number
FR3059797B1
FR3059797B1 FR1662034A FR1662034A FR3059797B1 FR 3059797 B1 FR3059797 B1 FR 3059797B1 FR 1662034 A FR1662034 A FR 1662034A FR 1662034 A FR1662034 A FR 1662034A FR 3059797 B1 FR3059797 B1 FR 3059797B1
Authority
FR
France
Prior art keywords
thesaurus
list
establishing
include
plurality
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
FR1662034A
Other languages
French (fr)
Other versions
FR3059797A1 (en
Inventor
Christophe Lecante
Florian Carichon
Romain Billet
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tecknowmetrix Sas
Original Assignee
Tecknowmetrix Sas
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tecknowmetrix Sas filed Critical Tecknowmetrix Sas
Priority to FR1662034A priority Critical patent/FR3059797B1/en
Publication of FR3059797A1 publication Critical patent/FR3059797A1/en
Application granted granted Critical
Publication of FR3059797B1 publication Critical patent/FR3059797B1/en
Application status is Active legal-status Critical
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • G06F17/2775Phrasal analysis, e.g. finite state techniques, chunking
    • G06F17/278Named entity recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

Abstract

A method for automatically generating text data from a named entity thesaurus, wherein a named entity may include a plurality of entities of different hierarchical levels, including: an extraction step, from text data, of a character string designating a named entity; a string processing step for establishing a list comprising at least one entity segment; a step of eliminating the list of entity segments already present in the thesaurus, to form a list of new segments; and a step of updating the thesaurus for each new segment of the list of new segments. The invention also relates to a method for automatically indexing and identifying textual data, using the thesaurus.
FR1662034A 2016-12-07 2016-12-07 Automated method of establishing thesaurus of named entities that can include a plurality of hierarchical levels, and the use of such thesaurus Active FR3059797B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
FR1662034A FR3059797B1 (en) 2016-12-07 2016-12-07 Automated method of establishing thesaurus of named entities that can include a plurality of hierarchical levels, and the use of such thesaurus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
FR1662034A FR3059797B1 (en) 2016-12-07 2016-12-07 Automated method of establishing thesaurus of named entities that can include a plurality of hierarchical levels, and the use of such thesaurus

Publications (2)

Publication Number Publication Date
FR3059797A1 FR3059797A1 (en) 2018-06-08
FR3059797B1 true FR3059797B1 (en) 2019-10-18

Family

ID=58401713

Family Applications (1)

Application Number Title Priority Date Filing Date
FR1662034A Active FR3059797B1 (en) 2016-12-07 2016-12-07 Automated method of establishing thesaurus of named entities that can include a plurality of hierarchical levels, and the use of such thesaurus

Country Status (1)

Country Link
FR (1) FR3059797B1 (en)

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7526486B2 (en) * 2006-05-22 2009-04-28 Initiate Systems, Inc. Method and system for indexing information about entities with respect to hierarchies
US8112402B2 (en) * 2007-02-26 2012-02-07 Microsoft Corporation Automatic disambiguation based on a reference resource

Also Published As

Publication number Publication date
FR3059797A1 (en) 2018-06-08

Similar Documents

Publication Publication Date Title
BR112015000622A2 (en) method and device for hiding privacy information
GB2551919A (en) Unlearning techniques for adaptive language models in text entry
GB201618161D0 (en) Improved method, system and software for searching, identifying, retrieving and presenting electronic documents
BR112015015904A2 (en) natural language rendering of structured search queries
JP2017519300A5 (en)
RU2014101126A (en) Automatic extraction of nameed essentials from text
BR112017002283A2 (en) Method and apparatus for automatically generating a dictionary of events on an IoT network
WO2015191731A8 (en) Systems and methods for software analytics
BR112016024522A2 (en) non-transient computer readable storage medium, and method
BR112016029514A2 (en) three-dimensional depth map (3d) of structured light based on content filtering
BR112018013524A2 (en) memory recovery method and apparatus
BR112013030366A2 (en) computer program method, apparatus and product
EA201891827A1 (en) Registry and method of automated administration of smart contracts using blocks
JP2011192145A5 (en)
BR112017005605A2 (en) automated verification of a software system
BR112016024779A2 (en) service delivery management system and method
GB2525719A8 (en) Method and system for providing a vulnerability management and verification service
CL2017001872A1 (en) Update classifier models understanding of language for a personal digital assistant based on massive outsourcing
CL2015001952A1 (en) Procedure character recognition, comprising reading an image of character, image processing, segmentation character, extraction edge, feature extraction for each point edge of each character with the distances from the edge points of support, processing of features, calculation template matching.
BR112018003372A2 (en) method for providing staged shaving recommendations, computer program executable on a processing unit, personal care system, and shaving appliance
Kuang et al. Classification on ADHD with deep learning
MX2016002294A (en) Text input method and device.
GB2558826A8 (en) Mitigation of anti-sandbox malware techniques
WO2016177337A8 (en) System and method for image segmentation
AR102682A1 (en) Systems and methods for optimizing fracturing operations training

Legal Events

Date Code Title Description
PLFP Fee payment

Year of fee payment: 2

PLSC Search report ready

Effective date: 20180608

PLFP Fee payment

Year of fee payment: 3