EP1217533A2 - Verfahren und Rechnersystem zur Wortartauszeichnung unvollständige Sätze - Google Patents
Verfahren und Rechnersystem zur Wortartauszeichnung unvollständige Sätze Download PDFInfo
- Publication number
- EP1217533A2 EP1217533A2 EP01129760A EP01129760A EP1217533A2 EP 1217533 A2 EP1217533 A2 EP 1217533A2 EP 01129760 A EP01129760 A EP 01129760A EP 01129760 A EP01129760 A EP 01129760A EP 1217533 A2 EP1217533 A2 EP 1217533A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- phrase
- identifier
- pos
- context
- context information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99934—Query formulation, input preparation, or translation
Definitions
- the invention generally relates to a method and a computer system for disambiguating a phrase in a linguistic system, and in particular to part-of-speech tagging.
- part-of-speech POS
- the function of a part-of-speech tagger is to associate each word or corresponding sub-unit in a text with an abstract morpho-syntactic category being represented by a tag.
- POS-tagged text is used in a variety of text manipulation processes, for example in a parser or syntactical analyzer allowing the recognition, extraction and normalization of semantic structures in the text. These structures may be used for text mining, indexing, understanding, and dialog systems.
- tags are for briefness also denoted as tags or POS-tags.
- the abstraction to general categories in a POS-tagger allows the creation of effective multilinguistic parsers, since text analysis rules can be described using a limited number of categories rather than using specific rules for each of the languages.
- POS-tagger performs three functions:
- POS-taggers can attain correct assignment of POS-tags with a success rate of more than 95% accuracy, but these tests are usually performed on text comprising complete sentences.
- documents often contain text composed of incomplete sentences: e.g. titles, lists of items, subheadings. Such phrases are often incorrectly tagged by POS-taggers.
- the present invention has been made in consideration of the above situation, and has its primary object to enhance the quality of a POS-tagger.
- Another object of the invention is to provide a method and a computer system for supporting a POS-tagger in disambiguating a phrase, by introducing grammatical constraints.
- a further object of the present invention is to provide method and a computer system for POS-tagging of a phrase based on the phrase supplemented with context information.
- the present invention provides in a first aspect a method for assigning at least one part-of-speech tag to a phrase.
- the method comprises the steps of obtaining an identifier for the phrase, the identifier being associated with context information; supplementing the phrase with the context information; and assigning at least one POS-tag to the phrase based on the supplemented phrase.
- a method for grammatically disambiguating a phrase comprises the steps of getting the phrase and getting an identifier for the phrase, the identifier being associated with artificial information; supplementing the phrase with the artificial information; and grammatically disambiguating the phrase based on the supplemented phrase.
- a computer system for part-of-speech tagging of a phrase comprising identifier input means for training an identifier associated to context information, a context storage comprising a plurality of context information items, an identifier storage connected to the context storage and comprising a plurality of identifiers each of which being associated with at least one context information item of the plurality of context information items.
- the computer system further comprises a context supplementer, connected to the identifier input and the context storage, for supplementing the phrase with the associated context information of the obtained identifier; and a POS-tagger, connected to the context supplementer, for identifying the part of speech of each part of the phrase based on the supplemented phrase.
- structural information of the phrase in a plurality of phrases or textual information is used to define the identifier for the phrase.
- the identifier being defined in such a way is applicable to a plurality of phrases at once, e.g. to instruction lists, groups of database contents or similar groups of phrases.
- the method of the present invention is used for a headword extracting application or for deriving a formal structure of the phrase for an Automatic Term Encoding process.
- the first step in the method from Start 10 to End 14 is the step 11 of obtaining an identifier, then the phrase 100 is associated 150 to the identifier.
- the identifier is associated 210 to context information which in step 12 is supplemented to the phrase 100.
- the context information comprises at least pre-context 201 or post-context 202 information.
- the supplemented phrase 120 is tagged by the POS tagger in step 13.
- the part corresponding to the original phrase 100 is retrieved in step 110.
- a phrase may be defined as comprising at least one part of a natural or artificial language.
- the method for POS-tagging may be continued as indicated by the dotted line connected to dotted connection point A.
- Such optional steps are discussed below with reference to Fig. 3.
- the identifier is a main grammatical category of the phrase.
- the identifier is defined by a structural property of the phrase (e.g. headline, instruction list).
- POS-tagger In a POS-tagger supplementing a context, at least two different identifiers or grammatical categories are defined, for example: VerbPhrase or NounPhrase for phrases which as a whole grammatically represent a noun or a verb. Corresponding to the method of the present invention a verb/noun phrase will be supplemented by Verb/NounPhrase context information for the POS-tagging or disambiguating of the phrase.
- the POS-tagger uses the following VerbPhrase and NounPhrase context information:
- the context information may comprise textual information, POS-tags or information adapted for the POS-tagger.
- the identifier for the category VerbPhrase in step 11 is obtained, the identifier being associated to context information as indicated above, and the corresponding pre-context 201 and post-context information 202 in step 12 is supplemented to the phrase 100: "The technicians close the door to someone".
- the POS-tagger uses the tags +VERB for verbs, +NOUN_SG for singular nouns, +NOUN_PL for plural nouns, +ART for articles, +PREP for preposition, +PRON for pronouns, +SENT for end of sentence marker for the step 13 of assigning the at least one POS-tag to the phrase 120: "the+ART technician+NOUN_PL close+VERB the+ART door+NOUN_SG to+PREP someone+PRON .+SENT".
- step 110 The supplemented context information is removed from the phrase when the POS-tagging process is finished.
- the result of step 110 is: "close+VERB the+ART door+NOUN_SG”.
- the step 11 of obtaining the identifier may be implemented in various manners for example the phrase "close the door" could be part of an instruction list in a document thereby being associated to the identifier for the grammatical category VerbPhrase. Further the phrase could be an input by an user, the identifier being automatically obtained on evaluation of an interaction history with the user or even manually obtained by input of the user.
- a second aspect of the invention is described in the following: a method for use in a computer system for grammatically disambiguating a phrase comprises the steps of getting the phrase; getting an identifier for the phrase, the identifier being associated to artificial information; supplementing the phrase with the artificial information and grammatically disambiguating the phrase based on the supplemented phrase.
- this second method is not limited to POS-taggers and can be seen as a more general version of the first method of the invention. Therefore all parts of the detailed description of the present invention above and following below are applicable to the second method as well although they are discussed with reference to the first method of the present invention only.
- Fig.1 improves prior art POS-tagging processes.
- Fig. 6 illustrates the steps in a common POS-tagger from start 60 to end 66.
- step 61 of getting a phrase 100 it is tokenised in step 62 into Token1 to Token3 101 to 103.
- Potential tags Tag11 to Tag32 111-132 are provided in step 63 by evaluating each token 101-103 based on lexical information.
- the step 63 of providing potential tags 111-132 may comprise a morphological analysis of the tokens 101-103. For example, for identifying a word "swam" as a simple past tense of the verb "swim".
- step 64 by disambiguating the tags 111-132 a single tag 113, 121, 132 is assigned to each token 101, 102, 103.
- the disambiguated tags 113, 121 and 132 are assembled to the tokens 101, 102, 103 of the phrase 100 in step 65 resulting in the tagged phrase 190.
- POS-taggers for example use Finite State Transducers (FSTs) or Hidden Markov Models (HMM) in the POS-tagging process.
- FSTs Finite State Transducers
- HMM Hidden Markov Models
- the method of the present invention is applicable to any prior art POS-tagger.
- the steps 11 and 12 of obtaining the identifier and supplementing the phrase can be performed with the step 61 of getting the phrase, wherein the step 13 of assigning the text summarizes the steps 63 and 64 of providing potential tags and disambiguating tags.
- the steps of the method as shown in Fig. 1 may be inserted in the step 64 of disambiguating tags, for example in case more than one potential tag is provided in step 63 for one token of the phrase.
- Fig. 2 illustrates in more detail the step 11 of obtaining the identifier for a preferred embodiment of the present invention from Start 20 to End 26.
- the phrase 100 and the associated 150 identifier in step 21 are obtained, and the identifier is mapped in step 22 to a plurality of potential categories 160 for the phrase.
- the mapping actually is a step of pre-selecting categories.
- the plurality of categories 160 are main grammatical categories of the phrase.
- the plurality of categories 160 is provided in step 23 for a selection which can be an external selection 24. In case no external selection 24 for the most probable category is made it is selected in step 25 as default.
- the phrase now is associated 161 to the most probable category being associated to the context information 211.
- the at least one POS-tag assigned to the phrase is selected from potential POS-tags for the phrase without context and the most probable category for the phrase is selected by evaluating the potential POS-tags. In fact such an evaluation eliminates the need for the further disambiguation of the POS-tags.
- Fig. 3 illustrates ways of using the method of Fig. 1 for optional applications, starting from connection point 15 to end 34.
- a first optional step 31 the tagged phrase or the phrase tags are stored or outputted.
- the optional step 32 of extracting a headword out of the phrase based on the phrase with the at least one assigned POS-tag is another application using the method of the present invention.
- a formal structure for the phrase is derived, that covers variations of the original phrase. The steps 32 and 33 are discussed in more detail in the following.
- the phrase "close the door” could be a part of a traveling dictionary including short phrases for every-day use, which has to be translated into different languages.
- the lexicographers specify this phrase as the term they want to encode and provide the general grammatical category for the phrase.
- the latter may also be derived by a structural property of the phrase, e.g. in case the phrase is part of an instruction list.
- the grammatical category obtained in this example is VerbPhrase.
- the tagged phrase "close +VERB the +ART door +NOUN_SG” is used to generate a regular expression capturing variations of the phrase.
- Syntactic categories resulting from the tagging process are mapped to more general grammatical tags.
- the POS-tag +VERB resulting from disambiguating and identifying the affected verb is mapped to the more generic qualifier V, which covers all types of verbs.
- the POS-tags +NOUN_SG (for noun, proper noun, or abbreviation), are replaced by the global qualifier N to which all noun tags are mapped.
- These generic tags generalize the initial expression.
- the mapping rules can also insert additional information: for example, a rule can specify that adjectives can be inserted between two nouns or that several adverbs can be added after a verb.
- the rules applied by the method in the step of deriving a formal structure 33 are language and tagger dependent. The phrase finally leads to the formal structure: "close V: ADV* the D: door N:”.
- a further application of this invention involves information retrieval, taking advantage of the methods described above by using the result of the Automatic Term Encoding.
- an application can determine all the different variations of a multiword expression thereby catching all the terms matching the regular expression. For example, for the phrase: "dense matrix" we will get the following results from the different steps of the Automatic Term Encoding process:
- a specific automatically applied grammar rule has added the possibility of having zero or more adjectives (ADJ for adjective) before a noun.
- the step 32 in Fig. 3 of extracting a headword out of the phrase based on the phrase with at least one assigned POS-tag is the next application using the method of the present invention.
- the phrase "alarm sensor switch” can be identified as a NounPhrase by the obtained corresponding identifier. Consequently the phrase is supplemented to the sentence "the alarm sensor switch who works well”. The disambiguation of the supplemented phrase leads to the result: "the alarm +NOUN_SG sensor +NOUN_SG switch +NOUN_SG who works well”.
- a headword in the phrase is identified to be "switch".
- Fig. 4 the functional units involved in POS-tagging, headword extracting or formal structuring processes are illustrated.
- the context supplementer 44 is connected to identifier input means 43, a POS-tagger 45 and a context storage 42, being connected to the identifier storage 41.
- the context supplementer 44 obtains an identifier for a phrase via the identifier input 43.
- the phrase may be obtained from a data storage 49 or a phrase input 48, being connected to the context supplementer 44.
- the context storage 42 comprises a plurality of context information items for being supplemented to a phrase.
- the identifier storage 41 comprises a plurality of identifiers, each of which being associated to at least one context information item of the context storage 42.
- the context supplementer 44 selects a context information item according to the obtained identifier from the context storage 42.
- the phrase is supplemented with the selected context information, both together being the input for the POS-tagger 45.
- the POS-tagger performs the POS-tagging process leading to the tagged phrase or the phrase tags. The result can be displayed or outputted at the output 83, or even stored to the data storage 49.
- the computer system further comprises a category storage 47 comprising a plurality of categories, each identifier being associated with at least one category of the category storage 47 and each category being associated to at least one context information item in the context storage 42.
- a category is obtained via category input 82 the context information that has to be supplemented to the phrase can be selected directly.
- An obtained identifier may be mapped to the category and consequently to the context information.
- a category evaluator 46 performs the pre-selection of probable categories e.g. main grammatical categories for the phrase according to the identifier and selects a most probable category from the pre-selected categories. The selection may be performed by external selection via the selection means 81 or according to selection rules stored in the data storage 49.
- the most probable category is selected based on potential POS-tags for the phrase, which are provided by the data storage together with the phrase.
- the context information may comprise at least pre-context or post-context information, each of which may be represented by at least one POS-tag or textual information.
- the POS-tagger 45 may be connected to a headword extractor 84 for performing the headword extraction process based on the tagged phrase, or a formalizer 85 for deriving a formal structure for the phrase, that covers variations of the original phrase.
- the formalizer 85 may be connected to a morphological generator 86 and the data storage 49.
- the data storage 49 may function as an input or output data storage for the phrase or the tagged phrase, and further may comprise rules for POS-tagging, formalizing or headword extraction processes.
- Fig. 5 illustrates a computer system with a CPU 50, a keyboard 51, a display 52, a pointing device 53, a wired/wireless interface 54, audio input means 55, audio output means 56, a secondary storage 57, printer 58 and a primary storage 59.
- the primary storage 59 comprises a computer program comprising processor-executable instructions implementing: a context supplementer 44 for supplementing the context information to the phrase and a POS-tagger 45 for assigning the at least one POS-tag to the phrase.
- the primary storage 59 further includes a context storage 42 comprising a plurality of context information items and an identifier storage 41 comprising a plurality of identifiers, each of which is associated with at least one context information item of the plurality of context information items.
- the CPU 50 executes the processor- executable instructions stored in the primary storage 59, thereby performing the implemented methods of the present invention.
- the keyboard 51 may be used as identifier input 43 to obtain an identifier for a phrase.
- the identifier is one of the plurality of identifiers of the identifier storage 41 and therefore is associated with a context information item of the plurality of context information items.
- the phrase is supplemented with the context information item by the context supplementer 44.
- the supplemented phrase being input for the POS-tagger 45 is evaluated for assigning at least one POS-tag to the phrase. Any rules used in the POS-tagging process are stored as a part of the POS-tagger 45.
- the keyboard 51 and the pointing device 53 can be used as identifier input 43, category input 82 or phrase input 48.
- the display 52 or the printer 58 can serve as result output 83, and in combination with the keyboard 51 or the pointing device 53 may be used as selection means 81.
- the audio input means 55 can be used as one of the input means or the selection means 81, whereas the audio output means 56 can be used as the result output 83.
- the secondary storage 57 serves as part of the data storage 49 and may be a hard disk, CD, DVD or the like. The secondary storage typically is used for storing language dependent data, mainly because it is exchangeable.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US738987 | 2000-12-19 | ||
US09/738,987 US6910004B2 (en) | 2000-12-19 | 2000-12-19 | Method and computer system for part-of-speech tagging of incomplete sentences |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1217533A2 true EP1217533A2 (de) | 2002-06-26 |
EP1217533A3 EP1217533A3 (de) | 2005-09-21 |
Family
ID=24970328
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP01129760A Ceased EP1217533A3 (de) | 2000-12-19 | 2001-12-13 | Verfahren und Rechnersystem zur Wortartauszeichnung unvollständige Sätze |
Country Status (3)
Country | Link |
---|---|
US (1) | US6910004B2 (de) |
EP (1) | EP1217533A3 (de) |
JP (1) | JP2002215617A (de) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2390704A (en) * | 2002-07-09 | 2004-01-14 | Canon Kk | Automatic summary generation and display |
US7263530B2 (en) | 2003-03-12 | 2007-08-28 | Canon Kabushiki Kaisha | Apparatus for and method of summarising text |
US9563192B2 (en) | 2014-01-02 | 2017-02-07 | Rockwell Automation Technologies, Inc. | Software workstation and method for employing appended metadata in industrial automation software |
EP3264281A1 (de) * | 2016-07-01 | 2018-01-03 | Wipro Limited | Verfahren und system zur automatischen identifizierung von verkehrsverstössen in einem oder mehreren testfällen |
US10878174B1 (en) | 2020-06-24 | 2020-12-29 | Starmind Ag | Advanced text tagging using key phrase extraction and key phrase generation |
US11379763B1 (en) | 2021-08-10 | 2022-07-05 | Starmind Ag | Ontology-based technology platform for mapping and filtering skills, job titles, and expertise topics |
Families Citing this family (148)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
US6859771B2 (en) * | 2001-04-23 | 2005-02-22 | Microsoft Corporation | System and method for identifying base noun phrases |
GB2407657B (en) * | 2003-10-30 | 2006-08-23 | Vox Generation Ltd | Automated grammar generator (AGG) |
US7865354B2 (en) * | 2003-12-05 | 2011-01-04 | International Business Machines Corporation | Extracting and grouping opinions from text documents |
KR100669241B1 (ko) * | 2004-12-15 | 2007-01-15 | 한국전자통신연구원 | 화행 정보를 이용한 대화체 음성합성 시스템 및 방법 |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
JP3963394B2 (ja) | 2005-12-28 | 2007-08-22 | インターナショナル・ビジネス・マシーンズ・コーポレーション | ソフトウェア障害情報をレポートするための装置 |
KR100764174B1 (ko) * | 2006-03-03 | 2007-10-08 | 삼성전자주식회사 | 음성 대화 서비스 장치 및 방법 |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
WO2008061002A2 (en) * | 2006-11-14 | 2008-05-22 | Networked Insights, Inc. | Method and system for automatically identifying users to participate in an electronic conversation |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US20080249762A1 (en) * | 2007-04-05 | 2008-10-09 | Microsoft Corporation | Categorization of documents using part-of-speech smoothing |
US9053089B2 (en) * | 2007-10-02 | 2015-06-09 | Apple Inc. | Part-of-speech tagging using latent analogy |
US8620662B2 (en) * | 2007-11-20 | 2013-12-31 | Apple Inc. | Context-aware unit selection |
US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
US7925743B2 (en) * | 2008-02-29 | 2011-04-12 | Networked Insights, Llc | Method and system for qualifying user engagement with a website |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US8521512B2 (en) * | 2008-04-30 | 2013-08-27 | Deep Sky Concepts, Inc | Systems and methods for natural language communication with a computer |
US10496753B2 (en) | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
WO2010067118A1 (en) | 2008-12-11 | 2010-06-17 | Novauris Technologies Limited | Speech recognition involving a mobile device |
US20100257182A1 (en) * | 2009-04-06 | 2010-10-07 | Equiom Labs Llc | Automated dynamic style guard for electronic documents |
US9805020B2 (en) | 2009-04-23 | 2017-10-31 | Deep Sky Concepts, Inc. | In-context access of stored declarative knowledge using natural language expression |
US8275788B2 (en) | 2009-11-17 | 2012-09-25 | Glace Holding Llc | System and methods for accessing web pages using natural language |
US8972445B2 (en) | 2009-04-23 | 2015-03-03 | Deep Sky Concepts, Inc. | Systems and methods for storage of declarative knowledge accessible by natural language in a computer capable of appropriately responding |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US10706373B2 (en) | 2011-06-03 | 2020-07-07 | Apple Inc. | Performing actions associated with task items that represent tasks to perform |
US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US20110161067A1 (en) * | 2009-12-29 | 2011-06-30 | Dynavox Systems, Llc | System and method of using pos tagging for symbol assignment |
US20110161073A1 (en) * | 2009-12-29 | 2011-06-30 | Dynavox Systems, Llc | System and method of disambiguating and selecting dictionary definitions for one or more target words |
US10705794B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US10679605B2 (en) | 2010-01-18 | 2020-06-09 | Apple Inc. | Hands-free list-reading by intelligent automated assistant |
US10553209B2 (en) | 2010-01-18 | 2020-02-04 | Apple Inc. | Systems and methods for hands-free notification summaries |
US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
DE112011100329T5 (de) | 2010-01-25 | 2012-10-31 | Andrew Peter Nelson Jerram | Vorrichtungen, Verfahren und Systeme für eine Digitalkonversationsmanagementplattform |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US9858338B2 (en) * | 2010-04-30 | 2018-01-02 | International Business Machines Corporation | Managed document research domains |
US20120151386A1 (en) * | 2010-12-10 | 2012-06-14 | Microsoft Corporation | Identifying actions in documents using options in menus |
US10762293B2 (en) | 2010-12-22 | 2020-09-01 | Apple Inc. | Using parts-of-speech tagging and named entity recognition for spelling correction |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
US20130311506A1 (en) * | 2012-01-09 | 2013-11-21 | Google Inc. | Method and apparatus for user query disambiguation |
US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
US9483461B2 (en) | 2012-03-06 | 2016-11-01 | Apple Inc. | Handling speech synthesis of content for multiple languages |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US9495129B2 (en) | 2012-06-29 | 2016-11-15 | Apple Inc. | Device, method, and user interface for voice-activated navigation and browsing of a document |
US9280520B2 (en) * | 2012-08-02 | 2016-03-08 | American Express Travel Related Services Company, Inc. | Systems and methods for semantic information retrieval |
US9576574B2 (en) | 2012-09-10 | 2017-02-21 | Apple Inc. | Context-sensitive handling of interruptions by intelligent digital assistant |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
CN104969289B (zh) | 2013-02-07 | 2021-05-28 | 苹果公司 | 数字助理的语音触发器 |
US9368114B2 (en) | 2013-03-14 | 2016-06-14 | Apple Inc. | Context-sensitive handling of interruptions |
US9262555B2 (en) | 2013-03-15 | 2016-02-16 | Yahoo! Inc. | Machine for recognizing or generating Jabba-type sequences |
WO2014144579A1 (en) | 2013-03-15 | 2014-09-18 | Apple Inc. | System and method for updating an adaptive speech recognition model |
US9530094B2 (en) * | 2013-03-15 | 2016-12-27 | Yahoo! Inc. | Jabba-type contextual tagger |
US9195940B2 (en) | 2013-03-15 | 2015-11-24 | Yahoo! Inc. | Jabba-type override for correcting or improving output of a model |
US9311058B2 (en) | 2013-03-15 | 2016-04-12 | Yahoo! Inc. | Jabba language |
KR101759009B1 (ko) | 2013-03-15 | 2017-07-17 | 애플 인크. | 적어도 부분적인 보이스 커맨드 시스템을 트레이닝시키는 것 |
US9275035B2 (en) | 2013-05-14 | 2016-03-01 | English Helper Inc. | Method and system to determine part-of-speech |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
KR101959188B1 (ko) | 2013-06-09 | 2019-07-02 | 애플 인크. | 디지털 어시스턴트의 둘 이상의 인스턴스들에 걸친 대화 지속성을 가능하게 하기 위한 디바이스, 방법 및 그래픽 사용자 인터페이스 |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
WO2014200731A1 (en) | 2013-06-13 | 2014-12-18 | Apple Inc. | System and method for emergency calls initiated by voice command |
KR101749009B1 (ko) | 2013-08-06 | 2017-06-19 | 애플 인크. | 원격 디바이스로부터의 활동에 기초한 스마트 응답의 자동 활성화 |
JP6263052B2 (ja) * | 2014-03-07 | 2018-01-17 | 日本放送協会 | タグ付与知識学習装置およびそのプログラム、ならびに、タグ付与装置およびそのプログラム |
US9620105B2 (en) | 2014-05-15 | 2017-04-11 | Apple Inc. | Analyzing audio input for efficient speech and music recognition |
US10592095B2 (en) | 2014-05-23 | 2020-03-17 | Apple Inc. | Instantaneous speaking of content on touch devices |
US9502031B2 (en) | 2014-05-27 | 2016-11-22 | Apple Inc. | Method for supporting dynamic grammars in WFST-based ASR |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US10078631B2 (en) | 2014-05-30 | 2018-09-18 | Apple Inc. | Entropy-guided text prediction using combined word and character n-gram language models |
US9734193B2 (en) | 2014-05-30 | 2017-08-15 | Apple Inc. | Determining domain salience ranking from ambiguous words in natural speech |
US9966065B2 (en) | 2014-05-30 | 2018-05-08 | Apple Inc. | Multi-command single utterance input method |
US9785630B2 (en) | 2014-05-30 | 2017-10-10 | Apple Inc. | Text prediction using combined word N-gram and unigram language models |
US10289433B2 (en) | 2014-05-30 | 2019-05-14 | Apple Inc. | Domain specific language for encoding assistant dialog |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
US9760559B2 (en) | 2014-05-30 | 2017-09-12 | Apple Inc. | Predictive text input |
US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US10659851B2 (en) | 2014-06-30 | 2020-05-19 | Apple Inc. | Real-time digital assistant knowledge updates |
US10446141B2 (en) | 2014-08-28 | 2019-10-15 | Apple Inc. | Automatic speech recognition based on user feedback |
US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
US10552013B2 (en) | 2014-12-02 | 2020-02-04 | Apple Inc. | Data detection |
US9711141B2 (en) | 2014-12-09 | 2017-07-18 | Apple Inc. | Disambiguating heteronyms in speech synthesis |
US10572810B2 (en) | 2015-01-07 | 2020-02-25 | Microsoft Technology Licensing, Llc | Managing user interaction for input understanding determinations |
US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US10249297B2 (en) | 2015-07-13 | 2019-04-02 | Microsoft Technology Licensing, Llc | Propagating conversational alternatives using delayed hypothesis binding |
US10803207B2 (en) | 2015-07-23 | 2020-10-13 | Autodesk, Inc. | System-level approach to goal-driven design |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179309B1 (en) | 2016-06-09 | 2018-04-23 | Apple Inc | Intelligent automated assistant in a home environment |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
US10446137B2 (en) * | 2016-09-07 | 2019-10-15 | Microsoft Technology Licensing, Llc | Ambiguity resolving conversational understanding system |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
CN107315733A (zh) * | 2016-11-24 | 2017-11-03 | 海南州云藏藏文信息技术有限公司 | 智能藏文词性自动标注系统 |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
US11295089B2 (en) | 2020-03-01 | 2022-04-05 | International Business Machines Corporation | Dynamically enhancing an instrument using multi-stem definitions |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0327266B1 (de) * | 1988-02-05 | 1995-08-30 | AT&T Corp. | Verfahren zur Bestimmung von Textteilen und Verwendung |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5610812A (en) | 1994-06-24 | 1997-03-11 | Mitsubishi Electric Information Technology Center America, Inc. | Contextual tagger utilizing deterministic finite state transducer |
US5794177A (en) | 1995-07-19 | 1998-08-11 | Inso Corporation | Method and apparatus for morphological analysis and generation of natural language text |
US5822731A (en) | 1995-09-15 | 1998-10-13 | Infonautics Corporation | Adjusting a hidden Markov model tagger for sentence fragments |
GB9713019D0 (en) | 1997-06-20 | 1997-08-27 | Xerox Corp | Linguistic search system |
DE69802402T2 (de) | 1997-07-04 | 2002-06-06 | Xerox Corp | Hidden-Markov-Modelle (HMM) approximierende endliche Transducer und ihre Verwendung zum Text-Tagging |
JP2002530761A (ja) | 1998-11-17 | 2002-09-17 | ルノー・アンド・オスピー・スピーチ・プロダクツ・ナームローゼ・ベンノートシャープ | 改良された品詞タグ付け方法及び装置 |
US6321372B1 (en) | 1998-12-23 | 2001-11-20 | Xerox Corporation | Executable for requesting a linguistic service |
-
2000
- 2000-12-19 US US09/738,987 patent/US6910004B2/en not_active Expired - Fee Related
-
2001
- 2001-12-12 JP JP2001378036A patent/JP2002215617A/ja active Pending
- 2001-12-13 EP EP01129760A patent/EP1217533A3/de not_active Ceased
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0327266B1 (de) * | 1988-02-05 | 1995-08-30 | AT&T Corp. | Verfahren zur Bestimmung von Textteilen und Verwendung |
Non-Patent Citations (2)
Title |
---|
KENNETH LITKOWSKI: "Dictionary Parsing Project" [Online] December 1998 (1998-12), CL RESEARCH , XP002327389 Retrieved from the Internet: URL:http://www.clres.com/dpp.html> * sentence 1, paragraph 1 * * sentence 2, paragraph 3 * * sentences 1-7, paragraph 4 * * |
TETSUYA NASUKAWA: "Robust parsing based on discourse information: completing partial parses of ill-formed sentences on the basis of discourse information" June 1995 (1995-06), PROCEEDINGS OF THE 33RD CONFERENCE ON ASSOCIATION FOR COMPUTATIONAL LINGUISTICS PAGES (39-46), ASSOCIATION FOR COMPUTATIONAL LINGUISTICS , CAMBRIDGE, MASSACHUSETTS , XP002327316 * page 42, left-hand column, lines 12-16 * * page 42, right-hand column, lines 15-20 * * |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2390704A (en) * | 2002-07-09 | 2004-01-14 | Canon Kk | Automatic summary generation and display |
US7234942B2 (en) | 2002-07-09 | 2007-06-26 | Canon Kabushiki Kaisha | Summarisation representation apparatus |
US7263530B2 (en) | 2003-03-12 | 2007-08-28 | Canon Kabushiki Kaisha | Apparatus for and method of summarising text |
US9563192B2 (en) | 2014-01-02 | 2017-02-07 | Rockwell Automation Technologies, Inc. | Software workstation and method for employing appended metadata in industrial automation software |
US10423407B2 (en) | 2014-01-02 | 2019-09-24 | Rockwell Automation Technologies, Inc. | Software workstation and method for employing appended metadata in industrial automation software |
EP3264281A1 (de) * | 2016-07-01 | 2018-01-03 | Wipro Limited | Verfahren und system zur automatischen identifizierung von verkehrsverstössen in einem oder mehreren testfällen |
US10545854B2 (en) | 2016-07-01 | 2020-01-28 | Wipro Limited | Method and a system for automatically identifying violations in one or more test cases |
US10878174B1 (en) | 2020-06-24 | 2020-12-29 | Starmind Ag | Advanced text tagging using key phrase extraction and key phrase generation |
US11379763B1 (en) | 2021-08-10 | 2022-07-05 | Starmind Ag | Ontology-based technology platform for mapping and filtering skills, job titles, and expertise topics |
Also Published As
Publication number | Publication date |
---|---|
US20020077806A1 (en) | 2002-06-20 |
US6910004B2 (en) | 2005-06-21 |
JP2002215617A (ja) | 2002-08-02 |
EP1217533A3 (de) | 2005-09-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6910004B2 (en) | Method and computer system for part-of-speech tagging of incomplete sentences | |
US5930746A (en) | Parsing and translating natural language sentences automatically | |
US8515733B2 (en) | Method, device, computer program and computer program product for processing linguistic data in accordance with a formalized natural language | |
JP2008152760A (ja) | マシンアシスト翻訳ツール | |
WO2003056450A1 (fr) | Procede et appareil d'analyse syntaxique | |
JP2012520528A (ja) | 自然言語テキストの自動的意味ラベリングのためのシステム及び方法 | |
WO1997004405A1 (en) | Method and apparatus for automated search and retrieval processing | |
JP2005182823A (ja) | 縮小されたテキスト本文を生成する方法 | |
JPH083815B2 (ja) | 自然言語の共起関係辞書保守方法 | |
CN111611810A (zh) | 一种多音字读音消歧装置及方法 | |
Kammoun et al. | The MORPH2 new version: A robust morphological analyzer for Arabic texts | |
Stamatatos et al. | A practical chunker for unrestricted text | |
US20020129066A1 (en) | Computer implemented method for reformatting logically complex clauses in an electronic text-based document | |
Alkahtani | Building and verifying parallel corpora between Arabic and English | |
Foufi et al. | Multilingual parsing and MWE detection | |
JP3441400B2 (ja) | 言語変換規則作成装置、及びプログラム記録媒体 | |
Fashwan et al. | A rule based method for adding case ending diacritics for modern standard Arabic texts | |
Jolly et al. | Anatomizing lexicon with natural language Tokenizer Toolkit 3 | |
Pettersson et al. | Automatic verb extraction from historical Swedish texts | |
Sharma et al. | Improving existing punjabi grammar checker | |
Gavhal et al. | Sentence Compression Using Natural Language Processing | |
Chege et al. | Developing an Open source Spell-checker for Gıkuyu | |
Mesfar | Towards a cascade of morpho-syntactic tools for Arabic natural language processing | |
Alansary | Basma: Bibalex standard arabic morphological analyzer | |
JPH0748217B2 (ja) | 文書要約装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
17P | Request for examination filed |
Effective date: 20060321 |
|
AKX | Designation fees paid |
Designated state(s): DE FR GB |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
18R | Application refused |
Effective date: 20070520 |