EP0327266A2 - Méthode pour la détermination des élements de langage et utilisation - Google Patents
Méthode pour la détermination des élements de langage et utilisation Download PDFInfo
- Publication number
- EP0327266A2 EP0327266A2 EP89300790A EP89300790A EP0327266A2 EP 0327266 A2 EP0327266 A2 EP 0327266A2 EP 89300790 A EP89300790 A EP 89300790A EP 89300790 A EP89300790 A EP 89300790A EP 0327266 A2 EP0327266 A2 EP 0327266A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- speech
- word
- words
- probability
- contextual
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 71
- 230000001755 vocal effect Effects 0.000 claims abstract description 4
- 238000009499 grossing Methods 0.000 claims description 6
- 230000001419 dependent effect Effects 0.000 claims description 4
- 230000002194 synthesizing effect Effects 0.000 claims 1
- 230000015572 biosynthetic process Effects 0.000 abstract description 7
- 238000003786 synthesis reaction Methods 0.000 abstract description 7
- 238000010606 normalization Methods 0.000 abstract description 2
- 238000012549 training Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 206010048232 Yawning Diseases 0.000 description 4
- 239000012530 fluid Substances 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000001915 proofreading effect Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 241000545744 Hirudinea Species 0.000 description 1
- 238000007476 Maximum Likelihood Methods 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/216—Parsing using statistical methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/253—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B19/00—Teaching not covered by other main groups of this subclass
- G09B19/04—Speaking
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
Definitions
- This invention relates to methods for part-of-speech determination and to methods for usage of the results, including intermediate methods of noun-phrase parsing, and including speech synthesis, speech recognition, training of writers, proofreading, indexing and data retrieval.
- automatic part-of-speech determination can play an important role in automatic speech recognition, in the education and training of writers by computer-assisted methods, in editing and proofreading of documents generated at a word-processing work station, in the indexing of a document, and in various forms of retrieval of word-dependent data from a date base.
- the second principal method which potentially has greater underlying unity is the "n-gram" technique described in the article "The Automatic Tagging of the LOB Corpus", in ICAME News, Vol. 7, pp. 13-33, by G. Leech et al., 1983, University of Lancaster, England.
- Part of the technique there described makes the assigned part of speech depend on the current best choices of parts of speech of certain preceding or following words, based on certain rules as to likely combinations of successive parts of speech. With this analysis, various ad hoc rules are also used, so that, overall, this method is still less accurate than desirable. In addition, this method fails to model lexical probabilities in a systematic fashion.
- parts of speech are assigned to words in a message by optimizing the product of individual word lexical probabilities and normalized three-word contectual probabilities. Normalization employs the contained two-word contextual probabilities. Endpoints of sentences (including multiple spaces between them), punctuation and words occurring with low frequency are assigned lexical probabilities and are otherwise treated as if they were words, so that discontinuities encountered in prior n-gram part-of-speech assignment and the prior use of "ad hoc" rules tend to be avoided. The generality of the technique is thereby established.
- a message in which the words have had parts-of-speech previously assigned has its noun phrases identified in a way that facilitates their use for. speech synthesis.
- This noun phrase parsing also may have other applications.
- the noun phrase parsing method is a highly probabilistic method that initially assigns beginnings and ends of noun phrases at every start or end of a word and progressively eliminates such assignments by eliminating the lowest probability assignments, until only very high probability non-recursive assignments remain.
- non-recursive assignments I mean that no noun phrase assignment is retained that is partly or wholly within another noun phrase.
- the method of this feature of my invention can also retain some high-probability noun phrases that occur wholly within other noun phrases, since such assignments are useful in practice, for example, in speech synthesis.
- noun phrase assignments which are always eliminated are endings without corresponding beginnings (e.g., at the start of a sentence), or beginnings without endings (e.g., at the end of a sentence), but my method further eliminates low-probability assignments of the beginnings and ends of noun phrases; or, to put it another way, retains only the highest probability assignments
- the output of my parts-of-speech assignment method may be the input to my noun-phrase-parsing method.
- my noun-phrase-parsing method the maximum likelihood optimization techniques used in both methods tend to reinforce each other, since each method, by itself, is superior in performance to that of its prior art.
- the message was a text message which has been read and stored in an electronic form.
- the first step then becomes, as indicated in block 11, to read the stored text, sentence by sentence.
- This step requires determining sentence boundaries. There are many known techniques, but I prefer to make the initial assumption that every period ends a sentence and then to discard that sentence and its results when my method subsequently demonstrates that the period had a more likely use.
- Token-izing includes the identification of words and certain non-words, such as punctuation and parentheses.
- non-words such as punctuation and parentheses.
- Token types involved in the process are the actual words of a sentence and structural indicators which inform the process that the end of a sentence has been reached.
- Those structural indicators include, for example, and end-of-sentence indicator, such as the machine-readable character for a period, a heading or paragraph indicator represented by a corresponding formatting character stored in the manuscript, filed, or file, along with the text words, and an end-of-file indicator.
- each final word in a sentence will have its contextual probability measured together with that for the period and the following blank. These three form a "trigram"; and the probability analysis therefore is exploring the question: "How likely is it that this word, as a certain part of speech, can end a sentence?" In this case the contextual probabilities of observing the period in this position is very high (near 1.0); and the contextual probability for the blank is 1.0.
- the resultant contextual probability is just the measured probability of seeing the subject part of speech at the end of a sentence which, in turn, is a statistic that can be tabulated from the text corpus and stored in a permanent memory of the computer.
- my method After token-izing the observed words and characters, as explained in connection with block 12, my method next computes the lexical part of speech probabilities (the probability of observing part of speech i given word j), dependent upon frequency of occurrence, as follows: If every sense of every word of interest appeared with a reasonably high frequency in the Brown Corpus, that calculation would be simply the quotient of the observed frequency of occurrence of the word as a particular part of speech, divided by its total frequency of occurrence, regardless of part of speech.
- the problem is to find an assignment of parts of speech to words that optimizes both lexical and contextual probabilities, both of which are estimated from the Tagged Brown Corpus.
- the lexical probabilities are estimated in the obvious way. For example, the probability that "I” is a pronoun, Prob(PPSS
- the probability that "see” is a verb is estimated to be 771/772.
- the other lexical probability estimates follow the same pattern.
- the contextual probability the probability of observing part of speech X, given the following two parts of speech Y and Z, is estimated by dividing the trigram part-of-speech frequency XYZ by the bigram part-of-speech frequency YZ.
- the other contextual probability estimates follow the same pattern.
- a search is performed in order to find the assignment of part of speech tags to words that optimizes the product of the lexical and contextual probabilities.
- the search enumerates all possible assignments of parts of speech to input words.
- there are four input words, three of which are two ways ambiguous, producing a set of 2 * 2 * 2 * 1 8 possible assignments of parts of speech to input words:
- Each of the eight sequences are then scored by the product of the lexical probabilities and the contextual probabilities, and the best sequence is selected. In this case, the first sequence is by far the best.
- the proposed method is a stochastic analog of precedence parsing. Recall that precedence parsing makes use of a table that says whether to insert an open or close bracket between any two categories (terminal or nonterminal). The proposed method makes use of a table that gives the probabilities of an open and close bracket between all pairs of parts of speech. A sample is shown below for the five parts of speech: AT (article), NN (singular noun), NNS (non-singular noun), VB (uninflected verb), IN (preposition). These probabilities were estimated from about 40,000 words of training material selected from the Brown Corpus. The training material was parsed into noun phrases by laborious semi-automatic means.
- the stochastic parser is given a sequence of parts of speech as input and is asked to insert brackets corresponding to the beginning and end of noun phrases.
- the parser enumerates all possible parsings of the input and scores each of them by the precedence probabilities.
- Each of these parsings is scored by multiplying 6 precedence probabilities, the probability of an openlclose bracket appearing (or not appearing) in any one of the three positions (before the NN, after the NN or after the VB). The parsing with the highest score is returned as output.
- noun phrase parsing as described in FIG. 2, assumes the output from the part of speech assignment of FIG. 1 as its input. But it could also use the results of any other part of speech assignment technique.
- block 24 involves laying out a probability tree for each self-consistent assignment of noun-phrase boundaries. The highest probability assignments are then retained for later processing, e.g., utilization of the results, as indicated in block 25.
- the part of speech tagger 31 is a computer employing the method of FIG. 1.
- Noun phrase parser 32 is a computer employing the method of FIG. 2.
- tagger 31 and parser 32 are applied in a syntax analyzer to provide the input signals for the absolute stress signal generator 18 of FIG. 1 of U.S. Patent No. 3,704,345 issued to C. H. Coker, et al.
- part of speech tagger 41 functions as described in FIG. 1; and noun phrase parser 42 functions as described in FIG. 2.
- the noun phrase and parts of speech information is applied in the text editing system 43, which is of the type described in U. S. Patent No. 4,674,065 issued to F. R. Lange et al.
- part-of-speech tagger 41 and noun phrase parser 42 provide a substitute for "parts of speech" Section 33 in the Lange et al. patent to assist in generating the editing displays therein.
- the accuracy inherent is my method of FIGS. 1 and 2 should yield more useful editing displays than is the case in the prior art.
- text editing system 43 may be the Writer's WorkbenchO system described in Computer Science Technical Report, No. 91 "Writing Tools - The STYLE & Diction Programs", by L. L. Cherry, et al., February 1981, Bell Telephone Laboratories, Incorporated. My methods would be a substitute for the method designated "PARTS" therein.
- lexical probabilities are not the only probabilities that could be improved by smoothing.
- Contextual frequencies also seem to follow Zipf's Law. That is, for the set of all sequences of three parts of speech, we have plotted the frequency of the sequence against its rank on log paper and observed the classic linear relationship and slope of almost -1. It is clear that smoothing techniques could well be applied to contextual frequencies alternatives. The same can also be said for the precedence probabilities used in noun phrase parsing.
- the techniques of my invention also have relevance to other applications, such as speech recognition. Part-of-speech contextual probabilities could make possible better choices for a spoken word which is to be recognized.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Business, Economics & Management (AREA)
- Human Computer Interaction (AREA)
- Educational Technology (AREA)
- Educational Administration (AREA)
- Probability & Statistics with Applications (AREA)
- Entrepreneurship & Innovation (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US152740 | 1988-02-05 | ||
US07/152,740 US5146405A (en) | 1988-02-05 | 1988-02-05 | Methods for part-of-speech determination and usage |
Publications (3)
Publication Number | Publication Date |
---|---|
EP0327266A2 true EP0327266A2 (fr) | 1989-08-09 |
EP0327266A3 EP0327266A3 (fr) | 1992-01-02 |
EP0327266B1 EP0327266B1 (fr) | 1995-08-30 |
Family
ID=22544213
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP89300790A Expired - Lifetime EP0327266B1 (fr) | 1988-02-05 | 1989-01-27 | Méthode pour la détermination des élements de langage et utilisation |
Country Status (9)
Country | Link |
---|---|
US (1) | US5146405A (fr) |
EP (1) | EP0327266B1 (fr) |
JP (1) | JPH0769910B2 (fr) |
KR (1) | KR970006402B1 (fr) |
AU (1) | AU617749B2 (fr) |
CA (1) | CA1301345C (fr) |
DE (1) | DE68923981T2 (fr) |
ES (1) | ES2076952T3 (fr) |
IN (1) | IN175380B (fr) |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0465058A2 (fr) * | 1990-06-28 | 1992-01-08 | AT&T Corp. | Système d'analyse d'un langage écrit |
EP0513918A1 (fr) * | 1991-05-16 | 1992-11-19 | Océ-Nederland B.V. | Méthode de correction d'erreurs dans une phrase en langage naturel |
EP0525470A2 (fr) * | 1991-07-25 | 1993-02-03 | International Business Machines Corporation | Méthode et système de traduction en langage naturel |
WO1996042079A1 (fr) * | 1995-06-13 | 1996-12-27 | British Telecommunications Public Limited Company | Synthese de la parole |
WO1997004405A1 (fr) * | 1995-07-19 | 1997-02-06 | Inso Corporation | Procede et appareil de recherche et extraction automatiques |
US5680628A (en) * | 1995-07-19 | 1997-10-21 | Inso Corporation | Method and apparatus for automated search and retrieval process |
EP0952533A2 (fr) * | 1998-03-23 | 1999-10-27 | Xerox Corporation | Synthèse de textes en utilisant des parties de parole |
BE1011964A3 (fr) * | 1997-11-07 | 2000-03-07 | Motorola Inc | Methode, dispositif et systeme pour la desambiguisation des parties du discours. |
WO2001018788A2 (fr) * | 1999-09-03 | 2001-03-15 | Siemens Aktiengesellschaft | Procede de determination de fins de phrase dans le traitement vocal automatique |
EP0953192B1 (fr) * | 1996-06-28 | 2003-10-29 | Microsoft Corporation | Analyseur syntaxique de langage naturel avec probabilites de nature grammaticale fondees sur dictionnaire |
WO2007006769A1 (fr) * | 2005-07-12 | 2007-01-18 | International Business Machines Corporation | Systeme, programme, et procede de controle pour synthese vocale |
DE202013104836U1 (de) | 2013-10-29 | 2014-01-30 | Foseco International Limited | Speiseraufbau |
US9263059B2 (en) | 2012-09-28 | 2016-02-16 | International Business Machines Corporation | Deep tagging background noises |
US10515138B2 (en) | 2014-04-25 | 2019-12-24 | Mayo Foundation For Medical Education And Research | Enhancing reading accuracy, efficiency and retention |
Families Citing this family (181)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5530863A (en) * | 1989-05-19 | 1996-06-25 | Fujitsu Limited | Programming language processing system with program translation performed by term rewriting with pattern matching |
US5418717A (en) * | 1990-08-27 | 1995-05-23 | Su; Keh-Yih | Multiple score language processing system |
JP2764343B2 (ja) * | 1990-09-07 | 1998-06-11 | 富士通株式会社 | 節/句境界抽出方式 |
US5475796A (en) * | 1991-12-20 | 1995-12-12 | Nec Corporation | Pitch pattern generation apparatus |
US5541836A (en) * | 1991-12-30 | 1996-07-30 | At&T Corp. | Word disambiguation apparatus and methods |
US5267345A (en) * | 1992-02-10 | 1993-11-30 | International Business Machines Corporation | Speech recognition apparatus which predicts word classes from context and words from word classes |
US5383120A (en) * | 1992-03-02 | 1995-01-17 | General Electric Company | Method for tagging collocations in text |
US5293584A (en) * | 1992-05-21 | 1994-03-08 | International Business Machines Corporation | Speech recognition system for natural language translation |
JPH06195373A (ja) * | 1992-12-24 | 1994-07-15 | Sharp Corp | 機械翻訳装置 |
US5440481A (en) * | 1992-10-28 | 1995-08-08 | The United States Of America As Represented By The Secretary Of The Navy | System and method for database tomography |
JPH0756957A (ja) * | 1993-08-03 | 1995-03-03 | Xerox Corp | ユーザへの情報提供方法 |
US5873056A (en) * | 1993-10-12 | 1999-02-16 | The Syracuse University | Natural language processing system for semantic vector representation which accounts for lexical ambiguity |
JPH08508127A (ja) * | 1993-10-15 | 1996-08-27 | エイ・ティ・アンド・ティ・コーポレーション | システムをトレーニングする方法、その結果得られる装置、およびその使用方法 |
JP2986345B2 (ja) * | 1993-10-18 | 1999-12-06 | インターナショナル・ビジネス・マシーンズ・コーポレイション | 音声記録指標化装置及び方法 |
US5510981A (en) * | 1993-10-28 | 1996-04-23 | International Business Machines Corporation | Language translation apparatus and method using context-based translation models |
SE513456C2 (sv) * | 1994-05-10 | 2000-09-18 | Telia Ab | Metod och anordning vid tal- till textomvandling |
US5537317A (en) * | 1994-06-01 | 1996-07-16 | Mitsubishi Electric Research Laboratories Inc. | System for correcting grammer based parts on speech probability |
US5485372A (en) * | 1994-06-01 | 1996-01-16 | Mitsubishi Electric Research Laboratories, Inc. | System for underlying spelling recovery |
US5610812A (en) * | 1994-06-24 | 1997-03-11 | Mitsubishi Electric Information Technology Center America, Inc. | Contextual tagger utilizing deterministic finite state transducer |
US5850561A (en) * | 1994-09-23 | 1998-12-15 | Lucent Technologies Inc. | Glossary construction tool |
US5721938A (en) * | 1995-06-07 | 1998-02-24 | Stuckey; Barbara K. | Method and device for parsing and analyzing natural language sentences and text |
AU5969896A (en) * | 1995-06-07 | 1996-12-30 | International Language Engineering Corporation | Machine assisted translation tools |
US5873660A (en) * | 1995-06-19 | 1999-02-23 | Microsoft Corporation | Morphological search and replace |
US5828991A (en) * | 1995-06-30 | 1998-10-27 | The Research Foundation Of The State University Of New York | Sentence reconstruction using word ambiguity resolution |
US5721902A (en) * | 1995-09-15 | 1998-02-24 | Infonautics Corporation | Restricted expansion of query terms using part of speech tagging |
US5819260A (en) * | 1996-01-22 | 1998-10-06 | Lexis-Nexis | Phrase recognition method and apparatus |
SG49804A1 (en) * | 1996-03-20 | 1998-06-15 | Government Of Singapore Repres | Parsing and translating natural language sentences automatically |
US5999896A (en) * | 1996-06-25 | 1999-12-07 | Microsoft Corporation | Method and system for identifying and resolving commonly confused words in a natural language parser |
US5802533A (en) * | 1996-08-07 | 1998-09-01 | Walker; Randall C. | Text processor |
US6279017B1 (en) * | 1996-08-07 | 2001-08-21 | Randall C. Walker | Method and apparatus for displaying text based upon attributes found within the text |
CN1332340C (zh) * | 1997-03-04 | 2007-08-15 | 石仓博 | 语言分析系统及方法 |
US7672829B2 (en) * | 1997-03-04 | 2010-03-02 | Hiroshi Ishikura | Pivot translation method and system |
WO1999016051A1 (fr) * | 1997-09-24 | 1999-04-01 | Lernout & Hauspie Speech Products N.V | Procede de discrimination des realisations de phonation se ressemblant dans un processus de reconnaissance de la parole |
US6260008B1 (en) * | 1998-01-08 | 2001-07-10 | Sharp Kabushiki Kaisha | Method of and system for disambiguating syntactic word multiples |
US6098042A (en) * | 1998-01-30 | 2000-08-01 | International Business Machines Corporation | Homograph filter for speech synthesis system |
CN1159662C (zh) | 1998-05-13 | 2004-07-28 | 国际商业机器公司 | 连续语音识别中的标点符号自动生成装置及方法 |
US6167370A (en) * | 1998-09-09 | 2000-12-26 | Invention Machine Corporation | Document semantic analysis/selection with knowledge creativity capability utilizing subject-action-object (SAO) structures |
US6185524B1 (en) * | 1998-12-31 | 2001-02-06 | Lernout & Hauspie Speech Products N.V. | Method and apparatus for automatic identification of word boundaries in continuous text and computation of word boundary scores |
CA2367320A1 (fr) | 1999-03-19 | 2000-09-28 | Trados Gmbh | Systeme de gestion de flux des travaux |
US20060116865A1 (en) | 1999-09-17 | 2006-06-01 | Www.Uniscape.Com | E-services translation utilizing machine translation and translation memory |
WO2001033409A2 (fr) * | 1999-11-01 | 2001-05-10 | Kurzweil Cyberart Technologies, Inc. | Systeme generateur de poesie informatise |
US6633846B1 (en) | 1999-11-12 | 2003-10-14 | Phoenix Solutions, Inc. | Distributed realtime speech recognition system |
US9076448B2 (en) | 1999-11-12 | 2015-07-07 | Nuance Communications, Inc. | Distributed real time speech recognition system |
US7725307B2 (en) | 1999-11-12 | 2010-05-25 | Phoenix Solutions, Inc. | Query engine for processing voice based queries including semantic decoding |
US6665640B1 (en) | 1999-11-12 | 2003-12-16 | Phoenix Solutions, Inc. | Interactive speech based learning/training system formulating search queries based on natural language parsing of recognized user queries |
US7050977B1 (en) | 1999-11-12 | 2006-05-23 | Phoenix Solutions, Inc. | Speech-enabled server for internet website and method |
US6615172B1 (en) | 1999-11-12 | 2003-09-02 | Phoenix Solutions, Inc. | Intelligent query engine for processing voice based queries |
US7392185B2 (en) | 1999-11-12 | 2008-06-24 | Phoenix Solutions, Inc. | Speech based learning/training system using semantic decoding |
US7120574B2 (en) | 2000-04-03 | 2006-10-10 | Invention Machine Corporation | Synonym extension of search queries with validation |
US7962326B2 (en) * | 2000-04-20 | 2011-06-14 | Invention Machine Corporation | Semantic answering system and method |
US6810375B1 (en) | 2000-05-31 | 2004-10-26 | Hapax Limited | Method for segmentation of text |
US6684202B1 (en) * | 2000-05-31 | 2004-01-27 | Lexis Nexis | Computer-based system and method for finding rules of law in text |
US6941513B2 (en) | 2000-06-15 | 2005-09-06 | Cognisphere, Inc. | System and method for text structuring and text generation |
US6952666B1 (en) * | 2000-07-20 | 2005-10-04 | Microsoft Corporation | Ranking parser for a natural language processing system |
US7171349B1 (en) | 2000-08-11 | 2007-01-30 | Attensity Corporation | Relational text index creation and searching |
US6732098B1 (en) | 2000-08-11 | 2004-05-04 | Attensity Corporation | Relational text index creation and searching |
US6738765B1 (en) | 2000-08-11 | 2004-05-18 | Attensity Corporation | Relational text index creation and searching |
US6728707B1 (en) | 2000-08-11 | 2004-04-27 | Attensity Corporation | Relational text index creation and searching |
US6732097B1 (en) | 2000-08-11 | 2004-05-04 | Attensity Corporation | Relational text index creation and searching |
US6741988B1 (en) | 2000-08-11 | 2004-05-25 | Attensity Corporation | Relational text index creation and searching |
US8272873B1 (en) | 2000-10-16 | 2012-09-25 | Progressive Language, Inc. | Language learning system |
DE10057634C2 (de) * | 2000-11-21 | 2003-01-30 | Bosch Gmbh Robert | Verfahren zur Verarbeitung von Text in einer Rechnereinheit und Rechnereinheit |
US6978239B2 (en) * | 2000-12-04 | 2005-12-20 | Microsoft Corporation | Method and apparatus for speech synthesis without prosody modification |
US7263488B2 (en) * | 2000-12-04 | 2007-08-28 | Microsoft Corporation | Method and apparatus for identifying prosodic word boundaries |
US6910004B2 (en) * | 2000-12-19 | 2005-06-21 | Xerox Corporation | Method and computer system for part-of-speech tagging of incomplete sentences |
US20020129066A1 (en) * | 2000-12-28 | 2002-09-12 | Milward David R. | Computer implemented method for reformatting logically complex clauses in an electronic text-based document |
US6859771B2 (en) * | 2001-04-23 | 2005-02-22 | Microsoft Corporation | System and method for identifying base noun phrases |
US7177792B2 (en) * | 2001-05-31 | 2007-02-13 | University Of Southern California | Integer programming decoder for machine translation |
US8214196B2 (en) * | 2001-07-03 | 2012-07-03 | University Of Southern California | Syntax-based statistical translation model |
US9009590B2 (en) * | 2001-07-31 | 2015-04-14 | Invention Machines Corporation | Semantic processor for recognition of cause-effect relations in natural language documents |
JP2003242176A (ja) * | 2001-12-13 | 2003-08-29 | Sony Corp | 情報処理装置および方法、記録媒体、並びにプログラム |
US6988063B2 (en) * | 2002-02-12 | 2006-01-17 | Sunflare Co., Ltd. | System and method for accurate grammar analysis using a part-of-speech tagged (POST) parser and learners' model |
US7620538B2 (en) | 2002-03-26 | 2009-11-17 | University Of Southern California | Constructing a translation lexicon from comparable, non-parallel corpora |
US20030191645A1 (en) * | 2002-04-05 | 2003-10-09 | Guojun Zhou | Statistical pronunciation model for text to speech |
AU2003280474A1 (en) * | 2002-06-28 | 2004-01-19 | Conceptual Speech, Llc | Multi-phoneme streamer and knowledge representation speech recognition system and method |
US7567902B2 (en) * | 2002-09-18 | 2009-07-28 | Nuance Communications, Inc. | Generating speech recognition grammars from a large corpus of data |
US20040167870A1 (en) * | 2002-12-06 | 2004-08-26 | Attensity Corporation | Systems and methods for providing a mixed data integration service |
US10733976B2 (en) * | 2003-03-01 | 2020-08-04 | Robert E. Coifman | Method and apparatus for improving the transcription accuracy of speech recognition software |
US7496498B2 (en) * | 2003-03-24 | 2009-02-24 | Microsoft Corporation | Front-end architecture for a multi-lingual text-to-speech system |
KR100481598B1 (ko) * | 2003-05-26 | 2005-04-08 | 한국전자통신연구원 | 복합 형태소 분석 장치 및 방법 |
US8548794B2 (en) | 2003-07-02 | 2013-10-01 | University Of Southern California | Statistical noun phrase translation |
US7711545B2 (en) * | 2003-07-02 | 2010-05-04 | Language Weaver, Inc. | Empirical methods for splitting compound words with application to machine translation |
US7475010B2 (en) * | 2003-09-03 | 2009-01-06 | Lingospot, Inc. | Adaptive and scalable method for resolving natural language ambiguities |
US7813916B2 (en) | 2003-11-18 | 2010-10-12 | University Of Utah | Acquisition and application of contextual role knowledge for coreference resolution |
US7983896B2 (en) | 2004-03-05 | 2011-07-19 | SDL Language Technology | In-context exact (ICE) matching |
US20100262621A1 (en) * | 2004-03-05 | 2010-10-14 | Russ Ross | In-context exact (ice) matching |
US7698125B2 (en) * | 2004-03-15 | 2010-04-13 | Language Weaver, Inc. | Training tree transducers for probabilistic operations |
US8296127B2 (en) * | 2004-03-23 | 2012-10-23 | University Of Southern California | Discovery of parallel text portions in comparable collections of corpora and training using comparable texts |
US8666725B2 (en) * | 2004-04-16 | 2014-03-04 | University Of Southern California | Selection and use of nonstatistical translation components in a statistical machine translation framework |
US7664748B2 (en) * | 2004-07-12 | 2010-02-16 | John Eric Harrity | Systems and methods for changing symbol sequences in documents |
GB2417103A (en) * | 2004-08-11 | 2006-02-15 | Sdl Plc | Natural language translation system |
US8600728B2 (en) | 2004-10-12 | 2013-12-03 | University Of Southern California | Training for a text-to-text application which uses string to tree conversion for training and decoding |
US20060122834A1 (en) * | 2004-12-03 | 2006-06-08 | Bennett Ian M | Emotion detection device & method for use in distributed systems |
US8676563B2 (en) | 2009-10-01 | 2014-03-18 | Language Weaver, Inc. | Providing human-generated and machine-generated trusted translations |
US8886517B2 (en) | 2005-06-17 | 2014-11-11 | Language Weaver, Inc. | Trust scoring for language translation systems |
US7974833B2 (en) | 2005-06-21 | 2011-07-05 | Language Weaver, Inc. | Weighted system of expressing language information using a compact notation |
US7389222B1 (en) | 2005-08-02 | 2008-06-17 | Language Weaver, Inc. | Task parallelization in a text-to-text system |
US7813918B2 (en) * | 2005-08-03 | 2010-10-12 | Language Weaver, Inc. | Identifying documents which form translated pairs, within a document collection |
JP2007058509A (ja) * | 2005-08-24 | 2007-03-08 | Toshiba Corp | 言語処理システム |
US8700404B1 (en) * | 2005-08-27 | 2014-04-15 | At&T Intellectual Property Ii, L.P. | System and method for using semantic and syntactic graphs for utterance classification |
US7624020B2 (en) * | 2005-09-09 | 2009-11-24 | Language Weaver, Inc. | Adapter for allowing both online and offline training of a text to text system |
US10319252B2 (en) | 2005-11-09 | 2019-06-11 | Sdl Inc. | Language capability assessment and training apparatus and techniques |
US20100280818A1 (en) * | 2006-03-03 | 2010-11-04 | Childers Stephen R | Key Talk |
US8943080B2 (en) | 2006-04-07 | 2015-01-27 | University Of Southern California | Systems and methods for identifying parallel documents and sentence fragments in multilingual document collections |
EP2024863B1 (fr) | 2006-05-07 | 2018-01-10 | Varcode Ltd. | Systeme et procede pour ameliorer la gestion de la qualite dans une chaine logistique de produits |
US7562811B2 (en) | 2007-01-18 | 2009-07-21 | Varcode Ltd. | System and method for improved quality management in a product logistic chain |
US8886518B1 (en) | 2006-08-07 | 2014-11-11 | Language Weaver, Inc. | System and method for capitalizing machine translated text |
US8521506B2 (en) * | 2006-09-21 | 2013-08-27 | Sdl Plc | Computer-implemented method, computer software and apparatus for use in a translation system |
US9645993B2 (en) | 2006-10-10 | 2017-05-09 | Abbyy Infopoisk Llc | Method and system for semantic searching |
US9984071B2 (en) | 2006-10-10 | 2018-05-29 | Abbyy Production Llc | Language ambiguity detection of text |
US9235573B2 (en) | 2006-10-10 | 2016-01-12 | Abbyy Infopoisk Llc | Universal difference measure |
US8548795B2 (en) * | 2006-10-10 | 2013-10-01 | Abbyy Software Ltd. | Method for translating documents from one language into another using a database of translations, a terminology dictionary, a translation dictionary, and a machine translation system |
US8214199B2 (en) * | 2006-10-10 | 2012-07-03 | Abbyy Software, Ltd. | Systems for translating sentences between languages using language-independent semantic structures and ratings of syntactic constructions |
US9047275B2 (en) | 2006-10-10 | 2015-06-02 | Abbyy Infopoisk Llc | Methods and systems for alignment of parallel text corpora |
US8145473B2 (en) | 2006-10-10 | 2012-03-27 | Abbyy Software Ltd. | Deep model statistics method for machine translation |
US20080086298A1 (en) * | 2006-10-10 | 2008-04-10 | Anisimovich Konstantin | Method and system for translating sentences between langauges |
US9633005B2 (en) | 2006-10-10 | 2017-04-25 | Abbyy Infopoisk Llc | Exhaustive automatic processing of textual information |
US8195447B2 (en) | 2006-10-10 | 2012-06-05 | Abbyy Software Ltd. | Translating sentences between languages using language-independent semantic structures and ratings of syntactic constructions |
US8433556B2 (en) | 2006-11-02 | 2013-04-30 | University Of Southern California | Semi-supervised training for statistical word alignment |
US9122674B1 (en) | 2006-12-15 | 2015-09-01 | Language Weaver, Inc. | Use of annotations in statistical machine translation |
CA2675216A1 (fr) * | 2007-01-10 | 2008-07-17 | Nick Koudas | Procede et systeme pour une decouverte d'informations et une analyse de texte |
US8468149B1 (en) | 2007-01-26 | 2013-06-18 | Language Weaver, Inc. | Multi-lingual online community |
US8615389B1 (en) | 2007-03-16 | 2013-12-24 | Language Weaver, Inc. | Generation and exploitation of an approximate language model |
US8959011B2 (en) | 2007-03-22 | 2015-02-17 | Abbyy Infopoisk Llc | Indicating and correcting errors in machine translation systems |
US8831928B2 (en) | 2007-04-04 | 2014-09-09 | Language Weaver, Inc. | Customizable machine translation service |
JP2010526386A (ja) | 2007-05-06 | 2010-07-29 | バーコード リミティド | バーコード標識を利用する品質管理のシステムと方法 |
KR100887726B1 (ko) * | 2007-05-28 | 2009-03-12 | 엔에이치엔(주) | 자동 띄어쓰기 방법 및 그 시스템 |
US8825466B1 (en) | 2007-06-08 | 2014-09-02 | Language Weaver, Inc. | Modification of annotated bilingual segment pairs in syntax-based machine translation |
US8812296B2 (en) | 2007-06-27 | 2014-08-19 | Abbyy Infopoisk Llc | Method and system for natural language dictionary generation |
CN101802812B (zh) * | 2007-08-01 | 2015-07-01 | 金格软件有限公司 | 使用互联网语料库的自动的上下文相关的语言校正和增强 |
US8595642B1 (en) | 2007-10-04 | 2013-11-26 | Great Northern Research, LLC | Multiple shell multi faceted graphical user interface |
WO2009063465A2 (fr) | 2007-11-14 | 2009-05-22 | Varcode Ltd. | Système et procédé de gestion de qualité utilisant des indicateurs de codes à barres |
US11704526B2 (en) | 2008-06-10 | 2023-07-18 | Varcode Ltd. | Barcoded indicators for quality management |
US9262409B2 (en) | 2008-08-06 | 2016-02-16 | Abbyy Infopoisk Llc | Translation of a selected text fragment of a screen |
US8190423B2 (en) * | 2008-09-05 | 2012-05-29 | Trigent Software Ltd. | Word sense disambiguation using emergent categories |
US9262403B2 (en) * | 2009-03-02 | 2016-02-16 | Sdl Plc | Dynamic generation of auto-suggest dictionary for natural language translation |
GB2468278A (en) * | 2009-03-02 | 2010-09-08 | Sdl Plc | Computer assisted natural language translation outputs selectable target text associated in bilingual corpus with input target text from partial translation |
US8666730B2 (en) * | 2009-03-13 | 2014-03-04 | Invention Machine Corporation | Question-answering system and method based on semantic labeling of text documents and user questions |
US8990064B2 (en) | 2009-07-28 | 2015-03-24 | Language Weaver, Inc. | Translating documents based on content |
CA2774278C (fr) * | 2009-09-25 | 2018-10-30 | Shady Shehata | Procedes et systemes permettant d'extraire des phrases cles a partir d'un texte naturel en vue d'une indexation par un moteur de recherche |
US8380486B2 (en) | 2009-10-01 | 2013-02-19 | Language Weaver, Inc. | Providing machine-generated translations and corresponding trust levels |
US20110161073A1 (en) * | 2009-12-29 | 2011-06-30 | Dynavox Systems, Llc | System and method of disambiguating and selecting dictionary definitions for one or more target words |
US20110161067A1 (en) * | 2009-12-29 | 2011-06-30 | Dynavox Systems, Llc | System and method of using pos tagging for symbol assignment |
CA2787390A1 (fr) | 2010-02-01 | 2011-08-04 | Ginger Software, Inc. | Correction linguistique automatique sensible au contexte utilisant un corpus internet en particulier pour des dispositifs a petit clavier |
US10417646B2 (en) | 2010-03-09 | 2019-09-17 | Sdl Inc. | Predicting the cost associated with translating textual content |
US8788260B2 (en) * | 2010-05-11 | 2014-07-22 | Microsoft Corporation | Generating snippets based on content features |
US9128929B2 (en) | 2011-01-14 | 2015-09-08 | Sdl Language Technologies | Systems and methods for automatically estimating a translation time including preparation time in addition to the translation itself |
US11003838B2 (en) | 2011-04-18 | 2021-05-11 | Sdl Inc. | Systems and methods for monitoring post translation editing |
US8694303B2 (en) | 2011-06-15 | 2014-04-08 | Language Weaver, Inc. | Systems and methods for tuning parameters in statistical machine translation |
US8620837B2 (en) | 2011-07-11 | 2013-12-31 | Accenture Global Services Limited | Determination of a basis for a new domain model based on a plurality of learned models |
EP2546760A1 (fr) | 2011-07-11 | 2013-01-16 | Accenture Global Services Limited | Fourniture d'entrées utilisateurs dans des systèmes pour découvrir conjointement des sujets et des sentiments |
US8676730B2 (en) * | 2011-07-11 | 2014-03-18 | Accenture Global Services Limited | Sentiment classifiers based on feature extraction |
US8886515B2 (en) | 2011-10-19 | 2014-11-11 | Language Weaver, Inc. | Systems and methods for enhancing machine translation post edit review processes |
US8942973B2 (en) | 2012-03-09 | 2015-01-27 | Language Weaver, Inc. | Content page URL translation |
US8971630B2 (en) | 2012-04-27 | 2015-03-03 | Abbyy Development Llc | Fast CJK character recognition |
US8989485B2 (en) | 2012-04-27 | 2015-03-24 | Abbyy Development Llc | Detecting a junction in a text line of CJK characters |
US10261994B2 (en) | 2012-05-25 | 2019-04-16 | Sdl Inc. | Method and system for automatic management of reputation of translators |
US8807422B2 (en) | 2012-10-22 | 2014-08-19 | Varcode Ltd. | Tamper-proof quality management barcode indicators |
US9152623B2 (en) | 2012-11-02 | 2015-10-06 | Fido Labs, Inc. | Natural language processing system and method |
US9152622B2 (en) | 2012-11-26 | 2015-10-06 | Language Weaver, Inc. | Personalized machine translation via online adaptation |
US9811517B2 (en) | 2013-01-29 | 2017-11-07 | Tencent Technology (Shenzhen) Company Limited | Method and system of adding punctuation and establishing language model using a punctuation weighting applied to chinese speech recognized text |
CN103971684B (zh) * | 2013-01-29 | 2015-12-09 | 腾讯科技(深圳)有限公司 | 一种添加标点的方法、系统及其语言模型建立方法、装置 |
CN104143331B (zh) | 2013-05-24 | 2015-12-09 | 腾讯科技(深圳)有限公司 | 一种添加标点的方法和系统 |
US9311299B1 (en) * | 2013-07-31 | 2016-04-12 | Google Inc. | Weakly supervised part-of-speech tagging with coupled token and type constraints |
US9213694B2 (en) | 2013-10-10 | 2015-12-15 | Language Weaver, Inc. | Efficient online domain adaptation |
RU2592395C2 (ru) | 2013-12-19 | 2016-07-20 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Разрешение семантической неоднозначности при помощи статистического анализа |
RU2586577C2 (ru) | 2014-01-15 | 2016-06-10 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Фильтрация дуг в синтаксическом графе |
RU2596600C2 (ru) | 2014-09-02 | 2016-09-10 | Общество с ограниченной ответственностью "Аби Девелопмент" | Способы и системы обработки изображений математических выражений |
US9626358B2 (en) | 2014-11-26 | 2017-04-18 | Abbyy Infopoisk Llc | Creating ontologies by analyzing natural language texts |
US10157168B2 (en) * | 2015-03-10 | 2018-12-18 | Asymmetrica Labs Inc. | Systems and methods for asymmetrical formatting of word spaces according to the uncertainty between words |
US9703394B2 (en) * | 2015-03-24 | 2017-07-11 | Google Inc. | Unlearning techniques for adaptive language models in text entry |
WO2016185474A1 (fr) | 2015-05-18 | 2016-11-24 | Varcode Ltd. | Marquage à l'encre thermochromique pour des étiquettes de qualité activables |
WO2017006326A1 (fr) | 2015-07-07 | 2017-01-12 | Varcode Ltd. | Indicateur de qualité électronique |
US10635863B2 (en) | 2017-10-30 | 2020-04-28 | Sdl Inc. | Fragment recall and adaptive automated translation |
US10817676B2 (en) | 2017-12-27 | 2020-10-27 | Sdl Inc. | Intelligent routing services and systems |
US10956670B2 (en) | 2018-03-03 | 2021-03-23 | Samurai Labs Sp. Z O.O. | System and method for detecting undesirable and potentially harmful online behavior |
US10599767B1 (en) * | 2018-05-31 | 2020-03-24 | The Ultimate Software Group, Inc. | System for providing intelligent part of speech processing of complex natural language |
US11256867B2 (en) | 2018-10-09 | 2022-02-22 | Sdl Inc. | Systems and methods of machine learning for digital assets and message creation |
RU2721190C1 (ru) | 2018-12-25 | 2020-05-18 | Общество с ограниченной ответственностью "Аби Продакшн" | Обучение нейронных сетей с использованием функций потерь, отражающих зависимости между соседними токенами |
CN111353295A (zh) * | 2020-02-27 | 2020-06-30 | 广东博智林机器人有限公司 | 序列标注方法、装置、存储介质及计算机设备 |
US11594213B2 (en) * | 2020-03-03 | 2023-02-28 | Rovi Guides, Inc. | Systems and methods for interpreting natural language search queries |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3704345A (en) * | 1971-03-19 | 1972-11-28 | Bell Telephone Labor Inc | Conversion of printed text into synthetic speech |
JPS58175074A (ja) * | 1982-04-07 | 1983-10-14 | Toshiba Corp | 構文分析方式 |
US4674065A (en) * | 1982-04-30 | 1987-06-16 | International Business Machines Corporation | System for detecting and correcting contextual errors in a text processing system |
US4456973A (en) * | 1982-04-30 | 1984-06-26 | International Business Machines Corporation | Automatic text grade level analyzer for a text processing system |
US4688195A (en) * | 1983-01-28 | 1987-08-18 | Texas Instruments Incorporated | Natural-language interface generating system |
US4580218A (en) * | 1983-09-08 | 1986-04-01 | At&T Bell Laboratories | Indexing subject-locating method |
US4692941A (en) * | 1984-04-10 | 1987-09-08 | First Byte | Real-time text-to-speech conversion system |
JPS6140672A (ja) * | 1984-07-31 | 1986-02-26 | Hitachi Ltd | 多品詞解消処理方式 |
-
1988
- 1988-02-05 US US07/152,740 patent/US5146405A/en not_active Expired - Lifetime
-
1989
- 1989-01-27 EP EP89300790A patent/EP0327266B1/fr not_active Expired - Lifetime
- 1989-01-27 ES ES89300790T patent/ES2076952T3/es not_active Expired - Lifetime
- 1989-01-27 DE DE68923981T patent/DE68923981T2/de not_active Expired - Fee Related
- 1989-02-01 AU AU28990/89A patent/AU617749B2/en not_active Ceased
- 1989-02-03 CA CA000590100A patent/CA1301345C/fr not_active Expired - Fee Related
- 1989-02-04 KR KR1019890001364A patent/KR970006402B1/ko not_active IP Right Cessation
- 1989-02-04 JP JP1024794A patent/JPH0769910B2/ja not_active Expired - Fee Related
-
1990
- 1990-01-16 IN IN46MA1990 patent/IN175380B/en unknown
Non-Patent Citations (5)
Title |
---|
EUROPEAN CONFERENCE ON SPEECH TECHNOLOGY vol. 1, September 1987, EDINGBURG,GB pages 389 - 392; & E. VIVALDA: 'Contextual syntactic analysis for text-to-speech conversion' * |
ICAME NEWS vol. 7, 1983, LANCASTER,GB pages 13 - 33; G. LEECH ET. AL.: 'The automatic tagging of the LOB corpus' * |
ICASSP 85 PROCEEDINGS vol. 4, March 1985, FLORIDA,US pages 1577 - 1580; & B. MERIALDO: 'Probabilistic grammar for phonetic to french transcription' * |
ICASSP 85 PROCEEDINGS vol. 4, March, FLORIDA, US pages 1577-1580; A.M. DEROUAULT & BMERIALDO: 'Probabilistic grammar for phonetic to french transcription' * |
PROCEEDINGS OF THE SPRING JOINT COMPUTER CONFERENCE, ATLANTIC CITY, N.J., US 30 April 1968, WASHINGTON,US pages 339 - 344; J. ALLAN: 'Machine-to-man communication by speech Part II: Synthesis of prosodic features of speech by rule' * |
Cited By (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0465058A3 (en) * | 1990-06-28 | 1995-03-22 | American Telephone & Telegraph | Written language parser system |
EP0465058A2 (fr) * | 1990-06-28 | 1992-01-08 | AT&T Corp. | Système d'analyse d'un langage écrit |
EP0513918A1 (fr) * | 1991-05-16 | 1992-11-19 | Océ-Nederland B.V. | Méthode de correction d'erreurs dans une phrase en langage naturel |
US5768603A (en) * | 1991-07-25 | 1998-06-16 | International Business Machines Corporation | Method and system for natural language translation |
EP0525470A3 (fr) * | 1991-07-25 | 1994-04-13 | Ibm | |
US5477451A (en) * | 1991-07-25 | 1995-12-19 | International Business Machines Corp. | Method and system for natural language translation |
US5805832A (en) * | 1991-07-25 | 1998-09-08 | International Business Machines Corporation | System for parametric text to text language translation |
EP0525470A2 (fr) * | 1991-07-25 | 1993-02-03 | International Business Machines Corporation | Méthode et système de traduction en langage naturel |
WO1996042079A1 (fr) * | 1995-06-13 | 1996-12-27 | British Telecommunications Public Limited Company | Synthese de la parole |
US6330538B1 (en) | 1995-06-13 | 2001-12-11 | British Telecommunications Public Limited Company | Phonetic unit duration adjustment for text-to-speech system |
WO1997004405A1 (fr) * | 1995-07-19 | 1997-02-06 | Inso Corporation | Procede et appareil de recherche et extraction automatiques |
US5680628A (en) * | 1995-07-19 | 1997-10-21 | Inso Corporation | Method and apparatus for automated search and retrieval process |
US5794177A (en) * | 1995-07-19 | 1998-08-11 | Inso Corporation | Method and apparatus for morphological analysis and generation of natural language text |
US5890103A (en) * | 1995-07-19 | 1999-03-30 | Lernout & Hauspie Speech Products N.V. | Method and apparatus for improved tokenization of natural language text |
EP0953192B1 (fr) * | 1996-06-28 | 2003-10-29 | Microsoft Corporation | Analyseur syntaxique de langage naturel avec probabilites de nature grammaticale fondees sur dictionnaire |
BE1011964A3 (fr) * | 1997-11-07 | 2000-03-07 | Motorola Inc | Methode, dispositif et systeme pour la desambiguisation des parties du discours. |
EP0952533A2 (fr) * | 1998-03-23 | 1999-10-27 | Xerox Corporation | Synthèse de textes en utilisant des parties de parole |
EP0952533A3 (fr) * | 1998-03-23 | 2005-08-03 | Xerox Corporation | Synthèse de textes en utilisant des parties de parole |
WO2001018788A3 (fr) * | 1999-09-03 | 2001-09-07 | Siemens Ag | Procede de determination de fins de phrase dans le traitement vocal automatique |
WO2001018788A2 (fr) * | 1999-09-03 | 2001-03-15 | Siemens Aktiengesellschaft | Procede de determination de fins de phrase dans le traitement vocal automatique |
US8751235B2 (en) | 2005-07-12 | 2014-06-10 | Nuance Communications, Inc. | Annotating phonemes and accents for text-to-speech system |
WO2007006769A1 (fr) * | 2005-07-12 | 2007-01-18 | International Business Machines Corporation | Systeme, programme, et procede de controle pour synthese vocale |
US9263059B2 (en) | 2012-09-28 | 2016-02-16 | International Business Machines Corporation | Deep tagging background noises |
US9472209B2 (en) | 2012-09-28 | 2016-10-18 | International Business Machines Corporation | Deep tagging background noises |
US9972340B2 (en) | 2012-09-28 | 2018-05-15 | International Business Machines Corporation | Deep tagging background noises |
DE202013104836U1 (de) | 2013-10-29 | 2014-01-30 | Foseco International Limited | Speiseraufbau |
US10515138B2 (en) | 2014-04-25 | 2019-12-24 | Mayo Foundation For Medical Education And Research | Enhancing reading accuracy, efficiency and retention |
US11093688B2 (en) | 2014-04-25 | 2021-08-17 | Mayo Foundation For Medical Education And Research | Enhancing reading accuracy, efficiency and retention |
US11531804B2 (en) | 2014-04-25 | 2022-12-20 | Mayo Foundation For Medical Education And Research | Enhancing reading accuracy, efficiency and retention |
Also Published As
Publication number | Publication date |
---|---|
EP0327266B1 (fr) | 1995-08-30 |
KR970006402B1 (ko) | 1997-04-28 |
KR890013549A (ko) | 1989-09-23 |
ES2076952T3 (es) | 1995-11-16 |
US5146405A (en) | 1992-09-08 |
AU617749B2 (en) | 1991-12-05 |
DE68923981T2 (de) | 1996-05-15 |
IN175380B (fr) | 1995-06-10 |
JPH01224796A (ja) | 1989-09-07 |
DE68923981D1 (de) | 1995-10-05 |
EP0327266A3 (fr) | 1992-01-02 |
JPH0769910B2 (ja) | 1995-07-31 |
CA1301345C (fr) | 1992-05-19 |
AU2899089A (en) | 1989-08-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0327266B1 (fr) | Méthode pour la détermination des élements de langage et utilisation | |
Duyck et al. | WordGen: A tool for word selection and nonword generation in Dutch, English, German, and French | |
US6424983B1 (en) | Spelling and grammar checking system | |
Oostdijk | Corpus linguistics and the automatic analysis of English | |
US4868750A (en) | Collocational grammar system | |
US5510981A (en) | Language translation apparatus and method using context-based translation models | |
Monaghan et al. | Words in puddles of sound: Modelling psycholinguistic effects in speech segmentation | |
US4864501A (en) | Word annotation system | |
Wan et al. | Speech errors and the representation of tone in Mandarin Chinese | |
US20100332217A1 (en) | Method for text improvement via linguistic abstractions | |
EP0971294A2 (fr) | Procédé et appareil de recherche et extraction automatiques | |
JPH05189481A (ja) | 翻訳用コンピュータ操作方法、字句モデル生成方法、モデル生成方法、翻訳用コンピュータシステム、字句モデル生成コンピュータシステム及びモデル生成コンピュータシステム | |
WO1997004405A9 (fr) | Procede et appareil de recherche et extraction automatiques | |
WO2012039686A1 (fr) | Procédés et systèmes pour une correction automatisée de texte | |
JP3765799B2 (ja) | 自然言語処理装置、自然言語処理方法及び自然言語処理プログラム | |
Zupan et al. | How to tag non-standard language: Normalisation versus domain adaptation for slovene historical and user-generated texts | |
Besdouri et al. | Improvement of the cota-orthography system through language modeling | |
Navas et al. | Assigning phrase breaks using CARTs for Basque TTS | |
KR20040018008A (ko) | 품사 태깅 장치 및 태깅 방법 | |
Minghu et al. | Segmentation of Mandarin Braille word and Braille translation based on multi-knowledge | |
Sanders | Using probabilistic methods to predict phrase boundaries for a text-to-speech system | |
Hasegawa-Johnson et al. | Arabic speech and language technology | |
Bayer et al. | Theoretical and computational linguistics: toward a mutual understanding | |
Sečujski et al. | A software tool for semi-automatic part-of-speech tagging and sentence accentuation in Serbian language | |
Mengliev et al. | Building a comprehensive Uzbek lexicon: bridging dialects for text standardization |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): DE ES FR GB IT NL SE |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): DE ES FR GB IT NL SE |
|
17P | Request for examination filed |
Effective date: 19920624 |
|
17Q | First examination report despatched |
Effective date: 19940311 |
|
RAP3 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: AT&T CORP. |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE ES FR GB IT NL SE |
|
ITF | It: translation for a ep patent filed | ||
ET | Fr: translation filed | ||
REF | Corresponds to: |
Ref document number: 68923981 Country of ref document: DE Date of ref document: 19951005 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2076952 Country of ref document: ES Kind code of ref document: T3 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed | ||
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20011221 Year of fee payment: 14 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: SE Payment date: 20011227 Year of fee payment: 14 |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: IF02 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: NL Payment date: 20020107 Year of fee payment: 14 Ref country code: GB Payment date: 20020107 Year of fee payment: 14 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: ES Payment date: 20020121 Year of fee payment: 14 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20020328 Year of fee payment: 14 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20030127 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20030128 Ref country code: ES Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20030128 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20030801 Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20030801 |
|
EUG | Se: european patent has lapsed | ||
GBPC | Gb: european patent ceased through non-payment of renewal fee | ||
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20030930 |
|
NLV4 | Nl: lapsed or anulled due to non-payment of the annual fee |
Effective date: 20030801 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: ST |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FD2A Effective date: 20030128 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED. Effective date: 20050127 |