BR9914102A - Extração de frase independente da linguagem - Google Patents
Extração de frase independente da linguagemInfo
- Publication number
- BR9914102A BR9914102A BR9914102-7A BR9914102A BR9914102A BR 9914102 A BR9914102 A BR 9914102A BR 9914102 A BR9914102 A BR 9914102A BR 9914102 A BR9914102 A BR 9914102A
- Authority
- BR
- Brazil
- Prior art keywords
- sequence
- words
- count
- word
- language
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/313—Selection or weighting of terms for indexing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Abstract
Patente de Invenção: <B>"EXTRAçãO DE FRASE INDEPENDENTE DA LINGUAGEM"<D>. Método de extração de frases significativas de um ou mais documentos armazenados em um meio legível por computador. Uma seq³ência de palavras é lida a partir de um ou mais documentos (110) e uma contagem é determinada para cada palavra na seq³ência com base no comprimento da palavras (120). A cotangem para cada palavra na seq³ência é comparada a uma contagem limite (130). A seq³ência de palavras será indicada como sendo uma frase significativa, se o número de palavras nas seq³ência que têm a contagem maior que a contagem limite for igual ou exceder um número predeterminado (140).
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/165,675 US6104990A (en) | 1998-09-28 | 1998-09-28 | Language independent phrase extraction |
PCT/US1999/022629 WO2000019334A1 (en) | 1998-09-28 | 1999-09-28 | Language independent phrase extraction |
Publications (1)
Publication Number | Publication Date |
---|---|
BR9914102A true BR9914102A (pt) | 2001-07-31 |
Family
ID=22599957
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR9914102-7A BR9914102A (pt) | 1998-09-28 | 1999-09-28 | Extração de frase independente da linguagem |
Country Status (8)
Country | Link |
---|---|
US (1) | US6104990A (pt) |
EP (1) | EP1125220A1 (pt) |
AU (1) | AU6276199A (pt) |
BR (1) | BR9914102A (pt) |
CA (1) | CA2345428A1 (pt) |
IL (1) | IL142280A0 (pt) |
MX (1) | MXPA01003202A (pt) |
WO (1) | WO2000019334A1 (pt) |
Families Citing this family (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6616703B1 (en) * | 1996-10-16 | 2003-09-09 | Sharp Kabushiki Kaisha | Character input apparatus with character string extraction portion, and corresponding storage medium |
US6760746B1 (en) | 1999-09-01 | 2004-07-06 | Eric Schneider | Method, product, and apparatus for processing a data request |
US7447626B2 (en) * | 1998-09-28 | 2008-11-04 | Udico Holdings | Method and apparatus for generating a language independent document abstract |
US6789230B2 (en) * | 1998-10-09 | 2004-09-07 | Microsoft Corporation | Creating a summary having sentences with the highest weight, and lowest length |
US7188138B1 (en) | 1999-03-22 | 2007-03-06 | Eric Schneider | Method, product, and apparatus for resource identifier registration and aftermarket services |
US9141717B2 (en) | 1999-03-22 | 2015-09-22 | Esdr Network Solutions Llc | Methods, systems, products, and devices for processing DNS friendly identifiers |
US6338082B1 (en) | 1999-03-22 | 2002-01-08 | Eric Schneider | Method, product, and apparatus for requesting a network resource |
US8037168B2 (en) | 1999-07-15 | 2011-10-11 | Esdr Network Solutions Llc | Method, product, and apparatus for enhancing resolution services, registration services, and search services |
USRE43690E1 (en) | 1999-03-22 | 2012-09-25 | Esdr Network Solutions Llc | Search engine request method, product, and apparatus |
USRE44207E1 (en) | 1999-09-01 | 2013-05-07 | Esdr Network Solutions Llc | Network resource access method, product, and apparatus |
US20050235031A1 (en) * | 1999-09-10 | 2005-10-20 | Eric Schneider | Hyperlink generation and enhanced spell check method, product, apparatus, and user interface system |
US6845369B1 (en) * | 2000-01-14 | 2005-01-18 | Relevant Software Inc. | System, apparatus and method for using and managing digital information |
GB0004578D0 (en) * | 2000-02-25 | 2000-04-19 | Xrefer Com Limited | Automated data cross-referencing method |
US7571234B2 (en) * | 2000-06-08 | 2009-08-04 | Aol Llc | Authentication of electronic data |
US6978419B1 (en) * | 2000-11-15 | 2005-12-20 | Justsystem Corporation | Method and apparatus for efficient identification of duplicate and near-duplicate documents and text spans using high-discriminability text fragments |
US6721728B2 (en) * | 2001-03-02 | 2004-04-13 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | System, method and apparatus for discovering phrases in a database |
US7325249B2 (en) * | 2001-04-30 | 2008-01-29 | Aol Llc | Identifying unwanted electronic messages |
US7565402B2 (en) * | 2002-01-05 | 2009-07-21 | Eric Schneider | Sitemap access method, product, and apparatus |
US7315810B2 (en) * | 2002-01-07 | 2008-01-01 | Microsoft Corporation | Named entity (NE) interface for multiple client application programs |
US7650562B2 (en) * | 2002-02-21 | 2010-01-19 | Xerox Corporation | Methods and systems for incrementally changing text representation |
US7131117B2 (en) * | 2002-09-04 | 2006-10-31 | Sbc Properties, L.P. | Method and system for automating the analysis of word frequencies |
US7590695B2 (en) | 2003-05-09 | 2009-09-15 | Aol Llc | Managing electronic messages |
US7739602B2 (en) | 2003-06-24 | 2010-06-15 | Aol Inc. | System and method for community centric resource sharing based on a publishing subscription model |
US7519559B1 (en) | 2003-10-30 | 2009-04-14 | Aol Llc | Messaging stamp authority |
US20050114131A1 (en) * | 2003-11-24 | 2005-05-26 | Kirill Stoimenov | Apparatus and method for voice-tagging lexicon |
US20060025091A1 (en) * | 2004-08-02 | 2006-02-02 | Matsushita Electric Industrial Co., Ltd | Method for creating and using phrase history for accelerating instant messaging input on mobile devices |
US20070055514A1 (en) * | 2005-09-08 | 2007-03-08 | Beattie Valerie L | Intelligent tutoring feedback |
US7895205B2 (en) * | 2008-03-04 | 2011-02-22 | Microsoft Corporation | Using core words to extract key phrases from documents |
US8145482B2 (en) * | 2008-05-25 | 2012-03-27 | Ezra Daya | Enhancing analysis of test key phrases from acoustic sources with key phrase training models |
EP2488963A1 (en) * | 2009-10-15 | 2012-08-22 | Rogers Communications Inc. | System and method for phrase identification |
US8880989B2 (en) | 2012-01-30 | 2014-11-04 | Microsoft Corporation | Educating users and enforcing data dissemination policies |
US9087039B2 (en) | 2012-02-07 | 2015-07-21 | Microsoft Technology Licensing, Llc | Language independent probabilistic content matching |
US20140350941A1 (en) * | 2013-05-21 | 2014-11-27 | Microsoft Corporation | Method For Finding Elements In A Webpage Suitable For Use In A Voice User Interface (Disambiguation) |
CN104615654B (zh) * | 2014-12-30 | 2017-09-22 | 中国联合网络通信有限公司广东省分公司 | 一种文本摘要获取方法及装置 |
US11210287B2 (en) * | 2020-01-30 | 2021-12-28 | Walmart Apollo, Llc | Systems and methods for a title quality scoring framework |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4965763A (en) * | 1987-03-03 | 1990-10-23 | International Business Machines Corporation | Computer method for automatic extraction of commonly specified information from business correspondence |
JP2783558B2 (ja) * | 1988-09-30 | 1998-08-06 | 株式会社東芝 | 要約生成方法および要約生成装置 |
JPH0418673A (ja) * | 1990-05-11 | 1992-01-22 | Hitachi Ltd | テキスト情報抽出方法および装置 |
US5182708A (en) * | 1990-12-11 | 1993-01-26 | Ricoh Corporation | Method and apparatus for classifying text |
US5384703A (en) * | 1993-07-02 | 1995-01-24 | Xerox Corporation | Method and apparatus for summarizing documents according to theme |
US5689716A (en) * | 1995-04-14 | 1997-11-18 | Xerox Corporation | Automatic method of generating thematic summaries |
US5924108A (en) * | 1996-03-29 | 1999-07-13 | Microsoft Corporation | Document summarizer for word processors |
US5960383A (en) * | 1997-02-25 | 1999-09-28 | Digital Equipment Corporation | Extraction of key sections from texts using automatic indexing techniques |
-
1998
- 1998-09-28 US US09/165,675 patent/US6104990A/en not_active Expired - Lifetime
-
1999
- 1999-09-28 EP EP99950011A patent/EP1125220A1/en not_active Withdrawn
- 1999-09-28 IL IL14228099A patent/IL142280A0/xx unknown
- 1999-09-28 BR BR9914102-7A patent/BR9914102A/pt not_active IP Right Cessation
- 1999-09-28 AU AU62761/99A patent/AU6276199A/en not_active Abandoned
- 1999-09-28 CA CA002345428A patent/CA2345428A1/en not_active Abandoned
- 1999-09-28 MX MXPA01003202A patent/MXPA01003202A/es unknown
- 1999-09-28 WO PCT/US1999/022629 patent/WO2000019334A1/en not_active Application Discontinuation
Also Published As
Publication number | Publication date |
---|---|
US6104990A (en) | 2000-08-15 |
MXPA01003202A (es) | 2003-07-14 |
WO2000019334A1 (en) | 2000-04-06 |
AU6276199A (en) | 2000-04-17 |
IL142280A0 (en) | 2002-03-10 |
EP1125220A1 (en) | 2001-08-22 |
CA2345428A1 (en) | 2000-04-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR9914102A (pt) | Extração de frase independente da linguagem | |
Pyles et al. | The Origins and Development of the English | |
BR9905978A (pt) | Identificação automática de linguagem que usa tanto a informação de n-grama como a de palavra | |
Attia | An ambiguity-controlled morphological analyzer for modern standard Arabic modeling finite state networks | |
Haiman | From V/2 to subject clitics: Evidence from Northern Italian | |
Cartoni et al. | How comparable are parallel corpora? Measuring the distribution of general vocabulary and connectives | |
Taljard et al. | A comparison of approaches to word class tagging: Disjunctively vs. conjunctively written Bantu languages | |
Diab et al. | Automatic processing of modern standard Arabic text | |
Kaalep | An Estonian morphological analyser and the impact of a corpus on its development | |
Harris | Revisiting anaphoric islands | |
Asgari et al. | Linguistic resources and topic models for the analysis of persian poems | |
Jenkins et al. | Conservative stemming for search and indexing | |
Wrihatni et al. | Low Malay Language as A Stimulant for Bahasa Indonesia Development | |
Megerdoomian et al. | Processing Persian text: Tokenization in the Shiraz project | |
Iranpour Mobarakeh et al. | Verb detection in persian corpus | |
Naess | Spatial deixis in Pileni | |
Lamarche | Gender agreement and suppletion in French | |
Liang et al. | THE UNIFORM INFORMATION DENSITY HYPOTHESIS: THE CASE OF SUBJET-DOUBLING IN FRENCH | |
Montenegro et al. | Automated question generator for Tagalog informational texts using case markers | |
Gorbachov | Grammatical Transformations in Ukrainian-English Translation of Official Texts | |
Harmon | Proto-Manobo pronouns and case marking particles | |
Giurgea | On the Evolution of articles into agreement markers in Romanian and Albanian | |
DuBois | Incipient semanticization of possessive ablaut in Mayan | |
Janda | A Stranger in the Lexicon: The Aspectual Status of Russian смочь ‘be able, manage (to)’ | |
Handayani | RELIGIOUS SONGS TRANSLATION IN POST-PANDEMIC ERA: COMPARING THE TRANSLATION METHODS DONE BY HUMAN AND MACHINE |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B08F | Application dismissed because of non-payment of annual fees [chapter 8.6 patent gazette] |
Free format text: REFERENTE A 4A, 5A, 6A, 7A E 8A ANUIDADES. |
|
B08K | Patent lapsed as no evidence of payment of the annual fee has been furnished to inpi [chapter 8.11 patent gazette] |
Free format text: REFERENTE AO DESPACHO 8.6 PUBLICADO NA RPI 1911 DE 21/08/2007. |