BR9914102A - Extração de frase independente da linguagem - Google Patents

Extração de frase independente da linguagem

Info

Publication number
BR9914102A
BR9914102A BR9914102-7A BR9914102A BR9914102A BR 9914102 A BR9914102 A BR 9914102A BR 9914102 A BR9914102 A BR 9914102A BR 9914102 A BR9914102 A BR 9914102A
Authority
BR
Brazil
Prior art keywords
sequence
words
count
word
language
Prior art date
Application number
BR9914102-7A
Other languages
English (en)
Inventor
Garnet R Chaney
Robert F Richardson
Original Assignee
Prompt Software Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Prompt Software Inc filed Critical Prompt Software Inc
Publication of BR9914102A publication Critical patent/BR9914102A/pt

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)

Abstract

Patente de Invenção: <B>"EXTRAçãO DE FRASE INDEPENDENTE DA LINGUAGEM"<D>. Método de extração de frases significativas de um ou mais documentos armazenados em um meio legível por computador. Uma seq³ência de palavras é lida a partir de um ou mais documentos (110) e uma contagem é determinada para cada palavra na seq³ência com base no comprimento da palavras (120). A cotangem para cada palavra na seq³ência é comparada a uma contagem limite (130). A seq³ência de palavras será indicada como sendo uma frase significativa, se o número de palavras nas seq³ência que têm a contagem maior que a contagem limite for igual ou exceder um número predeterminado (140).
BR9914102-7A 1998-09-28 1999-09-28 Extração de frase independente da linguagem BR9914102A (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/165,675 US6104990A (en) 1998-09-28 1998-09-28 Language independent phrase extraction
PCT/US1999/022629 WO2000019334A1 (en) 1998-09-28 1999-09-28 Language independent phrase extraction

Publications (1)

Publication Number Publication Date
BR9914102A true BR9914102A (pt) 2001-07-31

Family

ID=22599957

Family Applications (1)

Application Number Title Priority Date Filing Date
BR9914102-7A BR9914102A (pt) 1998-09-28 1999-09-28 Extração de frase independente da linguagem

Country Status (8)

Country Link
US (1) US6104990A (pt)
EP (1) EP1125220A1 (pt)
AU (1) AU6276199A (pt)
BR (1) BR9914102A (pt)
CA (1) CA2345428A1 (pt)
IL (1) IL142280A0 (pt)
MX (1) MXPA01003202A (pt)
WO (1) WO2000019334A1 (pt)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6616703B1 (en) * 1996-10-16 2003-09-09 Sharp Kabushiki Kaisha Character input apparatus with character string extraction portion, and corresponding storage medium
US6760746B1 (en) 1999-09-01 2004-07-06 Eric Schneider Method, product, and apparatus for processing a data request
US7447626B2 (en) * 1998-09-28 2008-11-04 Udico Holdings Method and apparatus for generating a language independent document abstract
US6789230B2 (en) * 1998-10-09 2004-09-07 Microsoft Corporation Creating a summary having sentences with the highest weight, and lowest length
US7188138B1 (en) 1999-03-22 2007-03-06 Eric Schneider Method, product, and apparatus for resource identifier registration and aftermarket services
US9141717B2 (en) 1999-03-22 2015-09-22 Esdr Network Solutions Llc Methods, systems, products, and devices for processing DNS friendly identifiers
US6338082B1 (en) 1999-03-22 2002-01-08 Eric Schneider Method, product, and apparatus for requesting a network resource
US8037168B2 (en) 1999-07-15 2011-10-11 Esdr Network Solutions Llc Method, product, and apparatus for enhancing resolution services, registration services, and search services
USRE43690E1 (en) 1999-03-22 2012-09-25 Esdr Network Solutions Llc Search engine request method, product, and apparatus
USRE44207E1 (en) 1999-09-01 2013-05-07 Esdr Network Solutions Llc Network resource access method, product, and apparatus
US20050235031A1 (en) * 1999-09-10 2005-10-20 Eric Schneider Hyperlink generation and enhanced spell check method, product, apparatus, and user interface system
US6845369B1 (en) * 2000-01-14 2005-01-18 Relevant Software Inc. System, apparatus and method for using and managing digital information
GB0004578D0 (en) * 2000-02-25 2000-04-19 Xrefer Com Limited Automated data cross-referencing method
US7571234B2 (en) * 2000-06-08 2009-08-04 Aol Llc Authentication of electronic data
US6978419B1 (en) * 2000-11-15 2005-12-20 Justsystem Corporation Method and apparatus for efficient identification of duplicate and near-duplicate documents and text spans using high-discriminability text fragments
US6721728B2 (en) * 2001-03-02 2004-04-13 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration System, method and apparatus for discovering phrases in a database
US7325249B2 (en) * 2001-04-30 2008-01-29 Aol Llc Identifying unwanted electronic messages
US7565402B2 (en) * 2002-01-05 2009-07-21 Eric Schneider Sitemap access method, product, and apparatus
US7315810B2 (en) * 2002-01-07 2008-01-01 Microsoft Corporation Named entity (NE) interface for multiple client application programs
US7650562B2 (en) * 2002-02-21 2010-01-19 Xerox Corporation Methods and systems for incrementally changing text representation
US7131117B2 (en) * 2002-09-04 2006-10-31 Sbc Properties, L.P. Method and system for automating the analysis of word frequencies
US7590695B2 (en) 2003-05-09 2009-09-15 Aol Llc Managing electronic messages
US7739602B2 (en) 2003-06-24 2010-06-15 Aol Inc. System and method for community centric resource sharing based on a publishing subscription model
US7519559B1 (en) 2003-10-30 2009-04-14 Aol Llc Messaging stamp authority
US20050114131A1 (en) * 2003-11-24 2005-05-26 Kirill Stoimenov Apparatus and method for voice-tagging lexicon
US20060025091A1 (en) * 2004-08-02 2006-02-02 Matsushita Electric Industrial Co., Ltd Method for creating and using phrase history for accelerating instant messaging input on mobile devices
US20070055514A1 (en) * 2005-09-08 2007-03-08 Beattie Valerie L Intelligent tutoring feedback
US7895205B2 (en) * 2008-03-04 2011-02-22 Microsoft Corporation Using core words to extract key phrases from documents
US8145482B2 (en) * 2008-05-25 2012-03-27 Ezra Daya Enhancing analysis of test key phrases from acoustic sources with key phrase training models
EP2488963A1 (en) * 2009-10-15 2012-08-22 Rogers Communications Inc. System and method for phrase identification
US8880989B2 (en) 2012-01-30 2014-11-04 Microsoft Corporation Educating users and enforcing data dissemination policies
US9087039B2 (en) 2012-02-07 2015-07-21 Microsoft Technology Licensing, Llc Language independent probabilistic content matching
US20140350941A1 (en) * 2013-05-21 2014-11-27 Microsoft Corporation Method For Finding Elements In A Webpage Suitable For Use In A Voice User Interface (Disambiguation)
CN104615654B (zh) * 2014-12-30 2017-09-22 中国联合网络通信有限公司广东省分公司 一种文本摘要获取方法及装置
US11210287B2 (en) * 2020-01-30 2021-12-28 Walmart Apollo, Llc Systems and methods for a title quality scoring framework

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4965763A (en) * 1987-03-03 1990-10-23 International Business Machines Corporation Computer method for automatic extraction of commonly specified information from business correspondence
JP2783558B2 (ja) * 1988-09-30 1998-08-06 株式会社東芝 要約生成方法および要約生成装置
JPH0418673A (ja) * 1990-05-11 1992-01-22 Hitachi Ltd テキスト情報抽出方法および装置
US5182708A (en) * 1990-12-11 1993-01-26 Ricoh Corporation Method and apparatus for classifying text
US5384703A (en) * 1993-07-02 1995-01-24 Xerox Corporation Method and apparatus for summarizing documents according to theme
US5689716A (en) * 1995-04-14 1997-11-18 Xerox Corporation Automatic method of generating thematic summaries
US5924108A (en) * 1996-03-29 1999-07-13 Microsoft Corporation Document summarizer for word processors
US5960383A (en) * 1997-02-25 1999-09-28 Digital Equipment Corporation Extraction of key sections from texts using automatic indexing techniques

Also Published As

Publication number Publication date
US6104990A (en) 2000-08-15
MXPA01003202A (es) 2003-07-14
WO2000019334A1 (en) 2000-04-06
AU6276199A (en) 2000-04-17
IL142280A0 (en) 2002-03-10
EP1125220A1 (en) 2001-08-22
CA2345428A1 (en) 2000-04-06

Similar Documents

Publication Publication Date Title
BR9914102A (pt) Extração de frase independente da linguagem
Pyles et al. The Origins and Development of the English
BR9905978A (pt) Identificação automática de linguagem que usa tanto a informação de n-grama como a de palavra
Attia An ambiguity-controlled morphological analyzer for modern standard Arabic modeling finite state networks
Haiman From V/2 to subject clitics: Evidence from Northern Italian
Cartoni et al. How comparable are parallel corpora? Measuring the distribution of general vocabulary and connectives
Taljard et al. A comparison of approaches to word class tagging: Disjunctively vs. conjunctively written Bantu languages
Diab et al. Automatic processing of modern standard Arabic text
Kaalep An Estonian morphological analyser and the impact of a corpus on its development
Harris Revisiting anaphoric islands
Asgari et al. Linguistic resources and topic models for the analysis of persian poems
Jenkins et al. Conservative stemming for search and indexing
Wrihatni et al. Low Malay Language as A Stimulant for Bahasa Indonesia Development
Megerdoomian et al. Processing Persian text: Tokenization in the Shiraz project
Iranpour Mobarakeh et al. Verb detection in persian corpus
Naess Spatial deixis in Pileni
Lamarche Gender agreement and suppletion in French
Liang et al. THE UNIFORM INFORMATION DENSITY HYPOTHESIS: THE CASE OF SUBJET-DOUBLING IN FRENCH
Montenegro et al. Automated question generator for Tagalog informational texts using case markers
Gorbachov Grammatical Transformations in Ukrainian-English Translation of Official Texts
Harmon Proto-Manobo pronouns and case marking particles
Giurgea On the Evolution of articles into agreement markers in Romanian and Albanian
DuBois Incipient semanticization of possessive ablaut in Mayan
Janda A Stranger in the Lexicon: The Aspectual Status of Russian смочь ‘be able, manage (to)’
Handayani RELIGIOUS SONGS TRANSLATION IN POST-PANDEMIC ERA: COMPARING THE TRANSLATION METHODS DONE BY HUMAN AND MACHINE

Legal Events

Date Code Title Description
B08F Application dismissed because of non-payment of annual fees [chapter 8.6 patent gazette]

Free format text: REFERENTE A 4A, 5A, 6A, 7A E 8A ANUIDADES.

B08K Patent lapsed as no evidence of payment of the annual fee has been furnished to inpi [chapter 8.11 patent gazette]

Free format text: REFERENTE AO DESPACHO 8.6 PUBLICADO NA RPI 1911 DE 21/08/2007.