HK1091668A1 - System and method of creating and using compact linguistic data - Google Patents

System and method of creating and using compact linguistic data

Info

Publication number
HK1091668A1
HK1091668A1 HK06108040.7A HK06108040A HK1091668A1 HK 1091668 A1 HK1091668 A1 HK 1091668A1 HK 06108040 A HK06108040 A HK 06108040A HK 1091668 A1 HK1091668 A1 HK 1091668A1
Authority
HK
Hong Kong
Prior art keywords
words
creating
linguistic data
mapped
characters
Prior art date
Application number
HK06108040.7A
Other languages
English (en)
Inventor
Vadim Fux
Michael G Elizarov
Original Assignee
Ontario Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from PCT/CA2003/001023 external-priority patent/WO2004006122A2/en
Application filed by Ontario Inc filed Critical Ontario Inc
Publication of HK1091668A1 publication Critical patent/HK1091668A1/xx

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99937Sorting

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
  • Communication Control (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
HK06108040.7A 2002-07-03 2006-07-18 System and method of creating and using compact linguistic data HK1091668A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US39390302P 2002-07-03 2002-07-03
US10/289,656 US7269548B2 (en) 2002-07-03 2002-11-07 System and method of creating and using compact linguistic data
CA002411227A CA2411227C (en) 2002-07-03 2002-11-07 System and method of creating and using compact linguistic data
PCT/CA2003/001023 WO2004006122A2 (en) 2002-07-03 2003-07-03 System and method of creating and using compact linguistic data

Publications (1)

Publication Number Publication Date
HK1091668A1 true HK1091668A1 (en) 2007-01-26

Family

ID=30770900

Family Applications (1)

Application Number Title Priority Date Filing Date
HK06108040.7A HK1091668A1 (en) 2002-07-03 2006-07-18 System and method of creating and using compact linguistic data

Country Status (6)

Country Link
US (3) US7269548B2 (xx)
JP (1) JP2009266244A (xx)
CN (1) CN1703692A (xx)
AT (1) ATE506651T1 (xx)
CA (1) CA2411227C (xx)
HK (1) HK1091668A1 (xx)

Families Citing this family (56)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
USRE43082E1 (en) 1998-12-10 2012-01-10 Eatoni Ergonomics, Inc. Touch-typable devices based on ambiguous codes and methods to design such devices
US7312726B2 (en) 2004-06-02 2007-12-25 Research In Motion Limited Handheld electronic device with text disambiguation
US7091885B2 (en) * 2004-06-02 2006-08-15 2012244 Ontario Inc. Handheld electronic device with text disambiguation
US7711542B2 (en) * 2004-08-31 2010-05-04 Research In Motion Limited System and method for multilanguage text input in a handheld electronic device
US7895218B2 (en) 2004-11-09 2011-02-22 Veveo, Inc. Method and system for performing searches for television content using reduced text input
FR2878344B1 (fr) * 2004-11-22 2012-12-21 Sionnest Laurent Guyot Dispositif de commandes et d'entree de donnees
KR20070088687A (ko) * 2004-12-01 2007-08-29 화이트스모크 인코포레이션 문서의 질을 자동으로 향상시키는 시스템 및 방법
US7779011B2 (en) 2005-08-26 2010-08-17 Veveo, Inc. Method and system for dynamically processing ambiguous, reduced text search queries and highlighting results thereof
US7788266B2 (en) * 2005-08-26 2010-08-31 Veveo, Inc. Method and system for processing ambiguous, multi-term search queries
US7644054B2 (en) 2005-11-23 2010-01-05 Veveo, Inc. System and method for finding desired results by incremental search using an ambiguous keypad with the input containing orthographic and typographic errors
US7529741B2 (en) * 2006-03-06 2009-05-05 Veveo, Inc. Methods and systems for segmenting relative user preferences into fine-grain and coarse-grain collections
US8073860B2 (en) 2006-03-30 2011-12-06 Veveo, Inc. Method and system for incrementally selecting and providing relevant search engines in response to a user query
US7539676B2 (en) 2006-04-20 2009-05-26 Veveo, Inc. User interface methods and systems for selecting and presenting content based on relationships between the user and other members of an organization
US7646868B2 (en) * 2006-08-29 2010-01-12 Intel Corporation Method for steganographic cryptography
US8423908B2 (en) * 2006-09-08 2013-04-16 Research In Motion Limited Method for identifying language of text in a handheld electronic device and a handheld electronic device incorporating the same
US7752193B2 (en) * 2006-09-08 2010-07-06 Guidance Software, Inc. System and method for building and retrieving a full text index
EP2062171A4 (en) 2006-09-14 2010-10-06 Veveo Inc METHOD AND SYSTEMS FOR THE DYNAMIC REORGANIZATION OF SEARCH RESULTS IN HIERARCHICALLY ORGANIZED CLAUSE CLUSTERS
WO2008045690A2 (en) 2006-10-06 2008-04-17 Veveo, Inc. Linear character selection display interface for ambiguous text input
US20080091427A1 (en) * 2006-10-11 2008-04-17 Nokia Corporation Hierarchical word indexes used for efficient N-gram storage
US8078884B2 (en) 2006-11-13 2011-12-13 Veveo, Inc. Method of and system for selecting and presenting content based on user identification
AU2007323859A1 (en) * 2006-11-19 2008-05-29 Rmax, Llc Internet-based computer for mobile and thin client users
US8048363B2 (en) * 2006-11-20 2011-11-01 Kimberly Clark Worldwide, Inc. Container with an in-mold label
US8103499B2 (en) * 2007-03-22 2012-01-24 Tegic Communications, Inc. Disambiguation of telephone style key presses to yield Chinese text using segmentation and selective shifting
US8549424B2 (en) 2007-05-25 2013-10-01 Veveo, Inc. System and method for text disambiguation and context designation in incremental search
US8176419B2 (en) * 2007-12-19 2012-05-08 Microsoft Corporation Self learning contextual spell corrector
JP2009245308A (ja) * 2008-03-31 2009-10-22 Fujitsu Ltd 文書校正支援プログラム、文書校正支援方法および文書校正支援装置
US7663511B2 (en) * 2008-06-18 2010-02-16 Microsoft Corporation Dynamic character encoding
US7730061B2 (en) * 2008-09-12 2010-06-01 International Business Machines Corporation Fast-approximate TFIDF
CN101533403B (zh) * 2008-11-07 2010-12-01 广东国笔科技股份有限公司 一种派生词生成方法及系统
US20100332215A1 (en) * 2009-06-26 2010-12-30 Nokia Corporation Method and apparatus for converting text input
US20110191330A1 (en) 2010-02-04 2011-08-04 Veveo, Inc. Method of and System for Enhanced Content Discovery Based on Network and Device Access Behavior
JP5337308B2 (ja) * 2010-08-06 2013-11-06 インターナショナル・ビジネス・マシーンズ・コーポレーション 文字列生成方法、プログラム及びシステム
JP5392228B2 (ja) * 2010-10-14 2014-01-22 株式会社Jvcケンウッド 番組検索装置および番組検索方法
JP5392227B2 (ja) * 2010-10-14 2014-01-22 株式会社Jvcケンウッド フィルタリング装置およびフィルタリング方法
JP5605288B2 (ja) * 2011-03-31 2014-10-15 富士通株式会社 出現マップ生成方法、ファイル抽出方法、出現マップ生成プログラム、ファイル抽出プログラム、出現マップ生成装置、およびファイル抽出装置
JPWO2012150637A1 (ja) * 2011-05-02 2014-07-28 富士通株式会社 抽出方法、情報処理方法、抽出プログラム、情報処理プログラム、抽出装置、および情報処理装置
US8924446B2 (en) 2011-12-29 2014-12-30 Verisign, Inc. Compression of small strings
CN102831224B (zh) * 2012-08-24 2018-09-04 北京百度网讯科技有限公司 一种数据索引库的建立方法、搜索建议生成方法和装置
US9329778B2 (en) 2012-09-07 2016-05-03 International Business Machines Corporation Supplementing a virtual input keyboard
US10304465B2 (en) 2012-10-30 2019-05-28 Google Technology Holdings LLC Voice control user interface for low power mode
US10373615B2 (en) 2012-10-30 2019-08-06 Google Technology Holdings LLC Voice control user interface during low power mode
US9584642B2 (en) 2013-03-12 2017-02-28 Google Technology Holdings LLC Apparatus with adaptive acoustic echo control for speakerphone mode
US10381001B2 (en) 2012-10-30 2019-08-13 Google Technology Holdings LLC Voice control user interface during low-power mode
US9805018B1 (en) 2013-03-15 2017-10-31 Steven E. Richfield Natural language processing for analyzing internet content and finding solutions to needs expressed in text
US8788263B1 (en) * 2013-03-15 2014-07-22 Steven E. Richfield Natural language processing for analyzing internet content and finding solutions to needs expressed in text
USD772898S1 (en) 2013-03-15 2016-11-29 H2 & Wf3 Research, Llc Display screen with graphical user interface for a document management system
USD788115S1 (en) 2013-03-15 2017-05-30 H2 & Wf3 Research, Llc. Display screen with graphical user interface for a document management system
EP3069248A4 (en) * 2013-11-14 2017-06-21 3M Innovative Properties Company Systems and methods for obfuscating data using dictionary
US8768712B1 (en) 2013-12-04 2014-07-01 Google Inc. Initiating actions based on partial hotwords
US9799049B2 (en) * 2014-12-15 2017-10-24 Nuance Communications, Inc. Enhancing a message by providing supplemental content in the message
US20160170971A1 (en) * 2014-12-15 2016-06-16 Nuance Communications, Inc. Optimizing a language model based on a topic of correspondence messages
KR20180031291A (ko) * 2016-09-19 2018-03-28 삼성전자주식회사 다중 언어 번역 및 예측 장치 및 방법
US10120860B2 (en) * 2016-12-21 2018-11-06 Intel Corporation Methods and apparatus to identify a count of n-grams appearing in a corpus
US10877998B2 (en) * 2017-07-06 2020-12-29 Durga Turaga Highly atomized segmented and interrogatable data systems (HASIDS)
US10740381B2 (en) * 2018-07-18 2020-08-11 International Business Machines Corporation Dictionary editing system integrated with text mining
CN110673836B (zh) * 2019-08-22 2023-05-23 创新先进技术有限公司 一种代码补全方法、装置、计算设备及存储介质

Family Cites Families (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4403303A (en) * 1981-05-15 1983-09-06 Beehive International Terminal configuration manager
US4500955A (en) 1981-12-31 1985-02-19 International Business Machines Corporation Full word coding for information processing
US4814746A (en) * 1983-06-01 1989-03-21 International Business Machines Corporation Data compression method
US4843389A (en) * 1986-12-04 1989-06-27 International Business Machines Corp. Text compression and expansion method and apparatus
US4864503A (en) * 1987-02-05 1989-09-05 Toltran, Ltd. Method of using a created international language as an intermediate pathway in translation between two national languages
US5126739A (en) * 1989-01-13 1992-06-30 Stac Electronics Data compression apparatus and method
US5146221A (en) * 1989-01-13 1992-09-08 Stac, Inc. Data compression apparatus and method
EP0464191B1 (en) * 1990-01-19 1996-03-27 Hewlett-Packard Limited Compressed data access
US5254990A (en) * 1990-02-26 1993-10-19 Fujitsu Limited Method and apparatus for compression and decompression of data
DE69123660T2 (de) * 1990-08-13 1997-04-17 Fujitsu Ltd Datenkompressionsmethode und Gerät
SG47586A1 (en) * 1990-12-21 1998-04-17 British Telecomm Speech coding
US5325091A (en) * 1992-08-13 1994-06-28 Xerox Corporation Text-compression technique using frequency-ordered array of word-number mappers
US5657423A (en) * 1993-02-22 1997-08-12 Texas Instruments Incorporated Hardware filter circuit and address circuitry for MPEG encoded data
US5509088A (en) * 1993-12-06 1996-04-16 Xerox Corporation Method for converting CCITT compressed data using a balanced tree
JPH07192095A (ja) 1993-12-27 1995-07-28 Nec Corp 文字列入力装置
US5798721A (en) * 1994-03-14 1998-08-25 Mita Industrial Co., Ltd. Method and apparatus for compressing text data
US5684478A (en) * 1994-12-06 1997-11-04 Cennoid Technologies, Inc. Method and apparatus for adaptive data compression
US5847697A (en) * 1995-01-31 1998-12-08 Fujitsu Limited Single-handed keyboard having keys with multiple characters and character ambiguity resolution logic
US5818437A (en) * 1995-07-26 1998-10-06 Tegic Communications, Inc. Reduced keyboard disambiguating computer
GB2305746B (en) 1995-09-27 2000-03-29 Canon Res Ct Europe Ltd Data compression apparatus
US5778361A (en) * 1995-09-29 1998-07-07 Microsoft Corporation Method and system for fast indexing and searching of text in compound-word languages
JP3566441B2 (ja) * 1996-01-30 2004-09-15 シャープ株式会社 テキスト圧縮用辞書作成装置
US6169672B1 (en) * 1996-07-03 2001-01-02 Hitachi, Ltd. Power converter with clamping circuit
US5951623A (en) * 1996-08-06 1999-09-14 Reynar; Jeffrey C. Lempel- Ziv data compression technique utilizing a dictionary pre-filled with frequent letter combinations, words and/or phrases
US6023670A (en) * 1996-08-19 2000-02-08 International Business Machines Corporation Natural language determination using correlation between common words
US6414610B1 (en) * 1997-02-24 2002-07-02 Rodney J Smith Data compression
US6618506B1 (en) * 1997-09-23 2003-09-09 International Business Machines Corporation Method and apparatus for improved compression and decompression
JPH11143877A (ja) * 1997-10-22 1999-05-28 Internatl Business Mach Corp <Ibm> 圧縮方法、辞書の見出し語インデックス・データを圧縮するための方法、及び機械翻訳システム
US5896321A (en) * 1997-11-14 1999-04-20 Microsoft Corporation Text completion system for a miniature computer
US6075470A (en) * 1998-02-26 2000-06-13 Research In Motion Limited Block-wise adaptive statistical data compressor
US6646573B1 (en) * 1998-12-04 2003-11-11 America Online, Inc. Reduced keyboard text input system for the Japanese language
US6219731B1 (en) * 1998-12-10 2001-04-17 Eaton: Ergonomics, Inc. Method and apparatus for improved multi-tap text input
GB2347240A (en) * 1999-02-22 2000-08-30 Nokia Mobile Phones Ltd Communication terminal having a predictive editor application
US6668092B1 (en) * 1999-07-30 2003-12-23 Sun Microsystems, Inc. Memory efficient variable-length encoding/decoding system
US6904402B1 (en) * 1999-11-05 2005-06-07 Microsoft Corporation System and iterative method for lexicon, segmentation and language model joint optimization
US6516305B1 (en) * 2000-01-14 2003-02-04 Microsoft Corporation Automatic inference of models for statistical code compression
EP1213643A1 (en) * 2000-12-05 2002-06-12 Inventec Appliances Corp. Intelligent dictionary input method
US7103534B2 (en) * 2001-03-31 2006-09-05 Microsoft Corporation Machine learning contextual approach to word determination for text input via reduced keypad keys
US6400286B1 (en) * 2001-06-20 2002-06-04 Unisys Corporation Data compression method and apparatus implemented with limited length character tables
US6587057B2 (en) * 2001-07-25 2003-07-01 Quicksilver Technology, Inc. High performance memory efficient variable-length coding decoder
US6653954B2 (en) * 2001-11-07 2003-11-25 International Business Machines Corporation System and method for efficient data compression
US20030182279A1 (en) * 2002-03-19 2003-09-25 Willows Kevin John Progressive prefix input method for data entry
US6657565B2 (en) * 2002-03-21 2003-12-02 International Business Machines Corporation Method and system for improving lossless compression efficiency

Also Published As

Publication number Publication date
CA2411227C (en) 2007-01-09
US20040006455A1 (en) 2004-01-08
CN1703692A (zh) 2005-11-30
ATE506651T1 (de) 2011-05-15
US7269548B2 (en) 2007-09-11
CA2411227A1 (en) 2004-01-03
US20100211381A1 (en) 2010-08-19
US7809553B2 (en) 2010-10-05
US20080015844A1 (en) 2008-01-17
JP2009266244A (ja) 2009-11-12

Similar Documents

Publication Publication Date Title
HK1091668A1 (en) System and method of creating and using compact linguistic data
SG142159A1 (en) Index structure of metadata, method for providing indices of metadata, and metadata searching method and apparatus using the indices of metadata
ATE518195T1 (de) Adaptives dokumenteinstufungsverfahren auf der basis des benutzerverhaltens
TW347516B (en) Browse by prompted keyword phrases with an improved user interface
IL172518A0 (en) System and method for configuring voice readers using semantic analysis
CA2318299A1 (en) Metadata exchange
ATE297992T1 (de) Recombinante herstellung aromatischer polyketiden
DE60126462D1 (de) Client/Server basiertes Spracherkennungssystem
Greenlee et al. Search for K 0 L--> μ+ e and K 0 L--> e+ e
SE0004319L (sv) System och förfarande
UA24036C2 (uk) Словhик алфавітhої іhоземhої мови
ATE433378T1 (de) Kommunikationssystem bei der herstellung von printmedien
AU2001276232A1 (en) Coding input method using numeral keys to input numbers, symbols and english letters
Stephens Witch-figures in recent children's fiction: the subaltern and the subversive
EP1187441A3 (en) Audio recognition method and device for sequence of numbers
MY128802A (en) Sub-optimal variable length coding
EA199800029A1 (ru) СПОСОБ ПОЛУЧЕНИЯ БЕНЗО[b]ТИОФЕНОВ
AU2000251210A1 (en) An alphabet character input device
DE60045663D1 (de) Rlichem interferon-alpha
KR20140060406A (ko) 수치정보 한글 표현 방법
TW348235B (en) Method of spelling check using Pinyin and universal characters
PT1406425E (pt) Metodo de producao de uma identificacao isolada multimedia por um fornecedor de acesso
TW200516425A (en) Character searching method
TW200723030A (en) A four-codes input method for inputting chinese character
CN101017406B (zh) 闪电快速输入法

Legal Events

Date Code Title Description
PE Patent expired

Effective date: 20230702