CN102016837B - 中文型文字及文字偏旁的分类及检索的系统与方法 - Google Patents

中文型文字及文字偏旁的分类及检索的系统与方法 Download PDF

Info

Publication number
CN102016837B
CN102016837B CN200880125478.XA CN200880125478A CN102016837B CN 102016837 B CN102016837 B CN 102016837B CN 200880125478 A CN200880125478 A CN 200880125478A CN 102016837 B CN102016837 B CN 102016837B
Authority
CN
China
Prior art keywords
radical
radicals
recurring
word
stroke
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN200880125478.XA
Other languages
English (en)
Chinese (zh)
Other versions
CN102016837A (zh
Inventor
沃伦·丹尼尔·蔡尔德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of CN102016837A publication Critical patent/CN102016837A/zh
Application granted granted Critical
Publication of CN102016837B publication Critical patent/CN102016837B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/53Querying
    • G06F16/532Query formulation, e.g. graphical querying
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Library & Information Science (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Character Discrimination (AREA)
  • User Interface Of Digital Computer (AREA)
  • Input From Keyboards Or The Like (AREA)
CN200880125478.XA 2007-11-26 2008-11-25 中文型文字及文字偏旁的分类及检索的系统与方法 Expired - Fee Related CN102016837B (zh)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US99012307P 2007-11-26 2007-11-26
US99016607P 2007-11-26 2007-11-26
US60/990,123 2007-11-26
US60/990,166 2007-11-26
US99101007P 2007-11-29 2007-11-29
US60/991,010 2007-11-29
PCT/US2008/084750 WO2009070615A1 (en) 2007-11-26 2008-11-25 System and method for classification and retrieval of chinese-type characters and character components

Publications (2)

Publication Number Publication Date
CN102016837A CN102016837A (zh) 2011-04-13
CN102016837B true CN102016837B (zh) 2014-08-20

Family

ID=40678958

Family Applications (2)

Application Number Title Priority Date Filing Date
CN200880125478.XA Expired - Fee Related CN102016837B (zh) 2007-11-26 2008-11-25 中文型文字及文字偏旁的分类及检索的系统与方法
CN2008801254775A Expired - Fee Related CN102016836B (zh) 2007-11-26 2008-11-25 管理电子形式的中文、日文及韩文语言数据的模组系统与方法

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN2008801254775A Expired - Fee Related CN102016836B (zh) 2007-11-26 2008-11-25 管理电子形式的中文、日文及韩文语言数据的模组系统与方法

Country Status (5)

Country Link
US (2) US8433709B2 (enExample)
JP (4) JP2011509442A (enExample)
CN (2) CN102016837B (enExample)
TW (2) TWI468954B (enExample)
WO (2) WO2009070615A1 (enExample)

Families Citing this family (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8564544B2 (en) 2006-09-06 2013-10-22 Apple Inc. Touch screen device, method, and graphical user interface for customizing display of content category icons
GB0624571D0 (en) * 2006-12-08 2007-01-17 Cambridge Silicon Radio Ltd Authenticating Devices for Communications
US8689132B2 (en) 2007-01-07 2014-04-01 Apple Inc. Portable electronic device, method, and graphical user interface for displaying electronic documents and lists
CN105117376B (zh) * 2007-04-10 2018-07-10 谷歌有限责任公司 多模式输入法编辑器
US8266514B2 (en) * 2008-06-26 2012-09-11 Microsoft Corporation Map service
US9824071B2 (en) * 2008-12-03 2017-11-21 Microsoft Technology Licensing, Llc Viewing messages and message attachments in different languages
US20120010870A1 (en) * 2010-07-09 2012-01-12 Vladimir Selegey Electronic dictionary and dictionary writing system
US20120038652A1 (en) * 2010-08-12 2012-02-16 Palm, Inc. Accepting motion-based character input on mobile computing devices
JP2012079252A (ja) * 2010-10-06 2012-04-19 Fujitsu Ltd 情報端末装置、文字入力方法および文字入力プログラム
US8914743B2 (en) * 2010-11-12 2014-12-16 Apple Inc. Device, method, and graphical user interface for navigating a list of identifiers
US20120156658A1 (en) * 2010-12-16 2012-06-21 Nicholas Fuzzell Methods for teaching and/or learning chinese, and related systems
WO2012174703A1 (en) * 2011-06-20 2012-12-27 Microsoft Corporation Hover translation of search result captions
JP2013041350A (ja) * 2011-08-12 2013-02-28 Panasonic Corp タッチテーブルシステム
KR101870729B1 (ko) * 2011-09-01 2018-07-20 삼성전자주식회사 휴대용 단말기의 번역 트리구조를 이용한 번역장치 및 방법
KR20130080515A (ko) * 2012-01-05 2013-07-15 삼성전자주식회사 디스플레이 장치 및 그 디스플레이 장치에 표시된 문자 편집 방법.
US9229928B2 (en) * 2012-03-13 2016-01-05 Nulu, Inc. Language learning platform using relevant and contextual content
TWI449000B (zh) * 2012-03-23 2014-08-11 Chinese Foundation For Digitization Technology Multimedia Chinese Character Learning Method
US9274609B2 (en) 2012-07-23 2016-03-01 Mingyan Xie Inputting radical on touch screen device
US20140344670A1 (en) * 2013-05-14 2014-11-20 Pandaworks Inc. Dba Contentpanda Method and system for on-demand delivery of predefined in-context web content
KR20150028627A (ko) * 2013-09-06 2015-03-16 삼성전자주식회사 사용자 필기를 텍스트 정보로 변환하는 방법 및 이를 수행하기 위한 전자 기기
JP2015060095A (ja) * 2013-09-19 2015-03-30 株式会社東芝 音声翻訳装置、音声翻訳方法およびプログラム
WO2015112250A1 (en) * 2014-01-22 2015-07-30 Speak Agent, Inc. Visual-kinesthetic language construction
CN104808806B (zh) * 2014-01-28 2019-10-25 北京三星通信技术研究有限公司 根据不确定性信息实现汉字输入的方法和装置
TW201530357A (zh) * 2014-01-29 2015-08-01 Chiu-Huei Teng 用於電子裝置之中文輸入法
RU2640322C2 (ru) * 2014-01-30 2017-12-27 Общество с ограниченной ответственностью "Аби Девелопмент" Способы и системы эффективного автоматического распознавания символов
WO2015167556A1 (en) * 2014-04-30 2015-11-05 Hewlett-Packard Development Company, L.P. Generating color similarity measures
WO2016029045A2 (en) * 2014-08-21 2016-02-25 Jobu Productions Lexical dialect analysis system
JP6466138B2 (ja) * 2014-11-04 2019-02-06 株式会社東芝 外国語文作成支援装置、方法及びプログラム
US20160147741A1 (en) * 2014-11-26 2016-05-26 Adobe Systems Incorporated Techniques for providing a user interface incorporating sign language
US9740684B2 (en) * 2015-02-18 2017-08-22 Lenovo (Singapore) Pte. Ltd. Determining homonyms of logogram input
CN106997245A (zh) * 2016-01-24 2017-08-01 杨文韬 一种根据中文语言模型构建输入法词库的方法
US10031949B2 (en) * 2016-03-03 2018-07-24 Tic Talking Holdings Inc. Interest based content distribution
US10176623B2 (en) 2016-05-02 2019-01-08 Tic Talking Holdings Inc. Facilitation of depiction of geographic relationships via a user interface
CN108346426B (zh) * 2018-02-01 2020-12-08 威盛电子(深圳)有限公司 语音识别装置以及语音识别方法
TWI659411B (zh) * 2018-03-01 2019-05-11 大陸商芋頭科技(杭州)有限公司 一種多語言混合語音識別方法
CN109147784B (zh) * 2018-09-10 2021-06-08 百度在线网络技术(北京)有限公司 语音交互方法、设备以及存储介质
US11017771B2 (en) * 2019-01-18 2021-05-25 Adobe Inc. Voice command matching during testing of voice-assisted application prototypes for languages with non-phonetic alphabets
US10964322B2 (en) 2019-01-23 2021-03-30 Adobe Inc. Voice interaction tool for voice-assisted application prototypes
TWI725608B (zh) * 2019-11-11 2021-04-21 財團法人資訊工業策進會 語音合成系統、方法及非暫態電腦可讀取媒體
CN111753556B (zh) * 2020-06-24 2022-01-04 掌阅科技股份有限公司 双语对照阅读的方法、终端及计算机存储介质
CN113536005B (zh) * 2021-09-17 2021-12-24 网娱互动科技(北京)股份有限公司 一种相似图片或字体查找方法和系统
WO2023146416A1 (en) * 2022-01-28 2023-08-03 John Chu Character retrieval method and apparatus, electronic device and medium
CN116738966A (zh) * 2022-03-01 2023-09-12 衍利行资产有限公司 一种分析包括中文字文本的方法和系统
US12112128B2 (en) * 2022-09-28 2024-10-08 Korea Electric Power Corporation Apparatus and method for generating word embedding library

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1144354A (zh) * 1995-04-25 1997-03-05 齐兰发展股份有限公司 增强的字符录入系统
CN1464430A (zh) * 2002-06-11 2003-12-31 富士施乐株式会社 区分亚洲语言写入系统中组织名称的系统
CN1581075A (zh) * 2003-07-31 2005-02-16 国际商业机器公司 中文/英文词汇学习工具

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01114976A (ja) * 1987-10-28 1989-05-08 Sharp Corp 文書処理装置の辞書構造
JPH0540747A (ja) * 1991-08-07 1993-02-19 Matsushita Electric Ind Co Ltd ワードプロセツサー
JPH05151197A (ja) * 1991-11-14 1993-06-18 Chinka Oka コンピユータに漢字を入力する方法
US5257938A (en) * 1992-01-30 1993-11-02 Tien Hsin C Game for encoding of ideographic characters simulating english alphabetic letters
US5923778A (en) * 1996-06-12 1999-07-13 Industrial Technology Research Institute Hierarchical representation of reference database for an on-line Chinese character recognition system
JP2000163418A (ja) * 1997-12-26 2000-06-16 Canon Inc 自然言語処理装置及びその方法、及びそのプログラムを格納した記憶媒体
US7257528B1 (en) * 1998-02-13 2007-08-14 Zi Corporation Of Canada, Inc. Method and apparatus for Chinese character text input
CN1145872C (zh) * 1999-01-13 2004-04-14 国际商业机器公司 手写汉字自动分割和识别方法以及使用该方法的系统
US6625335B1 (en) * 2000-05-11 2003-09-23 Matsushita Electric Industrial Co., Ltd. Method and apparatus for assigning keywords to documents
JP3838857B2 (ja) * 2000-09-19 2006-10-25 沖電気工業株式会社 辞書装置
US20060139315A1 (en) * 2001-01-17 2006-06-29 Kim Min-Kyum Apparatus and method for inputting alphabet characters on keypad
CN1403960A (zh) * 2001-08-27 2003-03-19 无敌科技股份有限公司 通过电脑拼字的方法
US7680649B2 (en) * 2002-06-17 2010-03-16 International Business Machines Corporation System, method, program product, and networking use for recognizing words and their parts of speech in one or more natural languages
JP2005157472A (ja) * 2003-11-20 2005-06-16 Sharp Corp 文字入力装置および文字入力方法
TW200527226A (en) * 2004-02-11 2005-08-16 Cheng-Fu Lee Chinese system for sorting and searching
KR20050092999A (ko) * 2004-03-17 2005-09-23 샤프전자(주) 전자사전에서의 한자검색방법
US7523102B2 (en) * 2004-06-12 2009-04-21 Getty Images, Inc. Content search in complex language, such as Japanese
US20070052868A1 (en) * 2005-09-02 2007-03-08 Charisma Communications, Inc. Multimedia accessible universal input device
JP2007087216A (ja) * 2005-09-22 2007-04-05 Toshiba Corp 階層型辞書作成装置、プログラムおよび階層型辞書作成方法

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1144354A (zh) * 1995-04-25 1997-03-05 齐兰发展股份有限公司 增强的字符录入系统
CN1464430A (zh) * 2002-06-11 2003-12-31 富士施乐株式会社 区分亚洲语言写入系统中组织名称的系统
CN1581075A (zh) * 2003-07-31 2005-02-16 国际商业机器公司 中文/英文词汇学习工具

Also Published As

Publication number Publication date
US20110320468A1 (en) 2011-12-29
CN102016837A (zh) 2011-04-13
CN102016836A (zh) 2011-04-13
WO2009070619A1 (en) 2009-06-04
HK1156710A1 (en) 2012-06-15
TW200945065A (en) 2009-11-01
JP2011505040A (ja) 2011-02-17
JP2016186805A (ja) 2016-10-27
TWI496012B (zh) 2015-08-11
WO2009070615A1 (en) 2009-06-04
TW200945066A (en) 2009-11-01
US8433709B2 (en) 2013-04-30
TWI468954B (zh) 2015-01-11
JP2011509442A (ja) 2011-03-24
JP2014142951A (ja) 2014-08-07
JP5666307B2 (ja) 2015-02-12
HK1156418A1 (en) 2012-06-08
US20100257173A1 (en) 2010-10-07
US8521738B2 (en) 2013-08-27
CN102016836B (zh) 2013-03-13

Similar Documents

Publication Publication Date Title
CN102016837B (zh) 中文型文字及文字偏旁的分类及检索的系统与方法
Van Atteveldt et al. Computational analysis of communication
US6721451B1 (en) Apparatus and method for reading a document image
CN102298582B (zh) 数据搜索和匹配方法和系统
US5586198A (en) Method and apparatus for identifying characters in ideographic alphabet
CN102449579B (zh) 一体式中文字输入方法
US8261200B2 (en) Increasing retrieval performance of images by providing relevance feedback on word images contained in the images
US8015203B2 (en) Document recognizing apparatus and method
CA2775879C (en) Systems and methods for processing data
US20100083173A1 (en) Method and system for applying metadata to data sets of file objects
JP2016186805A5 (enExample)
US20120109994A1 (en) Robust auto-correction for data retrieval
CN115310436A (zh) 一种文档提纲的抽取方法、装置、电子设备及存储介质
JP4972271B2 (ja) 検索結果提示装置
CN112989011B (zh) 数据查询方法、数据查询装置和电子设备
JP2008262248A (ja) 文字検索方法
CN1373431A (zh) 显示生字的方法及显示数字文章的电子装置
HK1156710B (en) System and method for classification and retrieval of chinese-type characters and character components
JP5741298B2 (ja) 辞書作成装置、辞書作成方法、およびプログラム
Tanaka-Ishii et al. Kansuke: A logograph look-up interface based on a few modified stroke prototypes
Balasubramanian Document Annotation and Retrieval Systems
Tarte et al. Digital Palaeography: New Machines and Old Texts (Dagstuhl Seminar 14302)
HK1170818B (en) Robust auto-correction for data retrieval
HK1156418B (en) Modular system and method for managing chinese, japanese, and korean linguistic data in electronic form
TW201528007A (zh) 多元拼音字典檢索系統

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1156710

Country of ref document: HK

C14 Grant of patent or utility model
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1156710

Country of ref document: HK

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20140820

CF01 Termination of patent right due to non-payment of annual fee