CN102016837B - 中文型文字及文字偏旁的分类及检索的系统与方法 - Google Patents
中文型文字及文字偏旁的分类及检索的系统与方法 Download PDFInfo
- Publication number
- CN102016837B CN102016837B CN200880125478.XA CN200880125478A CN102016837B CN 102016837 B CN102016837 B CN 102016837B CN 200880125478 A CN200880125478 A CN 200880125478A CN 102016837 B CN102016837 B CN 102016837B
- Authority
- CN
- China
- Prior art keywords
- radical
- radicals
- recurring
- word
- stroke
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/53—Querying
- G06F16/532—Query formulation, e.g. graphical querying
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/126—Character encoding
- G06F40/129—Handling non-Latin characters, e.g. kana-to-kanji conversion
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Library & Information Science (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Document Processing Apparatus (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Character Discrimination (AREA)
- User Interface Of Digital Computer (AREA)
- Input From Keyboards Or The Like (AREA)
Applications Claiming Priority (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US99012307P | 2007-11-26 | 2007-11-26 | |
| US99016607P | 2007-11-26 | 2007-11-26 | |
| US60/990,123 | 2007-11-26 | ||
| US60/990,166 | 2007-11-26 | ||
| US99101007P | 2007-11-29 | 2007-11-29 | |
| US60/991,010 | 2007-11-29 | ||
| PCT/US2008/084750 WO2009070615A1 (en) | 2007-11-26 | 2008-11-25 | System and method for classification and retrieval of chinese-type characters and character components |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN102016837A CN102016837A (zh) | 2011-04-13 |
| CN102016837B true CN102016837B (zh) | 2014-08-20 |
Family
ID=40678958
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN200880125478.XA Expired - Fee Related CN102016837B (zh) | 2007-11-26 | 2008-11-25 | 中文型文字及文字偏旁的分类及检索的系统与方法 |
| CN2008801254775A Expired - Fee Related CN102016836B (zh) | 2007-11-26 | 2008-11-25 | 管理电子形式的中文、日文及韩文语言数据的模组系统与方法 |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN2008801254775A Expired - Fee Related CN102016836B (zh) | 2007-11-26 | 2008-11-25 | 管理电子形式的中文、日文及韩文语言数据的模组系统与方法 |
Country Status (5)
| Country | Link |
|---|---|
| US (2) | US8433709B2 (enExample) |
| JP (4) | JP2011509442A (enExample) |
| CN (2) | CN102016837B (enExample) |
| TW (2) | TWI468954B (enExample) |
| WO (2) | WO2009070615A1 (enExample) |
Families Citing this family (44)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8564544B2 (en) | 2006-09-06 | 2013-10-22 | Apple Inc. | Touch screen device, method, and graphical user interface for customizing display of content category icons |
| GB0624571D0 (en) * | 2006-12-08 | 2007-01-17 | Cambridge Silicon Radio Ltd | Authenticating Devices for Communications |
| US8689132B2 (en) | 2007-01-07 | 2014-04-01 | Apple Inc. | Portable electronic device, method, and graphical user interface for displaying electronic documents and lists |
| CN105117376B (zh) * | 2007-04-10 | 2018-07-10 | 谷歌有限责任公司 | 多模式输入法编辑器 |
| US8266514B2 (en) * | 2008-06-26 | 2012-09-11 | Microsoft Corporation | Map service |
| US9824071B2 (en) * | 2008-12-03 | 2017-11-21 | Microsoft Technology Licensing, Llc | Viewing messages and message attachments in different languages |
| US20120010870A1 (en) * | 2010-07-09 | 2012-01-12 | Vladimir Selegey | Electronic dictionary and dictionary writing system |
| US20120038652A1 (en) * | 2010-08-12 | 2012-02-16 | Palm, Inc. | Accepting motion-based character input on mobile computing devices |
| JP2012079252A (ja) * | 2010-10-06 | 2012-04-19 | Fujitsu Ltd | 情報端末装置、文字入力方法および文字入力プログラム |
| US8914743B2 (en) * | 2010-11-12 | 2014-12-16 | Apple Inc. | Device, method, and graphical user interface for navigating a list of identifiers |
| US20120156658A1 (en) * | 2010-12-16 | 2012-06-21 | Nicholas Fuzzell | Methods for teaching and/or learning chinese, and related systems |
| WO2012174703A1 (en) * | 2011-06-20 | 2012-12-27 | Microsoft Corporation | Hover translation of search result captions |
| JP2013041350A (ja) * | 2011-08-12 | 2013-02-28 | Panasonic Corp | タッチテーブルシステム |
| KR101870729B1 (ko) * | 2011-09-01 | 2018-07-20 | 삼성전자주식회사 | 휴대용 단말기의 번역 트리구조를 이용한 번역장치 및 방법 |
| KR20130080515A (ko) * | 2012-01-05 | 2013-07-15 | 삼성전자주식회사 | 디스플레이 장치 및 그 디스플레이 장치에 표시된 문자 편집 방법. |
| US9229928B2 (en) * | 2012-03-13 | 2016-01-05 | Nulu, Inc. | Language learning platform using relevant and contextual content |
| TWI449000B (zh) * | 2012-03-23 | 2014-08-11 | Chinese Foundation For Digitization Technology | Multimedia Chinese Character Learning Method |
| US9274609B2 (en) | 2012-07-23 | 2016-03-01 | Mingyan Xie | Inputting radical on touch screen device |
| US20140344670A1 (en) * | 2013-05-14 | 2014-11-20 | Pandaworks Inc. Dba Contentpanda | Method and system for on-demand delivery of predefined in-context web content |
| KR20150028627A (ko) * | 2013-09-06 | 2015-03-16 | 삼성전자주식회사 | 사용자 필기를 텍스트 정보로 변환하는 방법 및 이를 수행하기 위한 전자 기기 |
| JP2015060095A (ja) * | 2013-09-19 | 2015-03-30 | 株式会社東芝 | 音声翻訳装置、音声翻訳方法およびプログラム |
| WO2015112250A1 (en) * | 2014-01-22 | 2015-07-30 | Speak Agent, Inc. | Visual-kinesthetic language construction |
| CN104808806B (zh) * | 2014-01-28 | 2019-10-25 | 北京三星通信技术研究有限公司 | 根据不确定性信息实现汉字输入的方法和装置 |
| TW201530357A (zh) * | 2014-01-29 | 2015-08-01 | Chiu-Huei Teng | 用於電子裝置之中文輸入法 |
| RU2640322C2 (ru) * | 2014-01-30 | 2017-12-27 | Общество с ограниченной ответственностью "Аби Девелопмент" | Способы и системы эффективного автоматического распознавания символов |
| WO2015167556A1 (en) * | 2014-04-30 | 2015-11-05 | Hewlett-Packard Development Company, L.P. | Generating color similarity measures |
| WO2016029045A2 (en) * | 2014-08-21 | 2016-02-25 | Jobu Productions | Lexical dialect analysis system |
| JP6466138B2 (ja) * | 2014-11-04 | 2019-02-06 | 株式会社東芝 | 外国語文作成支援装置、方法及びプログラム |
| US20160147741A1 (en) * | 2014-11-26 | 2016-05-26 | Adobe Systems Incorporated | Techniques for providing a user interface incorporating sign language |
| US9740684B2 (en) * | 2015-02-18 | 2017-08-22 | Lenovo (Singapore) Pte. Ltd. | Determining homonyms of logogram input |
| CN106997245A (zh) * | 2016-01-24 | 2017-08-01 | 杨文韬 | 一种根据中文语言模型构建输入法词库的方法 |
| US10031949B2 (en) * | 2016-03-03 | 2018-07-24 | Tic Talking Holdings Inc. | Interest based content distribution |
| US10176623B2 (en) | 2016-05-02 | 2019-01-08 | Tic Talking Holdings Inc. | Facilitation of depiction of geographic relationships via a user interface |
| CN108346426B (zh) * | 2018-02-01 | 2020-12-08 | 威盛电子(深圳)有限公司 | 语音识别装置以及语音识别方法 |
| TWI659411B (zh) * | 2018-03-01 | 2019-05-11 | 大陸商芋頭科技(杭州)有限公司 | 一種多語言混合語音識別方法 |
| CN109147784B (zh) * | 2018-09-10 | 2021-06-08 | 百度在线网络技术(北京)有限公司 | 语音交互方法、设备以及存储介质 |
| US11017771B2 (en) * | 2019-01-18 | 2021-05-25 | Adobe Inc. | Voice command matching during testing of voice-assisted application prototypes for languages with non-phonetic alphabets |
| US10964322B2 (en) | 2019-01-23 | 2021-03-30 | Adobe Inc. | Voice interaction tool for voice-assisted application prototypes |
| TWI725608B (zh) * | 2019-11-11 | 2021-04-21 | 財團法人資訊工業策進會 | 語音合成系統、方法及非暫態電腦可讀取媒體 |
| CN111753556B (zh) * | 2020-06-24 | 2022-01-04 | 掌阅科技股份有限公司 | 双语对照阅读的方法、终端及计算机存储介质 |
| CN113536005B (zh) * | 2021-09-17 | 2021-12-24 | 网娱互动科技(北京)股份有限公司 | 一种相似图片或字体查找方法和系统 |
| WO2023146416A1 (en) * | 2022-01-28 | 2023-08-03 | John Chu | Character retrieval method and apparatus, electronic device and medium |
| CN116738966A (zh) * | 2022-03-01 | 2023-09-12 | 衍利行资产有限公司 | 一种分析包括中文字文本的方法和系统 |
| US12112128B2 (en) * | 2022-09-28 | 2024-10-08 | Korea Electric Power Corporation | Apparatus and method for generating word embedding library |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1144354A (zh) * | 1995-04-25 | 1997-03-05 | 齐兰发展股份有限公司 | 增强的字符录入系统 |
| CN1464430A (zh) * | 2002-06-11 | 2003-12-31 | 富士施乐株式会社 | 区分亚洲语言写入系统中组织名称的系统 |
| CN1581075A (zh) * | 2003-07-31 | 2005-02-16 | 国际商业机器公司 | 中文/英文词汇学习工具 |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH01114976A (ja) * | 1987-10-28 | 1989-05-08 | Sharp Corp | 文書処理装置の辞書構造 |
| JPH0540747A (ja) * | 1991-08-07 | 1993-02-19 | Matsushita Electric Ind Co Ltd | ワードプロセツサー |
| JPH05151197A (ja) * | 1991-11-14 | 1993-06-18 | Chinka Oka | コンピユータに漢字を入力する方法 |
| US5257938A (en) * | 1992-01-30 | 1993-11-02 | Tien Hsin C | Game for encoding of ideographic characters simulating english alphabetic letters |
| US5923778A (en) * | 1996-06-12 | 1999-07-13 | Industrial Technology Research Institute | Hierarchical representation of reference database for an on-line Chinese character recognition system |
| JP2000163418A (ja) * | 1997-12-26 | 2000-06-16 | Canon Inc | 自然言語処理装置及びその方法、及びそのプログラムを格納した記憶媒体 |
| US7257528B1 (en) * | 1998-02-13 | 2007-08-14 | Zi Corporation Of Canada, Inc. | Method and apparatus for Chinese character text input |
| CN1145872C (zh) * | 1999-01-13 | 2004-04-14 | 国际商业机器公司 | 手写汉字自动分割和识别方法以及使用该方法的系统 |
| US6625335B1 (en) * | 2000-05-11 | 2003-09-23 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for assigning keywords to documents |
| JP3838857B2 (ja) * | 2000-09-19 | 2006-10-25 | 沖電気工業株式会社 | 辞書装置 |
| US20060139315A1 (en) * | 2001-01-17 | 2006-06-29 | Kim Min-Kyum | Apparatus and method for inputting alphabet characters on keypad |
| CN1403960A (zh) * | 2001-08-27 | 2003-03-19 | 无敌科技股份有限公司 | 通过电脑拼字的方法 |
| US7680649B2 (en) * | 2002-06-17 | 2010-03-16 | International Business Machines Corporation | System, method, program product, and networking use for recognizing words and their parts of speech in one or more natural languages |
| JP2005157472A (ja) * | 2003-11-20 | 2005-06-16 | Sharp Corp | 文字入力装置および文字入力方法 |
| TW200527226A (en) * | 2004-02-11 | 2005-08-16 | Cheng-Fu Lee | Chinese system for sorting and searching |
| KR20050092999A (ko) * | 2004-03-17 | 2005-09-23 | 샤프전자(주) | 전자사전에서의 한자검색방법 |
| US7523102B2 (en) * | 2004-06-12 | 2009-04-21 | Getty Images, Inc. | Content search in complex language, such as Japanese |
| US20070052868A1 (en) * | 2005-09-02 | 2007-03-08 | Charisma Communications, Inc. | Multimedia accessible universal input device |
| JP2007087216A (ja) * | 2005-09-22 | 2007-04-05 | Toshiba Corp | 階層型辞書作成装置、プログラムおよび階層型辞書作成方法 |
-
2008
- 2008-11-25 WO PCT/US2008/084750 patent/WO2009070615A1/en not_active Ceased
- 2008-11-25 TW TW97145512A patent/TWI468954B/zh not_active IP Right Cessation
- 2008-11-25 TW TW097145519A patent/TWI496012B/zh not_active IP Right Cessation
- 2008-11-25 JP JP2010535118A patent/JP2011509442A/ja active Pending
- 2008-11-25 WO PCT/US2008/084755 patent/WO2009070619A1/en not_active Ceased
- 2008-11-25 CN CN200880125478.XA patent/CN102016837B/zh not_active Expired - Fee Related
- 2008-11-25 US US12/744,801 patent/US8433709B2/en not_active Expired - Fee Related
- 2008-11-25 CN CN2008801254775A patent/CN102016836B/zh not_active Expired - Fee Related
- 2008-11-25 US US12/744,809 patent/US8521738B2/en not_active Expired - Fee Related
- 2008-11-25 JP JP2010535116A patent/JP5666307B2/ja not_active Expired - Fee Related
-
2014
- 2014-03-12 JP JP2014048371A patent/JP2014142951A/ja active Pending
-
2016
- 2016-06-23 JP JP2016124051A patent/JP2016186805A/ja active Pending
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1144354A (zh) * | 1995-04-25 | 1997-03-05 | 齐兰发展股份有限公司 | 增强的字符录入系统 |
| CN1464430A (zh) * | 2002-06-11 | 2003-12-31 | 富士施乐株式会社 | 区分亚洲语言写入系统中组织名称的系统 |
| CN1581075A (zh) * | 2003-07-31 | 2005-02-16 | 国际商业机器公司 | 中文/英文词汇学习工具 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20110320468A1 (en) | 2011-12-29 |
| CN102016837A (zh) | 2011-04-13 |
| CN102016836A (zh) | 2011-04-13 |
| WO2009070619A1 (en) | 2009-06-04 |
| HK1156710A1 (en) | 2012-06-15 |
| TW200945065A (en) | 2009-11-01 |
| JP2011505040A (ja) | 2011-02-17 |
| JP2016186805A (ja) | 2016-10-27 |
| TWI496012B (zh) | 2015-08-11 |
| WO2009070615A1 (en) | 2009-06-04 |
| TW200945066A (en) | 2009-11-01 |
| US8433709B2 (en) | 2013-04-30 |
| TWI468954B (zh) | 2015-01-11 |
| JP2011509442A (ja) | 2011-03-24 |
| JP2014142951A (ja) | 2014-08-07 |
| JP5666307B2 (ja) | 2015-02-12 |
| HK1156418A1 (en) | 2012-06-08 |
| US20100257173A1 (en) | 2010-10-07 |
| US8521738B2 (en) | 2013-08-27 |
| CN102016836B (zh) | 2013-03-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN102016837B (zh) | 中文型文字及文字偏旁的分类及检索的系统与方法 | |
| Van Atteveldt et al. | Computational analysis of communication | |
| US6721451B1 (en) | Apparatus and method for reading a document image | |
| CN102298582B (zh) | 数据搜索和匹配方法和系统 | |
| US5586198A (en) | Method and apparatus for identifying characters in ideographic alphabet | |
| CN102449579B (zh) | 一体式中文字输入方法 | |
| US8261200B2 (en) | Increasing retrieval performance of images by providing relevance feedback on word images contained in the images | |
| US8015203B2 (en) | Document recognizing apparatus and method | |
| CA2775879C (en) | Systems and methods for processing data | |
| US20100083173A1 (en) | Method and system for applying metadata to data sets of file objects | |
| JP2016186805A5 (enExample) | ||
| US20120109994A1 (en) | Robust auto-correction for data retrieval | |
| CN115310436A (zh) | 一种文档提纲的抽取方法、装置、电子设备及存储介质 | |
| JP4972271B2 (ja) | 検索結果提示装置 | |
| CN112989011B (zh) | 数据查询方法、数据查询装置和电子设备 | |
| JP2008262248A (ja) | 文字検索方法 | |
| CN1373431A (zh) | 显示生字的方法及显示数字文章的电子装置 | |
| HK1156710B (en) | System and method for classification and retrieval of chinese-type characters and character components | |
| JP5741298B2 (ja) | 辞書作成装置、辞書作成方法、およびプログラム | |
| Tanaka-Ishii et al. | Kansuke: A logograph look-up interface based on a few modified stroke prototypes | |
| Balasubramanian | Document Annotation and Retrieval Systems | |
| Tarte et al. | Digital Palaeography: New Machines and Old Texts (Dagstuhl Seminar 14302) | |
| HK1170818B (en) | Robust auto-correction for data retrieval | |
| HK1156418B (en) | Modular system and method for managing chinese, japanese, and korean linguistic data in electronic form | |
| TW201528007A (zh) | 多元拼音字典檢索系統 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1156710 Country of ref document: HK |
|
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1156710 Country of ref document: HK |
|
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20140820 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |