JP6273285B2 - 電子文字列をフォーマットするためのフォーマットモジュール、システム及び方法 - Google Patents
電子文字列をフォーマットするためのフォーマットモジュール、システム及び方法 Download PDFInfo
- Publication number
- JP6273285B2 JP6273285B2 JP2015531650A JP2015531650A JP6273285B2 JP 6273285 B2 JP6273285 B2 JP 6273285B2 JP 2015531650 A JP2015531650 A JP 2015531650A JP 2015531650 A JP2015531650 A JP 2015531650A JP 6273285 B2 JP6273285 B2 JP 6273285B2
- Authority
- JP
- Japan
- Prior art keywords
- rule
- language
- rules
- character
- string
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
- G06F16/24564—Applying rules; Deductive queries
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/163—Handling of whitespace
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/274—Converting codes to words; Guess-ahead of partial word inputs
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/263—Language identification
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GBGB1216640.1A GB201216640D0 (en) | 2012-09-18 | 2012-09-18 | Formatting module, system and method for formatting an electronic character sequence |
| GB1216640.1 | 2012-09-18 | ||
| PCT/GB2013/052443 WO2014045032A1 (en) | 2012-09-18 | 2013-09-18 | Formatting module, system and method for formatting an electronic character sequence |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2015534171A JP2015534171A (ja) | 2015-11-26 |
| JP2015534171A5 JP2015534171A5 (enExample) | 2016-10-06 |
| JP6273285B2 true JP6273285B2 (ja) | 2018-01-31 |
Family
ID=47144444
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2015531650A Expired - Fee Related JP6273285B2 (ja) | 2012-09-18 | 2013-09-18 | 電子文字列をフォーマットするためのフォーマットモジュール、システム及び方法 |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US20150248379A1 (enExample) |
| EP (1) | EP2898426A1 (enExample) |
| JP (1) | JP6273285B2 (enExample) |
| CN (1) | CN104641367B (enExample) |
| GB (1) | GB201216640D0 (enExample) |
| WO (1) | WO2014045032A1 (enExample) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2014122733A1 (ja) * | 2013-02-06 | 2014-08-14 | 株式会社日立製作所 | 計算機、データアクセス管理方法及び記録媒体 |
| CN106909296A (zh) | 2016-06-07 | 2017-06-30 | 阿里巴巴集团控股有限公司 | 数据的提取方法、装置及终端设备 |
| JP7566520B2 (ja) * | 2020-07-17 | 2024-10-15 | キヤノン株式会社 | 画像処理装置、方法、プログラム |
| JP7724676B2 (ja) * | 2021-10-05 | 2025-08-18 | 株式会社日本総合研究所 | 情報処理方法、プログラム及び情報処理装置 |
Family Cites Families (35)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4247906A (en) | 1978-11-13 | 1981-01-27 | Wang Laboratories, Inc. | Text editing system having flexible repetitive operation capability |
| US4783803A (en) * | 1985-11-12 | 1988-11-08 | Dragon Systems, Inc. | Speech recognition apparatus and method |
| JPH0762841B2 (ja) | 1986-06-27 | 1995-07-05 | 横河・ヒユ−レツト・パツカ−ド株式会社 | 文書清書装置 |
| US5222225A (en) * | 1988-10-07 | 1993-06-22 | International Business Machines Corporation | Apparatus for processing character string moves in a data processing system |
| US5062143A (en) * | 1990-02-23 | 1991-10-29 | Harris Corporation | Trigram-based method of language identification |
| US5937420A (en) * | 1996-07-23 | 1999-08-10 | Adobe Systems Incorporated | Pointsize-variable character spacing |
| KR100213910B1 (ko) * | 1997-03-26 | 1999-08-02 | 윤종용 | 한영 자동 변환기 및 방법 |
| US6513002B1 (en) * | 1998-02-11 | 2003-01-28 | International Business Machines Corporation | Rule-based number formatter |
| US6529864B1 (en) * | 1999-08-11 | 2003-03-04 | Roedy-Black Publishing, Inc. | Interactive connotative dictionary system |
| US6374242B1 (en) * | 1999-09-29 | 2002-04-16 | Lockheed Martin Corporation | Natural-language information processor with association searches limited within blocks |
| US20020123994A1 (en) | 2000-04-26 | 2002-09-05 | Yves Schabes | System for fulfilling an information need using extended matching techniques |
| US20040078191A1 (en) * | 2002-10-22 | 2004-04-22 | Nokia Corporation | Scalable neural network-based language identification from written text |
| US7580838B2 (en) * | 2002-11-22 | 2009-08-25 | Scansoft, Inc. | Automatic insertion of non-verbalized punctuation |
| US20060184878A1 (en) * | 2005-02-11 | 2006-08-17 | Microsoft Corporation | Using a description language to provide a user interface presentation |
| US8027832B2 (en) * | 2005-02-11 | 2011-09-27 | Microsoft Corporation | Efficient language identification |
| JP4135950B2 (ja) * | 2005-06-09 | 2008-08-20 | インターナショナル・ビジネス・マシーンズ・コーポレーション | アクセス管理装置、アクセス管理方法、およびプログラム |
| CN100382022C (zh) * | 2005-09-09 | 2008-04-16 | 华为技术有限公司 | 一种接口数据文法分析处理系统及其分析处理方法 |
| US7552045B2 (en) * | 2006-12-18 | 2009-06-23 | Nokia Corporation | Method, apparatus and computer program product for providing flexible text based language identification |
| US8527262B2 (en) | 2007-06-22 | 2013-09-03 | International Business Machines Corporation | Systems and methods for automatic semantic role labeling of high morphological text for natural language processing applications |
| US8783570B2 (en) * | 2007-08-21 | 2014-07-22 | Symbol Technologies, Inc. | Reader with optical character recognition |
| US8306356B1 (en) * | 2007-09-28 | 2012-11-06 | Language Technologies, Inc. | System, plug-in, and method for improving text composition by modifying character prominence according to assigned character information measures |
| US8706474B2 (en) * | 2008-02-23 | 2014-04-22 | Fair Isaac Corporation | Translation of entity names based on source document publication date, and frequency and co-occurrence of the entity names |
| KR101496885B1 (ko) | 2008-04-07 | 2015-02-27 | 삼성전자주식회사 | 문장 띄어쓰기 시스템 및 방법 |
| US8224641B2 (en) * | 2008-11-19 | 2012-07-17 | Stratify, Inc. | Language identification for documents containing multiple languages |
| JP4701292B2 (ja) * | 2009-01-05 | 2011-06-15 | インターナショナル・ビジネス・マシーンズ・コーポレーション | テキスト・データに含まれる固有表現又は専門用語から用語辞書を作成するためのコンピュータ・システム、並びにその方法及びコンピュータ・プログラム |
| US8879846B2 (en) * | 2009-02-10 | 2014-11-04 | Kofax, Inc. | Systems, methods and computer program products for processing financial documents |
| GB0905457D0 (en) | 2009-03-30 | 2009-05-13 | Touchtype Ltd | System and method for inputting text into electronic devices |
| GB201016385D0 (en) * | 2010-09-29 | 2010-11-10 | Touchtype Ltd | System and method for inputting text into electronic devices |
| KR101638594B1 (ko) * | 2010-05-26 | 2016-07-20 | 삼성전자주식회사 | Dna 서열 검색 방법 및 장치 |
| WO2012098544A2 (en) | 2011-01-19 | 2012-07-26 | Keyless Systems, Ltd. | Improved data entry systems |
| US20120262461A1 (en) * | 2011-02-17 | 2012-10-18 | Conversive, Inc. | System and Method for the Normalization of Text |
| WO2014042976A2 (en) * | 2012-09-15 | 2014-03-20 | Numbergun Llc, A Utah Limited Liability Company | Flexible high-speed generation and formatting of application-specified strings |
| US20140136967A1 (en) | 2012-11-09 | 2014-05-15 | Research In Motion Limited | Method of providing predictive text |
| US9811517B2 (en) | 2013-01-29 | 2017-11-07 | Tencent Technology (Shenzhen) Company Limited | Method and system of adding punctuation and establishing language model using a punctuation weighting applied to chinese speech recognized text |
| US8943405B1 (en) | 2013-11-27 | 2015-01-27 | Google Inc. | Assisted punctuation of character strings |
-
2012
- 2012-09-18 GB GBGB1216640.1A patent/GB201216640D0/en not_active Ceased
-
2013
- 2013-09-18 CN CN201380048564.6A patent/CN104641367B/zh active Active
- 2013-09-18 WO PCT/GB2013/052443 patent/WO2014045032A1/en not_active Ceased
- 2013-09-18 US US14/428,972 patent/US20150248379A1/en not_active Abandoned
- 2013-09-18 EP EP13771173.5A patent/EP2898426A1/en not_active Ceased
- 2013-09-18 JP JP2015531650A patent/JP6273285B2/ja not_active Expired - Fee Related
-
2023
- 2023-04-19 US US18/136,730 patent/US12182496B2/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| CN104641367B (zh) | 2019-01-11 |
| JP2015534171A (ja) | 2015-11-26 |
| US12182496B2 (en) | 2024-12-31 |
| US20150248379A1 (en) | 2015-09-03 |
| US20230252222A1 (en) | 2023-08-10 |
| EP2898426A1 (en) | 2015-07-29 |
| GB201216640D0 (en) | 2012-10-31 |
| CN104641367A (zh) | 2015-05-20 |
| WO2014045032A1 (en) | 2014-03-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Kenton et al. | Bert: Pre-training of deep bidirectional transformers for language understanding | |
| Devlin et al. | Bert: Pre-training of deep bidirectional transformers for language understanding | |
| KR101524740B1 (ko) | 입력 방법 편집기 | |
| KR102268875B1 (ko) | 전자 장치에 텍스트를 입력하는 시스템 및 방법 | |
| US12182496B2 (en) | Formatting module, system and method for formatting an electronic character sequence | |
| WO2020051192A1 (en) | Dialogue systems | |
| CN103703459A (zh) | 基于字符变换和无监督网络数据的文本消息规格化方法和系统 | |
| WO2018005203A1 (en) | Leveraging information available in a corpus for data parsing and predicting | |
| WO2011131785A1 (en) | Normalisation of noisy typewritten texts | |
| WO2008103894A1 (en) | Automated word-form transformation and part of speech tag assignment | |
| Luong et al. | Lig system for word level qe task at wmt14 | |
| Fischbach et al. | Fine-grained causality extraction from natural language requirements using recursive neural tensor networks | |
| Xiao et al. | Information extraction from the web: System and techniques | |
| CN114661917B (zh) | 文本扩增方法、系统、计算机设备及可读存储介质 | |
| KR20100062834A (ko) | 번역 오류 후처리 보정 장치 및 방법 | |
| Bhargava et al. | Query Labelling for Indic Languages using a hybrid approach. | |
| CN113688615A (zh) | 一种字段注释生成、字符串理解方法、设备及存储介质 | |
| Ding et al. | Span-Oriented Information Extraction--A unifying perspective on information extraction | |
| JPWO2012124301A1 (ja) | 関連仕様対応付けシステム、関連仕様対応付け方法およびプログラム | |
| CN107203512B (zh) | 用于从用户的自然语言输入中提取关键元素的方法 | |
| Ding et al. | Span-Oriented Information Extraction: A Unified Framework | |
| Ullah et al. | Part-Of-Speech Tagging for Balochi Language: A Data driven application of Conditional Random Fields | |
| Faiz et al. | Semantic event extraction from biological texts using a Kernel-based method | |
| Mahte et al. | Emoticon Suggestion with Word Prediction using Natural Language Processing | |
| Kalykulova et al. | T-Extractor: A Hybrid Unsupervised Approach for Term and Named Entity Extraction Using Rules, Statistical, and Semantic Methods |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RD03 | Notification of appointment of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7423 Effective date: 20160421 |
|
| RD04 | Notification of resignation of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7424 Effective date: 20160421 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20160815 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20160815 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20170810 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20170831 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20171128 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20171211 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20180105 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 6273285 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| S111 | Request for change of ownership or part of ownership |
Free format text: JAPANESE INTERMEDIATE CODE: R313113 |
|
| R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| LAPS | Cancellation because of no payment of annual fees |