JP2005302023A5 - - Google Patents

Download PDF

Info

Publication number
JP2005302023A5
JP2005302023A5 JP2005110069A JP2005110069A JP2005302023A5 JP 2005302023 A5 JP2005302023 A5 JP 2005302023A5 JP 2005110069 A JP2005110069 A JP 2005110069A JP 2005110069 A JP2005110069 A JP 2005110069A JP 2005302023 A5 JP2005302023 A5 JP 2005302023A5
Authority
JP
Japan
Prior art keywords
capitalization
training
word
computer
procedure
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2005110069A
Other languages
English (en)
Japanese (ja)
Other versions
JP4672418B2 (ja
JP2005302023A (ja
Filing date
Publication date
Priority claimed from US10/819,023 external-priority patent/US7827025B2/en
Application filed filed Critical
Publication of JP2005302023A publication Critical patent/JP2005302023A/ja
Publication of JP2005302023A5 publication Critical patent/JP2005302023A5/ja
Application granted granted Critical
Publication of JP4672418B2 publication Critical patent/JP4672418B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

JP2005110069A 2004-04-06 2005-04-06 ユーザモデリングによる効率のよい大文字化 Expired - Fee Related JP4672418B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/819,023 US7827025B2 (en) 2004-04-06 2004-04-06 Efficient capitalization through user modeling

Publications (3)

Publication Number Publication Date
JP2005302023A JP2005302023A (ja) 2005-10-27
JP2005302023A5 true JP2005302023A5 (enExample) 2008-05-22
JP4672418B2 JP4672418B2 (ja) 2011-04-20

Family

ID=34912696

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2005110069A Expired - Fee Related JP4672418B2 (ja) 2004-04-06 2005-04-06 ユーザモデリングによる効率のよい大文字化

Country Status (7)

Country Link
US (1) US7827025B2 (enExample)
EP (1) EP1585030B1 (enExample)
JP (1) JP4672418B2 (enExample)
KR (1) KR101122887B1 (enExample)
CN (1) CN1680935B (enExample)
AT (1) ATE497213T1 (enExample)
DE (1) DE602005026077D1 (enExample)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2002316581A1 (en) 2001-07-03 2003-01-21 University Of Southern California A syntax-based statistical translation model
US7620538B2 (en) 2002-03-26 2009-11-17 University Of Southern California Constructing a translation lexicon from comparable, non-parallel corpora
US8548794B2 (en) 2003-07-02 2013-10-01 University Of Southern California Statistical noun phrase translation
US8296127B2 (en) 2004-03-23 2012-10-23 University Of Southern California Discovery of parallel text portions in comparable collections of corpora and training using comparable texts
US8666725B2 (en) 2004-04-16 2014-03-04 University Of Southern California Selection and use of nonstatistical translation components in a statistical machine translation framework
US8886517B2 (en) 2005-06-17 2014-11-11 Language Weaver, Inc. Trust scoring for language translation systems
US8676563B2 (en) 2009-10-01 2014-03-18 Language Weaver, Inc. Providing human-generated and machine-generated trusted translations
US10319252B2 (en) 2005-11-09 2019-06-11 Sdl Inc. Language capability assessment and training apparatus and techniques
US8943080B2 (en) 2006-04-07 2015-01-27 University Of Southern California Systems and methods for identifying parallel documents and sentence fragments in multilingual document collections
US8886518B1 (en) * 2006-08-07 2014-11-11 Language Weaver, Inc. System and method for capitalizing machine translated text
US8433556B2 (en) 2006-11-02 2013-04-30 University Of Southern California Semi-supervised training for statistical word alignment
US9122674B1 (en) 2006-12-15 2015-09-01 Language Weaver, Inc. Use of annotations in statistical machine translation
US8468149B1 (en) 2007-01-26 2013-06-18 Language Weaver, Inc. Multi-lingual online community
US8615389B1 (en) 2007-03-16 2013-12-24 Language Weaver, Inc. Generation and exploitation of an approximate language model
US8831928B2 (en) 2007-04-04 2014-09-09 Language Weaver, Inc. Customizable machine translation service
US8825466B1 (en) 2007-06-08 2014-09-02 Language Weaver, Inc. Modification of annotated bilingual segment pairs in syntax-based machine translation
US8972855B2 (en) * 2008-12-16 2015-03-03 At&T Intellectual Property I, L.P. Method and apparatus for providing case restoration
US8990064B2 (en) 2009-07-28 2015-03-24 Language Weaver, Inc. Translating documents based on content
US8380486B2 (en) 2009-10-01 2013-02-19 Language Weaver, Inc. Providing machine-generated translations and corresponding trust levels
US10417646B2 (en) 2010-03-09 2019-09-17 Sdl Inc. Predicting the cost associated with translating textual content
US11003838B2 (en) 2011-04-18 2021-05-11 Sdl Inc. Systems and methods for monitoring post translation editing
US8694303B2 (en) 2011-06-15 2014-04-08 Language Weaver, Inc. Systems and methods for tuning parameters in statistical machine translation
US8886515B2 (en) 2011-10-19 2014-11-11 Language Weaver, Inc. Systems and methods for enhancing machine translation post edit review processes
US8942973B2 (en) 2012-03-09 2015-01-27 Language Weaver, Inc. Content page URL translation
US10261994B2 (en) 2012-05-25 2019-04-16 Sdl Inc. Method and system for automatic management of reputation of translators
US9152622B2 (en) 2012-11-26 2015-10-06 Language Weaver, Inc. Personalized machine translation via online adaptation
US9213694B2 (en) 2013-10-10 2015-12-15 Language Weaver, Inc. Efficient online domain adaptation
US10733235B2 (en) * 2015-06-09 2020-08-04 Patricia Henery Aid for dyslexic readers

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2644995B2 (ja) 1986-09-09 1997-08-25 株式会社東芝 文書処理方法
DE4323241A1 (de) * 1993-07-12 1995-02-02 Ibm Verfahren und Computersystem zur Suche fehlerhafter Zeichenketten in einem Text
US5761689A (en) * 1994-09-01 1998-06-02 Microsoft Corporation Autocorrecting text typed into a word processing document
CN1180204A (zh) * 1996-05-02 1998-04-29 微软公司 大写和无重音文本的词典处理的方法和系统
US5819265A (en) 1996-07-12 1998-10-06 International Business Machines Corporation Processing names in a text
US6618697B1 (en) * 1999-05-14 2003-09-09 Justsystem Corporation Method for rule-based correction of spelling and grammar errors
US6981040B1 (en) * 1999-12-28 2005-12-27 Utopy, Inc. Automatic, personalized online information and product services
US6490549B1 (en) * 2000-03-30 2002-12-03 Scansoft, Inc. Automatic orthographic transformation of a text stream
JP2002169834A (ja) 2000-11-20 2002-06-14 Hewlett Packard Co <Hp> 文書のベクトル解析を行うコンピュータおよび方法
US6922809B2 (en) * 2001-01-25 2005-07-26 International Business Machines Corporation Method and apparatus providing capitalization recovery for text
JP2003167901A (ja) 2001-11-29 2003-06-13 Kddi Corp 協調フィルタリング方法、協調フィルタリング装置及び協調フィルタリングプログラム
CN1679022B (zh) * 2002-07-23 2010-06-09 捷讯研究有限公司 用于构建和使用定制单词列表的系统和方法
US6873996B2 (en) * 2003-04-16 2005-03-29 Yahoo! Inc. Affinity analysis method and article of manufacture
US7447627B2 (en) * 2003-10-23 2008-11-04 Microsoft Corporation Compound word breaker and spell checker

Similar Documents

Publication Publication Date Title
JP2005302023A5 (enExample)
CN110705260B (zh) 一种基于无监督图神经网络结构的文本向量生成方法
CN103744905B (zh) 垃圾邮件判定方法和装置
CN104615767B (zh) 搜索排序模型的训练方法、搜索处理方法及装置
CN104281653B (zh) 一种针对千万级规模微博文本的观点挖掘方法
US8983963B2 (en) Techniques for comparing and clustering documents
CN105426360A (zh) 一种关键词抽取方法及装置
CN103593336B (zh) 一种基于语义分析的知识推送系统及方法
JP2006004411A5 (enExample)
CN107562831A (zh) 一种基于全文检索的精确查找方法
CN105844424A (zh) 基于网络评论的产品质量问题发现及风险评估方法
CN107766318A (zh) 一种关键词的抽取方法、装置及电子设备
CN106933787A (zh) 判决文书相似度的计算方法、查找装置及计算机设备
CN104361059B (zh) 一种基于多示例学习的有害信息识别和网页分类方法
CN106021383A (zh) 网页相似度计算方法及装置
CN105956095B (zh) 一种基于细粒度情感词典的心理预警模型构建方法
CN105740229A (zh) 关键词提取的方法及装置
CN107402960B (zh) 一种基于语义语气加权的倒排索引优化算法
CN104462399B (zh) 搜索结果的处理方法及装置
CN102789449A (zh) 对评论文本进行评价的方法和装置
CN106407473B (zh) 一种基于事件相似性建模的获取事件脉络的方法及其系统
CN105956192A (zh) 一种基于网站首页信息获取组织机构名简称的方法及系统
CN105740448B (zh) 面向话题的多微博时序文摘方法
CN103324641B (zh) 信息记录推荐方法和装置
CN107895024A (zh) 用于网页新闻分类推荐的用户模型构建方法及推荐方法