KR101122887B1 - 사용자 모델링을 통한 효율적인 대문자화 - Google Patents

사용자 모델링을 통한 효율적인 대문자화 Download PDF

Info

Publication number
KR101122887B1
KR101122887B1 KR1020050028607A KR20050028607A KR101122887B1 KR 101122887 B1 KR101122887 B1 KR 101122887B1 KR 1020050028607 A KR1020050028607 A KR 1020050028607A KR 20050028607 A KR20050028607 A KR 20050028607A KR 101122887 B1 KR101122887 B1 KR 101122887B1
Authority
KR
South Korea
Prior art keywords
capitalization
word
model
training
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
KR1020050028607A
Other languages
English (en)
Korean (ko)
Other versions
KR20060045535A (ko
Inventor
동 유
피터 케이.엘. 마우
Original Assignee
마이크로소프트 코포레이션
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 마이크로소프트 코포레이션 filed Critical 마이크로소프트 코포레이션
Publication of KR20060045535A publication Critical patent/KR20060045535A/ko
Application granted granted Critical
Publication of KR101122887B1 publication Critical patent/KR101122887B1/ko
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/232Orthographic correction, e.g. spell checking or vowelisation
    • EFIXED CONSTRUCTIONS
    • E03WATER SUPPLY; SEWERAGE
    • E03FSEWERS; CESSPOOLS
    • E03F5/00Sewerage structures
    • E03F5/04Gullies inlets, road sinks, floor drains with or without odour seals or sediment traps
    • E03F5/042Arrangements of means against overflow of water, backing-up from the drain
    • EFIXED CONSTRUCTIONS
    • E03WATER SUPPLY; SEWERAGE
    • E03FSEWERS; CESSPOOLS
    • E03F5/00Sewerage structures
    • E03F5/04Gullies inlets, road sinks, floor drains with or without odour seals or sediment traps
    • E03F2005/0416Gullies inlets, road sinks, floor drains with or without odour seals or sediment traps with an odour seal
    • E03F2005/0417Gullies inlets, road sinks, floor drains with or without odour seals or sediment traps with an odour seal in the form of a valve

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Water Supply & Treatment (AREA)
  • Public Health (AREA)
  • Hydrology & Water Resources (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Organic Low-Molecular-Weight Compounds And Preparation Thereof (AREA)
  • Acyclic And Carbocyclic Compounds In Medicinal Compositions (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
KR1020050028607A 2004-04-06 2005-04-06 사용자 모델링을 통한 효율적인 대문자화 Expired - Fee Related KR101122887B1 (ko)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/819,023 2004-04-06
US10/819,023 US7827025B2 (en) 2004-04-06 2004-04-06 Efficient capitalization through user modeling

Publications (2)

Publication Number Publication Date
KR20060045535A KR20060045535A (ko) 2006-05-17
KR101122887B1 true KR101122887B1 (ko) 2012-03-22

Family

ID=34912696

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020050028607A Expired - Fee Related KR101122887B1 (ko) 2004-04-06 2005-04-06 사용자 모델링을 통한 효율적인 대문자화

Country Status (7)

Country Link
US (1) US7827025B2 (enExample)
EP (1) EP1585030B1 (enExample)
JP (1) JP4672418B2 (enExample)
KR (1) KR101122887B1 (enExample)
CN (1) CN1680935B (enExample)
AT (1) ATE497213T1 (enExample)
DE (1) DE602005026077D1 (enExample)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2002316581A1 (en) 2001-07-03 2003-01-21 University Of Southern California A syntax-based statistical translation model
US7620538B2 (en) 2002-03-26 2009-11-17 University Of Southern California Constructing a translation lexicon from comparable, non-parallel corpora
US8548794B2 (en) 2003-07-02 2013-10-01 University Of Southern California Statistical noun phrase translation
US8296127B2 (en) 2004-03-23 2012-10-23 University Of Southern California Discovery of parallel text portions in comparable collections of corpora and training using comparable texts
US8666725B2 (en) 2004-04-16 2014-03-04 University Of Southern California Selection and use of nonstatistical translation components in a statistical machine translation framework
US8886517B2 (en) 2005-06-17 2014-11-11 Language Weaver, Inc. Trust scoring for language translation systems
US8676563B2 (en) 2009-10-01 2014-03-18 Language Weaver, Inc. Providing human-generated and machine-generated trusted translations
US10319252B2 (en) 2005-11-09 2019-06-11 Sdl Inc. Language capability assessment and training apparatus and techniques
US8943080B2 (en) 2006-04-07 2015-01-27 University Of Southern California Systems and methods for identifying parallel documents and sentence fragments in multilingual document collections
US8886518B1 (en) * 2006-08-07 2014-11-11 Language Weaver, Inc. System and method for capitalizing machine translated text
US8433556B2 (en) 2006-11-02 2013-04-30 University Of Southern California Semi-supervised training for statistical word alignment
US9122674B1 (en) 2006-12-15 2015-09-01 Language Weaver, Inc. Use of annotations in statistical machine translation
US8468149B1 (en) 2007-01-26 2013-06-18 Language Weaver, Inc. Multi-lingual online community
US8615389B1 (en) 2007-03-16 2013-12-24 Language Weaver, Inc. Generation and exploitation of an approximate language model
US8831928B2 (en) 2007-04-04 2014-09-09 Language Weaver, Inc. Customizable machine translation service
US8825466B1 (en) 2007-06-08 2014-09-02 Language Weaver, Inc. Modification of annotated bilingual segment pairs in syntax-based machine translation
US8972855B2 (en) * 2008-12-16 2015-03-03 At&T Intellectual Property I, L.P. Method and apparatus for providing case restoration
US8990064B2 (en) 2009-07-28 2015-03-24 Language Weaver, Inc. Translating documents based on content
US8380486B2 (en) 2009-10-01 2013-02-19 Language Weaver, Inc. Providing machine-generated translations and corresponding trust levels
US10417646B2 (en) 2010-03-09 2019-09-17 Sdl Inc. Predicting the cost associated with translating textual content
US11003838B2 (en) 2011-04-18 2021-05-11 Sdl Inc. Systems and methods for monitoring post translation editing
US8694303B2 (en) 2011-06-15 2014-04-08 Language Weaver, Inc. Systems and methods for tuning parameters in statistical machine translation
US8886515B2 (en) 2011-10-19 2014-11-11 Language Weaver, Inc. Systems and methods for enhancing machine translation post edit review processes
US8942973B2 (en) 2012-03-09 2015-01-27 Language Weaver, Inc. Content page URL translation
US10261994B2 (en) 2012-05-25 2019-04-16 Sdl Inc. Method and system for automatic management of reputation of translators
US9152622B2 (en) 2012-11-26 2015-10-06 Language Weaver, Inc. Personalized machine translation via online adaptation
US9213694B2 (en) 2013-10-10 2015-12-15 Language Weaver, Inc. Efficient online domain adaptation
US10733235B2 (en) * 2015-06-09 2020-08-04 Patricia Henery Aid for dyslexic readers

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020099744A1 (en) * 2001-01-25 2002-07-25 International Business Machines Corporation Method and apparatus providing capitalization recovery for text
JP2003167901A (ja) 2001-11-29 2003-06-13 Kddi Corp 協調フィルタリング方法、協調フィルタリング装置及び協調フィルタリングプログラム

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2644995B2 (ja) 1986-09-09 1997-08-25 株式会社東芝 文書処理方法
DE4323241A1 (de) * 1993-07-12 1995-02-02 Ibm Verfahren und Computersystem zur Suche fehlerhafter Zeichenketten in einem Text
US5761689A (en) * 1994-09-01 1998-06-02 Microsoft Corporation Autocorrecting text typed into a word processing document
CN1180204A (zh) * 1996-05-02 1998-04-29 微软公司 大写和无重音文本的词典处理的方法和系统
US5819265A (en) 1996-07-12 1998-10-06 International Business Machines Corporation Processing names in a text
US6618697B1 (en) * 1999-05-14 2003-09-09 Justsystem Corporation Method for rule-based correction of spelling and grammar errors
US6981040B1 (en) * 1999-12-28 2005-12-27 Utopy, Inc. Automatic, personalized online information and product services
US6490549B1 (en) * 2000-03-30 2002-12-03 Scansoft, Inc. Automatic orthographic transformation of a text stream
JP2002169834A (ja) 2000-11-20 2002-06-14 Hewlett Packard Co <Hp> 文書のベクトル解析を行うコンピュータおよび方法
CN1679022B (zh) * 2002-07-23 2010-06-09 捷讯研究有限公司 用于构建和使用定制单词列表的系统和方法
US6873996B2 (en) * 2003-04-16 2005-03-29 Yahoo! Inc. Affinity analysis method and article of manufacture
US7447627B2 (en) * 2003-10-23 2008-11-04 Microsoft Corporation Compound word breaker and spell checker

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020099744A1 (en) * 2001-01-25 2002-07-25 International Business Machines Corporation Method and apparatus providing capitalization recovery for text
JP2003167901A (ja) 2001-11-29 2003-06-13 Kddi Corp 協調フィルタリング方法、協調フィルタリング装置及び協調フィルタリングプログラム

Also Published As

Publication number Publication date
JP4672418B2 (ja) 2011-04-20
CN1680935A (zh) 2005-10-12
DE602005026077D1 (de) 2011-03-10
KR20060045535A (ko) 2006-05-17
US20050228642A1 (en) 2005-10-13
JP2005302023A (ja) 2005-10-27
EP1585030A3 (en) 2006-07-12
ATE497213T1 (de) 2011-02-15
US7827025B2 (en) 2010-11-02
EP1585030A2 (en) 2005-10-12
CN1680935B (zh) 2011-05-11
EP1585030B1 (en) 2011-01-26

Similar Documents

Publication Publication Date Title
KR101122887B1 (ko) 사용자 모델링을 통한 효율적인 대문자화
US7831911B2 (en) Spell checking system including a phonetic speller
JP4173774B2 (ja) 重み付き編集距離に基づく例文の自動検索用システムおよび方法
US7493251B2 (en) Using source-channel models for word segmentation
US7076731B2 (en) Spelling correction system and method for phrasal strings using dictionary looping
CN100483416C (zh) 一种字符输入的方法、输入法系统及词库更新的方法
CN100483417C (zh) 获取限制词信息的方法、优化输出的方法和输入法系统
CN108304375A (zh) 一种信息识别方法及其设备、存储介质、终端
US20030120479A1 (en) Method and apparatus for determining unbounded dependencies during syntactic parsing
US20090055386A1 (en) System and Method for Enhanced In-Document Searching for Text Applications in a Data Processing System
US20100094855A1 (en) System for transforming queries using object identification
US7756859B2 (en) Multi-segment string search
CN101639830A (zh) 一种输入过程中的中文术语自动纠错方法
US7599828B2 (en) Grammatically correct contraction spelling suggestions for french
GB2569858A (en) Constructing content based on multi-sentence compression of source content
JP7520500B2 (ja) データ生成装置およびデータ生成方法
CN112925882B (zh) 一种信息处理方法及装置
CN110795617A (zh) 一种搜索词的纠错方法及相关装置
WO2007088902A1 (ja) 文字処理装置、方法、プログラムおよび記録媒体
JP4754849B2 (ja) 文書検索装置、文書検索方法、および文書検索プログラム
CN116932732A (zh) 确定目标关键词的方法、装置、电子设备及存储介质
CN113641783B (zh) 基于关键语句的内容块检索方法、装置、设备和介质
JP5557469B2 (ja) 文字検索装置、文字検索システム、文字検索方法、入力端末装置、検索サーバおよびプログラム
US20040243411A1 (en) Method and apparatus for compressing asymmetric clustering language models
JPH07325837A (ja) 抽象単語による通信文検索装置及び抽象単語による通信文検索方法

Legal Events

Date Code Title Description
PA0109 Patent application

St.27 status event code: A-0-1-A10-A12-nap-PA0109

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

A201 Request for examination
P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

St.27 status event code: A-1-2-D10-D21-exm-PE0902

E13-X000 Pre-grant limitation requested

St.27 status event code: A-2-3-E10-E13-lim-X000

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

R17-X000 Change to representative recorded

St.27 status event code: A-3-3-R10-R17-oth-X000

R17-X000 Change to representative recorded

St.27 status event code: A-3-3-R10-R17-oth-X000

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

St.27 status event code: A-1-2-D10-D22-exm-PE0701

GRNT Written decision to grant
PR0701 Registration of establishment

St.27 status event code: A-2-4-F10-F11-exm-PR0701

PR1002 Payment of registration fee

St.27 status event code: A-2-2-U10-U11-oth-PR1002

Fee payment year number: 1

PG1601 Publication of registration

St.27 status event code: A-4-4-Q10-Q13-nap-PG1601

PN2301 Change of applicant

St.27 status event code: A-5-5-R10-R13-asn-PN2301

St.27 status event code: A-5-5-R10-R11-asn-PN2301

FPAY Annual fee payment

Payment date: 20150121

Year of fee payment: 4

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 4

PN2301 Change of applicant

St.27 status event code: A-5-5-R10-R11-asn-PN2301

PN2301 Change of applicant

St.27 status event code: A-5-5-R10-R14-asn-PN2301

FPAY Annual fee payment

Payment date: 20160127

Year of fee payment: 5

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 5

FPAY Annual fee payment

Payment date: 20170201

Year of fee payment: 6

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 6

FPAY Annual fee payment

Payment date: 20180201

Year of fee payment: 7

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 7

R18-X000 Changes to party contact information recorded

St.27 status event code: A-5-5-R10-R18-oth-X000

LAPS Lapse due to unpaid annual fee
PC1903 Unpaid annual fee

St.27 status event code: A-4-4-U10-U13-oth-PC1903

Not in force date: 20190225

Payment event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE

PC1903 Unpaid annual fee

St.27 status event code: N-4-6-H10-H13-oth-PC1903

Ip right cessation event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE

Not in force date: 20190225