WO2018067440A1 - Systems and methods for language detection - Google Patents

Systems and methods for language detection Download PDF

Info

Publication number
WO2018067440A1
WO2018067440A1 PCT/US2017/054722 US2017054722W WO2018067440A1 WO 2018067440 A1 WO2018067440 A1 WO 2018067440A1 US 2017054722 W US2017054722 W US 2017054722W WO 2018067440 A1 WO2018067440 A1 WO 2018067440A1
Authority
WO
WIPO (PCT)
Prior art keywords
language
scores
text message
module
alphabet
Prior art date
Application number
PCT/US2017/054722
Other languages
English (en)
French (fr)
Inventor
Nikhil BOJJA
Pidong WANG
Shiman Guo
Original Assignee
Machine Zone, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US15/283,646 external-priority patent/US10162811B2/en
Application filed by Machine Zone, Inc. filed Critical Machine Zone, Inc.
Priority to AU2017339433A priority Critical patent/AU2017339433A1/en
Priority to CA3039085A priority patent/CA3039085A1/en
Priority to EP17788004.4A priority patent/EP3519984A1/en
Priority to CN201780074219.8A priority patent/CN110023931A/zh
Priority to JP2019517966A priority patent/JP2019535082A/ja
Publication of WO2018067440A1 publication Critical patent/WO2018067440A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/263Language identification
PCT/US2017/054722 2016-10-03 2017-10-02 Systems and methods for language detection WO2018067440A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
AU2017339433A AU2017339433A1 (en) 2016-10-03 2017-10-02 Systems and methods for language detection
CA3039085A CA3039085A1 (en) 2016-10-03 2017-10-02 Systems and methods for language detection
EP17788004.4A EP3519984A1 (en) 2016-10-03 2017-10-02 Systems and methods for language detection
CN201780074219.8A CN110023931A (zh) 2016-10-03 2017-10-02 用于语言检测的系统和方法
JP2019517966A JP2019535082A (ja) 2016-10-03 2017-10-02 言語検出のためのシステムおよび方法

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US15/283,646 2016-10-03
US15/283,646 US10162811B2 (en) 2014-10-17 2016-10-03 Systems and methods for language detection

Publications (1)

Publication Number Publication Date
WO2018067440A1 true WO2018067440A1 (en) 2018-04-12

Family

ID=60162256

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2017/054722 WO2018067440A1 (en) 2016-10-03 2017-10-02 Systems and methods for language detection

Country Status (6)

Country Link
EP (1) EP3519984A1 (zh)
JP (1) JP2019535082A (zh)
CN (1) CN110023931A (zh)
AU (1) AU2017339433A1 (zh)
CA (1) CA3039085A1 (zh)
WO (1) WO2018067440A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11551461B2 (en) * 2020-04-10 2023-01-10 I.R.I.S. Text classification

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080147380A1 (en) * 2006-12-18 2008-06-19 Nokia Corporation Method, Apparatus and Computer Program Product for Providing Flexible Text Based Language Identification
US20090324005A1 (en) * 2008-06-26 2009-12-31 Microsoft Corporation Script Detection Service
US20100312545A1 (en) * 2009-06-05 2010-12-09 Google Inc. Detecting Writing Systems and Languages
WO2016060687A1 (en) * 2014-10-17 2016-04-21 Machine Zone, Inc. System and method for language detection

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080147380A1 (en) * 2006-12-18 2008-06-19 Nokia Corporation Method, Apparatus and Computer Program Product for Providing Flexible Text Based Language Identification
US20090324005A1 (en) * 2008-06-26 2009-12-31 Microsoft Corporation Script Detection Service
US20100312545A1 (en) * 2009-06-05 2010-12-09 Google Inc. Detecting Writing Systems and Languages
WO2016060687A1 (en) * 2014-10-17 2016-04-21 Machine Zone, Inc. System and method for language detection

Also Published As

Publication number Publication date
CN110023931A (zh) 2019-07-16
AU2017339433A1 (en) 2019-05-02
CA3039085A1 (en) 2018-04-12
JP2019535082A (ja) 2019-12-05
EP3519984A1 (en) 2019-08-07

Similar Documents

Publication Publication Date Title
US9535896B2 (en) Systems and methods for language detection
US10699073B2 (en) Systems and methods for language detection
Kim et al. Two-stage multi-intent detection for spoken language understanding
US9971763B2 (en) Named entity recognition
US8380488B1 (en) Identifying a property of a document
JP5475795B2 (ja) カスタム言語モデル
US20170185581A1 (en) Systems and methods for suggesting emoji
JP5379138B2 (ja) 領域辞書の作成
Sazzed et al. A sentiment classification in bengali and machine translated english corpus
CN107111607B (zh) 用于语言检测的系统和方法
Atia et al. Increasing the accuracy of opinion mining in Arabic
Dutta et al. Text normalization in code-mixed social media text
Habib et al. An exploratory approach to find a novel metric based optimum language model for automatic bangla word prediction
Balazevic et al. Language detection for short text messages in social media
EP3704660A1 (en) Techniques for ranking posts in community forums
EP3519984A1 (en) Systems and methods for language detection
Wisniewski et al. Limsi submission for wmt’14 qe task
Kamath et al. Sarcasm detection approaches survey
Sharma et al. Language identification for hindi language transliterated text in roman script using generative adversarial networks
JP2019215876A (ja) 言語検出を行うためのシステムおよび方法
Sonnadara et al. Sinhala spell correction: A novel benchmark with neural spell correction
Hemmer et al. Estimating Post-OCR Denoising Complexity on Numerical Texts
JP5450276B2 (ja) 読み推定装置、読み推定方法、および読み推定プログラム
Ramanna et al. Japanese Language Review Mining using Translators, Word Embedding and ML Techniques
CN111859940A (zh) 一种关键词提取方法、装置、电子设备及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17788004

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 3039085

Country of ref document: CA

ENP Entry into the national phase

Ref document number: 2019517966

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2017339433

Country of ref document: AU

Date of ref document: 20171002

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2017788004

Country of ref document: EP