KR20240011718A - 오프셋 매핑을 사용한 워드브레이크 알고리즘 - Google Patents

오프셋 매핑을 사용한 워드브레이크 알고리즘 Download PDF

Info

Publication number
KR20240011718A
KR20240011718A KR1020237040866A KR20237040866A KR20240011718A KR 20240011718 A KR20240011718 A KR 20240011718A KR 1020237040866 A KR1020237040866 A KR 1020237040866A KR 20237040866 A KR20237040866 A KR 20237040866A KR 20240011718 A KR20240011718 A KR 20240011718A
Authority
KR
South Korea
Prior art keywords
string
character
index value
target
offset index
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
KR1020237040866A
Other languages
English (en)
Korean (ko)
Inventor
마노즈 굽타
카빈 모트라니
Original Assignee
마이크로소프트 테크놀로지 라이센싱, 엘엘씨
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US17/444,347 external-priority patent/US11899698B2/en
Application filed by 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 filed Critical 마이크로소프트 테크놀로지 라이센싱, 엘엘씨
Publication of KR20240011718A publication Critical patent/KR20240011718A/ko
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • G06F21/6245Protecting personal data, e.g. for financial or medical purposes
    • G06F21/6254Protecting personal data, e.g. for financial or medical purposes by anonymising data, e.g. decorrelating personal data from the owner's identification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/109Font handling; Temporal or kinetic typography
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/163Handling of whitespace
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/53Processing of non-Latin text

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Bioethics (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Hardware Design (AREA)
  • Databases & Information Systems (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
KR1020237040866A 2021-05-28 2022-05-05 오프셋 매핑을 사용한 워드브레이크 알고리즘 Pending KR20240011718A (ko)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
IN202141023933 2021-05-28
IN202141023933 2021-05-28
US17/444,347 2021-08-03
US17/444,347 US11899698B2 (en) 2021-05-28 2021-08-03 Wordbreak algorithm with offset mapping
PCT/IB2022/000257 WO2022248933A1 (en) 2021-05-28 2022-05-05 Wordbreak algorithm with offset mapping

Publications (1)

Publication Number Publication Date
KR20240011718A true KR20240011718A (ko) 2024-01-26

Family

ID=82846495

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020237040866A Pending KR20240011718A (ko) 2021-05-28 2022-05-05 오프셋 매핑을 사용한 워드브레이크 알고리즘

Country Status (4)

Country Link
EP (1) EP4348490A1 (https=)
JP (1) JP2024521833A (https=)
KR (1) KR20240011718A (https=)
WO (1) WO2022248933A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240338186A1 (en) * 2023-04-06 2024-10-10 Oracle International Corporation Compile-Time Checking For Exhaustive Switch Statements And Expressions

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5963893A (en) * 1996-06-28 1999-10-05 Microsoft Corporation Identification of words in Japanese text by a computer system
US20200334381A1 (en) * 2019-04-16 2020-10-22 3M Innovative Properties Company Systems and methods for natural pseudonymization of text

Also Published As

Publication number Publication date
EP4348490A1 (en) 2024-04-10
JP2024521833A (ja) 2024-06-04
WO2022248933A1 (en) 2022-12-01

Similar Documents

Publication Publication Date Title
US10552462B1 (en) Systems and methods for tokenizing user-annotated names
US10303689B2 (en) Answering natural language table queries through semantic table representation
US9904673B2 (en) Conversation advisor
US8875302B2 (en) Classification of an electronic document
US11620304B2 (en) Example management for string transformation
CN108090351B (zh) 用于处理请求消息的方法和装置
US10528675B2 (en) Context-aware translation memory to facilitate more accurate translation
US10354078B2 (en) Multi-focused fine-grained security framework
US9971809B1 (en) Systems and methods for searching unstructured documents for structured data
US10606957B1 (en) Method and system for translating natural language policy to logical access control policy
US20160344773A1 (en) Integrated Development Environment (IDE) for Network Security Configuration Files
US11062129B2 (en) Systems and methods for enabling search services to highlight documents
US20200081961A1 (en) Estimation of document structure
US20160179954A1 (en) Systems and methods for culling search results in electronic discovery
US20250165590A1 (en) Preventing attacks on generative models
US11899698B2 (en) Wordbreak algorithm with offset mapping
US10776500B2 (en) Autonomous hint generator
KR20240011718A (ko) 오프셋 매핑을 사용한 워드브레이크 알고리즘
US9483535B1 (en) Systems and methods for expanding search results
US20160124961A1 (en) Using Priority Scores for Iterative Precision Reduction in Structured Lookups for Questions
US20150006498A1 (en) Dynamic search system
US11132400B2 (en) Data classification using probabilistic data structures
CN117396878A (zh) 带有偏移映射的分词算法
US20260038306A1 (en) Dynamic signature identification from handwritten elements
KR102417236B1 (ko) 컨텐츠 작성자의 식별 방법, 그리고 이를 구현하기 위한 장치

Legal Events

Date Code Title Description
PA0105 International application

St.27 status event code: A-0-1-A10-A15-nap-PA0105

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

A201 Request for examination
P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

D22 Grant of ip right intended

Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D22-EXM-PE0701 (AS PROVIDED BY THE NATIONAL OFFICE)

PE0701 Decision of registration

St.27 status event code: A-1-2-D10-D22-exm-PE0701