WO2023172162A1 - Procédé de protection d'informations lors de l'impression de documents - Google Patents

Procédé de protection d'informations lors de l'impression de documents Download PDF

Info

Publication number
WO2023172162A1
WO2023172162A1 PCT/RU2022/000383 RU2022000383W WO2023172162A1 WO 2023172162 A1 WO2023172162 A1 WO 2023172162A1 RU 2022000383 W RU2022000383 W RU 2022000383W WO 2023172162 A1 WO2023172162 A1 WO 2023172162A1
Authority
WO
WIPO (PCT)
Prior art keywords
uid
digital
digital document
printing
document
Prior art date
Application number
PCT/RU2022/000383
Other languages
English (en)
Russian (ru)
Inventor
Михаил Артурович АНИСТРАТЕНКО
Валентин Валерьевич СЫСОЕВ
Иван Александрович ОБОЛЕНСКИЙ
Дмитрий Алексеевич БОРИСОВ
Александр Артурович АНИСТРАТЕНКО
Original Assignee
Публичное Акционерное Общество "Сбербанк России"
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from RU2022106206A external-priority patent/RU2790938C1/ru
Application filed by Публичное Акционерное Общество "Сбербанк России" filed Critical Публичное Акционерное Общество "Сбербанк России"
Publication of WO2023172162A1 publication Critical patent/WO2023172162A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/64Protecting data integrity, e.g. using checksums, certificates or signatures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof

Definitions

  • the claimed solution relates to the field of information security, in particular to solutions for preventing information leakage when printing documents.
  • DLP Data Leak Prevention
  • the claimed invention is aimed at solving a technical problem, which is to create an effective means for protecting digital information from leakage during printing.
  • the technical result is to increase the efficiency of data protection from leakage by introducing digital tags into the document that encode a unique user identifier for subsequent identification when analyzing printed documents.
  • each user UID character is encoded into binary code.
  • the area of placement of digital marks is determined based on the bit of the binary code.
  • the claimed technical result is also achieved by implementing a method for protecting information from leaks on printed documents, performed using a processor of a computer device, the method comprising the steps of: obtaining at least part of an image of a printed document with an encoded user UID using the above method ; perform recognition of the resulting image; identify letters containing digital marks in their vicinity; performing determination and extraction of the encoded UID.
  • digital document recognition is performed using OCR.
  • FIG. 1 illustrates a flowchart of a digital mark encoding method.
  • FIG. 3 illustrates a block diagram of digital mark decoding.
  • FIG. 4 illustrates a diagram of the hour of disclosure of UID positions.
  • FIG. 5 illustrates a general view of a computing device.
  • FIG. 1 presents a method (100) for protecting information in digital documents from leakage by encoding the user UID in the form of digital marks into the document.
  • information about printing of the digital document is obtained.
  • the method (100) is carried out on a computer device of a user, for example, an employee, and a user UID is associated with the device, allowing him to be identified.
  • the execution of step (101) is animated by software logic executed by a computer device and can be implemented, for example, in the form of a software agent or module that provides signals from the processor indicating that a digital document is being sent for printing.
  • a digital document is typically a file and can contain text, graphics, or a combination of both.
  • step (102) After receiving a command on the device to intercept and analyze the document before sending it to the printer, at step (102) recognition of the mentioned digital document is performed. Document processing is done using OCR technology to ensure recognition of letters and symbols in a digital document.
  • the UID encoding process is carried out at step (103).
  • the UID is, for example, a numeric personnel number of an employee - a digital TAB code, consisting, for example, of 8 digits.
  • a schematic view of the code is presented in Table 1. Table 1. Schematic representation of the personnel number:
  • Table 3 Schematic division of a binary number into digits.
  • Each Pos array is filled with those characters from Wrus p . , which correspond to the position from table 4.
  • Po5 is filled with all the characters from Wrus p ., which have the values ⁇ a, z, p, h ⁇ , regardless of case.
  • the resulting arrays Pos , Pos 2 ... Pos 8 are used to apply digital marks in the manner described above.
  • Digital tagging is done by cutting out letters using OCR, adding the tagging to pixel coordinates, and adding the digitally tagged letters back into the document to be printed. After introducing all the labels (21, 22) on the desired page p t , the same is done for the next page p i+1 and so on until the end of the document p g .
  • Table 5 shows an example of label encoding for the user UID - 00013400.
  • Table 7 Table of frequency of disclosure of personnel number positions.
  • Printing was carried out on a Lexmark MX71 Ide office black and white laser printer on “Snow Maiden” office paper with CIE 146 whiteness according to ISO 11475.
  • Photographing was done on a Samsung A51 phone under office lighting, the paper lies horizontally on the table, photographing is random at different, slight angles, about 2-4% in 3 dimensions.
  • the Telegram messenger was used with image compression when sending.
  • FIG. 5 is an overview of a computing device (500) suitable for performing the above methods.
  • the device (500) may be, for example, a computer, a server, or other type of suitable computing device.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Bioethics (AREA)
  • Computer Hardware Design (AREA)
  • Software Systems (AREA)
  • Character Discrimination (AREA)

Abstract

La présente invention concerne des solutions pour empêcher la fuite d'informations lors de l'impression de documents. Lors de son exécution sur un dispositif informatique d'utilisateur, le procédé consiste à: obtenir des informations sur l'impression d'un document numérique (101) contenant du texte, le dispositif informatique étant associé à un identifiant unique (UID) d'utilisateur; effectuer un traitement du document numérique avant son envoi vers l'impression, au cours duquel on reconnaît des lettres (102) contenues dans le document numérique; coder l'UID d'utilisateur en un ensemble de marques numériques qui sont disposées sur les contours des lettres et/ou à proximité du contour des lettres du document numérique (103); transmettre le document numérique vers l'impression avec l'UID codé d'utilisateur. Cette invention permet d'augmenter l'efficacité de protection des données contre les fuites.
PCT/RU2022/000383 2022-03-10 2022-12-20 Procédé de protection d'informations lors de l'impression de documents WO2023172162A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
RU2022106206A RU2790938C1 (ru) 2022-03-10 Способ и система защиты информации от утечки при печати документов с помощью внедрения цифровых меток
RU2022106206 2022-03-10

Publications (1)

Publication Number Publication Date
WO2023172162A1 true WO2023172162A1 (fr) 2023-09-14

Family

ID=87935599

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/RU2022/000383 WO2023172162A1 (fr) 2022-03-10 2022-12-20 Procédé de protection d'informations lors de l'impression de documents

Country Status (1)

Country Link
WO (1) WO2023172162A1 (fr)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040001606A1 (en) * 2002-06-28 2004-01-01 Levy Kenneth L. Watermark fonts
US20070047818A1 (en) * 2005-08-23 2007-03-01 Hull Jonathan J Embedding Hot Spots in Imaged Documents
US20080205699A1 (en) * 2005-10-25 2008-08-28 Fujitsu Limited Digital watermark embedding and detection
RU2446464C2 (ru) * 2010-05-06 2012-03-27 Корпорация "САМСУНГ ЭЛЕКТРОНИКС Ко., Лтд." Способ и система встраивания и извлечения скрытых данных в печатаемых документах
US20130028466A1 (en) * 2005-09-16 2013-01-31 Sursen Corp. Embedding and Detecting Hidden Information
RU2758666C1 (ru) * 2021-03-25 2021-11-01 Публичное Акционерное Общество "Сбербанк России" (Пао Сбербанк) Способ и система защиты цифровой информации, отображаемой на экране электронных устройств, с помощью динамических цифровых меток

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040001606A1 (en) * 2002-06-28 2004-01-01 Levy Kenneth L. Watermark fonts
US20070047818A1 (en) * 2005-08-23 2007-03-01 Hull Jonathan J Embedding Hot Spots in Imaged Documents
US20130028466A1 (en) * 2005-09-16 2013-01-31 Sursen Corp. Embedding and Detecting Hidden Information
US20080205699A1 (en) * 2005-10-25 2008-08-28 Fujitsu Limited Digital watermark embedding and detection
RU2446464C2 (ru) * 2010-05-06 2012-03-27 Корпорация "САМСУНГ ЭЛЕКТРОНИКС Ко., Лтд." Способ и система встраивания и извлечения скрытых данных в печатаемых документах
RU2758666C1 (ru) * 2021-03-25 2021-11-01 Публичное Акционерное Общество "Сбербанк России" (Пао Сбербанк) Способ и система защиты цифровой информации, отображаемой на экране электронных устройств, с помощью динамических цифровых меток

Similar Documents

Publication Publication Date Title
US10339378B2 (en) Method and apparatus for finding differences in documents
CN107239666B (zh) 一种对医疗影像数据进行脱敏处理的方法及系统
US9626555B2 (en) Content-based document image classification
CN112016273B (zh) 文档目录生成方法、装置、电子设备及可读存储介质
US20200097713A1 (en) Method and System for Accurately Detecting, Extracting and Representing Redacted Text Blocks in a Document
KR102503880B1 (ko) 머신 판독 가능 보안 마크 및 이를 생성하는 프로세스
CN108805787A (zh) 一种纸质文档篡改鉴真的方法和装置
JP2016048444A (ja) 帳票識別プログラム、帳票識別装置、帳票識別システム、および帳票識別方法
US20190384971A1 (en) System and method for optical character recognition
CN112102402A (zh) 闪光灯光斑位置识别方法、装置、电子设备及存储介质
US7596270B2 (en) Method of shuffling text in an Asian document image
US10867170B2 (en) System and method of identifying an image containing an identification document
WO2022103564A1 (fr) Détection de fraude par regroupement automatique d'écriture manuscrite
JP2011178075A (ja) 真贋判定装置及び真贋判定方法
RU2790938C1 (ru) Способ и система защиты информации от утечки при печати документов с помощью внедрения цифровых меток
Eskenazi et al. When document security brings new challenges to document analysis
WO2023172162A1 (fr) Procédé de protection d'informations lors de l'impression de documents
US20080279374A1 (en) Pixel-Based Method for Encryption and Decryption of Data
RU2793611C1 (ru) Способ и система защиты информации от утечки при печати документов с помощью смещения символов
EA044732B1 (ru) Способ и система защиты информации от утечки при печати документов с помощью внедрения цифровых меток
WO2023172161A1 (fr) Procédé et système de protection d'informations lors de l'impression de documents
CN110942075A (zh) 信息处理装置、存储介质及信息处理方法
US20170337165A1 (en) System and method of embedding symbology in alphabetic letters and then linking the letters to a site or sites on the global computer network
EA045968B1 (ru) Способ и система защиты информации от утечки при печати документов с помощью смещения символов
RU2739936C1 (ru) Способ внесения цифровых меток в цифровое изображение и устройство для осуществления способа

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22931154

Country of ref document: EP

Kind code of ref document: A1