WO2023002748A1 - Document management server, document management method, and computer program - Google Patents

Document management server, document management method, and computer program Download PDF

Info

Publication number
WO2023002748A1
WO2023002748A1 PCT/JP2022/021282 JP2022021282W WO2023002748A1 WO 2023002748 A1 WO2023002748 A1 WO 2023002748A1 JP 2022021282 W JP2022021282 W JP 2022021282W WO 2023002748 A1 WO2023002748 A1 WO 2023002748A1
Authority
WO
WIPO (PCT)
Prior art keywords
document
unit
words
document management
management server
Prior art date
Application number
PCT/JP2022/021282
Other languages
French (fr)
Japanese (ja)
Inventor
康幸 日下
Original Assignee
アストロラボ株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by アストロラボ株式会社 filed Critical アストロラボ株式会社
Publication of WO2023002748A1 publication Critical patent/WO2023002748A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules

Definitions

  • the present invention relates to a document management server, document management method, and computer program.
  • This application claims priority based on Japanese Patent Application No. 2021-120640 filed in Japan on July 21, 2021, the contents of which are incorporated herein.
  • the contract is exchanged as follows. After agreeing on the contents of the contract between the contractors, one contractor prepares two copies of the contract, signs them, and sends them to the other contractor. When the other contracting party receives the contract, he/she signs both copies, keeps one copy as evidence, and returns the other copy to the other contracting party. In this case, the contract that is kept on hand as evidence is stored and managed in a document shelf.
  • the document management apparatus includes a document information acquisition unit, a container information acquisition unit, a document input/output information recording unit, a processing status determination unit, a processing status information output unit, and a document input/output information management DB.
  • the document information acquisition unit acquires information on a predetermined document.
  • the container information acquisition unit acquires information about a predetermined container.
  • the document entry/exit information recording section records information relating to entry/exit of documents in a predetermined recording medium.
  • the processing status determination unit determines the processing status of a given document.
  • the processing status information output unit outputs information about the processing status of the document.
  • An object of the present invention is to provide a document management server, a document management method, and a computer program that can improve the convenience of document management.
  • One aspect of the present invention includes: a first receiving unit that receives text data of a document; a first dividing unit that divides the text data received by the first receiving unit to obtain a plurality of words; a first encryption unit for encrypting each of the plurality of words obtained by the first division unit; and a registration for registering each of the plurality of words encrypted by the first encryption unit in an index of the document.
  • a processing unit that associates the index of the document in which each of the plurality of words encrypted by the registration unit is registered with the document and stores the document as document data in a storage unit; a second receiving unit that receives, a second dividing unit that divides the search character string received by the second receiving unit to obtain one or more words, and one or more words that the second dividing unit obtains a second encryption unit that encrypts each of the words; and a search unit that searches for a plurality of the document data based on each of the one or more words encrypted by the second encryption unit.
  • a document management server that assigns the index of the document in which each of the plurality of words encrypted by the registration unit is registered with the document and stores the document as document data in a storage unit
  • a second receiving unit that receives, a second dividing unit that divides the search character string received by the second receiving unit to obtain one or more words, and one or more words that the second dividing unit obtains a second encryption unit that encrypts each of the words
  • the first dividing unit converts a character string included in each of the plurality of words into a predetermined format
  • 1 encryption unit encrypts the character string converted into the predetermined format by the first division unit.
  • the first reception unit receives one or more predetermined items included in the text data of the document.
  • the first encryption unit encrypts each of the one or more predetermined items extracted by the first reception unit; and the registration unit encrypts the one or more predetermined items.
  • the processing unit associates the document with the index of the document in which the result of encrypting one or more predetermined items by the registration unit is further registered to the document. It is stored in the storage unit as data.
  • the registration unit arranges the words in the order of the plurality of words included in the text data. Based on this, each of the plurality of words encrypted by the first encryption unit is registered in the index of the document.
  • the first encryption unit stores a plurality of Among the words, words corresponding to a predetermined part of speech are encrypted.
  • the search unit is encrypted by the second encryption unit.
  • Each of the plurality of words is searched for a match of the plurality of encrypted words contained in each of the indices of the plurality of documents.
  • the search unit is encrypted by the second encryption unit or Deriving a match rate between each of the plurality of words and the encrypted plurality of words contained in each of the indices of the plurality of documents.
  • An aspect of the present invention is a document management method executed by a document management server, comprising: receiving text data of a document; dividing the text data to obtain a plurality of words; encrypting each of said words; registering each of said plurality of words encrypted in said step of encrypting in said document index; and registering each of said plurality of encrypted words.
  • One aspect of the present invention provides a computer with the steps of receiving text data of a document, dividing the text data to obtain a plurality of words, and encrypting each of the plurality of words. registering each of the plurality of words encrypted in the step of encrypting in an index of the document; and the index of the document and the document in which each of the plurality of encrypted words is registered. a step of associating and storing in a storage unit as document data; a step of receiving a search character string; a step of dividing the search character string to obtain one or more words; encrypting each of the divided one or more words; and searching the plurality of document data based on each of the one or more divided words of the encrypted search character string.
  • a computer program that is executed.
  • the present invention it is possible to provide a document management server, a document management method, and a computer program that can improve the convenience of document management.
  • FIG. 1 is a diagram showing a configuration example of a document management system according to an embodiment of the present invention
  • FIG. 3 is a diagram showing details of a document management server and terminal devices included in the document management system according to the embodiment
  • FIG. FIG. 4 is a diagram showing an example 1 of operation of the document management system according to the present embodiment
  • FIG. 4 is a diagram showing an example 1 of a document management request screen displayed by the terminal device according to the embodiment
  • FIG. 10 is a diagram showing an example 2 of a document management request screen displayed by the terminal device according to the embodiment
  • FIG. 10 is a diagram showing an example 3 of a document management request screen displayed by the terminal device according to the embodiment; This is for explaining document scanning processing by the terminal device according to the present embodiment.
  • FIG. 3 is a diagram showing details of a document management server and terminal devices included in the document management system according to the embodiment
  • FIG. 4 is a diagram showing an example 1 of operation of the document management system according to the present embodiment
  • FIG. 4 is a diagram showing an
  • FIG. 4 is a diagram showing an example 1 of information displayed on the terminal device according to the embodiment; It is a figure which shows the example 2 of the information displayed on the terminal device which concerns on this embodiment.
  • FIG. 10 is a diagram showing an example 2 of the operation of the document management system according to this embodiment;
  • FIG. 4 is a diagram showing an example 1 of information displayed on the terminal device according to the embodiment; It is a figure which shows the example 2 of the information displayed on the terminal device which concerns on this embodiment.
  • FIG. 1 is a diagram showing a configuration example of a document management system according to an embodiment of the present invention.
  • the document management system 1 has a document management server 100 .
  • FIG. 1 shows a terminal device 200 used by a customer such as a user U who requests document management.
  • the document management server 100 and the terminal device 200 communicate via the network NW.
  • the network NW includes, for example, the Internet, a WAN (Wide Area Network), a LAN (Local Area Network), a provider device, a radio base station, and the like.
  • a user U who requests document management is registered in the document management system 1 , and user identification information (hereinafter referred to as “user ID”) and password are set in the document management system 1 .
  • a user U accesses the document management server 100 by operating the terminal device 200 .
  • the user U logs into the document management system 1 by entering a user ID and a password into the terminal device 200 and performing a login operation.
  • the user U refers to the initial screen of the document management system 1 and presses the document management request button to display the document management request screen on the terminal device 200 .
  • An example of a document is a contract. As an example, the description will be continued below for the case where the document is a contract.
  • the user U operates the terminal device 200 to enter document-related items such as the title and contract type on the document management request screen.
  • a user U scans a document by operating the terminal device 200 .
  • the terminal device 200 creates a document management request addressed to the document management server 100, which includes a user ID, document-related items, and document scan data.
  • the terminal device 200 transmits the created document management request to the document management server 100 .
  • the document management server 100 receives the document management request transmitted by the terminal device 200 .
  • the document management server 100 acquires the user ID, document-related items, and document scan data included in the received document management request.
  • the document management server 100 creates information (hereinafter referred to as “document identification information”) that uniquely identifies the acquired document-related matters and the scan data of the document.
  • the document management server 100 converts the scanned data of the acquired document into text data by processing it.
  • the document management server 100 extracts one or more predetermined items such as a title included in the document based on the text data of the document.
  • the document management server 100 encrypts the obtained one or more predetermined items.
  • the document management server 100 obtains one or more words by dividing the character string included in the document based on the text data of the document.
  • the document management server 100 performs a process of aligning the display formats of alphanumeric characters contained in each of the acquired one or more words. Specifically, the document management server 100 may align the display format of alphanumeric characters included in each of one or more words to full-width or half-width.
  • the document management server 100 performs a process of aligning the expressions of alphabetic characters included in each of the acquired one or more words. Specifically, the document management server 100 may align the representation format of the alphabetic characters included in each of one or more words to lowercase letters or uppercase letters. As an example, a case where the document management server 100 converts the representation format of alphanumeric characters contained in each of one or more words into half-width characters and converts the representation format of alphabetic characters into lowercase letters will be continued.
  • the document management server 100 converts the display format of alphanumeric characters into half-width characters and converts the representation format of alphabetic characters into lowercase letters, and encrypts each of one or more words.
  • the document management server 100 registers one or more encrypted predetermined items in a document index (index, headline) in order of appearance.
  • the document management server 100 registers each of the one or more encrypted words in the document index (index, headline) in order of appearance.
  • the document management server 100 has a user ID, document identification information, a document index in which each of one or more encrypted predetermined items and each of one or more encrypted words are registered, The data is stored in the storage unit in association with the scan data of the document.
  • the document management server 100 creates a document management response addressed to the terminal device 200 and including information indicating that document management has started.
  • the document management server 100 transmits the created document management response to the terminal device 200 .
  • the terminal device 200 receives the document management response sent by the document management server 100 .
  • the terminal device 200 acquires information indicating that management of the document contained in the received document management response has started.
  • the terminal device 200 displays information indicating that management of the acquired document has started.
  • a user U requesting a document search accesses the document management server 100 by operating the terminal device 200 .
  • the user U logs into the document management system 1 by entering a user ID and a password into the terminal device 200 and performing a login operation.
  • the user U refers to the initial screen of the document management system 1 and pushes the document search request button to display the document search request screen on the terminal device 200 .
  • the user U operates the terminal device 200 to input a search character string such as a search keyword on the document search request screen and press the OK button.
  • the terminal device 200 creates a document search request addressed to the document management server 100, which includes the user ID and information specifying the search character string.
  • the terminal device 200 transmits the created document search request to the document management server
  • the document management server 100 receives the document search request sent by the terminal device 200 .
  • the document management server 100 acquires the user ID and the information specifying the search character string included in the received document search request. Based on the text data of the search character string, the document management server 100 obtains one or more words by dividing the character string included in the search character string.
  • the document management server 100 performs a process of aligning the display formats of alphanumeric characters contained in each of the acquired one or more words. Specifically, the document management server 100 may align the display format of alphanumeric characters included in each of one or more words to full-width or half-width.
  • the document management server 100 performs a process of aligning the expressions of alphabetic characters included in each of the acquired one or more words.
  • the document management server 100 may align the representation format of the alphabetic characters included in each of one or more words to lowercase letters or uppercase letters.
  • the document management server 100 converts the representation format of alphanumeric characters contained in each of one or more words into half-width characters and converts the representation format of alphabetic characters into lowercase letters will be continued.
  • the document management server 100 converts the display format of alphanumeric characters into half-width characters and converts the representation format of alphabetic characters into lowercase letters, and encrypts each of one or more words.
  • the document management server 100 searches for a document index containing one or more encrypted words from among one or more document indexes stored in the storage unit in association with the acquired user ID. .
  • the document management server 100 retrieves one or more indexes of documents containing either one or more of the encrypted words by searching.
  • the document management server 100 rearranges the indexes of the obtained one or more documents in descending order of matching rate of one or more encrypted words.
  • the document management server 100 acquires document identification information associated with each of the rearranged one or more document indexes.
  • the document management server 100 creates a document search response addressed to the terminal device 200 including one or more pieces of acquired document identification information.
  • the document management server 100 transmits the created document search response to the terminal device 200 .
  • the terminal device 200 receives the document search response sent by the document management server 100 .
  • the terminal device 200 acquires one or more pieces of document identification information included in the received document search response.
  • the terminal device 200 displays the acquired one or more pieces of document specifying information.
  • the user U refers to one or a plurality of pieces of document specifying information displayed on the terminal device 200, and selects a document to be browsed by pressing a portion corresponding to the document specifying information to be browsed.
  • the terminal device 200 creates a document request addressed to the document management server 100, including the document identification information corresponding to the part pressed by the user U.
  • the document management server 100 transmits the created document request to the document management server 100 .
  • the document management server 100 receives the document request sent by the terminal device 200 .
  • the document management server 100 acquires document specifying information included in the received document request.
  • the document management server 100 acquires document scan data and document text data stored in association with the acquired document specifying information.
  • the document management server 100 obtains one or more words by dividing the character string included in the document based on the text data of the obtained document.
  • the document management server 100 extracts document-related matters based on predetermined rules such as the format of each of the acquired one or more words and the arrangement of parts of speech.
  • the document management server 100 creates a document response destined for the terminal device 200, which includes the acquired scan data of the document, text data of the document, and document-related items.
  • the document management server 100 transmits the created document response to the terminal device 200 .
  • the terminal device 200 receives the document response sent by the document management server 100 .
  • the terminal device 200 acquires the document scan data, the document text data, and the document-related items included in the received document response.
  • the terminal device 200 displays the selected one of the acquired scan data of the document, text data of the document, and document-related items.
  • FIG. 2 is a diagram showing the details of the document management server and the terminal device included in the document management system according to this embodiment.
  • the document management server 100 is implemented by a device such as a personal computer, server, smart phone, tablet computer, or industrial computer.
  • the document management server 100 may be a cloud server.
  • the document management server 100 includes, for example, a conversion unit 101, a communication unit 102, a first reception unit 103-1, a second reception unit 103-2, a first division unit 104-1, a second division unit 104 -2, a first encryption unit 105-1, a second encryption unit 105-2, a registration unit 106, a processing unit 107, a search unit 108, and a creation unit 110.
  • the communication unit 102 is implemented by a communication module.
  • the communication unit 102 communicates with an external communication device via the network NW.
  • the communication unit 102 may communicate using a communication method such as a wired LAN, for example. Further, the communication unit 102 may communicate by a wireless communication method such as a wireless LAN, Bluetooth (registered trademark), or LTE (registered trademark).
  • the communication unit 102 holds communication information necessary for communicating with the terminal device 200 via the network NW.
  • the communication unit 102 receives the login request, the document management request, the document search request, and the document request transmitted by the terminal device 200 .
  • the communication unit 102 acquires the login response, the document management response, the document search response, and the document response output by the creation unit 110, and transmits the acquired document management response, document search response, and document response to the terminal. Send to device 200 .
  • the storage unit 109 is implemented by a HDD (Hard Disk Drive), flash memory, RAM (Random Access Memory), ROM (Read Only Memory), or the like.
  • the storage unit 109 stores one or more user IDs and passwords for each of the one or more user IDs in association with each other.
  • the processing unit 107 acquires the user ID and password included in the login request received by the communication unit 102 .
  • the processing unit 107 determines that the login is successful when the acquired combination of the user ID and password is registered in advance in the storage unit 109, and determines that the login is unsuccessful when the combination is not registered.
  • the creating unit 110 creates a login response addressed to the terminal device 200 including an initial screen when the processing unit 107 determines that the login is successful.
  • the processing unit 107 outputs the created login response to the communication unit 102 .
  • the conversion unit 101 acquires the document management request received by the communication unit 102 .
  • the conversion unit 101 acquires the user ID and the document scan data included in the acquired document management request.
  • the conversion unit 101 converts the acquired scan data of the document into text data of the document.
  • the conversion unit 101 uses optical character recognition (OCR) technology to convert images of printed characters and handwritten characters included in document scan data into character code strings.
  • OCR optical character recognition
  • the first reception unit 103-1 acquires the document management request received by the communication unit 102.
  • the first reception unit 103-1 receives the user ID, document-related items, and document scan data included in the acquired document management request.
  • the first reception unit 103-1 creates document identification information that uniquely identifies the acquired document-related matters and the scan data of the document.
  • the first reception unit 103-1 acquires from the conversion unit 101 the text data of the document corresponding to the user ID included in the acquired document management request.
  • First accepting unit 103-1 accepts the acquired user ID and the text data of the document.
  • the first receiving unit 103-1 extracts one or a plurality of predetermined items such as a title included in the document based on the text data of the received document.
  • First dividing unit 104-1 acquires the user ID and text data of the document accepted by first accepting unit 103-1.
  • the first dividing unit 104-1 obtains one or more words by dividing the character string included in the document based on the text data of the obtained document.
  • the first dividing unit 104-1 divides a character string included in a document into morphemes by morphologically analyzing the text data of the document.
  • morphological analysis natural language text data (sentences) without grammatical information notes are divided into strings of morphemes based on information such as the grammar of the target language and the parts of speech of words called a dictionary. It is to discriminate the part of speech of the morpheme.
  • a morpheme is the smallest unit of a meaningful expression element.
  • a morpheme refers to each group of phonemes that are separated and extracted to the point where they become meaningless when further broken down in a certain language.
  • the first dividing unit 104-1 converts the display format of alphanumeric characters contained in each of the acquired one or more words into single-byte characters, and converts the representation format of alphabetic characters into lowercase letters.
  • First encryption unit 105-1 acquires from first division unit 104-1 the user ID and one or more words obtained by converting the display format of alphanumeric characters into half-size characters and converting the representation format of alphabetic characters into lowercase letters. .
  • First encryption unit 105-1 acquires one or more predetermined items such as a title included in the document corresponding to the acquired user ID from first reception unit 103-1.
  • the first encryption unit 105-1 converts the display format of the acquired alphanumeric characters to half-size characters, converts the expression format of the alphabetic characters to lower case letters, and converts each of one or more words to one or more predetermined items. Encrypt.
  • the first encryption unit 105-1 uses the Advanced Encryption Standard algorithm to convert one or more predetermined items and alphanumeric display format into half-width characters, are converted to lowercase and perform encryption of each of the one or more words.
  • the registration unit 106 converts the result of encrypting the user ID and one or more predetermined items from the first encryption unit 105-1, the display format of alphanumeric characters into single-byte characters, and the representation format of alphabetic characters into lowercase letters. and obtaining a result of encrypting each of the one or more words obtained.
  • the registration unit 106 encrypts each of the results obtained by encrypting one or more predetermined items and one or more words obtained by converting the display format of alphanumeric characters into half-width characters and converting the representation format of alphabetic characters into lowercase letters. index the document in order of appearance.
  • the processing unit 107 converts the user ID and one or more predetermined items from the registration unit 106 into encrypted data, converts the display format of alphanumeric characters to half size, and converts the representation format of alphabetic characters to lower case. Get the index of the document that registered the result of encrypting each of the words.
  • the processing unit 107 acquires document identification information corresponding to the acquired user ID and scan data of the document.
  • the processing unit 107 converts the obtained user ID, the document identification information, the result of encrypting one or a plurality of predetermined items, and the alphanumeric display format to half-size, and converts the alphabetic representation format to lowercase.
  • the index of the document in which the result of encrypting each of one or more words and the scanned data of the document are associated with each other and stored in the storage unit 109 .
  • the creation unit 110 converts the user ID, the document identification information, the result of the encryption of one or more predetermined items by the processing unit 107, and the alphanumeric display format into single-byte characters, and converts the alphabetic representation format into lowercase letters.
  • Information indicating that management of the document is started after the index of the document in which the result of encrypting each of the converted one or more words is registered and the scanned data of the document are associated with each other and stored in the storage unit 109.
  • a document management response addressed to the terminal device 200 is created. Creation unit 110 outputs the created document management response to communication unit 102 .
  • the second reception unit 103-2 acquires the document search request received by the communication unit 102.
  • FIG. The second reception unit 103-2 receives information specifying the user ID and search character string included in the acquired document search request.
  • Second dividing unit 104-2 acquires the user ID and the text data of the search character string accepted by second accepting unit 103-2.
  • the second dividing unit 104-2 obtains one or more words by dividing the character string included in the search character string based on the obtained text data of the search character string. For example, the second dividing unit 104-2 divides the character string included in the search character string into morphemes by morphologically analyzing the text data of the search character string.
  • the second dividing unit 104-2 converts the representation format of alphanumeric characters contained in each of the acquired one or more words into half-width characters, and converts the representation format of alphabetic characters into lowercase letters.
  • Second encryption unit 105-2 converts the display format of alphanumeric characters from second division unit 104-2 to the user ID and search character string, converts the display format of alphanumeric characters to half size, and converts the representation format of alphabetic characters to lower case. Or get multiple words.
  • the second encryption unit 105-2 encrypts each of one or a plurality of words included in the obtained search character string, converted from alphanumeric display format to half-size, and from alphabetic representation format to lowercase. do.
  • the second encryption unit 105-2 is included in the search character string using the Advanced Encryption Standard algorithm, converts the display format of alphanumeric characters to half size, and converts the representation format of alphabets to lower case. Perform encryption of each of the multiple words.
  • Search unit 108 acquires the user ID and the result of encrypting each of the one or more words from second encryption unit 105-2.
  • the search unit 108 retrieves an index of a document that includes one or more of the results of encrypting one or more words obtained from among one or more document indexes stored in the storage unit 109 in association with the obtained user ID.
  • Search for The search unit 108 obtains one or more indexes of documents including any of the results of encrypting one or more words by searching.
  • the search unit 108 rearranges the obtained indexes of the one or more documents in descending order of the matching rate of the encrypted one or more words.
  • the search unit 108 acquires document identification information associated with each of the rearranged indexes of one or more documents.
  • the creation unit 110 creates a document search response addressed to the terminal device 200 and including one or more pieces of document identification information acquired by the search unit 108 .
  • the document management server 100 outputs the created document search response to the communication unit 102 .
  • the processing unit 107 acquires the document request received by the communication unit 102 .
  • the processing unit 107 acquires document specifying information included in the acquired document request.
  • the processing unit 107 acquires from the storage unit 109 the scan data of the document and the text data of the document stored in association with the acquired document specifying information.
  • the processing unit 107 obtains one or more words by dividing the character string included in the document based on the text data of the obtained document.
  • the processing unit 107 extracts document-related matters based on predetermined rules such as the format of each of the acquired one or more words and the arrangement of parts of speech.
  • the creation unit 110 acquires the document scan data and the document text data acquired by the processing unit 107, and the extracted document-related items.
  • the preparation unit 110 prepares a document response destined for the terminal device 200 including the acquired scan data of the document, the text data of the document, and the extracted document-related items.
  • the document management server 100 outputs the created document response to the communication unit 102 .
  • Conversion unit 101 executes a computer program (software) stored in the storage unit 109 by a hardware processor such as a CPU (Central Processing Unit). It is realized by Some or all of these functional units are hardware (circuits) such as LSI (Large Scale Integration), ASIC (Application Specific Integrated Circuit), FPGA (Field-Programmable Gate Array), GPU (Graphics Processing Unit) (including circuitry), or by cooperation between software and hardware.
  • LSI Large Scale Integration
  • ASIC Application Specific Integrated Circuit
  • FPGA Field-Programmable Gate Array
  • GPU Graphics Processing Unit
  • the computer program may be stored in advance in a storage device such as a HDD (Hard Disk Drive) or flash memory, or may be stored in a removable storage medium such as a DVD or CD-ROM, and the storage medium may be a drive device. may be installed by being attached to the storage device.
  • a storage device such as a HDD (Hard Disk Drive) or flash memory
  • a removable storage medium such as a DVD or CD-ROM
  • the storage medium may be a drive device.
  • the terminal device 200 is realized as a smart phone, a mobile terminal, a personal computer, a tablet terminal device, a wristwatch type terminal device, or other information processing equipment.
  • the terminal device 200 includes, for example, a display unit 201, a communication unit 202, an imaging unit 203, a creation unit 206, an operation detection unit 207, a display processing unit 208, and a storage unit 209.
  • the display unit 201 displays the screen output by the display processing unit 208 .
  • the display unit 201 displays an initial screen output by the display processing unit 208, a document management request screen, information indicating the start of document management, one or more pieces of document identification information, document scan data, and the like. indicate.
  • the imaging unit 203 scans a document and converts the document into digital still image information to create scan data of the document.
  • the communication unit 202 is implemented by a communication module.
  • the communication unit 202 communicates with other devices such as the document management server 100 via the network NW.
  • the communication unit 202 may communicate using a wireless communication method, for example.
  • the communication unit 202 may communicate using a communication method such as a wired LAN.
  • the communication unit 202 transmits the document management request, the document search request, and the document request output by the creation unit 206 to the document management server 100 .
  • the communication unit 202 receives the document management response, the document search response, and the document response transmitted by the document management server 100 .
  • the operation detection unit 207 is configured by, for example, a touch panel or the like, detects a touch operation on the screen displayed on the display unit 201 , and outputs the detection result of the touch operation to the creation unit 206 .
  • Touch operations include an operation of inputting a user ID and password, an operation of inputting items related to documents, an operation of pressing a document management request button, an operation of pressing a document search request button, an operation of inputting a search character string, and an OK button. This includes an operation of pressing , an operation of pressing a portion corresponding to the document specifying information to be browsed, and the like.
  • the storage unit 209 is implemented by, for example, a RAM (Random Access Memory), a ROM (Read Only Memory), a flash memory, or a hybrid storage device in which a plurality of these are combined.
  • a creation unit 206 creates a login request for logging into the document management system 1 . Specifically, when the creation unit 206 acquires a detection result of an operation of inputting a user ID and a password from the operation detection unit 207, the creation unit 206 generates a log-in information including the user ID and the password whose destination is the document management server 100. create a request; Creation unit 206 outputs the created login request to communication unit 202 .
  • the display processing unit 208 causes the display unit 201 to display an initial screen when the terminal device 200 logs in to the document management system 1 .
  • the initial screen includes a document management request button that is pressed when requesting document management, and a document search request button that is pressed when requesting document retrieval.
  • the creating unit 206 creates a document management request for requesting the document management system 1 to manage documents. Specifically, the creation unit 206 receives the detection result of the operation of pressing the document management request button from the operation detection unit 207 and the detection result of the operation of inputting the document-related items. The creation unit 206 acquires document scan data from the imaging unit 203 when the user U performs an operation to scan the document. The creation unit 206 creates a document management request addressed to the document management server 100, which includes a user ID, document-related items, and document scan data. The creation unit 206 outputs the created document management request to the communication unit 202 .
  • the display processing unit 208 acquires information indicating that management of the document contained in the document management response received by the communication unit 202 has started.
  • the display processing unit 208 causes the display unit 201 to display information indicating that management of the acquired document has started.
  • the creation unit 206 creates a document search request for requesting the document management system 1 to search for documents. Specifically, the creation unit 206 receives the detection result of the operation of pressing the document search request button from the operation detection unit 207, receives the detection result of the operation of inputting a search character string, and receives the detection result of the operation of pressing the OK button. When the detection result is input, a document search request addressed to the document management server 100 and including the user ID and information specifying the search character string is created. The creation unit 206 outputs the created document search request to the communication unit 202 .
  • the display processing unit 208 acquires one or more pieces of document identification information included in the document search response received by the communication unit 202 .
  • the display processing unit 208 causes the display unit 201 to display one or more pieces of acquired document specifying information.
  • a creation unit 206 creates a document request for requesting a document from the document management system 1 . Specifically, when the operation detecting unit 207 inputs a detection result of an operation of pressing a portion corresponding to the document specifying information to be browsed, the creation unit 206 detects the document specifying information corresponding to the portion pressed by the user U. A document request addressed to the document management server 100 is created. The creation unit 206 outputs the created document request to the communication unit 202 .
  • the display processing unit 208 acquires the document scan data, the document text data, and the document-related items included in the document response received by the communication unit 202 .
  • the display processing unit 208 causes the display unit 201 to display the selected one of the acquired scan data of the document, text data of the document, and document-related items.
  • All or part of the creation unit 206, the operation detection unit 207, and the display processing unit 208 are functional units realized by a processor such as a CPU executing a program stored in the storage unit 209 (hereinafter , software functions). All or part of these functional units may be implemented by hardware such as LSI, ASIC, or FPGA, or may be implemented by a combination of software functional units and hardware.
  • the computer program may be stored in advance in a storage device such as an HDD or flash memory, or may be stored in a removable storage medium such as a DVD or CD-ROM, and the storage medium may be loaded into a drive device. may be installed with
  • FIG. 3 is a diagram showing Example 1 of the operation of the document management system according to this embodiment. Processing for registering a document in the document management server 100 will be described with reference to FIG. (Step S1-1) Between the terminal device 200 and the document management server 100, processing is performed for the terminal device 200 to log in to the document management system 1. FIG. Here, the explanation will be continued for the case where the terminal device 200 has successfully logged into the document management system 1 . When the terminal device 200 fails to log in to the document management system 1, predetermined error processing is performed.
  • the display processing section 208 causes the display section 201 to display an initial screen when the terminal device 200 logs in to the document management system 1 .
  • the display processing unit 208 causes the display unit 201 to display the document management request screen when the detection result of the operation of pressing the document management request button is input from the operation detection unit 207 .
  • FIG. 4 is a diagram showing an example 1 of the document management request screen displayed by the terminal device according to the present embodiment.
  • the document request screen includes a field for entering the title of the document, a pull-down menu for selecting the type of contract, and radio buttons for selecting whether to enter in the Western calendar or the Japanese calendar. , and fields for entering the execution date, issue date, and expiration date.
  • the contract type can be selected from a pull-down menu.
  • the conclusion date, issue date, and expiration date can be entered directly or selected from the calendar.
  • FIG. 5 is a diagram showing an example 2 of the document management request screen displayed by the terminal device according to the present embodiment.
  • FIG. 5 shows a continuation of the document management request screen displayed by sliding the document management request screen shown in FIG. 4 in the vertical direction (downward).
  • the document request screen includes a pull-down menu for entering or selecting the contractor (company name, trade name, individual name) and a field for entering the name of the person in charge. .
  • the contractor can enter directly or select from a pull-down menu.
  • FIG. 6 is a diagram showing an example 3 of the document management request screen displayed by the terminal device according to the present embodiment.
  • FIG. 6 shows a continuation of the document management request screen displayed by further sliding the document management request screen shown in FIG. 5 in the vertical direction (downward).
  • the document request screen includes buttons for selecting whether to photograph the latest contract and non-disclosure agreement with a camera or to select from photographs.
  • the creation unit 206 acquires the detection result of the operation of inputting the document-related item from the operation detection unit 207 .
  • Step S3-1 In the terminal device 200, the imaging unit 203 scans a document.
  • FIG. 7 is for explaining document scanning processing by the terminal device according to the present embodiment.
  • the screen SC of the terminal device 200 displays the scan area SCA.
  • the scan area SCA can be adjusted by moving the positions of the upper left corner CUL, the upper right corner CUR, the lower left corner CLL, and the lower right corner CLR indicated by circles at the four corners of the scan area SCA.
  • the user U adjusts the distance between the terminal device 200 and the document DOC and the angle with respect to the document DOC so that the document DOC is within the scan area SCA.
  • Scan data of the document is created when the user U presses the scan button SCB.
  • the creation unit 206 receives the detection result of the operation of inputting document-related items from the operation detection unit 207, and acquires the document scan data from the imaging unit 203 when the user U performs an operation to scan the document.
  • Step S4-1) In the terminal device 200, the creation unit 206 creates a document management request addressed to the document management server 100, including the user ID, document-related items, and document scan data.
  • Step S5-1) In terminal device 200 , creation unit 206 outputs the created document management request to communication unit 102 .
  • the communication unit 102 acquires the document management request output by the creation unit 206 and transmits the acquired document management request to the document management server 100 .
  • Step S6-1) In the document management server 100 , the communication section 102 receives the document management request transmitted by the terminal device 200 .
  • the conversion unit 101 acquires the user ID and the document scan data included in the document management request received by the communication unit 102 .
  • the conversion unit 101 converts the acquired scan data of the document into text data of the document.
  • Step S7-1) In the document management server 100, the first reception section 103-1 acquires the document management request received by the communication section .
  • the first reception unit 103-1 receives the user ID, the document-related items, the scan data of the document, and the text data of the document included in the acquired document management request.
  • the first reception unit 103-1 creates document identification information that uniquely identifies the acquired document-related matters and the scan data of the document.
  • the first reception unit 103-1 acquires from the conversion unit 101 the text data of the document corresponding to the user ID included in the acquired document management request. First accepting unit 103-1 accepts the acquired user ID and the text data of the document. The first receiving unit 103-1 extracts one or a plurality of predetermined items such as a title included in the document based on the text data of the received document.
  • Step S8-1) In the document management server 100, the first dividing unit 104-1 acquires the user ID and text data of the document accepted by the first accepting unit 103-1. The first dividing unit 104-1 obtains one or more words by dividing the character string included in the document based on the text data of the obtained document.
  • Step S9-1) In the document management server 100, the first division unit 104-1 converts the display format of the alphanumeric characters contained in each of the acquired one or more words into half-size characters, and converts the representation format of the alphabetic characters into lowercase letters. Process to align the format.
  • Step S10-1 In the document management server 100, the first encryption unit 105-1 converts the display format of the user ID and alphanumeric characters from the first division unit 104-1 to half-width characters, and converts the representation format of alphabetic characters to lower case. word and get.
  • First encryption unit 105-1 acquires one or more predetermined items such as a title included in a document corresponding to the user ID acquired from first reception unit 103-1.
  • the first encryption unit 105-1 converts the display format of the acquired alphanumeric characters to half-size characters, converts the expression format of the alphabetic characters to lower case letters, and converts each of one or more words to one or more predetermined items. Encrypt.
  • Step S11-1 In the document management server 100, the registration unit 106 converts the user ID from the first encryption unit 105-1, the result of encrypting one or more predetermined items, and the alphanumeric display format into single-byte characters, and the result of encrypting each of the one or more words obtained by converting the expression form of to lower case.
  • the registration unit 106 encrypts each of the results obtained by encrypting one or more predetermined items and one or more words obtained by converting the display format of alphanumeric characters into half-width characters and converting the representation format of alphabetic characters into lowercase letters. index the document in order of appearance.
  • the processing unit 107 receives the user ID from the registration unit 106, the result of encrypting one or more predetermined items, and one or more items obtained by converting the display format of alphanumeric characters into single-byte characters and converting the representation format of alphabetic characters into lowercase characters. Get the index of the document that registered the results of encrypting each of the words in .
  • the processing unit 107 acquires document identification information corresponding to the acquired user ID and scan data of the document.
  • the processing unit 107 converts the obtained user ID, the document identification information, the result of encrypting one or a plurality of predetermined items, and the alphanumeric display format to half-size, and converts the alphabetic representation format to lowercase.
  • the index of the document in which the result of encrypting each of one or more words and the scanned data of the document are associated with each other and stored in the storage unit 109 .
  • Step S12-1) In the document management server 100, the creation unit 110 creates a document management response addressed to the terminal device 200, including information indicating that document management has started.
  • Step S13-1) In the document management server 100 , the creation unit 110 outputs the created document management response to the communication unit 102 .
  • the communication unit 102 acquires the document management response output by the creation unit 110 and transmits the acquired document management response to the terminal device 200 .
  • Step S14-1 In the terminal device 200 , the communication section 202 receives the document management response transmitted by the document management server 100 .
  • the display processing unit 208 acquires information indicating that management of the document contained in the document management response received by the communication unit 202 has started.
  • the display processing unit 208 causes the display unit 201 to display information indicating that management of the acquired document has started.
  • FIG. 8 is a diagram showing example 1 of information displayed on the terminal device according to the present embodiment. As shown in FIG. 8, the terminal device 200 displays, as an example of information indicating that document management has started, that the contract has been successfully uploaded and the text has been analyzed, and that the content of the contract has been confirmed. A button labeled .
  • FIG. 9 is a diagram showing Example 2 of information displayed on the terminal device according to the present embodiment. As shown in FIG. 9, the terminal device 200 displays "encrypted and stored in the cloud" as an example of information indicating that document management has started.
  • FIG. 10 is a diagram showing Example 2 of the operation of the document management system according to this embodiment. Referring to FIG. 10, the operation of referring to an arbitrary document from the documents stored in document management server 100 will be described. (Step S1-2) Between the terminal device 200 and the document management server 100, processing is performed for the terminal device 200 to log in to the document management system 1. FIG. Here, the explanation will be continued for the case where the terminal device 200 has successfully logged into the document management system 1 . When the terminal device 200 fails to log in to the document management system 1, predetermined error processing is performed.
  • the display processing section 208 causes the display section 201 to display an initial screen when the terminal device 200 logs in to the document management system 1 .
  • the display processing unit 208 causes the display unit 201 to display a document search request screen when the detection result of the operation of pressing the document search request button is input from the operation detection unit 207 .
  • FIG. 11 is a diagram showing Example 1 of information displayed on the terminal device according to the present embodiment.
  • FIG. 11 shows an example of a document search request screen.
  • the document search request screen includes a field for entering a search character string.
  • the document search request screen includes a field for entering a search character string when searching for documents containing the search character string.
  • the document search request screen includes a field for entering a search character string when searching for documents that do not contain the search character string.
  • FIG. 11 shows, as an example of the document search request screen, fields for entering search character strings when searching for documents containing search character strings, and fields for entering search character strings when searching for documents that do not contain search character strings. A case where a field for entering a character string for search is included is shown. Returning to FIG. 10, the description is continued.
  • the creation unit 206 receives from the operation detection unit 207 the detection result of the operation of pressing the document search request button, the detection result of the operation of entering a search character string, and the detection result of the operation of pressing the OK button.
  • a document search request addressed to the document management server 100 is created, which includes the user ID and information specifying the search character string.
  • a search character string is entered in a field for entering a search character string when searching for a document containing the search character string will be described below.
  • Step S3-2 In terminal device 200 , creation unit 206 outputs the created document search request to communication unit 102 .
  • the communication unit 102 acquires the document search request created by the creation unit 206 and transmits the acquired document search request to the document management server 100 .
  • Step S4-2) In the document management server 100 , the communication section 102 receives the document search request transmitted by the terminal device 200 .
  • the second reception unit 103-2 acquires the document search request received by the communication unit 102.
  • the second reception unit 103-2 receives information specifying the user ID and search character string included in the acquired document search request.
  • Second dividing unit 104-2 acquires the user ID and the text data of the search character string accepted by second accepting unit 103-2.
  • the second dividing unit 104-2 obtains one or more words by dividing the character string included in the search character string based on the obtained text data of the search character string.
  • Step S5-2 In the document management server 100, the second dividing unit 104-2 converts the display format of alphanumeric characters contained in each of the acquired one or more words into half-width characters, and converts the representation format of alphabetic characters into lowercase letters.
  • Step S6-2 In the document management server 100, the second encryption unit 105-2 converts the display format of the alphanumeric characters included in the user ID and search character string from the second dividing unit 104-2 into half-width characters, Gets one or more words converted to lowercase. The second encryption unit 105-2 encrypts each of one or a plurality of words included in the obtained search character string, converted from alphanumeric display format to half-size, and from alphabetic representation format to lowercase. do.
  • Step S7-2 In the document management server 100, the search unit 108 acquires the user ID and the result of encrypting each of one or more words from the second encryption unit 105-2.
  • the search unit 108 retrieves an index of a document that includes one or more of the results of encrypting one or more words obtained from among one or more document indexes stored in the storage unit 109 in association with the obtained user ID.
  • Search for The search unit 108 obtains one or more indexes of documents including any of the results of encrypting one or more words by searching.
  • the search unit 108 rearranges the obtained indexes of the one or more documents in descending order of the matching rate of the encrypted one or more words.
  • the search unit 108 acquires document identification information associated with each of the rearranged indexes of one or more documents.
  • Step S8-2) In the document management server 100 , the creating unit 110 creates a document search response addressed to the terminal device 200 , which includes one or more pieces of document specifying information acquired by the searching unit 108 .
  • the document management server 100 outputs the created document search response to the communication unit 102 .
  • the communication unit 102 acquires the document search response output by the creation unit 110 and transmits the acquired document search response to the terminal device 200 .
  • the communication section 202 receives the document search response transmitted by the document management server 100 .
  • the display processing unit 208 acquires one or more pieces of document identification information included in the document search response received by the communication unit 202 .
  • the display processing unit 208 causes the display unit 201 to display one or more pieces of acquired document specifying information.
  • Step S11-2) In the terminal device 200, the creation unit 206 receives from the operation detection unit 207 a detection result of an operation of pressing a portion corresponding to the document specifying information to be browsed. A document request addressed to the management server 100 is created.
  • Step S12-2) In terminal device 200 , creation unit 206 outputs the created document request to communication unit 102 .
  • the communication unit 102 acquires the document request output by the creation unit 206 and transmits the acquired document request to the document management server 100 .
  • Step S13-2) In the document management server 100 , the communication section 102 receives the document request transmitted by the terminal device 200 .
  • the processing unit 107 acquires the document request received by the communication unit 102 .
  • the processing unit 107 acquires document specifying information included in the acquired document request.
  • the processing unit 107 acquires from the storage unit 109 the scan data of the document and the text data of the document stored in association with the acquired document specifying information.
  • Step S14-2 In the document management server 100, the processing unit 107 obtains one or more words by dividing the character string included in the document based on the text data of the obtained document.
  • Step S15-2) In the document management server 100, the processing unit 107 extracts document-related matters based on predetermined rules such as the format of each of the acquired one or more words and the arrangement of parts of speech.
  • Step S16-2) In the document management server 100, the creation unit 110 acquires the scan data of the document acquired by the processing unit 107, the text data of the document, and the extracted document-related items. The preparation unit 110 prepares a document response destined for the terminal device 200 including the acquired scan data of the document, the text data of the document, and the extracted document-related items.
  • Step S17-2) In the document management server 100 , the creation unit 110 outputs the created document response to the communication unit 102 .
  • the communication unit 102 acquires the document response output by the creation unit 110 and transmits the acquired document response to the terminal device 200 .
  • Step S18-2 In the terminal device 200 , the communication section 202 receives the document response transmitted by the document management server 100 .
  • the display processing unit 208 acquires the document scan data, the document text data, and the document-related items included in the document response received by the communication unit 202 .
  • the display processing unit 208 causes the display unit 201 to display the selected one of the acquired scan data of the document, text data of the document, and document-related items.
  • FIG. 12 is a diagram showing example 2 of information displayed on the terminal device according to the present embodiment.
  • FIG. 12 shows a confidentiality agreement as an example of display.
  • contract information document-related items
  • document-related items such as the document title, contract type, conclusion date, issue date, and expiration date are displayed as shown in FIGS.
  • the terminal device 200 may convert document scan data into document text data, and extract document-related items from the document text data.
  • the terminal device 200 may display the extracted document-related items on the document management request screen.
  • the terminal device 200 includes a conversion unit and a processing unit.
  • the conversion unit converts the document scan data into document text data.
  • the processing unit extracts document-related items from the document text data.
  • the display processing unit 208 displays items corresponding to the document-related items extracted by the processing unit on the document management request screen.
  • the document index contains the result of encrypting one or more predetermined items and one or more words in which the alphanumeric representation format is converted to half-width and the alphabetic representation format is converted to lowercase.
  • the result of encrypting each of is registered has been described, the present invention is not limited to this example.
  • the result of encrypting one or more predetermined items and one or more words converted from alphanumeric display formats to half-width characters and from alphabetic representation formats to lowercase are encrypted. Either one of the converted result and the converted result may be registered.
  • the document management server 100 encrypts each of one or more words obtained by dividing the character string included in the document
  • the present invention is not limited to this example.
  • the first encryption unit 105-1 may encrypt one or more words that correspond to a predetermined part of speech such as a noun or a verb. By configuring in this way, the processing load on the document management server 100 can be reduced compared to the case where all one or more words are encrypted.
  • the document management server 100 encrypts each of one or more words obtained by dividing the character string included in the search character string
  • the present invention is limited to this example. Absent.
  • the second encryption unit 105-2 may encrypt words corresponding to a predetermined part of speech such as nouns and verbs among one or more words. By configuring in this way, the processing load on the document management server 100 can be reduced compared to the case where all one or more words are encrypted.
  • the document management server 100 includes a first reception section 103-1, a first division section 104-1, a first encryption section 105-1, a registration section 106, a processing unit 107, a second reception unit 103-2, a second division unit 104-2, a second encryption unit 105-2, and a search unit .
  • First reception unit 103-1 receives text data of a document.
  • First dividing unit 104-1 divides the text data received by first receiving unit 103-1 to obtain a plurality of words.
  • First encryption unit 105-1 encrypts each of the plurality of words acquired by first division unit 104-1.
  • Registration unit 106 registers each of the plurality of words encrypted by first encryption unit 105-1 in the document index.
  • the processing unit 107 associates the index of the document in which each of the plurality of words encrypted by the registration unit 106 is registered with the document, and stores them in the storage unit 109 as document data.
  • Second accepting unit 103-2 accepts a search character string.
  • Second division unit 104-2 divides the character string for search received by second reception unit 103-2 to acquire one or more words.
  • Second encryption unit 105-2 encrypts each of the one or more words acquired by second division unit 104-2.
  • Search unit 108 searches for a plurality of document data based on each of the one or more words encrypted by second encryption unit 105-2. By configuring in this way, each of the plurality of words obtained by dividing the text data of the document can be encrypted, and each of the plurality of encrypted words can be registered in the index of the document.
  • I can get the document. Therefore, the convenience of document management can be improved.
  • First dividing unit 104-1 converts a character string included in each of a plurality of words into a predetermined format
  • first encryption unit 105-1 converts first dividing unit 104-1 into a predetermined format. encrypted string.
  • First receiving unit 103-1 extracts one or more predetermined items included in the text data of the document, and first encryption unit 105-1 encrypts the one or more items extracted by first receiving unit 103-1.
  • Each of the predetermined items is encrypted, the registration unit 106 registers the encryption result of the one or more predetermined items in the index of the document, and the processing unit 107 causes the registration unit 106 to encrypt the one or more predetermined items.
  • the registration unit 106 registers each of the multiple words encrypted by the first encryption unit 105-1 in the document index based on the order of the multiple words included in the text data.
  • a document such as a contract
  • each of the plurality of words encrypted by the first encryption unit 105-1 can be registered in the document index based on the order of the plurality of words included in the text data. You can reduce the time to find a hit when searching the index for documents that contain one of multiple encrypted words.
  • the first encryption unit 105-1 encrypts words corresponding to a predetermined part of speech among the plurality of words acquired by the first division unit 104-1. By configuring in this way, the processing load on the document management server 100 can be reduced compared to the case where all one or more words are encrypted. It is assumed that particles such as particles cannot be used for document retrieval even if they are encrypted.
  • the search unit 108 searches for a plurality of encrypted words included in each index of a plurality of documents that match each of the one or more words encrypted by the second encryption unit 105-2. do. By configuring in this way, it is possible to search for documents containing search character strings.
  • the search unit 108 derives the match rate between each of the one or more words encrypted by the second encryption unit 105-2 and the plurality of encrypted words included in each of the indexes of the plurality of documents. do.
  • the match rate derives how many words (percentage) the encrypted word or words in the search string match the encrypted words in the index. can be anything.
  • a computer program for realizing the functions of the devices described above may be recorded on a computer-readable recording medium, and the computer program recorded on this recording medium may be read and executed by a computer system.
  • the “computer system” referred to here may include hardware such as an OS and peripheral devices.
  • “computer-readable recording medium” includes writable nonvolatile memories such as flexible discs, magneto-optical discs, ROMs and flash memories, portable media such as DVDs (Digital Versatile Discs), and computer system built-in media.
  • a storage device such as a hard disk that
  • “computer-readable recording medium” means a volatile memory (e.g., DRAM ( It also includes those that hold programs for a certain period of time, such as Dynamic Random Access Memory)).
  • the above program may be transmitted from a computer system storing this program in a storage device or the like to another computer system via a transmission medium or by a transmission wave in a transmission medium.
  • the "transmission medium” for transmitting the program refers to a medium having a function of transmitting information, such as a network (communication network) such as the Internet or a communication line (communication line) such as a telephone line.
  • the program may be for realizing part of the functions described above. Further, it may be a so-called difference file (difference program) that can realize the above-described functions in combination with a program already recorded in the computer system.
  • REFERENCE SIGNS LIST 1 document management system 100 document management server 101 conversion unit 102 communication unit 103-1 first reception unit 103-2 second reception unit 104-1 first division unit 104 -2... Second dividing unit 105-1... First encryption unit 105-2... Second encryption unit 106... Registration unit 107... Processing unit 108... Searching unit 109... Storage unit 110... Creation unit 200 Terminal device 201 Display unit 202 Communication unit 203 Imaging unit 206 Creation unit 207 Operation detection unit 208 Display processing unit 209 Storage unit

Abstract

This document management server comprises: a first reception unit that receives text data from a document; a first division unit that divides the received text data and acquires a plurality of words; a first encryption unit that encrypts each of the plurality of acquired words; a registration unit that registers each of the plurality of encrypted words in a document index; a processing unit that associates the document and the document index in which each of the plurality of encrypted words is registered and stores the result as document data in a storage unit; a second reception unit that receives a character string for a search; a second division unit that divides the received character string for a search and acquires one or a plurality of words; a second encryption unit that encrypts each of the one or more acquired words; and a search unit that searches a plurality of pieces of document data on the basis of each of the one or more encrypted words.

Description

文書管理サーバ、文書管理方法及びコンピュータプログラムDocument management server, document management method and computer program
 本発明は、文書管理サーバ、文書管理方法及びコンピュータプログラムに関する。
 本願は、2021年7月21日に、日本に出願された特願2021-120640号に基づき優先権を主張し、その内容をここに援用する。
The present invention relates to a document management server, document management method, and computer program.
This application claims priority based on Japanese Patent Application No. 2021-120640 filed in Japan on July 21, 2021, the contents of which are incorporated herein.
 例えば、契約書は、以下のように取り交わされる。契約者間で契約内容を合意した後、一方の契約者は、契約書を二通作成してサインを行い、他方の契約者に送付する。他方の契約者は契約書を受け取ると、二通ともサインをした後、一通を証拠として手元に保管し、もう一通を一方の契約相手に返送する。この場合、証拠として手元に保管される契約書は、書類棚に保管されて管理される。 For example, the contract is exchanged as follows. After agreeing on the contents of the contract between the contractors, one contractor prepares two copies of the contract, signs them, and sends them to the other contractor. When the other contracting party receives the contract, he/she signs both copies, keeps one copy as evidence, and returns the other copy to the other contracting party. In this case, the contract that is kept on hand as evidence is stored and managed in a document shelf.
 契約書などの文書を管理する技術に関して、効率的かつ確実に文書管理作業を行う技術が知られている(例えば特許文献1参照)。この技術によれば、文書管理装置は、文書情報取得部と、容器情報取得部と、文書入出情報記録部と、処理状況判定部と、処理状況情報出力部と、文書入出情報管理DBとを備える。文書情報取得部は、所定の文書に関する情報を取得する。容器情報取得部は、所定の容器に関する情報を取得する。文書入出情報記録部は、文書の入出に関する情報を所定の記録媒体に記録する。処理状況判定部は、所定の文書の処理状況を判定する。処理状況情報出力部は、文書の処理状況に関する情報を出力する。 Regarding techniques for managing documents such as contracts, techniques for efficiently and reliably managing documents are known (see Patent Document 1, for example). According to this technique, the document management apparatus includes a document information acquisition unit, a container information acquisition unit, a document input/output information recording unit, a processing status determination unit, a processing status information output unit, and a document input/output information management DB. Prepare. The document information acquisition unit acquires information on a predetermined document. The container information acquisition unit acquires information about a predetermined container. The document entry/exit information recording section records information relating to entry/exit of documents in a predetermined recording medium. The processing status determination unit determines the processing status of a given document. The processing status information output unit outputs information about the processing status of the document.
特開2007-265280号公報Japanese Patent Application Laid-Open No. 2007-265280
 契約書が書類棚に保管される場合には、目当ての契約書を書類棚の契約書の束から探し出すのが大変である。また、契約書の確認が必要になった場合には、契約書を確認するために出社しなければならない。また、海外支社などの離れた場所で管理されている契約書については、見返すことができない。仮に、ある契約に不備が発覚し、同様の契約書を参照したい場合に、その契約書を書類棚に保管された複数の契約書の束から探し出すのは大変である。 When contracts are stored in a document shelf, it is difficult to find the desired contract from the bundle of contracts in the document shelf. In addition, when it becomes necessary to confirm the contract, the employee must come to the office to confirm the contract. In addition, it is not possible to review contracts that are managed at remote locations such as overseas branch offices. If a defect is discovered in a certain contract and the user wants to refer to the same contract, it is difficult to find the contract from the bundle of contracts stored in the document shelf.
 また、不利な契約があったとしても、その契約を見直す機会がないため、いつの間にか自動更新済みになってしまうおそれがある。仮に見返す必要がある契約書があった場合には、その契約書を手元に置いておく必要があるため、保管場所を倉庫に移せない。
 契約書を電子データ化して保管する場合には、製本された契約書をスキャンするのは大変である。紙の書面や複数の電子契約サービスの利用で管理がバラバラになるおそれがある。以上は、契約書などの文書に当てはまる。
 本発明の目的は、文書管理の利便性を向上できる文書管理サーバ、文書管理方法及びコンピュータプログラムを提供することにある。
Also, even if there is an unfavorable contract, there is no opportunity to review the contract, so there is a risk that the contract will automatically be renewed. If there is a contract that needs to be reviewed, it is necessary to keep the contract at hand, so the storage location cannot be moved to the warehouse.
When the contract is converted into electronic data and stored, it is difficult to scan the bound contract. The use of paper documents and multiple electronic contract services may lead to disjointed management. The above applies to documents such as contracts.
SUMMARY OF THE INVENTION An object of the present invention is to provide a document management server, a document management method, and a computer program that can improve the convenience of document management.
(1)本発明の一態様は、文書のテキストデータを受け付ける第1受付部と、前記第1受付部が受け付けた前記テキストデータを分割し、複数の単語を取得する第1分割部と、前記第1分割部が取得した複数の前記単語の各々を暗号化する第1暗号化部と、前記第1暗号化部によって暗号化された複数の前記単語の各々を前記文書のインデックスに登録する登録部と、前記登録部によって暗号化された複数の前記単語の各々が登録された前記文書の前記インデックスと前記文書とを関連付けて文書データとして記憶部に記憶させる処理部と、検索用文字列を受け付ける第2受付部と、前記第2受付部が受け付けた前記検索用文字列を分割し、一又は複数の単語を取得する第2分割部と、前記第2分割部が取得した一又は複数の前記単語の各々を暗号化する第2暗号化部と、前記第2暗号化部によって暗号化された一又は複数の前記単語の各々に基づいて複数の前記文書データを検索する検索部とを備える、文書管理サーバである。
(2)本発明の一態様は、上記(1)に記載の文書管理サーバにおいて、前記第1分割部は、複数の前記単語の各々に含まれる文字列を所定の形式に変換し、前記第1暗号化部は、前記第1分割部が所定の前記形式に変換した前記文字列を暗号化する。
(3)本発明の一態様は、上記(1)又は上記(2)に記載の文書管理サーバにおいて、前記第1受付部は、前記文書の前記テキストデータに含まれる所定の項目を一又は複数抽出し、前記第1暗号化部は、前記第1受付部が抽出した一又は複数の所定の前記項目の各々を暗号化し、前記登録部は、一又は複数の所定の前記項目を暗号化した結果を前記文書のインデックスに登録し、前記処理部は、前記登録部によって一又は複数の所定の前記項目を暗号化した結果がさらに登録された前記文書の前記インデックスと前記文書とを関連付けて文書データとして記憶部に記憶させる。
(4)本発明の一態様は、上記(1)から上記(3)のいずれか一項に記載の文書管理サーバにおいて、前記登録部は、前記テキストデータに含まれる複数の前記単語の順序に基づいて、前記第1暗号化部によって暗号化された複数の前記単語の各々を前記文書の前記インデックスに登録する。
(5)本発明の一態様は、上記(1)から上記(4)のいずれか一項に記載の文書管理サーバにおいて、前記第1暗号化部は、前記第1分割部が取得した複数の前記単語のうち、所定の品詞に該当する単語を暗号化する。
(6)本発明の一態様は、上記(1)から上記(5)のいずれか一項に記載の文書管理サーバにおいて、前記検索部は、前記第2暗号化部によって暗号化された一又は複数の前記単語の各々に、複数の文書のインデックスの各々に含まれる暗号化された複数の前記単語が一致するものを検索する。
(7)本発明の一態様は、上記(1)から上記(6)のいずれか一項に記載の文書管理サーバにおいて、前記検索部は、前記第2暗号化部によって暗号化された一又は複数の前記単語の各々と、複数の文書のインデックスの各々に含まれる暗号化された複数の前記単語との一致率を導出する。
(1) One aspect of the present invention includes: a first receiving unit that receives text data of a document; a first dividing unit that divides the text data received by the first receiving unit to obtain a plurality of words; a first encryption unit for encrypting each of the plurality of words obtained by the first division unit; and a registration for registering each of the plurality of words encrypted by the first encryption unit in an index of the document. a processing unit that associates the index of the document in which each of the plurality of words encrypted by the registration unit is registered with the document and stores the document as document data in a storage unit; a second receiving unit that receives, a second dividing unit that divides the search character string received by the second receiving unit to obtain one or more words, and one or more words that the second dividing unit obtains a second encryption unit that encrypts each of the words; and a search unit that searches for a plurality of the document data based on each of the one or more words encrypted by the second encryption unit. , is a document management server.
(2) In one aspect of the present invention, in the document management server according to (1) above, the first dividing unit converts a character string included in each of the plurality of words into a predetermined format, 1 encryption unit encrypts the character string converted into the predetermined format by the first division unit.
(3) In one aspect of the present invention, in the document management server according to (1) or (2) above, the first reception unit receives one or more predetermined items included in the text data of the document. the first encryption unit encrypts each of the one or more predetermined items extracted by the first reception unit; and the registration unit encrypts the one or more predetermined items. The result is registered in the index of the document, and the processing unit associates the document with the index of the document in which the result of encrypting one or more predetermined items by the registration unit is further registered to the document. It is stored in the storage unit as data.
(4) In one aspect of the present invention, in the document management server according to any one of (1) to (3) above, the registration unit arranges the words in the order of the plurality of words included in the text data. Based on this, each of the plurality of words encrypted by the first encryption unit is registered in the index of the document.
(5) In one aspect of the present invention, in the document management server according to any one of (1) to (4) above, the first encryption unit stores a plurality of Among the words, words corresponding to a predetermined part of speech are encrypted.
(6) In one aspect of the present invention, in the document management server according to any one of (1) to (5) above, the search unit is encrypted by the second encryption unit. Each of the plurality of words is searched for a match of the plurality of encrypted words contained in each of the indices of the plurality of documents.
(7) In one aspect of the present invention, in the document management server according to any one of (1) to (6) above, the search unit is encrypted by the second encryption unit or Deriving a match rate between each of the plurality of words and the encrypted plurality of words contained in each of the indices of the plurality of documents.
(8)本発明の一態様は、文書管理サーバが実行する文書管理方法であって、文書のテキストデータを受け付けるステップと、前記テキストデータを分割し、複数の単語を取得するステップと、複数の前記単語の各々を暗号化するステップと、暗号化する前記ステップで暗号化された複数の前記単語の各々を前記文書のインデックスに登録するステップと、暗号化された複数の前記単語の各々が登録された前記文書の前記インデックスと前記文書とを関連付けて文書データとして記憶部に記憶させるステップと、検索用文字列を受け付けるステップと、前記検索用文字列を分割し、一又は複数の単語を取得するステップと、前記検索用文字列を分割した一又は複数の単語の各々を暗号化するステップと、暗号化された前記検索用文字列を分割した一又は複数の前記単語の各々に基づいて複数の前記文書データを検索するステップとを有する、文書管理方法である。 (8) An aspect of the present invention is a document management method executed by a document management server, comprising: receiving text data of a document; dividing the text data to obtain a plurality of words; encrypting each of said words; registering each of said plurality of words encrypted in said step of encrypting in said document index; and registering each of said plurality of encrypted words. a step of associating the index of the document obtained with the document and storing it in a storage unit as document data; a step of receiving a character string for search; dividing the character string for search and acquiring one or more words; encrypting each of the one or more words obtained by dividing the search string; and based on each of the one or more words obtained by dividing the encrypted search string, a plurality of and retrieving the document data of the document management method.
(9)本発明の一態様は、コンピュータに、文書のテキストデータを受け付けるステップと、前記テキストデータを分割し、複数の単語を取得するステップと、複数の前記単語の各々を暗号化するステップと、暗号化する前記ステップで暗号化された複数の前記単語の各々を前記文書のインデックスに登録するステップと、暗号化された複数の前記単語の各々が登録された前記文書の前記インデックスと前記文書とを関連付けて文書データとして記憶部に記憶させるステップと、検索用文字列を受け付けるステップと、前記検索用文字列を分割し、一又は複数の単語を取得するステップと、前記検索用文字列を分割した一又は複数の単語の各々を暗号化するステップと、暗号化された前記検索用文字列を分割した一又は複数の前記単語の各々に基づいて複数の前記文書データを検索するステップとを実行させる、コンピュータプログラムである。 (9) One aspect of the present invention provides a computer with the steps of receiving text data of a document, dividing the text data to obtain a plurality of words, and encrypting each of the plurality of words. registering each of the plurality of words encrypted in the step of encrypting in an index of the document; and the index of the document and the document in which each of the plurality of encrypted words is registered. a step of associating and storing in a storage unit as document data; a step of receiving a search character string; a step of dividing the search character string to obtain one or more words; encrypting each of the divided one or more words; and searching the plurality of document data based on each of the one or more divided words of the encrypted search character string. A computer program that is executed.
 本発明によれば、文書管理の利便性を向上できる文書管理サーバ、文書管理方法及びコンピュータプログラムを提供できる。 According to the present invention, it is possible to provide a document management server, a document management method, and a computer program that can improve the convenience of document management.
本発明の実施形態の文書管理システムの構成例を示す図である。1 is a diagram showing a configuration example of a document management system according to an embodiment of the present invention; FIG. 本実施形態に係る文書管理システムに含まれる文書管理サーバと、端末装置との詳細を示す図である。3 is a diagram showing details of a document management server and terminal devices included in the document management system according to the embodiment; FIG. 本実施形態に係る文書管理システムの動作の例1を示す図である。FIG. 4 is a diagram showing an example 1 of operation of the document management system according to the present embodiment; 本実施形態に係る端末装置が表示する文書管理依頼画面の例1を示す図である。FIG. 4 is a diagram showing an example 1 of a document management request screen displayed by the terminal device according to the embodiment; 本実施形態に係る端末装置が表示する文書管理依頼画面の例2を示す図である。FIG. 10 is a diagram showing an example 2 of a document management request screen displayed by the terminal device according to the embodiment; 本実施形態に係る端末装置が表示する文書管理依頼画面の例3を示す図である。FIG. 10 is a diagram showing an example 3 of a document management request screen displayed by the terminal device according to the embodiment; 本実施形態に係る端末装置による文書のスキャン処理を説明するためのである。This is for explaining document scanning processing by the terminal device according to the present embodiment. 本実施形態に係る端末装置に表示される情報の例1を示す図である。FIG. 4 is a diagram showing an example 1 of information displayed on the terminal device according to the embodiment; 本実施形態に係る端末装置に表示される情報の例2を示す図である。It is a figure which shows the example 2 of the information displayed on the terminal device which concerns on this embodiment. 本実施形態に係る文書管理システムの動作の例2を示す図である。FIG. 10 is a diagram showing an example 2 of the operation of the document management system according to this embodiment; 本実施形態に係る端末装置に表示される情報の例1を示す図である。FIG. 4 is a diagram showing an example 1 of information displayed on the terminal device according to the embodiment; 本実施形態に係る端末装置に表示される情報の例2を示す図である。It is a figure which shows the example 2 of the information displayed on the terminal device which concerns on this embodiment.
 次に、本実施形態の文書管理サーバ、文書管理方法及びコンピュータプログラムを、図面を参照しつつ説明する。以下で説明する実施形態は一例に過ぎず、本発明が適用される実施形態は、以下の実施形態に限られない。
 なお、実施形態を説明するための全図において、同一の機能を有するものは同一符号を用い、繰り返しの説明は省略する。
 また、本願でいう「XXに基づいて」とは、「少なくともXXに基づく」ことを意味し、XXに加えて別の要素に基づく場合も含む。また、「XXに基づいて」とは、XXを直接に用いる場合に限定されず、XXに対して演算や加工が行われたものに基づく場合も含む。「XX」は、任意の要素(例えば、任意の情報)である。
Next, the document management server, document management method, and computer program of this embodiment will be described with reference to the drawings. The embodiments described below are merely examples, and embodiments to which the present invention is applied are not limited to the following embodiments.
In addition, in all the drawings for explaining the embodiments, the same reference numerals are used for the parts having the same functions, and repeated explanations are omitted.
In addition, "based on XX" in the present application means "based on at least XX", and includes cases based on other elements in addition to XX. Moreover, "based on XX" is not limited to the case of using XX directly, but also includes the case of being based on what has been calculated or processed with respect to XX. "XX" is an arbitrary element (for example, arbitrary information).
 (第1の実施形態)
 (文書管理システム)
 図1は、本発明の実施形態の文書管理システムの構成例を示す図である。図1において、文書管理システム1は、文書管理サーバ100を備える。図1には、文書管理サーバ100に加え、ユーザーUなどの文書の管理を依頼する顧客が使用する端末装置200が示されている。
 文書管理サーバ100と端末装置200とは、ネットワークNWを介して通信する。ネットワークNWは、例えば、インターネット、WAN(Wide Area Network)、LAN(Local Area Network)、プロバイダ装置、無線基地局などを含む。
(First embodiment)
(Document management system)
FIG. 1 is a diagram showing a configuration example of a document management system according to an embodiment of the present invention. In FIG. 1, the document management system 1 has a document management server 100 . In addition to the document management server 100, FIG. 1 shows a terminal device 200 used by a customer such as a user U who requests document management.
The document management server 100 and the terminal device 200 communicate via the network NW. The network NW includes, for example, the Internet, a WAN (Wide Area Network), a LAN (Local Area Network), a provider device, a radio base station, and the like.
 文書の管理を依頼するユーザーUは、文書管理システム1への登録を行っており、ユーザーの識別情報(以下「ユーザーID」という)とパスワードとが、文書管理システム1に設定されている。ユーザーUは、端末装置200を操作することによって文書管理サーバ100にアクセスする。ユーザーUは、端末装置200にユーザーIDとパスワードとを入力し、ログイン操作を行うことによって、文書管理システム1にログインする。ユーザーUは、文書管理システム1の初期画面を参照し、文書管理要求ボタンを押すことによって端末装置200に文書管理依頼画面を表示させる。文書の一例は、契約書である。以下、一例として、文書が契約書である場合について説明を続ける。
 ユーザーUは、端末装置200を操作することによって、文書管理依頼画面にタイトル、契約種別などの文書関連事項を入力する。ユーザーUは、端末装置200を操作することによって、文書をスキャンする。端末装置200は、ユーザーIDと文書関連事項と文書のスキャンデータとを含む、文書管理サーバ100を宛先とする、文書管理要求を作成する。端末装置200は、作成した文書管理要求を文書管理サーバ100へ送信する。
A user U who requests document management is registered in the document management system 1 , and user identification information (hereinafter referred to as “user ID”) and password are set in the document management system 1 . A user U accesses the document management server 100 by operating the terminal device 200 . The user U logs into the document management system 1 by entering a user ID and a password into the terminal device 200 and performing a login operation. The user U refers to the initial screen of the document management system 1 and presses the document management request button to display the document management request screen on the terminal device 200 . An example of a document is a contract. As an example, the description will be continued below for the case where the document is a contract.
The user U operates the terminal device 200 to enter document-related items such as the title and contract type on the document management request screen. A user U scans a document by operating the terminal device 200 . The terminal device 200 creates a document management request addressed to the document management server 100, which includes a user ID, document-related items, and document scan data. The terminal device 200 transmits the created document management request to the document management server 100 .
 文書管理サーバ100は、端末装置200が送信した文書管理要求を受信する。文書管理サーバ100は、受信した文書管理要求に含まれるユーザーIDと文書関連事項と文書のスキャンデータとを取得する。文書管理サーバ100は、取得した文書関連事項と文書のスキャンデータとを一意に特定する情報(以下「文書特定情報」という)を作成する。
 文書管理サーバ100は、取得した文書のスキャンデータを処理することによってテキストデータに変換する。文書管理サーバ100は、文書のテキストデータに基づいて、文書に含まれるタイトルなどの所定の項目を一又は複数抽出する。文書管理サーバ100は、取得した一又は複数の所定の項目を暗号化する。
 文書管理サーバ100は、文書のテキストデータに基づいて、文書に含まれる文字列を分割することによって一又は複数の単語を取得する。文書管理サーバ100は、取得した一又は複数の単語の各々に含まれる英数字の表示形式を揃える処理を行う。具体的には、文書管理サーバ100は、一又は複数の単語の各々に含まれる英数字の表示形式を全角に揃えてもよいし、半角に揃えてもよい。
The document management server 100 receives the document management request transmitted by the terminal device 200 . The document management server 100 acquires the user ID, document-related items, and document scan data included in the received document management request. The document management server 100 creates information (hereinafter referred to as “document identification information”) that uniquely identifies the acquired document-related matters and the scan data of the document.
The document management server 100 converts the scanned data of the acquired document into text data by processing it. The document management server 100 extracts one or more predetermined items such as a title included in the document based on the text data of the document. The document management server 100 encrypts the obtained one or more predetermined items.
The document management server 100 obtains one or more words by dividing the character string included in the document based on the text data of the document. The document management server 100 performs a process of aligning the display formats of alphanumeric characters contained in each of the acquired one or more words. Specifically, the document management server 100 may align the display format of alphanumeric characters included in each of one or more words to full-width or half-width.
 文書管理サーバ100は、取得した一又は複数の単語の各々に含まれる英字の表現形式を揃える処理を行う。具体的には、文書管理サーバ100は、一又は複数の単語の各々に含まれる英字の表現形式を小文字に揃えてもよいし、大文字に揃えてもよい。一例として、文書管理サーバ100が、一又は複数の単語の各々に含まれる英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換する場合について説明を続ける。
 文書管理サーバ100は、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化する。文書管理サーバ100は、暗号化された一又は複数の所定の項目を出現順に文書のインデックス(索引、見出し)に登録する。文書管理サーバ100は、一又は複数の暗号化された単語の各々を出現順に文書のインデックス(索引、見出し)に登録する。
The document management server 100 performs a process of aligning the expressions of alphabetic characters included in each of the acquired one or more words. Specifically, the document management server 100 may align the representation format of the alphabetic characters included in each of one or more words to lowercase letters or uppercase letters. As an example, a case where the document management server 100 converts the representation format of alphanumeric characters contained in each of one or more words into half-width characters and converts the representation format of alphabetic characters into lowercase letters will be continued.
The document management server 100 converts the display format of alphanumeric characters into half-width characters and converts the representation format of alphabetic characters into lowercase letters, and encrypts each of one or more words. The document management server 100 registers one or more encrypted predetermined items in a document index (index, headline) in order of appearance. The document management server 100 registers each of the one or more encrypted words in the document index (index, headline) in order of appearance.
 文書管理サーバ100は、ユーザーIDと、文書特定情報と、一又は複数の暗号化された所定の項目の各々と一又は複数の暗号化された単語の各々とが登録された文書のインデックスと、文書のスキャンデータとを関連付けて記憶部に記憶させる。文書管理サーバ100は、文書の管理を開始したことを示す情報を含む、端末装置200を宛先とする文書管理応答を作成する。文書管理サーバ100は、作成した文書管理応答を端末装置200へ送信する。 The document management server 100 has a user ID, document identification information, a document index in which each of one or more encrypted predetermined items and each of one or more encrypted words are registered, The data is stored in the storage unit in association with the scan data of the document. The document management server 100 creates a document management response addressed to the terminal device 200 and including information indicating that document management has started. The document management server 100 transmits the created document management response to the terminal device 200 .
 端末装置200は、文書管理サーバ100が送信した文書管理応答を受信する。端末装置200は、受信した文書管理応答に含まれる文書の管理を開始したことを示す情報を取得する。端末装置200は、取得した文書の管理を開始したことを示す情報を表示する。
 文書の検索を依頼するユーザーUは、端末装置200を操作することによって文書管理サーバ100にアクセスする。ユーザーUは、端末装置200にユーザーIDとパスワードとを入力し、ログイン操作を行うことによって、文書管理システム1にログインする。
 ユーザーUは、文書管理システム1の初期画面を参照し、文書検索依頼ボタンを押すことによって端末装置200に文書検索依頼画面を表示させる。ユーザーUは、端末装置200を操作することによって、文書検索依頼画面に検索キーワードなどの検索用文字列を入力し、OKボタンを押す操作を行う。端末装置200は、ユーザーIDと検索用文字列を特定する情報とを含む、文書管理サーバ100を宛先とする、文書検索要求を作成する。端末装置200は、作成した文書検索要求を文書管理サーバ100へ送信する。
The terminal device 200 receives the document management response sent by the document management server 100 . The terminal device 200 acquires information indicating that management of the document contained in the received document management response has started. The terminal device 200 displays information indicating that management of the acquired document has started.
A user U requesting a document search accesses the document management server 100 by operating the terminal device 200 . The user U logs into the document management system 1 by entering a user ID and a password into the terminal device 200 and performing a login operation.
The user U refers to the initial screen of the document management system 1 and pushes the document search request button to display the document search request screen on the terminal device 200 . The user U operates the terminal device 200 to input a search character string such as a search keyword on the document search request screen and press the OK button. The terminal device 200 creates a document search request addressed to the document management server 100, which includes the user ID and information specifying the search character string. The terminal device 200 transmits the created document search request to the document management server 100 .
 文書管理サーバ100は、端末装置200が送信した文書検索要求を受信する。文書管理サーバ100は、受信した文書検索要求に含まれるユーザーIDと検索用文字列を特定する情報とを取得する。文書管理サーバ100は、検索用文字列のテキストデータに基づいて、検索用文字列に含まれる文字列を分割することによって一又は複数の単語を取得する。文書管理サーバ100は、取得した一又は複数の単語の各々に含まれる英数字の表示形式を揃える処理を行う。具体的には、文書管理サーバ100は、一又は複数の単語の各々に含まれる英数字の表示形式を全角に揃えてもよいし、半角に揃えてもよい。
 文書管理サーバ100は、取得した一又は複数の単語の各々に含まれる英字の表現形式を揃える処理を行う。具体的には、文書管理サーバ100は、一又は複数の単語の各々に含まれる英字の表現形式を小文字に揃えてもよいし、大文字に揃えてもよい。一例として、文書管理サーバ100が、一又は複数の単語の各々に含まれる英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換する場合について説明を続ける。
 文書管理サーバ100は、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化する。
The document management server 100 receives the document search request sent by the terminal device 200 . The document management server 100 acquires the user ID and the information specifying the search character string included in the received document search request. Based on the text data of the search character string, the document management server 100 obtains one or more words by dividing the character string included in the search character string. The document management server 100 performs a process of aligning the display formats of alphanumeric characters contained in each of the acquired one or more words. Specifically, the document management server 100 may align the display format of alphanumeric characters included in each of one or more words to full-width or half-width.
The document management server 100 performs a process of aligning the expressions of alphabetic characters included in each of the acquired one or more words. Specifically, the document management server 100 may align the representation format of the alphabetic characters included in each of one or more words to lowercase letters or uppercase letters. As an example, a case where the document management server 100 converts the representation format of alphanumeric characters contained in each of one or more words into half-width characters and converts the representation format of alphabetic characters into lowercase letters will be continued.
The document management server 100 converts the display format of alphanumeric characters into half-width characters and converts the representation format of alphabetic characters into lowercase letters, and encrypts each of one or more words.
 文書管理サーバ100は、取得したユーザーIDに関連付けて記憶部に記憶されている一又は複数の文書のインデックスのうち、一又は複数の暗号化された単語のいずれかを含む文書のインデックスを検索する。文書管理サーバ100は、検索することによって一又は複数の暗号化された単語のいずれかを含む文書のインデックスを一又は複数取得する。文書管理サーバ100は、一又は複数の暗号化された単語の一致率が高い順に、取得した一又は複数の文書のインデックスを並べ変える。文書管理サーバ100は、並べ替えた一又は複数の文書のインデックスの各々に関連付けられている文書特定情報を取得する。文書管理サーバ100は、取得した一又は複数の文書特定情報を含む、端末装置200を宛先とする文書検索応答を作成する。文書管理サーバ100は、作成した文書検索応答を、端末装置200へ送信する。 The document management server 100 searches for a document index containing one or more encrypted words from among one or more document indexes stored in the storage unit in association with the acquired user ID. . The document management server 100 retrieves one or more indexes of documents containing either one or more of the encrypted words by searching. The document management server 100 rearranges the indexes of the obtained one or more documents in descending order of matching rate of one or more encrypted words. The document management server 100 acquires document identification information associated with each of the rearranged one or more document indexes. The document management server 100 creates a document search response addressed to the terminal device 200 including one or more pieces of acquired document identification information. The document management server 100 transmits the created document search response to the terminal device 200 .
 端末装置200は、文書管理サーバ100が送信した文書検索応答を受信する。端末装置200は、受信した文書検索応答に含まれる一又は複数の文書特定情報を取得する。端末装置200は、取得した一又は複数の文書特定情報を表示する。
 ユーザーUは、端末装置200に表示された一又は複数の文書特定情報を参照し、閲覧する文書特定情報に該当する部分を押すことによって、閲覧する文書を選択する。端末装置200は、ユーザーUによって押された部分に該当する文書特定情報を含む、文書管理サーバ100を宛先とする文書要求を作成する。文書管理サーバ100は、作成した文書要求を文書管理サーバ100へ送信する。
The terminal device 200 receives the document search response sent by the document management server 100 . The terminal device 200 acquires one or more pieces of document identification information included in the received document search response. The terminal device 200 displays the acquired one or more pieces of document specifying information.
The user U refers to one or a plurality of pieces of document specifying information displayed on the terminal device 200, and selects a document to be browsed by pressing a portion corresponding to the document specifying information to be browsed. The terminal device 200 creates a document request addressed to the document management server 100, including the document identification information corresponding to the part pressed by the user U. FIG. The document management server 100 transmits the created document request to the document management server 100 .
 文書管理サーバ100は、端末装置200が送信した文書要求を受信する。文書管理サーバ100は、受信した文書要求に含まれる文書特定情報を取得する。文書管理サーバ100は、取得した文書特定情報に関連付けて記憶されている文書のスキャンデータと文書のテキストデータを取得する。文書管理サーバ100は、取得した文書のテキストデータに基づいて、文書に含まれる文字列を分割することによって一又は複数の単語を取得する。文書管理サーバ100は、取得した一又は複数の単語の各々の書式、品詞の並びなどの所定の規則に基づいて、文書関連事項を抽出する。文書管理サーバ100は、取得した文書のスキャンデータと文書のテキストデータと文書関連事項とを含む端末装置200を宛先とする文書応答を作成する。文書管理サーバ100は、作成した文書応答を、端末装置200へ送信する。
 端末装置200は、文書管理サーバ100が送信した文書応答を受信する。端末装置200は、受信した文書応答に含まれる文書のスキャンデータと文書のテキストデータと文書関連事項とを取得する。端末装置200は、取得した文書のスキャンデータと文書のテキストデータと文書関連事項とのうち選択されたものを表示する。
The document management server 100 receives the document request sent by the terminal device 200 . The document management server 100 acquires document specifying information included in the received document request. The document management server 100 acquires document scan data and document text data stored in association with the acquired document specifying information. The document management server 100 obtains one or more words by dividing the character string included in the document based on the text data of the obtained document. The document management server 100 extracts document-related matters based on predetermined rules such as the format of each of the acquired one or more words and the arrangement of parts of speech. The document management server 100 creates a document response destined for the terminal device 200, which includes the acquired scan data of the document, text data of the document, and document-related items. The document management server 100 transmits the created document response to the terminal device 200 .
The terminal device 200 receives the document response sent by the document management server 100 . The terminal device 200 acquires the document scan data, the document text data, and the document-related items included in the received document response. The terminal device 200 displays the selected one of the acquired scan data of the document, text data of the document, and document-related items.
 以下、文書管理システム1に含まれる文書管理サーバ100と、端末装置200とについて、順次説明する。
 図2は、本実施形態に係る文書管理システムに含まれる文書管理サーバと、端末装置との詳細を示す図である。
 (文書管理サーバ100)
 文書管理サーバ100は、パーソナルコンピュータ、サーバ、スマートフォン、タブレットコンピュータ又は産業用コンピュータ等の装置によって実現される。文書管理サーバ100がクラウドサーバであってもよい。
 文書管理サーバ100は、例えば、変換部101と、通信部102と、第1受付部103-1と、第2受付部103-2と、第1分割部104-1と、第2分割部104-2と、第1暗号化部105-1と、第2暗号化部105-2と、登録部106と、処理部107と、検索部108と、作成部110とを備える。
The document management server 100 and the terminal device 200 included in the document management system 1 will be sequentially described below.
FIG. 2 is a diagram showing the details of the document management server and the terminal device included in the document management system according to this embodiment.
(Document management server 100)
The document management server 100 is implemented by a device such as a personal computer, server, smart phone, tablet computer, or industrial computer. The document management server 100 may be a cloud server.
The document management server 100 includes, for example, a conversion unit 101, a communication unit 102, a first reception unit 103-1, a second reception unit 103-2, a first division unit 104-1, a second division unit 104 -2, a first encryption unit 105-1, a second encryption unit 105-2, a registration unit 106, a processing unit 107, a search unit 108, and a creation unit 110.
 通信部102は、通信モジュールによって実現される。通信部102は、ネットワークNWを介して、外部の通信装置と通信する。通信部102は、例えば有線LANなどの通信方式で通信してもよい。また、通信部102は、例えば無線LAN、ブルートゥース(登録商標)又はLTE(登録商標)などの無線通信方式で通信してもよい。通信部102は、ネットワークNWを介して端末装置200と通信するために必要な通信情報を保持する。
 通信部102は、端末装置200が送信したログイン要求と、文書管理要求と、文書検索要求と、文書要求とを受信する。通信部102は、作成部110が出力したログイン応答と、文書管理応答と、文書検索応答と、文書応答とを取得し、取得した文書管理応答と、文書検索応答と、文書応答とを、端末装置200へ送信する。
The communication unit 102 is implemented by a communication module. The communication unit 102 communicates with an external communication device via the network NW. The communication unit 102 may communicate using a communication method such as a wired LAN, for example. Further, the communication unit 102 may communicate by a wireless communication method such as a wireless LAN, Bluetooth (registered trademark), or LTE (registered trademark). The communication unit 102 holds communication information necessary for communicating with the terminal device 200 via the network NW.
The communication unit 102 receives the login request, the document management request, the document search request, and the document request transmitted by the terminal device 200 . The communication unit 102 acquires the login response, the document management response, the document search response, and the document response output by the creation unit 110, and transmits the acquired document management response, document search response, and document response to the terminal. Send to device 200 .
 記憶部109は、HDD(Hard Disk Drive)やフラッシュメモリ、RAM(Random Access Memory)、ROM(Read Only Memory)などにより実現される。記憶部109は、一又は複数のユーザーIDと、一又は複数のユーザーIDの各々についてパスワードをと関連付けて記憶する。
 処理部107は、通信部102が受信したログイン要求に含まれるユーザーIDとパスワードとを取得する。処理部107は、取得したユーザーIDとパスワードとの組み合わせが記憶部109に予め登録されている場合にはログインが成功とし、登録されていない場合にはログインが失敗とする。
 作成部110は、処理部107によってログインが成功とされた場合には初期画面を含む、端末装置200を宛先とするログイン応答を作成する。処理部107は、作成したログイン応答を通信部102へ出力する。
 変換部101は、通信部102が受信した文書管理要求を取得する。変換部101は、取得した文書管理要求に含まれるユーザーIDと、文書のスキャンデータとを取得する。変換部101は、取得した文書のスキャンデータを、文書のテキストデータに変換する。例えば、変換部101は、光学文字認識(OCR: Optical character recognition)技術を使用して、文書のスキャンデータに含まれる活字、手書き文字の画像を文字コードの列に変換する。
The storage unit 109 is implemented by a HDD (Hard Disk Drive), flash memory, RAM (Random Access Memory), ROM (Read Only Memory), or the like. The storage unit 109 stores one or more user IDs and passwords for each of the one or more user IDs in association with each other.
The processing unit 107 acquires the user ID and password included in the login request received by the communication unit 102 . The processing unit 107 determines that the login is successful when the acquired combination of the user ID and password is registered in advance in the storage unit 109, and determines that the login is unsuccessful when the combination is not registered.
The creating unit 110 creates a login response addressed to the terminal device 200 including an initial screen when the processing unit 107 determines that the login is successful. The processing unit 107 outputs the created login response to the communication unit 102 .
The conversion unit 101 acquires the document management request received by the communication unit 102 . The conversion unit 101 acquires the user ID and the document scan data included in the acquired document management request. The conversion unit 101 converts the acquired scan data of the document into text data of the document. For example, the conversion unit 101 uses optical character recognition (OCR) technology to convert images of printed characters and handwritten characters included in document scan data into character code strings.
 第1受付部103-1は、通信部102が受信した文書管理要求を取得する。第1受付部103-1は、取得した文書管理要求に含まれるユーザーIDと文書関連事項と文書のスキャンデータとを受け付ける。第1受付部103-1は、取得した文書関連事項と文書のスキャンデータとを一意に特定する文書特定情報を作成する。第1受付部103-1は、取得した文書管理要求に含まれるユーザーIDに該当する文書のテキストデータを、変換部101から取得する。第1受付部103-1は、取得したユーザーIDと文書のテキストデータとを受け付ける。第1受付部103-1は、受け付けた文書のテキストデータに基づいて、文書に含まれるタイトルなどの所定の項目を一又は複数抽出する。 The first reception unit 103-1 acquires the document management request received by the communication unit 102. The first reception unit 103-1 receives the user ID, document-related items, and document scan data included in the acquired document management request. The first reception unit 103-1 creates document identification information that uniquely identifies the acquired document-related matters and the scan data of the document. The first reception unit 103-1 acquires from the conversion unit 101 the text data of the document corresponding to the user ID included in the acquired document management request. First accepting unit 103-1 accepts the acquired user ID and the text data of the document. The first receiving unit 103-1 extracts one or a plurality of predetermined items such as a title included in the document based on the text data of the received document.
 第1分割部104-1は、第1受付部103-1が受け付けたユーザーIDと文書のテキストデータとを取得する。第1分割部104-1は、取得した文書のテキストデータに基づいて、文書に含まれる文字列を分割することによって一又は複数の単語を取得する。例えば、第1分割部104-1は、文書のテキストデータを形態素解析することによって、文書に含まれる文字列を形態素に分割する。
 形態素解析とは、文法的な情報の注記の無い自然言語のテキストデータ(文)から、対象言語の文法や、辞書と呼ばれる単語の品詞等の情報にもとづき、形態素の列に分割し、それぞれの形態素の品詞等を判別することである。形態素とは、意味をもつ表現要素の最小単位である。形態素とは、ある言語においてそれ以上分解したら意味をなさなくなるところまで分割して抽出された、音素のまとまりの一つ一つを指す。
 第1分割部104-1は、取得した一又は複数の単語の各々に含まれる英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換する。
First dividing unit 104-1 acquires the user ID and text data of the document accepted by first accepting unit 103-1. The first dividing unit 104-1 obtains one or more words by dividing the character string included in the document based on the text data of the obtained document. For example, the first dividing unit 104-1 divides a character string included in a document into morphemes by morphologically analyzing the text data of the document.
In morphological analysis, natural language text data (sentences) without grammatical information notes are divided into strings of morphemes based on information such as the grammar of the target language and the parts of speech of words called a dictionary. It is to discriminate the part of speech of the morpheme. A morpheme is the smallest unit of a meaningful expression element. A morpheme refers to each group of phonemes that are separated and extracted to the point where they become meaningless when further broken down in a certain language.
The first dividing unit 104-1 converts the display format of alphanumeric characters contained in each of the acquired one or more words into single-byte characters, and converts the representation format of alphabetic characters into lowercase letters.
 第1暗号化部105-1は、第1分割部104-1からユーザーIDと英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語とを取得する。第1暗号化部105-1は、第1受付部103-1から、取得したユーザーIDに該当する文書に含まれるタイトルなどの一又は複数の所定の項目を取得する。第1暗号化部105-1は、取得した英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々と、一又は複数の所定の項目とを暗号化する。例えば、第1暗号化部105-1は、高度暗号化標準(Advanced Encryption Standard)アルゴリズムを使用して一又は複数の所定の項目と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々の暗号化を実行する。
 登録部106は、第1暗号化部105-1からユーザーIDと一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とを取得する。登録部106は、取得した一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とを出現順に文書のインデックスに登録する。
First encryption unit 105-1 acquires from first division unit 104-1 the user ID and one or more words obtained by converting the display format of alphanumeric characters into half-size characters and converting the representation format of alphabetic characters into lowercase letters. . First encryption unit 105-1 acquires one or more predetermined items such as a title included in the document corresponding to the acquired user ID from first reception unit 103-1. The first encryption unit 105-1 converts the display format of the acquired alphanumeric characters to half-size characters, converts the expression format of the alphabetic characters to lower case letters, and converts each of one or more words to one or more predetermined items. Encrypt. For example, the first encryption unit 105-1 uses the Advanced Encryption Standard algorithm to convert one or more predetermined items and alphanumeric display format into half-width characters, are converted to lowercase and perform encryption of each of the one or more words.
The registration unit 106 converts the result of encrypting the user ID and one or more predetermined items from the first encryption unit 105-1, the display format of alphanumeric characters into single-byte characters, and the representation format of alphabetic characters into lowercase letters. and obtaining a result of encrypting each of the one or more words obtained. The registration unit 106 encrypts each of the results obtained by encrypting one or more predetermined items and one or more words obtained by converting the display format of alphanumeric characters into half-width characters and converting the representation format of alphabetic characters into lowercase letters. index the document in order of appearance.
 処理部107は、登録部106からユーザーIDと一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とを登録した文書のインデックスを取得する。処理部107は、取得したユーザーIDに該当する文書特定情報と、文書のスキャンデータとを取得する。処理部107は、取得したユーザーIDと、文書特定情報と、一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とが登録された文書のインデックスと文書のスキャンデータとを関連付けて記憶部109に記憶させる。
 作成部110は、処理部107がユーザーIDと、文書特定情報と、一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とが登録された文書のインデックスと文書のスキャンデータとを関連付けて記憶部109に記憶させた後に、文書の管理を開始したことを示す情報を含む、端末装置200を宛先とする文書管理応答を作成する。作成部110は、作成した文書管理応答を通信部102へ出力する。
The processing unit 107 converts the user ID and one or more predetermined items from the registration unit 106 into encrypted data, converts the display format of alphanumeric characters to half size, and converts the representation format of alphabetic characters to lower case. Get the index of the document that registered the result of encrypting each of the words. The processing unit 107 acquires document identification information corresponding to the acquired user ID and scan data of the document. The processing unit 107 converts the obtained user ID, the document identification information, the result of encrypting one or a plurality of predetermined items, and the alphanumeric display format to half-size, and converts the alphabetic representation format to lowercase. The index of the document in which the result of encrypting each of one or more words and the scanned data of the document are associated with each other and stored in the storage unit 109 .
The creation unit 110 converts the user ID, the document identification information, the result of the encryption of one or more predetermined items by the processing unit 107, and the alphanumeric display format into single-byte characters, and converts the alphabetic representation format into lowercase letters. Information indicating that management of the document is started after the index of the document in which the result of encrypting each of the converted one or more words is registered and the scanned data of the document are associated with each other and stored in the storage unit 109. A document management response addressed to the terminal device 200 is created. Creation unit 110 outputs the created document management response to communication unit 102 .
 第2受付部103-2は、通信部102が受信した文書検索要求を取得する。第2受付部103-2は、取得した文書検索要求に含まれるユーザーIDと検索用文字列を特定する情報とを受け付ける。
 第2分割部104-2は、第2受付部103-2が受け付けたユーザーIDと検索用文字列のテキストデータとを取得する。第2分割部104-2は、取得した検索用文字列のテキストデータに基づいて、検索用文字列に含まれる文字列を分割することによって一又は複数の単語を取得する。例えば、第2分割部104-2は、検索用文字列のテキストデータを形態素解析することによって、検索用文字列に含まれる文字列を形態素に分割する。
 第2分割部104-2は、取得した一又は複数の単語の各々に含まれる英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換する。
The second reception unit 103-2 acquires the document search request received by the communication unit 102. FIG. The second reception unit 103-2 receives information specifying the user ID and search character string included in the acquired document search request.
Second dividing unit 104-2 acquires the user ID and the text data of the search character string accepted by second accepting unit 103-2. The second dividing unit 104-2 obtains one or more words by dividing the character string included in the search character string based on the obtained text data of the search character string. For example, the second dividing unit 104-2 divides the character string included in the search character string into morphemes by morphologically analyzing the text data of the search character string.
The second dividing unit 104-2 converts the representation format of alphanumeric characters contained in each of the acquired one or more words into half-width characters, and converts the representation format of alphabetic characters into lowercase letters.
 第2暗号化部105-2は、第2分割部104-2からユーザーIDと検索用文字列に含まれ、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語とを取得する。第2暗号化部105-2は、取得した検索用文字列に含まれ、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化する。例えば、第2暗号化部105-2は、Advanced Encryption Standardアルゴリズムを使用して検索用文字列に含まれ、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々の暗号化を実行する。 Second encryption unit 105-2 converts the display format of alphanumeric characters from second division unit 104-2 to the user ID and search character string, converts the display format of alphanumeric characters to half size, and converts the representation format of alphabetic characters to lower case. Or get multiple words. The second encryption unit 105-2 encrypts each of one or a plurality of words included in the obtained search character string, converted from alphanumeric display format to half-size, and from alphabetic representation format to lowercase. do. For example, the second encryption unit 105-2 is included in the search character string using the Advanced Encryption Standard algorithm, converts the display format of alphanumeric characters to half size, and converts the representation format of alphabets to lower case. Perform encryption of each of the multiple words.
 検索部108は、第2暗号化部105-2からユーザーIDと一又は複数の単語の各々を暗号化した結果とを取得する。検索部108は、取得したユーザーIDに関連付けて記憶部109に記憶されている一又は複数の文書のインデックスのうち、取得した一又は複数の単語を暗号化した結果のいずれかを含む文書のインデックスを検索する。検索部108は、検索することによって一又は複数の単語を暗号化した結果のいずれかを含む文書のインデックスを一又は複数取得する。
 検索部108は、一又は複数の単語を暗号化した結果の一致率が高い順に、取得した一又は複数の文書のインデックスを並べ変える。検索部108は、並べ替えた一又は複数の文書のインデックスの各々に関連付けられている文書特定情報を取得する。
 作成部110は、検索部108が取得した一又は複数の文書特定情報を含む、端末装置200を宛先とする文書検索応答を作成する。文書管理サーバ100は、作成した文書検索応答を通信部102へ出力する。
Search unit 108 acquires the user ID and the result of encrypting each of the one or more words from second encryption unit 105-2. The search unit 108 retrieves an index of a document that includes one or more of the results of encrypting one or more words obtained from among one or more document indexes stored in the storage unit 109 in association with the obtained user ID. Search for The search unit 108 obtains one or more indexes of documents including any of the results of encrypting one or more words by searching.
The search unit 108 rearranges the obtained indexes of the one or more documents in descending order of the matching rate of the encrypted one or more words. The search unit 108 acquires document identification information associated with each of the rearranged indexes of one or more documents.
The creation unit 110 creates a document search response addressed to the terminal device 200 and including one or more pieces of document identification information acquired by the search unit 108 . The document management server 100 outputs the created document search response to the communication unit 102 .
 処理部107は、通信部102が受信した文書要求を取得する。処理部107は、取得した文書要求に含まれる文書特定情報を取得する。処理部107は、取得した文書特定情報に関連付けて記憶されている文書のスキャンデータと文書のテキストデータとを記憶部109から取得する。処理部107は、取得した文書のテキストデータに基づいて、文書に含まれる文字列を分割することによって一又は複数の単語を取得する。処理部107は、取得した一又は複数の単語の各々の書式、品詞の並びなどの所定の規則に基づいて、文書関連事項を抽出する。
 作成部110は、処理部107が取得した文書のスキャンデータと文書のテキストデータと、抽出した文書関連事項とを取得する。作成部110は、取得した文書のスキャンデータと文書のテキストデータと、抽出した文書関連事項とを含む端末装置200を宛先とする文書応答を作成する。文書管理サーバ100は、作成した文書応答を、通信部102へ出力する。
The processing unit 107 acquires the document request received by the communication unit 102 . The processing unit 107 acquires document specifying information included in the acquired document request. The processing unit 107 acquires from the storage unit 109 the scan data of the document and the text data of the document stored in association with the acquired document specifying information. The processing unit 107 obtains one or more words by dividing the character string included in the document based on the text data of the obtained document. The processing unit 107 extracts document-related matters based on predetermined rules such as the format of each of the acquired one or more words and the arrangement of parts of speech.
The creation unit 110 acquires the document scan data and the document text data acquired by the processing unit 107, and the extracted document-related items. The preparation unit 110 prepares a document response destined for the terminal device 200 including the acquired scan data of the document, the text data of the document, and the extracted document-related items. The document management server 100 outputs the created document response to the communication unit 102 .
 変換部101、第1受付部103-1、第2受付部103-2、第1分割部104-1、第2分割部104-2、第1暗号化部105-1、第2暗号化部105-2、登録部106、処理部107、検索部108、及び作成部110は、例えば、CPU(Central Processing Unit)などのハードウェアプロセッサが記憶部109に格納されたコンピュータプログラム(ソフトウェア)を実行することにより実現される。
 また、これらの機能部のうち一部または全部は、LSI(Large Scale Integration)やASIC(Application Specific Integrated Circuit)、FPGA(Field-Programmable Gate Array)、GPU(Graphics Processing Unit)などのハードウェア(回路部;circuitryを含む)によって実現されてもよいし、ソフトウェアとハードウェアとの協働によって実現されてもよい。コンピュータプログラムは、予めHDD(Hard Disk Drive)やフラッシュメモリなどの記憶装置に格納されていてもよいし、DVDやCD-ROMなどの着脱可能な記憶媒体に格納されており、記憶媒体がドライブ装置に装着されることでインストールされてもよい。
Conversion unit 101, first reception unit 103-1, second reception unit 103-2, first division unit 104-1, second division unit 104-2, first encryption unit 105-1, second encryption unit 105-2, the registration unit 106, the processing unit 107, the search unit 108, and the creation unit 110 execute a computer program (software) stored in the storage unit 109 by a hardware processor such as a CPU (Central Processing Unit). It is realized by
Some or all of these functional units are hardware (circuits) such as LSI (Large Scale Integration), ASIC (Application Specific Integrated Circuit), FPGA (Field-Programmable Gate Array), GPU (Graphics Processing Unit) (including circuitry), or by cooperation between software and hardware. The computer program may be stored in advance in a storage device such as a HDD (Hard Disk Drive) or flash memory, or may be stored in a removable storage medium such as a DVD or CD-ROM, and the storage medium may be a drive device. may be installed by being attached to the
 (端末装置200)
 端末装置200は、スマートフォン、携帯端末、又はパーソナルコンピュータ、タブレット端末装置、腕時計型端末装置、あるいはその他の情報処理機器として実現される。
 端末装置200は、例えば、表示部201と、通信部202と、撮像部203と、作成部206と、操作検出部207と、表示処理部208と、記憶部209とを備える。
 表示部201は、表示処理部208が出力した画面を表示する。具体的には、表示部201は、表示処理部208が出力する初期画面、文書管理依頼画面、文書の管理を開始したことを示す情報、一又は複数の文書特定情報、文書のスキャンデータなどを表示する。
 撮像部203は、文書をスキャンし、文書をデジタル静止画像情報化することによって、文書のスキャンデータを作成する。
(Terminal device 200)
The terminal device 200 is realized as a smart phone, a mobile terminal, a personal computer, a tablet terminal device, a wristwatch type terminal device, or other information processing equipment.
The terminal device 200 includes, for example, a display unit 201, a communication unit 202, an imaging unit 203, a creation unit 206, an operation detection unit 207, a display processing unit 208, and a storage unit 209.
The display unit 201 displays the screen output by the display processing unit 208 . Specifically, the display unit 201 displays an initial screen output by the display processing unit 208, a document management request screen, information indicating the start of document management, one or more pieces of document identification information, document scan data, and the like. indicate.
The imaging unit 203 scans a document and converts the document into digital still image information to create scan data of the document.
 通信部202は、通信モジュールによって実現される。通信部202は、ネットワークNWを経由して、文書管理サーバ100などの他の装置と通信を行う。通信部202は、例えば、無線通信方式で通信してもよい。また、通信部202は、例えば有線LANなどの通信方式で通信してもよい。具体的には、通信部202は、作成部206が出力した文書管理要求と、文書検索要求と、文書要求とを文書管理サーバ100へ送信する。通信部202は、文書管理サーバ100が送信した文書管理応答と、文書検索応答と、文書応答とを受信する。 The communication unit 202 is implemented by a communication module. The communication unit 202 communicates with other devices such as the document management server 100 via the network NW. The communication unit 202 may communicate using a wireless communication method, for example. Also, the communication unit 202 may communicate using a communication method such as a wired LAN. Specifically, the communication unit 202 transmits the document management request, the document search request, and the document request output by the creation unit 206 to the document management server 100 . The communication unit 202 receives the document management response, the document search response, and the document response transmitted by the document management server 100 .
 操作検出部207は、例えば、タッチパネルなどによって構成され、表示部201に表示される画面に対するタッチ操作を検出し、タッチ操作の検出結果を、作成部206へ出力する。タッチ操作には、ユーザーIDとパスワードとを入力する操作、文書関連事項を入力する操作、文書管理依頼ボタンを押す操作、文書検索依頼ボタンを押す操作、検索用文字列を入力する操作、OKボタンを押す操作、閲覧する文書特定情報に該当する部分を押す操作などが含まれる。
 記憶部209は、例えば、RAM(Random Access Memory)、ROM(Read Only Memory)、フラッシュメモリ、またはこれらのうち複数が組み合わされたハイブリッド型記憶装置などにより実現される。
The operation detection unit 207 is configured by, for example, a touch panel or the like, detects a touch operation on the screen displayed on the display unit 201 , and outputs the detection result of the touch operation to the creation unit 206 . Touch operations include an operation of inputting a user ID and password, an operation of inputting items related to documents, an operation of pressing a document management request button, an operation of pressing a document search request button, an operation of inputting a search character string, and an OK button. This includes an operation of pressing , an operation of pressing a portion corresponding to the document specifying information to be browsed, and the like.
The storage unit 209 is implemented by, for example, a RAM (Random Access Memory), a ROM (Read Only Memory), a flash memory, or a hybrid storage device in which a plurality of these are combined.
 作成部206は、文書管理システム1にログインするためのログイン要求を作成する。具体的には、作成部206は、操作検出部207からユーザーIDとパスワードとを入力する操作の検出結果を取得した場合に、ユーザーIDとパスワードとを含む、文書管理サーバ100を宛先とするログイン要求を作成する。作成部206は、作成したログイン要求を通信部202に出力する。
 表示処理部208は、端末装置200が文書管理システム1にログインした場合に表示部201に初期画面を表示させる。初期画面には、文書の管理を依頼する場合に押される文書管理依頼ボタンと、文書の検索を依頼する場合に押される文書検索依頼ボタンとが含まれる。
A creation unit 206 creates a login request for logging into the document management system 1 . Specifically, when the creation unit 206 acquires a detection result of an operation of inputting a user ID and a password from the operation detection unit 207, the creation unit 206 generates a log-in information including the user ID and the password whose destination is the document management server 100. create a request; Creation unit 206 outputs the created login request to communication unit 202 .
The display processing unit 208 causes the display unit 201 to display an initial screen when the terminal device 200 logs in to the document management system 1 . The initial screen includes a document management request button that is pressed when requesting document management, and a document search request button that is pressed when requesting document retrieval.
 作成部206は、文書管理システム1に文書の管理を要求するための文書管理要求を作成する。具体的には、作成部206は、操作検出部207から文書管理依頼ボタンを押す操作の検出結果が入力され、文書関連事項を入力する操作の検出結果が入力される。作成部206は、ユーザーUによって文書をスキャンする操作が行われた場合に撮像部203から文書のスキャンデータを取得する。作成部206は、ユーザーIDと文書関連事項と文書のスキャンデータとを含む、文書管理サーバ100を宛先とする、文書管理要求を作成する。作成部206は、作成した文書管理要求を通信部202に出力する。 The creating unit 206 creates a document management request for requesting the document management system 1 to manage documents. Specifically, the creation unit 206 receives the detection result of the operation of pressing the document management request button from the operation detection unit 207 and the detection result of the operation of inputting the document-related items. The creation unit 206 acquires document scan data from the imaging unit 203 when the user U performs an operation to scan the document. The creation unit 206 creates a document management request addressed to the document management server 100, which includes a user ID, document-related items, and document scan data. The creation unit 206 outputs the created document management request to the communication unit 202 .
 表示処理部208は、通信部202が受信した文書管理応答に含まれる文書の管理を開始したことを示す情報を取得する。表示処理部208は、取得した文書の管理を開始したことを示す情報を表示部201に表示させる。
 作成部206は、文書管理システム1に文書の検索を要求するための文書検索要求を作成する。具体的には、作成部206は、操作検出部207から文書検索依頼ボタンを押す操作の検出結果が入力され、検索用文字列を入力する操作の検出結果が入力され、OKボタンを押す操作の検出結果が入力された場合に、ユーザーIDと検索用文字列を特定する情報とを含む、文書管理サーバ100を宛先とする、文書検索要求を作成する。作成部206は、作成した文書検索要求を通信部202に出力する。
The display processing unit 208 acquires information indicating that management of the document contained in the document management response received by the communication unit 202 has started. The display processing unit 208 causes the display unit 201 to display information indicating that management of the acquired document has started.
The creation unit 206 creates a document search request for requesting the document management system 1 to search for documents. Specifically, the creation unit 206 receives the detection result of the operation of pressing the document search request button from the operation detection unit 207, receives the detection result of the operation of inputting a search character string, and receives the detection result of the operation of pressing the OK button. When the detection result is input, a document search request addressed to the document management server 100 and including the user ID and information specifying the search character string is created. The creation unit 206 outputs the created document search request to the communication unit 202 .
 表示処理部208は、通信部202が受信した文書検索応答に含まれる一又は複数の文書特定情報を取得する。表示処理部208は、表示部201に、取得した一又は複数の文書特定情報を表示させる。
 作成部206は、文書管理システム1に文書を要求するための文書要求を作成する。具体的には、作成部206は、操作検出部207から閲覧する文書特定情報に該当する部分を押す操作の検出結果が入力された場合に、ユーザーUによって押された部分に該当する文書特定情報を含む、文書管理サーバ100を宛先とする文書要求を作成する。作成部206は、作成した文書要求を通信部202に出力する。
The display processing unit 208 acquires one or more pieces of document identification information included in the document search response received by the communication unit 202 . The display processing unit 208 causes the display unit 201 to display one or more pieces of acquired document specifying information.
A creation unit 206 creates a document request for requesting a document from the document management system 1 . Specifically, when the operation detecting unit 207 inputs a detection result of an operation of pressing a portion corresponding to the document specifying information to be browsed, the creation unit 206 detects the document specifying information corresponding to the portion pressed by the user U. A document request addressed to the document management server 100 is created. The creation unit 206 outputs the created document request to the communication unit 202 .
 表示処理部208は、通信部202が受信した文書応答に含まれる文書のスキャンデータと文書のテキストデータと文書関連事項とを取得する。表示処理部208は、表示部201に、取得した文書のスキャンデータと文書のテキストデータと文書関連事項とのうち選択されたものを表示させる。 The display processing unit 208 acquires the document scan data, the document text data, and the document-related items included in the document response received by the communication unit 202 . The display processing unit 208 causes the display unit 201 to display the selected one of the acquired scan data of the document, text data of the document, and document-related items.
 作成部206と、操作検出部207と、表示処理部208との全部または一部は、例えば、CPUなどのプロセッサが記憶部209に格納されたプログラムを実行することにより実現される機能部(以下、ソフトウェア機能部と称する)である。なお、これらの機能部のうち全部または一部は、LSI、ASIC、またはFPGAなどのハードウェアにより実現されてもよく、ソフトウェア機能部とハードウェアとの組み合わせによって実現されてもよい。コンピュータプログラムは、予めHDDやフラッシュメモリなどの記憶装置に格納されていてもよいし、DVDやCD-ROMなどの着脱可能な記憶媒体に格納されており、記憶媒体がドライブ装置に装着されることでインストールされてもよい。 All or part of the creation unit 206, the operation detection unit 207, and the display processing unit 208 are functional units realized by a processor such as a CPU executing a program stored in the storage unit 209 (hereinafter , software functions). All or part of these functional units may be implemented by hardware such as LSI, ASIC, or FPGA, or may be implemented by a combination of software functional units and hardware. The computer program may be stored in advance in a storage device such as an HDD or flash memory, or may be stored in a removable storage medium such as a DVD or CD-ROM, and the storage medium may be loaded into a drive device. may be installed with
 (文書管理システム1の動作)
 図3は、本実施形態に係る文書管理システムの動作の例1を示す図である。図3を参照して、文書を文書管理サーバ100に登録する処理について説明する。
 (ステップS1-1)
 端末装置200と文書管理サーバ100との間で、端末装置200が文書管理システム1にログインする処理が行われる。ここでは、端末装置200が文書管理システム1にログインする処理が成功した場合について説明を続ける。端末装置200が文書管理システム1にログインする処理が失敗した場合には所定のエラー処理が行われる。
(Operation of document management system 1)
FIG. 3 is a diagram showing Example 1 of the operation of the document management system according to this embodiment. Processing for registering a document in the document management server 100 will be described with reference to FIG.
(Step S1-1)
Between the terminal device 200 and the document management server 100, processing is performed for the terminal device 200 to log in to the document management system 1. FIG. Here, the explanation will be continued for the case where the terminal device 200 has successfully logged into the document management system 1 . When the terminal device 200 fails to log in to the document management system 1, predetermined error processing is performed.
 (ステップS2-1)
 端末装置200において、表示処理部208は、端末装置200が文書管理システム1にログインした場合に表示部201に初期画面を表示させる。表示処理部208は、操作検出部207から文書管理依頼ボタンを押す操作の検出結果が入力された場合に、文書管理依頼画面を表示部201に表示させる。
 図4は、本実施形態に係る端末装置が表示する文書管理依頼画面の例1を示す図である。図4に示されるように、文書依頼画面には、文書のタイトルを入力する欄と契約種別を選択するプルダウンメニューと、西暦で入力するか和暦で入力するかを選択するためのラジオボタンと、締結日と発行日と満了日とを入力する欄とが含まれる。例えば、契約種別は、プルダウンメニューから選択できる。締結日と発行日と満了日とは、直接入力できるし、カレンダーから選択できる。
(Step S2-1)
In the terminal device 200 , the display processing section 208 causes the display section 201 to display an initial screen when the terminal device 200 logs in to the document management system 1 . The display processing unit 208 causes the display unit 201 to display the document management request screen when the detection result of the operation of pressing the document management request button is input from the operation detection unit 207 .
FIG. 4 is a diagram showing an example 1 of the document management request screen displayed by the terminal device according to the present embodiment. As shown in FIG. 4, the document request screen includes a field for entering the title of the document, a pull-down menu for selecting the type of contract, and radio buttons for selecting whether to enter in the Western calendar or the Japanese calendar. , and fields for entering the execution date, issue date, and expiration date. For example, the contract type can be selected from a pull-down menu. The conclusion date, issue date, and expiration date can be entered directly or selected from the calendar.
 図5は、本実施形態に係る端末装置が表示する文書管理依頼画面の例2を示す図である。図5は、図4に示した文書管理依頼画面を垂直方向(下方向)にスライドさせることで表示される文書管理依頼画面の続きを示す。図5に示されるように、文書依頼画面には、契約の相手方について、契約者(会社名、屋号、個人名)を入力または選択するプルダウンメニューと、担当者氏名を入力する欄とが含まれる。例えば、契約者は、直接入力できるし、プルダウンメニューから選択できる。
 図6は、本実施形態に係る端末装置が表示する文書管理依頼画面の例3を示す図である。図6は、図5に示した文書管理依頼画面をさらに垂直方向(下方向)にスライドさせることで表示される文書管理依頼画面の続きを示す。図6に示されるように、文書依頼画面には、最近の契約書と、秘密保持契約書との各々について、カメラで撮影するか、写真から選択するかを選択するボタンが含まれる。図3に戻り説明を続ける。
 作成部206は、操作検出部207から文書関連事項を入力する操作の検出結果を取得する。
FIG. 5 is a diagram showing an example 2 of the document management request screen displayed by the terminal device according to the present embodiment. FIG. 5 shows a continuation of the document management request screen displayed by sliding the document management request screen shown in FIG. 4 in the vertical direction (downward). As shown in FIG. 5, the document request screen includes a pull-down menu for entering or selecting the contractor (company name, trade name, individual name) and a field for entering the name of the person in charge. . For example, the contractor can enter directly or select from a pull-down menu.
FIG. 6 is a diagram showing an example 3 of the document management request screen displayed by the terminal device according to the present embodiment. FIG. 6 shows a continuation of the document management request screen displayed by further sliding the document management request screen shown in FIG. 5 in the vertical direction (downward). As shown in FIG. 6, the document request screen includes buttons for selecting whether to photograph the latest contract and non-disclosure agreement with a camera or to select from photographs. Returning to FIG. 3, the description is continued.
The creation unit 206 acquires the detection result of the operation of inputting the document-related item from the operation detection unit 207 .
 (ステップS3-1)
 端末装置200において、撮像部203は、文書をスキャンする。
 図7は、本実施形態に係る端末装置による文書のスキャン処理を説明するためのである。図7に示されるように、端末装置200の画面SCには、スキャンエリアSCAが表示されている。スキャンエリアSCAは、スキャンエリアSCAの四隅に丸印で表示されている左上コーナーCULと右上コーナーCURと左下コーナーCLLと右下コーナーCLRとの位置を移動させることで調整可能である。
 ユーザーUは、文書DOCがスキャンエリアSCAの内部に収まるように端末装置200と文書DOCとの距離を調節したり、文書DOCに対する角度を調節したりする。ユーザーUがスキャンボタンSCB押すことで、文書のスキャンデータが作成される。図3に戻り説明を続ける。
 作成部206は、操作検出部207から文書関連事項を入力する操作の検出結果が入力され、ユーザーUによって文書をスキャンする操作が行われた場合に撮像部203から文書のスキャンデータを取得する。
(Step S3-1)
In the terminal device 200, the imaging unit 203 scans a document.
FIG. 7 is for explaining document scanning processing by the terminal device according to the present embodiment. As shown in FIG. 7, the screen SC of the terminal device 200 displays the scan area SCA. The scan area SCA can be adjusted by moving the positions of the upper left corner CUL, the upper right corner CUR, the lower left corner CLL, and the lower right corner CLR indicated by circles at the four corners of the scan area SCA.
The user U adjusts the distance between the terminal device 200 and the document DOC and the angle with respect to the document DOC so that the document DOC is within the scan area SCA. Scan data of the document is created when the user U presses the scan button SCB. Returning to FIG. 3, the description continues.
The creation unit 206 receives the detection result of the operation of inputting document-related items from the operation detection unit 207, and acquires the document scan data from the imaging unit 203 when the user U performs an operation to scan the document.
 (ステップS4-1)
 端末装置200において、作成部206は、ユーザーIDと文書関連事項と文書のスキャンデータとを含む、文書管理サーバ100を宛先とする、文書管理要求を作成する。
 (ステップS5-1)
 端末装置200において、作成部206は、作成した文書管理要求を通信部102に出力する。通信部102は、作成部206が出力した文書管理要求を取得し、取得した文書管理要求を文書管理サーバ100へ送信する。
(Step S4-1)
In the terminal device 200, the creation unit 206 creates a document management request addressed to the document management server 100, including the user ID, document-related items, and document scan data.
(Step S5-1)
In terminal device 200 , creation unit 206 outputs the created document management request to communication unit 102 . The communication unit 102 acquires the document management request output by the creation unit 206 and transmits the acquired document management request to the document management server 100 .
 (ステップS6-1)
 文書管理サーバ100において、通信部102は、端末装置200が送信した文書管理要求を受信する。変換部101は、通信部102が受信した文書管理要求に含まれるユーザーIDと、文書のスキャンデータとを取得する。変換部101は、取得した文書のスキャンデータを、文書のテキストデータに変換する。
 (ステップS7-1)
 文書管理サーバ100において、第1受付部103-1は、通信部102が受信した文書管理要求を取得する。第1受付部103-1は、取得した文書管理要求に含まれるユーザーIDと文書関連事項と文書のスキャンデータと文書のテキストデータとを受け付ける。第1受付部103-1は、取得した文書関連事項と文書のスキャンデータとを一意に特定する文書特定情報を作成する。第1受付部103-1は、取得した文書管理要求に含まれるユーザーIDに該当する文書のテキストデータを、変換部101から取得する。第1受付部103-1は、取得したユーザーIDと文書のテキストデータとを受け付ける。第1受付部103-1は、受け付けた文書のテキストデータに基づいて、文書に含まれるタイトルなどの所定の項目を一又は複数抽出する。
(Step S6-1)
In the document management server 100 , the communication section 102 receives the document management request transmitted by the terminal device 200 . The conversion unit 101 acquires the user ID and the document scan data included in the document management request received by the communication unit 102 . The conversion unit 101 converts the acquired scan data of the document into text data of the document.
(Step S7-1)
In the document management server 100, the first reception section 103-1 acquires the document management request received by the communication section . The first reception unit 103-1 receives the user ID, the document-related items, the scan data of the document, and the text data of the document included in the acquired document management request. The first reception unit 103-1 creates document identification information that uniquely identifies the acquired document-related matters and the scan data of the document. The first reception unit 103-1 acquires from the conversion unit 101 the text data of the document corresponding to the user ID included in the acquired document management request. First accepting unit 103-1 accepts the acquired user ID and the text data of the document. The first receiving unit 103-1 extracts one or a plurality of predetermined items such as a title included in the document based on the text data of the received document.
 (ステップS8-1)
 文書管理サーバ100において、第1分割部104-1は、第1受付部103-1が受け付けたユーザーIDと文書のテキストデータとを取得する。第1分割部104-1は、取得した文書のテキストデータに基づいて、文書に含まれる文字列を分割することによって一又は複数の単語を取得する。
 (ステップS9-1)
 文書管理サーバ100において、第1分割部104-1は、取得した一又は複数の単語の各々に含まれる英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換することによって、形式を揃える処理を行う。
(Step S8-1)
In the document management server 100, the first dividing unit 104-1 acquires the user ID and text data of the document accepted by the first accepting unit 103-1. The first dividing unit 104-1 obtains one or more words by dividing the character string included in the document based on the text data of the obtained document.
(Step S9-1)
In the document management server 100, the first division unit 104-1 converts the display format of the alphanumeric characters contained in each of the acquired one or more words into half-size characters, and converts the representation format of the alphabetic characters into lowercase letters. Process to align the format.
 (ステップS10-1)
 文書管理サーバ100において、第1暗号化部105-1は、第1分割部104-1からユーザーIDと英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語とを取得する。第1暗号化部105-1は、第1受付部103-1から取得したユーザーIDに該当する文書に含まれるタイトルなどの一又は複数の所定の項目を取得する。第1暗号化部105-1は、取得した英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々と、一又は複数の所定の項目とを暗号化する。
 (ステップS11-1)
 文書管理サーバ100において、登録部106は、第1暗号化部105-1からユーザーIDと、一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とを取得する。登録部106は、取得した一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とを出現順に文書のインデックスに登録する。
 処理部107は、登録部106からユーザーIDと、一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とを登録した文書のインデックスを取得する。処理部107は、取得したユーザーIDに該当する文書特定情報と、文書のスキャンデータとを取得する。処理部107は、取得したユーザーIDと、文書特定情報と、一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とが登録された文書のインデックスと文書のスキャンデータとを関連付けて記憶部109に記憶させる。
(Step S10-1)
In the document management server 100, the first encryption unit 105-1 converts the display format of the user ID and alphanumeric characters from the first division unit 104-1 to half-width characters, and converts the representation format of alphabetic characters to lower case. word and get. First encryption unit 105-1 acquires one or more predetermined items such as a title included in a document corresponding to the user ID acquired from first reception unit 103-1. The first encryption unit 105-1 converts the display format of the acquired alphanumeric characters to half-size characters, converts the expression format of the alphabetic characters to lower case letters, and converts each of one or more words to one or more predetermined items. Encrypt.
(Step S11-1)
In the document management server 100, the registration unit 106 converts the user ID from the first encryption unit 105-1, the result of encrypting one or more predetermined items, and the alphanumeric display format into single-byte characters, and the result of encrypting each of the one or more words obtained by converting the expression form of to lower case. The registration unit 106 encrypts each of the results obtained by encrypting one or more predetermined items and one or more words obtained by converting the display format of alphanumeric characters into half-width characters and converting the representation format of alphabetic characters into lowercase letters. index the document in order of appearance.
The processing unit 107 receives the user ID from the registration unit 106, the result of encrypting one or more predetermined items, and one or more items obtained by converting the display format of alphanumeric characters into single-byte characters and converting the representation format of alphabetic characters into lowercase characters. Get the index of the document that registered the results of encrypting each of the words in . The processing unit 107 acquires document identification information corresponding to the acquired user ID and scan data of the document. The processing unit 107 converts the obtained user ID, the document identification information, the result of encrypting one or a plurality of predetermined items, and the alphanumeric display format to half-size, and converts the alphabetic representation format to lowercase. The index of the document in which the result of encrypting each of one or more words and the scanned data of the document are associated with each other and stored in the storage unit 109 .
 (ステップS12-1)
 文書管理サーバ100において、作成部110は、文書の管理を開始したことを示す情報を含む、端末装置200を宛先とする文書管理応答を作成する。
 (ステップS13-1)
 文書管理サーバ100において、作成部110は、作成した文書管理応答を通信部102へ出力する。通信部102は、作成部110が出力した文書管理応答を取得し、取得した文書管理応答を端末装置200へ送信する。
(Step S12-1)
In the document management server 100, the creation unit 110 creates a document management response addressed to the terminal device 200, including information indicating that document management has started.
(Step S13-1)
In the document management server 100 , the creation unit 110 outputs the created document management response to the communication unit 102 . The communication unit 102 acquires the document management response output by the creation unit 110 and transmits the acquired document management response to the terminal device 200 .
 (ステップS14-1)
 端末装置200において、通信部202は、文書管理サーバ100が送信した文書管理応答を受信する。表示処理部208は、通信部202が受信した文書管理応答に含まれる文書の管理を開始したことを示す情報を取得する。表示処理部208は、取得した文書の管理を開始したことを示す情報を表示部201に表示させる。
 図8は、本実施形態に係る端末装置に表示される情報の例1を示す図である。図8に示すように、端末装置200には、文書の管理を開始したことを示す情報の一例として「契約書のアップロードとテキストの解析に成功しました」と表示され、「契約内容を確認」と表示されたボタンが表示される。ここで、「契約内容を確認」と表示されたボタンが押された場合には、図4から図6を参照して説明した契約内容が表示される。
 図9は、本実施形態に係る端末装置に表示される情報の例2を示す図である。図9に示すように、端末装置200には、文書の管理を開始したことを示す情報の一例として「暗号化済でクラウドに保管されました」と表示される。
(Step S14-1)
In the terminal device 200 , the communication section 202 receives the document management response transmitted by the document management server 100 . The display processing unit 208 acquires information indicating that management of the document contained in the document management response received by the communication unit 202 has started. The display processing unit 208 causes the display unit 201 to display information indicating that management of the acquired document has started.
FIG. 8 is a diagram showing example 1 of information displayed on the terminal device according to the present embodiment. As shown in FIG. 8, the terminal device 200 displays, as an example of information indicating that document management has started, that the contract has been successfully uploaded and the text has been analyzed, and that the content of the contract has been confirmed. A button labeled . Here, when the button labeled "confirm contract details" is pressed, the contract details described with reference to FIGS. 4 to 6 are displayed.
FIG. 9 is a diagram showing Example 2 of information displayed on the terminal device according to the present embodiment. As shown in FIG. 9, the terminal device 200 displays "encrypted and stored in the cloud" as an example of information indicating that document management has started.
 図10は、本実施形態に係る文書管理システムの動作の例2を示す図である。図10を参照して、文書管理サーバ100に保管された文書から任意の文書を参照する場合の動作について説明する。
 (ステップS1-2)
 端末装置200と文書管理サーバ100との間で、端末装置200が文書管理システム1にログインする処理が行われる。ここでは、端末装置200が文書管理システム1にログインする処理が成功した場合について説明を続ける。端末装置200が文書管理システム1にログインする処理が失敗した場合には所定のエラー処理が行われる。
FIG. 10 is a diagram showing Example 2 of the operation of the document management system according to this embodiment. Referring to FIG. 10, the operation of referring to an arbitrary document from the documents stored in document management server 100 will be described.
(Step S1-2)
Between the terminal device 200 and the document management server 100, processing is performed for the terminal device 200 to log in to the document management system 1. FIG. Here, the explanation will be continued for the case where the terminal device 200 has successfully logged into the document management system 1 . When the terminal device 200 fails to log in to the document management system 1, predetermined error processing is performed.
 (ステップS2-2)
 端末装置200において、表示処理部208は、端末装置200が文書管理システム1にログインした場合に表示部201に初期画面を表示させる。表示処理部208は、操作検出部207から文書検索依頼ボタンを押す操作の検出結果が入力された場合に、文書検索依頼画面を表示部201に表示させる。
 図11は、本実施形態に係る端末装置に表示される情報の例1を示す図である。図11は、文書検索依頼画面の一例を示す。図11に示すように文書検索依頼画面には、検索用文字列を入力する欄が含まれる。具体的には、文書検索依頼画面は、検索用文字列を含む文書を検索するときにその検索用文字列を入力する欄を含む。文書検索依頼画面は、検索用文字列を含まない文書を検索するときにその検索用文字列を入力する欄を含む。図11には、文書検索依頼画面の一例として、検索用文字列を含む文書を検索するときにその検索用文字列を入力する欄と、検索用文字列を含まない文書を検索するときにその検索用文字列を入力する欄とが含まれる場合について示す。図10に戻り、説明を続ける。
(Step S2-2)
In the terminal device 200 , the display processing section 208 causes the display section 201 to display an initial screen when the terminal device 200 logs in to the document management system 1 . The display processing unit 208 causes the display unit 201 to display a document search request screen when the detection result of the operation of pressing the document search request button is input from the operation detection unit 207 .
FIG. 11 is a diagram showing Example 1 of information displayed on the terminal device according to the present embodiment. FIG. 11 shows an example of a document search request screen. As shown in FIG. 11, the document search request screen includes a field for entering a search character string. Specifically, the document search request screen includes a field for entering a search character string when searching for documents containing the search character string. The document search request screen includes a field for entering a search character string when searching for documents that do not contain the search character string. FIG. 11 shows, as an example of the document search request screen, fields for entering search character strings when searching for documents containing search character strings, and fields for entering search character strings when searching for documents that do not contain search character strings. A case where a field for entering a character string for search is included is shown. Returning to FIG. 10, the description is continued.
 作成部206は、操作検出部207から文書検索依頼ボタンを押す操作の検出結果が入力され、検索用文字列を入力する操作の検出結果が入力され、OKボタンを押す操作の検出結果が入力された場合に、ユーザーIDと検索用文字列を特定する情報とを含む、文書管理サーバ100を宛先とする、文書検索要求を作成する。以下、一例として、検索用文字列を含む文書を検索するときにその検索用文字列を入力する欄に検索用文字列が入力された場合について説明を続ける。 The creation unit 206 receives from the operation detection unit 207 the detection result of the operation of pressing the document search request button, the detection result of the operation of entering a search character string, and the detection result of the operation of pressing the OK button. In this case, a document search request addressed to the document management server 100 is created, which includes the user ID and information specifying the search character string. As an example, the case where a search character string is entered in a field for entering a search character string when searching for a document containing the search character string will be described below.
 (ステップS3-2)
 端末装置200において、作成部206は、作成した文書検索要求を通信部102に出力する。通信部102は、作成部206が作成した文書検索要求を取得し、取得した文書検索要求を文書管理サーバ100へ送信する。
 (ステップS4-2)
 文書管理サーバ100において、通信部102は、端末装置200が送信した文書検索要求を受信する。第2受付部103-2は、通信部102が受信した文書検索要求を取得する。第2受付部103-2は、取得した文書検索要求に含まれるユーザーIDと検索用文字列を特定する情報とを受け付ける。
 第2分割部104-2は、第2受付部103-2が受け付けたユーザーIDと検索用文字列のテキストデータとを取得する。第2分割部104-2は、取得した検索用文字列のテキストデータに基づいて、検索用文字列に含まれる文字列を分割することによって一又は複数の単語を取得する。
(Step S3-2)
In terminal device 200 , creation unit 206 outputs the created document search request to communication unit 102 . The communication unit 102 acquires the document search request created by the creation unit 206 and transmits the acquired document search request to the document management server 100 .
(Step S4-2)
In the document management server 100 , the communication section 102 receives the document search request transmitted by the terminal device 200 . The second reception unit 103-2 acquires the document search request received by the communication unit 102. FIG. The second reception unit 103-2 receives information specifying the user ID and search character string included in the acquired document search request.
Second dividing unit 104-2 acquires the user ID and the text data of the search character string accepted by second accepting unit 103-2. The second dividing unit 104-2 obtains one or more words by dividing the character string included in the search character string based on the obtained text data of the search character string.
 (ステップS5-2)
 文書管理サーバ100において、第2分割部104-2は、取得した一又は複数の単語の各々に含まれる英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換する。
 (ステップS6-2)
 文書管理サーバ100において、第2暗号化部105-2は、第2分割部104-2からユーザーIDと検索用文字列に含まれ、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語とを取得する。第2暗号化部105-2は、取得した検索用文字列に含まれ、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化する。
 (ステップS7-2)
 文書管理サーバ100において、検索部108は、第2暗号化部105-2からユーザーIDと一又は複数の単語の各々を暗号化した結果とを取得する。検索部108は、取得したユーザーIDに関連付けて記憶部109に記憶されている一又は複数の文書のインデックスのうち、取得した一又は複数の単語を暗号化した結果のいずれかを含む文書のインデックスを検索する。検索部108は、検索することによって一又は複数の単語を暗号化した結果のいずれかを含む文書のインデックスを一又は複数取得する。検索部108は、一又は複数の単語を暗号化した結果の一致率が高い順に、取得した一又は複数の文書のインデックスを並べ変える。検索部108は、並べ替えた一又は複数の文書のインデックスの各々に関連付けられている文書特定情報を取得する。
(Step S5-2)
In the document management server 100, the second dividing unit 104-2 converts the display format of alphanumeric characters contained in each of the acquired one or more words into half-width characters, and converts the representation format of alphabetic characters into lowercase letters.
(Step S6-2)
In the document management server 100, the second encryption unit 105-2 converts the display format of the alphanumeric characters included in the user ID and search character string from the second dividing unit 104-2 into half-width characters, Gets one or more words converted to lowercase. The second encryption unit 105-2 encrypts each of one or a plurality of words included in the obtained search character string, converted from alphanumeric display format to half-size, and from alphabetic representation format to lowercase. do.
(Step S7-2)
In the document management server 100, the search unit 108 acquires the user ID and the result of encrypting each of one or more words from the second encryption unit 105-2. The search unit 108 retrieves an index of a document that includes one or more of the results of encrypting one or more words obtained from among one or more document indexes stored in the storage unit 109 in association with the obtained user ID. Search for The search unit 108 obtains one or more indexes of documents including any of the results of encrypting one or more words by searching. The search unit 108 rearranges the obtained indexes of the one or more documents in descending order of the matching rate of the encrypted one or more words. The search unit 108 acquires document identification information associated with each of the rearranged indexes of one or more documents.
 (ステップS8-2)
 文書管理サーバ100において、作成部110は、検索部108が取得した一又は複数の文書特定情報を含む、端末装置200を宛先とする文書検索応答を作成する。文書管理サーバ100は、作成した文書検索応答を通信部102へ出力する。
 (ステップS9-2)
 文書管理サーバ100において、通信部102は、作成部110が出力した文書検索応答を取得し、取得した文書検索応答を、端末装置200へ送信する。
 (ステップS10-2)
 端末装置200において、通信部202は、文書管理サーバ100が送信した文書検索応答を受信する。表示処理部208は、通信部202が受信した文書検索応答に含まれる一又は複数の文書特定情報を取得する。表示処理部208は、表示部201に、取得した一又は複数の文書特定情報を表示させる。
(Step S8-2)
In the document management server 100 , the creating unit 110 creates a document search response addressed to the terminal device 200 , which includes one or more pieces of document specifying information acquired by the searching unit 108 . The document management server 100 outputs the created document search response to the communication unit 102 .
(Step S9-2)
In the document management server 100 , the communication unit 102 acquires the document search response output by the creation unit 110 and transmits the acquired document search response to the terminal device 200 .
(Step S10-2)
In the terminal device 200 , the communication section 202 receives the document search response transmitted by the document management server 100 . The display processing unit 208 acquires one or more pieces of document identification information included in the document search response received by the communication unit 202 . The display processing unit 208 causes the display unit 201 to display one or more pieces of acquired document specifying information.
 (ステップS11-2)
 端末装置200において、作成部206は、操作検出部207から閲覧する文書特定情報に該当する部分を押す操作の検出結果が入力された場合に、ユーザーUによって押された文書特定情報を含む、文書管理サーバ100を宛先とする文書要求を作成する。
 (ステップS12-2)
 端末装置200において、作成部206は、作成した文書要求を通信部102に出力する。通信部102は、作成部206が出力した文書要求を取得し、取得した文書要求を文書管理サーバ100へ送信する。
(Step S11-2)
In the terminal device 200, the creation unit 206 receives from the operation detection unit 207 a detection result of an operation of pressing a portion corresponding to the document specifying information to be browsed. A document request addressed to the management server 100 is created.
(Step S12-2)
In terminal device 200 , creation unit 206 outputs the created document request to communication unit 102 . The communication unit 102 acquires the document request output by the creation unit 206 and transmits the acquired document request to the document management server 100 .
 (ステップS13-2)
 文書管理サーバ100において、通信部102は、端末装置200が送信した文書要求を受信する。処理部107は、通信部102が受信した文書要求を取得する。処理部107は、取得した文書要求に含まれる文書特定情報を取得する。処理部107は、取得した文書特定情報に関連付けて記憶されている文書のスキャンデータと文書のテキストデータとを記憶部109から取得する。
 (ステップS14-2)
 文書管理サーバ100において、処理部107は、取得した文書のテキストデータに基づいて、文書に含まれる文字列を分割することによって一又は複数の単語を取得する。
(Step S13-2)
In the document management server 100 , the communication section 102 receives the document request transmitted by the terminal device 200 . The processing unit 107 acquires the document request received by the communication unit 102 . The processing unit 107 acquires document specifying information included in the acquired document request. The processing unit 107 acquires from the storage unit 109 the scan data of the document and the text data of the document stored in association with the acquired document specifying information.
(Step S14-2)
In the document management server 100, the processing unit 107 obtains one or more words by dividing the character string included in the document based on the text data of the obtained document.
 (ステップS15-2)
 文書管理サーバ100において、処理部107は、取得した一又は複数の単語の各々の書式、品詞の並びなどの所定の規則に基づいて、文書関連事項を抽出する。
 (ステップS16-2)
 文書管理サーバ100において、作成部110は、処理部107が取得した文書のスキャンデータと文書のテキストデータと、抽出した文書関連事項とを取得する。作成部110は、取得した文書のスキャンデータと文書のテキストデータと、抽出した文書関連事項とを含む端末装置200を宛先とする文書応答を作成する。
 (ステップS17-2)
 文書管理サーバ100において、作成部110は、作成した文書応答を、通信部102へ出力する。通信部102は、作成部110が出力した文書応答を取得し、取得した文書応答を、端末装置200へ送信する。
(Step S15-2)
In the document management server 100, the processing unit 107 extracts document-related matters based on predetermined rules such as the format of each of the acquired one or more words and the arrangement of parts of speech.
(Step S16-2)
In the document management server 100, the creation unit 110 acquires the scan data of the document acquired by the processing unit 107, the text data of the document, and the extracted document-related items. The preparation unit 110 prepares a document response destined for the terminal device 200 including the acquired scan data of the document, the text data of the document, and the extracted document-related items.
(Step S17-2)
In the document management server 100 , the creation unit 110 outputs the created document response to the communication unit 102 . The communication unit 102 acquires the document response output by the creation unit 110 and transmits the acquired document response to the terminal device 200 .
 (ステップS18-2)
 端末装置200において、通信部202は、文書管理サーバ100が送信した文書応答を受信する。表示処理部208は、通信部202が受信した文書応答に含まれる文書のスキャンデータと文書のテキストデータと文書関連事項とを取得する。表示処理部208は、表示部201に、取得した文書のスキャンデータと文書のテキストデータと文書関連事項とのうち選択されたものを表示させる。
 図12は、本実施形態に係る端末装置に表示される情報の例2を示す図である。図12は、表示の一例として、秘密保持契約書を示す。図12に示すように、表示の一例には、契約書情報(文書関連事項)と、読取済テキスト(文書のテキストデータ)と、原本(文書のスキャンデータ)とを選択的に表示可能である。例えば、契約書情報が選択された場合には、図4から図6に示したように、文書のタイトル、契約種別、締結日、発行日、満了日などの文書関連事項が表示される。
(Step S18-2)
In the terminal device 200 , the communication section 202 receives the document response transmitted by the document management server 100 . The display processing unit 208 acquires the document scan data, the document text data, and the document-related items included in the document response received by the communication unit 202 . The display processing unit 208 causes the display unit 201 to display the selected one of the acquired scan data of the document, text data of the document, and document-related items.
FIG. 12 is a diagram showing example 2 of information displayed on the terminal device according to the present embodiment. FIG. 12 shows a confidentiality agreement as an example of display. As shown in FIG. 12, as an example of the display, it is possible to selectively display contract information (document-related items), read text (document text data), and original (document scan data). . For example, when contract information is selected, document-related items such as the document title, contract type, conclusion date, issue date, and expiration date are displayed as shown in FIGS.
 前述した実施形態では、端末装置200において、ユーザーUが文書関連事項を入力する場合について説明したがこの例に限られない。例えば、端末装置200は、文書のスキャンデータを文書のテキストデータに変換し、文書テキストデータから文書関連事項に該当するものを抽出してもよい。端末装置200は、抽出した文書関連事項に該当するものを、文書管理依頼画面に表示してもよい。具体的には、端末装置200に、変換部と、処理部とを備える。変換部は、文書のスキャンデータを文書のテキストデータに変換する。処理部は、文書テキストデータから文書関連事項に該当するものを抽出する。表示処理部208は、処理部が抽出した文書関連事項に該当するものを、文書管理依頼画面に表示させる。このように構成することによって、ユーザーUが文書関連事項を入力する手間を低減できる。 In the above-described embodiment, the case where the user U inputs document-related items on the terminal device 200 has been described, but the present invention is not limited to this example. For example, the terminal device 200 may convert document scan data into document text data, and extract document-related items from the document text data. The terminal device 200 may display the extracted document-related items on the document management request screen. Specifically, the terminal device 200 includes a conversion unit and a processing unit. The conversion unit converts the document scan data into document text data. The processing unit extracts document-related items from the document text data. The display processing unit 208 displays items corresponding to the document-related items extracted by the processing unit on the document management request screen. By configuring in this way, it is possible to reduce the trouble of the user U inputting document-related matters.
 前述した実施形態では、文書のインデックスに、一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とが登録される場合について説明したが、この例に限られない。例えば、文書のインデックスに、一又は複数の所定の項目を暗号化した結果と、英数字の表示形式を半角に変換し、英字の表現形式を小文字に変換した一又は複数の単語の各々を暗号化した結果とのいずれか一方が登録されてもよい。 In the above-described embodiment, the document index contains the result of encrypting one or more predetermined items and one or more words in which the alphanumeric representation format is converted to half-width and the alphabetic representation format is converted to lowercase. Although the case where the result of encrypting each of is registered has been described, the present invention is not limited to this example. For example, in a document index, the result of encrypting one or more predetermined items and one or more words converted from alphanumeric display formats to half-width characters and from alphabetic representation formats to lowercase are encrypted. Either one of the converted result and the converted result may be registered.
 前述した実施形態では、文書管理サーバ100が、文書に含まれる文字列を分割することによって取得した一又は複数の単語の各々を暗号化する場合について説明したが、この例に限られない。例えば、第1暗号化部105-1は、一又は複数の単語のうち、名詞、動詞などの所定の品詞に該当する単語を暗号化してもよい。このように構成することによって、一又は複数の全ての単語を暗号化する場合と比較して、文書管理サーバ100の処理負荷を低減できる。 In the above-described embodiment, the case where the document management server 100 encrypts each of one or more words obtained by dividing the character string included in the document has been described, but the present invention is not limited to this example. For example, the first encryption unit 105-1 may encrypt one or more words that correspond to a predetermined part of speech such as a noun or a verb. By configuring in this way, the processing load on the document management server 100 can be reduced compared to the case where all one or more words are encrypted.
 前述した実施形態では、文書管理サーバ100が、検索用文字列に含まれる文字列を分割することによって取得した一又は複数の単語の各々を暗号化する場合について説明したが、この例に限られない。例えば、第2暗号化部105-2は、一又は複数の単語のうち、名詞、動詞などの所定の品詞に該当する単語を暗号化してもよい。このように構成することによって、一又は複数の全ての単語を暗号化する場合と比較して、文書管理サーバ100の処理負荷を低減できる。 In the above-described embodiment, the case where the document management server 100 encrypts each of one or more words obtained by dividing the character string included in the search character string has been described, but the present invention is limited to this example. Absent. For example, the second encryption unit 105-2 may encrypt words corresponding to a predetermined part of speech such as nouns and verbs among one or more words. By configuring in this way, the processing load on the document management server 100 can be reduced compared to the case where all one or more words are encrypted.
 前述した実施形態に係る文書管理システム1によれば、文書管理サーバ100は、第1受付部103-1と、第1分割部104-1と、第1暗号化部105-1と、登録部106と、処理部107と、第2受付部103-2と、第2分割部104-2と、第2暗号化部105-2と、検索部108とを備える。
 第1受付部103-1は、文書のテキストデータを受け付ける。第1分割部104-1は、第1受付部103-1が受け付けたテキストデータを分割し、複数の単語を取得する。第1暗号化部105-1は、第1分割部104-1が取得した複数の単語の各々を暗号化する。登録部106は、第1暗号化部105-1によって暗号化された複数の単語の各々を文書のインデックスに登録する。処理部107は、登録部106によって暗号化された複数の単語の各々が登録された文書のインデックスと文書とを関連付けて文書データとして記憶部109に記憶させる。
According to the document management system 1 according to the above-described embodiment, the document management server 100 includes a first reception section 103-1, a first division section 104-1, a first encryption section 105-1, a registration section 106, a processing unit 107, a second reception unit 103-2, a second division unit 104-2, a second encryption unit 105-2, and a search unit .
First reception unit 103-1 receives text data of a document. First dividing unit 104-1 divides the text data received by first receiving unit 103-1 to obtain a plurality of words. First encryption unit 105-1 encrypts each of the plurality of words acquired by first division unit 104-1. Registration unit 106 registers each of the plurality of words encrypted by first encryption unit 105-1 in the document index. The processing unit 107 associates the index of the document in which each of the plurality of words encrypted by the registration unit 106 is registered with the document, and stores them in the storage unit 109 as document data.
 第2受付部103-2は、検索用文字列を受け付ける。第2分割部104-2は、第2受付部103-2が受け付けた検索用文字列を分割し、一又は複数の単語を取得する。第2暗号化部105-2は、第2分割部104-2が取得した一又は複数の単語の各々を暗号化する。検索部108は、第2暗号化部105-2によって暗号化された一又は複数の単語の各々に基づいて複数の文書データを検索する。
 このように構成することによって、文書のテキストデータを分割することによって取得される複数の単語の各々を暗号化し、暗号化された複数の単語の各々を文書のインデックスに登録できる。検索用文字列を分割することによって取得される一又は複数の単語の各々を暗号化し、暗号化された複数の単語のいずれかを含む文書のインデックスを検索することによって、複数の文書から所望の文書を取得できる。このため、文書管理の利便性を向上できる。
Second accepting unit 103-2 accepts a search character string. Second division unit 104-2 divides the character string for search received by second reception unit 103-2 to acquire one or more words. Second encryption unit 105-2 encrypts each of the one or more words acquired by second division unit 104-2. Search unit 108 searches for a plurality of document data based on each of the one or more words encrypted by second encryption unit 105-2.
By configuring in this way, each of the plurality of words obtained by dividing the text data of the document can be encrypted, and each of the plurality of encrypted words can be registered in the index of the document. By encrypting each of one or more words obtained by dividing the search string and searching the index of documents containing any of the encrypted words, I can get the document. Therefore, the convenience of document management can be improved.
 第1分割部104-1は、複数の単語の各々に含まれる文字列を所定の形式に変換し、第1暗号化部105-1は、第1分割部104-1が所定の形式に変換した文字列を暗号化する。このように構成することによって、複数の単語の各々に含まれる文字列を所定の形式に揃えることができる。このため、その所定の形式で検索用文字列を入力させることによって、検索を容易にできる。 First dividing unit 104-1 converts a character string included in each of a plurality of words into a predetermined format, and first encryption unit 105-1 converts first dividing unit 104-1 into a predetermined format. encrypted string. By configuring in this way, the character strings included in each of the plurality of words can be arranged in a predetermined format. Therefore, the retrieval can be facilitated by inputting the retrieval character string in the predetermined format.
 第1受付部103-1は、文書のテキストデータに含まれる所定の項目を一又は複数抽出し、第1暗号化部105-1は、第1受付部103-1が抽出した一又は複数の所定の項目の各々を暗号化し、登録部106は、一又は複数の所定の項目を暗号化した結果を文書のインデックスに登録し、処理部107は、登録部106によって一又は複数の所定の項目を暗号化した結果がさらに登録された文書の前記インデックスと文書とを関連付けて文書データとして記憶部109に記憶させる。
 このように構成することによって、文書のテキストデータに含まれる一又は複数の所定の項目の各々を暗号化し、一又は複数の所定の項目を暗号化した結果を文書のインデックスにさらに登録できる。検索用文字列を分割することによって取得される一又は複数の単語の各々を暗号化し、暗号化された複数の単語のいずれかを含む文書のインデックスを検索することによって、複数の文書から所望の文書を取得できる。このため、文書管理の利便性をさらに向上できる。
First receiving unit 103-1 extracts one or more predetermined items included in the text data of the document, and first encryption unit 105-1 encrypts the one or more items extracted by first receiving unit 103-1. Each of the predetermined items is encrypted, the registration unit 106 registers the encryption result of the one or more predetermined items in the index of the document, and the processing unit 107 causes the registration unit 106 to encrypt the one or more predetermined items. is further stored in the storage unit 109 as document data in association with the index of the document in which the encrypted result is further registered.
By configuring in this way, each of the one or more predetermined items included in the text data of the document can be encrypted, and the result of encrypting the one or more predetermined items can be further registered in the index of the document. By encrypting each of one or more words obtained by dividing the search string and searching the index of documents containing any of the encrypted words, I can get the document. Therefore, the convenience of document management can be further improved.
 登録部106は、テキストデータに含まれる複数の単語の順序に基づいて、第1暗号化部105-1によって暗号化された複数の単語の各々を文書のインデックスに登録する。契約書などの文書において、前段の方が後段よりも検索用文字列に該当する単語が多いと想定される。このように構成することによって、テキストデータに含まれる複数の単語の順序に基づいて、第1暗号化部105-1によって暗号化された複数の単語の各々を文書のインデックスに登録するできるため、暗号化された複数の単語のいずれかを含む文書のインデックスを検索する際に、ヒットするまでの時間を短縮できる。 The registration unit 106 registers each of the multiple words encrypted by the first encryption unit 105-1 in the document index based on the order of the multiple words included in the text data. In a document such as a contract, it is assumed that there are more words corresponding to the search character string in the first part than in the second part. With this configuration, each of the plurality of words encrypted by the first encryption unit 105-1 can be registered in the document index based on the order of the plurality of words included in the text data. You can reduce the time to find a hit when searching the index for documents that contain one of multiple encrypted words.
 第1暗号化部105-1は、第1分割部104-1が取得した複数の単語のうち、所定の品詞に該当する単語を暗号化する。このように構成することによって、一又は複数の全ての単語を暗号化する場合と比較して、文書管理サーバ100の処理負荷を低減できる。助詞などは暗号化しても文書の検索には使用できないと想定される。 The first encryption unit 105-1 encrypts words corresponding to a predetermined part of speech among the plurality of words acquired by the first division unit 104-1. By configuring in this way, the processing load on the document management server 100 can be reduced compared to the case where all one or more words are encrypted. It is assumed that particles such as particles cannot be used for document retrieval even if they are encrypted.
 検索部108は、第2暗号化部105-2によって暗号化された一又は複数の単語の各々に、複数の文書のインデックスの各々に含まれる暗号化された複数の単語が一致するものを検索する。このように構成することによって、検索用文字列を含む文書を検索できる。 The search unit 108 searches for a plurality of encrypted words included in each index of a plurality of documents that match each of the one or more words encrypted by the second encryption unit 105-2. do. By configuring in this way, it is possible to search for documents containing search character strings.
 検索部108は、第2暗号化部105-2によって暗号化された一又は複数の単語の各々と、複数の文書のインデックスの各々に含まれる暗号化された複数の単語との一致率を導出する。例えば、一致率は、検索用文字列に含まれる一又は複数の単語を暗号化したものが、インデックスに含まれる暗号化された複数の単語と何語(何%)一致しているかを導出したものであってもよい。 The search unit 108 derives the match rate between each of the one or more words encrypted by the second encryption unit 105-2 and the plurality of encrypted words included in each of the indexes of the plurality of documents. do. For example, the match rate derives how many words (percentage) the encrypted word or words in the search string match the encrypted words in the index. can be anything.
 以上、本発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、本発明の要旨を逸脱しない範囲の設計変更等も含まれる。
 例えば、上述した各装置の機能を実現するためのコンピュータプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたコンピュータプログラムをコンピュータシステムに読み込ませ、実行するようにしてもよい。なお、ここでいう「コンピュータシステム」とは、OSや周辺機器等のハードウェアを含むものであってもよい。
Although the embodiment of the present invention has been described in detail above with reference to the drawings, the specific configuration is not limited to this embodiment, and design changes and the like are included within the scope of the present invention.
For example, a computer program for realizing the functions of the devices described above may be recorded on a computer-readable recording medium, and the computer program recorded on this recording medium may be read and executed by a computer system. . Note that the “computer system” referred to here may include hardware such as an OS and peripheral devices.
 また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ROM、フラッシュメモリ等の書き込み可能な不揮発性メモリ、DVD(Digital Versatile Disc)等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。
 さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してコンピュータプログラムが送信された場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリ(例えばDRAM(Dynamic Random Access Memory))のように、一定時間プログラムを保持しているものも含むものとする。
In addition, "computer-readable recording medium" includes writable nonvolatile memories such as flexible discs, magneto-optical discs, ROMs and flash memories, portable media such as DVDs (Digital Versatile Discs), and computer system built-in media. A storage device such as a hard disk that
Furthermore, "computer-readable recording medium" means a volatile memory (e.g., DRAM ( It also includes those that hold programs for a certain period of time, such as Dynamic Random Access Memory)).
 また、上記プログラムは、このプログラムを記憶装置等に格納したコンピュータシステムから、伝送媒体を介して、あるいは、伝送媒体中の伝送波により他のコンピュータシステムに伝送されてもよい。ここで、プログラムを伝送する「伝送媒体」は、インターネット等のネットワーク(通信網)や電話回線等の通信回線(通信線)のように情報を伝送する機能を有する媒体のことをいう。
 また、上記プログラムは、前述した機能の一部を実現するためのものであっても良い。
さらに、前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるもの、いわゆる差分ファイル(差分プログラム)であっても良い。
Further, the above program may be transmitted from a computer system storing this program in a storage device or the like to another computer system via a transmission medium or by a transmission wave in a transmission medium. Here, the "transmission medium" for transmitting the program refers to a medium having a function of transmitting information, such as a network (communication network) such as the Internet or a communication line (communication line) such as a telephone line.
Further, the program may be for realizing part of the functions described above.
Further, it may be a so-called difference file (difference program) that can realize the above-described functions in combination with a program already recorded in the computer system.
1…文書管理システム、100…文書管理サーバ、101…変換部、102…通信部、103-1…第1受付部、103-2…第2受付部、104-1…第1分割部、104-2…第2分割部、105-1…第1暗号化部、105-2…第2暗号化部、106…登録部、107…処理部、108…検索部、109…記憶部、110…作成部、200…端末装置、201…表示部、202…通信部、203…撮像部、206…作成部、207…操作検出部、208…表示処理部、209…記憶部 REFERENCE SIGNS LIST 1 document management system 100 document management server 101 conversion unit 102 communication unit 103-1 first reception unit 103-2 second reception unit 104-1 first division unit 104 -2... Second dividing unit 105-1... First encryption unit 105-2... Second encryption unit 106... Registration unit 107... Processing unit 108... Searching unit 109... Storage unit 110... Creation unit 200 Terminal device 201 Display unit 202 Communication unit 203 Imaging unit 206 Creation unit 207 Operation detection unit 208 Display processing unit 209 Storage unit

Claims (9)

  1.  文書のテキストデータを受け付ける第1受付部と、
     前記第1受付部が受け付けた前記テキストデータを分割し、複数の単語を取得する第1分割部と、
     前記第1分割部が取得した複数の前記単語の各々を暗号化する第1暗号化部と、
     前記第1暗号化部によって暗号化された複数の前記単語の各々を前記文書のインデックスに登録する登録部と、
     前記登録部によって暗号化された複数の前記単語の各々が登録された前記文書の前記インデックスと前記文書とを関連付けて文書データとして記憶部に記憶させる処理部と、
     検索用文字列を受け付ける第2受付部と、
     前記第2受付部が受け付けた前記検索用文字列を分割し、一又は複数の単語を取得する第2分割部と、
     前記第2分割部が取得した一又は複数の前記単語の各々を暗号化する第2暗号化部と、
     前記第2暗号化部によって暗号化された一又は複数の前記単語の各々に基づいて複数の前記文書データを検索する検索部と
     を備える、文書管理サーバ。
    a first reception unit that receives text data of a document;
    a first dividing unit that divides the text data received by the first receiving unit and acquires a plurality of words;
    a first encryption unit that encrypts each of the plurality of words acquired by the first division unit;
    a registration unit for registering each of the plurality of words encrypted by the first encryption unit in an index of the document;
    a processing unit that associates the index of the document in which each of the plurality of words encrypted by the registration unit is registered with the document and stores the document as document data in a storage unit;
    a second reception unit that receives a character string for search;
    a second dividing unit that divides the search character string received by the second receiving unit to obtain one or more words;
    a second encryption unit that encrypts each of the one or more words acquired by the second division unit;
    A document management server, comprising: a search unit that searches for the plurality of document data based on each of the one or more words encrypted by the second encryption unit.
  2.  前記第1分割部は、複数の前記単語の各々に含まれる文字列を所定の形式に変換し、
     前記第1暗号化部は、前記第1分割部が所定の前記形式に変換した前記文字列を暗号化する、請求項1に記載の文書管理サーバ。
    The first dividing unit converts a character string included in each of the plurality of words into a predetermined format,
    2. The document management server according to claim 1, wherein said first encryption unit encrypts said character string converted into said predetermined format by said first division unit.
  3.  前記第1受付部は、前記文書の前記テキストデータに含まれる所定の項目を一又は複数抽出し、
     前記第1暗号化部は、前記第1受付部が抽出した一又は複数の所定の前記項目の各々を暗号化し、
     前記登録部は、一又は複数の所定の前記項目を暗号化した結果を前記文書のインデックスに登録し、
     前記処理部は、前記登録部によって一又は複数の所定の前記項目を暗号化した結果がさらに登録された前記文書の前記インデックスと前記文書とを関連付けて文書データとして記憶部に記憶させる、請求項1又は請求項2に記載の文書管理サーバ。
    The first reception unit extracts one or more predetermined items included in the text data of the document,
    The first encryption unit encrypts each of the one or more predetermined items extracted by the first reception unit,
    The registration unit registers a result of encrypting one or more predetermined items in the index of the document;
    3. The processing unit associates the document with the index of the document in which a result of encrypting one or more of the predetermined items is further registered by the registration unit, and causes the storage unit to store the index and the document as document data. 3. The document management server according to claim 1 or 2.
  4.  前記登録部は、前記テキストデータに含まれる複数の前記単語の順序に基づいて、前記第1暗号化部によって暗号化された複数の前記単語の各々を前記文書の前記インデックスに登録する、請求項1から請求項3のいずれか一項に記載の文書管理サーバ。 3. The registration unit registers each of the plurality of words encrypted by the first encryption unit in the index of the document based on the order of the plurality of words included in the text data. The document management server according to any one of claims 1 to 3.
  5.  前記第1暗号化部は、前記第1分割部が取得した複数の前記単語のうち、所定の品詞に該当する単語を暗号化する、請求項1から請求項4のいずれか一項に記載の文書管理サーバ。 5. The first encryption unit according to any one of claims 1 to 4, wherein the first encryption unit encrypts words corresponding to a predetermined part of speech among the plurality of words acquired by the first division unit. Document management server.
  6.  前記検索部は、前記第2暗号化部によって暗号化された一又は複数の前記単語の各々に、複数の文書のインデックスの各々に含まれる暗号化された複数の前記単語が一致するものを検索する、請求項1から請求項5のいずれか一項に記載の文書管理サーバ。 The search unit searches for a plurality of encrypted words included in each index of a plurality of documents that match each of the one or more words encrypted by the second encryption unit. 6. The document management server according to any one of claims 1 to 5, wherein the document management server
  7.  前記検索部は、前記第2暗号化部によって暗号化された一又は複数の前記単語の各々と、複数の文書のインデックスの各々に含まれる暗号化された複数の前記単語との一致率を導出する、請求項1から請求項6のいずれか一項に記載の文書管理サーバ。 The search unit derives a match rate between each of the one or more words encrypted by the second encryption unit and the plurality of encrypted words included in each of the indexes of the plurality of documents. The document management server according to any one of claims 1 to 6, wherein the document management server
  8.  文書管理サーバが実行する文書管理方法であって、
     文書のテキストデータを受け付けるステップと、
     前記テキストデータを分割し、複数の単語を取得するステップと、
     複数の前記単語の各々を暗号化するステップと、
     暗号化する前記ステップで暗号化された複数の前記単語の各々を前記文書のインデックスに登録するステップと、
     暗号化された複数の前記単語の各々が登録された前記文書の前記インデックスと前記文書とを関連付けて文書データとして記憶部に記憶させるステップと、
     検索用文字列を受け付けるステップと、
     前記検索用文字列を分割し、一又は複数の単語を取得するステップと、
     前記検索用文字列を分割した一又は複数の単語の各々を暗号化するステップと、
     暗号化された前記検索用文字列を分割した一又は複数の前記単語の各々に基づいて複数の前記文書データを検索するステップと
     を有する、文書管理方法。
    A document management method executed by a document management server, comprising:
    accepting text data of a document;
    splitting the text data to obtain multiple words;
    encrypting each of a plurality of said words;
    indexing each of the plurality of words encrypted in the encrypting step into an index of the document;
    a step of associating the index of the document in which each of the plurality of encrypted words is registered with the document and storing the document as document data in a storage unit;
    accepting a search string;
    dividing the search string to obtain one or more words;
    encrypting each of the one or more words obtained by dividing the search string;
    and searching for a plurality of the document data based on each of the one or more words obtained by dividing the encrypted character string for search.
  9.  コンピュータに、
     文書のテキストデータを受け付けるステップと、
     前記テキストデータを分割し、複数の単語を取得するステップと、
     複数の前記単語の各々を暗号化するステップと、
     暗号化する前記ステップで暗号化された複数の前記単語の各々を前記文書のインデックスに登録するステップと、
     暗号化された複数の前記単語の各々が登録された前記文書の前記インデックスと前記文書とを関連付けて文書データとして記憶部に記憶させるステップと、
     検索用文字列を受け付けるステップと、
     前記検索用文字列を分割し、一又は複数の単語を取得するステップと、
     前記検索用文字列を分割した一又は複数の単語の各々を暗号化するステップと、
     暗号化された前記検索用文字列を分割した一又は複数の前記単語の各々に基づいて複数の前記文書データを検索するステップと
     を実行させる、コンピュータプログラム。
    to the computer,
    accepting text data of a document;
    splitting the text data to obtain multiple words;
    encrypting each of a plurality of said words;
    indexing each of the plurality of words encrypted in the encrypting step into an index of the document;
    a step of associating the index of the document in which each of the plurality of encrypted words is registered with the document and storing the document as document data in a storage unit;
    accepting a search string;
    dividing the search string to obtain one or more words;
    encrypting each of the one or more words obtained by dividing the search string;
    and searching for a plurality of the document data based on each of the one or more words obtained by dividing the encrypted search character string.
PCT/JP2022/021282 2021-07-21 2022-05-24 Document management server, document management method, and computer program WO2023002748A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2021120640A JP2023016373A (en) 2021-07-21 2021-07-21 Document management server, document management method and computer program
JP2021-120640 2021-07-21

Publications (1)

Publication Number Publication Date
WO2023002748A1 true WO2023002748A1 (en) 2023-01-26

Family

ID=84979130

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2022/021282 WO2023002748A1 (en) 2021-07-21 2022-05-24 Document management server, document management method, and computer program

Country Status (2)

Country Link
JP (1) JP2023016373A (en)
WO (1) WO2023002748A1 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009015407A (en) * 2007-07-02 2009-01-22 Sony Corp Information processor, reputation retrieval method for content, and reputation retrieval system for content
JP2021073788A (en) * 2021-01-21 2021-05-13 日本電気株式会社 Collation system, method, and program

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009015407A (en) * 2007-07-02 2009-01-22 Sony Corp Information processor, reputation retrieval method for content, and reputation retrieval system for content
JP2021073788A (en) * 2021-01-21 2021-05-13 日本電気株式会社 Collation system, method, and program

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
TAKUYA NOJIMA; YOHEI WATANABE; MITSUGU IWAMOTO; KAZUO OHTA: "3C2-2 The Modified CGKO-SSE protocol with an Improved Method of Inserting Dummy Entries", PROCEEDINGS OF 2018 SYMPOSIUM ON CRYPTOGRAPHY AND INFORMATION SECURITY (SCIS2018); JANUARY 23-26, 2018, IEICE, JP, 28 January 2018 (2018-01-28) - 23 January 2018 (2018-01-23), JP, pages 1 - 8, XP009543071 *

Also Published As

Publication number Publication date
JP2023016373A (en) 2023-02-02

Similar Documents

Publication Publication Date Title
EP1450267B1 (en) Methods and systems for language translation
US8489583B2 (en) Techniques for retrieving documents using an image capture device
JP4398992B2 (en) Information search apparatus, information search method, and information search program
US20090052804A1 (en) Method process and apparatus for automated document scanning and management system
US6938037B2 (en) Method and apparatus for data storage and retrieval
US20080162603A1 (en) Document archiving system
US20100333194A1 (en) System, Method, and Apparatus for Capturing, Securing, Sharing, Retrieving, and Searching Data
JP5020979B2 (en) Organizing digital content on the Internet through digital content reviews
US20230196002A1 (en) Document processing method, and information processing device
US20070214177A1 (en) Document management system, program and method
WO2023002748A1 (en) Document management server, document management method, and computer program
CN105740317A (en) Method and system for objectifying non-textual content and finding document
JP2009075637A (en) Data management system, data providing system, and program
JP2021144565A (en) Information processing apparatus and information processing program
JP2009075849A (en) Information processor, information processing method, program thereof, and storage medium
US11108922B2 (en) Image processing apparatus, system, server, control method, and storage medium to perform encryption processes on image data and attribute data using first and second keys and further perform character recognition process
JP2002073399A (en) Method for delivering analyzed data in analyzing service
JP2000020549A (en) Device for assisting input to document database system
WO2024089910A1 (en) Information processing method, information processing program, information processing system
US20230177859A1 (en) Document Processing Method, and Information Processing Device
KR101133564B1 (en) The system which manages the electrical document data
JP2023079244A (en) Character string processing program
JP2011170393A (en) Image search system, image search method and image search program
EA036253B1 (en) Computer implemented system and method for collating and presenting multi-format information
JP2023075965A (en) Character string processing program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22845685

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE