WO2016186137A1 - Accounting assistance system - Google Patents

Accounting assistance system Download PDF

Info

Publication number
WO2016186137A1
WO2016186137A1 PCT/JP2016/064758 JP2016064758W WO2016186137A1 WO 2016186137 A1 WO2016186137 A1 WO 2016186137A1 JP 2016064758 W JP2016064758 W JP 2016064758W WO 2016186137 A1 WO2016186137 A1 WO 2016186137A1
Authority
WO
WIPO (PCT)
Prior art keywords
journal
voucher
unit
data
entry
Prior art date
Application number
PCT/JP2016/064758
Other languages
French (fr)
Japanese (ja)
Inventor
上野裕史
高島研也
Original Assignee
株式会社スキャる
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社スキャる filed Critical 株式会社スキャる
Priority to JP2017519380A priority Critical patent/JP6835713B2/en
Publication of WO2016186137A1 publication Critical patent/WO2016186137A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes

Definitions

  • the present invention relates to a system that supports accounting operations.
  • Japanese Patent Application Laid-Open No. 2014-235484 provides a cloud-type system in which a user can obtain a journal entry result of a transaction indicated in a voucher in real time simply by transmitting the voucher data from a Web terminal.
  • This system includes a server that executes processing of a journal analysis service, a first master that stores a product name and a product group in association with each other, and a journal by a journal pattern of the product group and account item as a pair.
  • a database storing a master shared by all users including a second master that records the number of persons to be processed, and the server analyzes the voucher data transmitted from the Web terminal that requests the journal analysis service.
  • Means for extracting journal element information, and a product group corresponding to the product name included in the element information is obtained from the first master, and among all account items in the second master corresponding to the product group, Means for generating a journal by selecting an account item having the largest number of journal processing persons and presenting the journal as a recommended journal.
  • a journal unit that outputs a journal entry of a voucher that is evidence of a user's transaction, and divided data obtained by dividing a plurality of character information included in the journal to be journalized according to a notation position in the voucher, Along with the identification information indicating the voucher to be journalized, a distributed unit that is distributed and transmitted to different workers for digitization, and character information that has been digitized (divided and digitized) by different workers. And an aggregation unit that generates journal data to be journalized by the journal unit based on the identification information from the divided digitized character information.
  • the voucher data is distributed and transmitted in a highly confidential state in the distributed unit, it can be transmitted and received safely via a computer network (cloud), typically the Internet. For this reason, a person who can connect to the network (cloud) can be used for digitizing the voucher, and voucher data can be digitized safely at low cost.
  • cloud computer network
  • the distribution unit may include a unit that classifies the divided data according to the notation position in the voucher, adds a category based on the display position, and transmits the divided data to different workers for each type. Since the worker digitizes the divided data of the same category (type, group, type), the efficiency of the digitization work can be improved.
  • the distribution unit is configured to generate a plurality of voucher divided images obtained by dividing an image of a journal to be journalized according to a character information notation position and a divided data including the plurality of voucher divided images through different operations via a computer network. And a unit for transmitting to a person.
  • the transmitting unit may transmit the divided data to different workers via the Internet. By separating and digitizing the data contained in the voucher independently by different workers distributed in places via the Internet (cloud), the voucher data can be digitized more safely.
  • the journal unit that obtains the journal data including the word sequence extracted from the transaction date, amount, and other text information of the journal voucher will determine the distance between the journal data and multiple journal reference entries, the transaction date, An analogy determination unit that calculates the amount of money and the similarity of each word included in the word array as a parameter may be included.
  • Each of the plurality of journal entry entries includes a transaction date, an amount, and a word array extracted from other character information for each entry in the book including information on past vouchers that have been journalized by the user as entries.
  • the journal unit may further include a first journal output unit that outputs the account item of the journal reference entry having the shortest distance from the journal data as a journal.
  • journal reference entry that is the past journal result and the journal object data to be journaled are acquired as metadata including the transaction date, the amount, and the word array.
  • the similarity determination unit cuts out the elements included in the metadata, in particular, each word included in the word array into words that can be determined by similarity, and converts a plurality of journal reference entries and journal target data into transaction dates, A distance between a journal reference entry and journal data is calculated by mapping the amount and a plurality of words that can be judged to be similar to a multidimensional space.
  • the first journal entry output unit outputs the account item of the journal reference entry having the shortest distance from the journal entry data as the journal entry.
  • journal output unit When the distance between the journal unit and the shortest journal reference entry selected by the first journal output unit is larger than the first threshold, the difference from the amount of the journal data is the largest within the second threshold.
  • a second journal output unit that outputs the account item of the journal reference entry having a close transaction date as the journal may be included. The amount and date are included in the most effective analogy judgment items that determine the journal entry for the voucher. Accordingly, when the distance between the journal entry data and the journal reference entry is too far away due to other factors, there are cases in which the optimal account item can be output by removing the other factors and making a comparison.
  • Account items can be divided into several categories.
  • the accounts of the plurality of journal entry entries are divided into a plurality of categories, and the categories include at least one account.
  • Each of the plurality of journal reference entries may include category information.
  • the similarity determination unit is a category-based similarity determination unit that calculates the distance between the journal reference entry of the same category as the category determined based on at least the title and the word included in the word array of the journal object data and the journal object data. May be included.
  • the category may distinguish a difference in transaction direction, or may distinguish a difference in titles of vouchers.
  • the information may be deleted from the metadata.
  • the journal is determined by majority vote in the cloud, the user's voucher Can be prevented from spreading on the cloud.
  • the information “category” may be obtained from the title of the voucher when extracting the journal entry data from the voucher and included in the journal entry data in advance, and the journal entry unit may include a category determination unit.
  • the category determination unit determines which category the journal entry data belongs to based on at least a word indicating the title and destination included in the word array of the journal entry data.
  • the journalizing unit A title / address extracting unit for extracting a title and a destination from a word array based on the position information of the word sequence may be included. There is a tendency for the title to be displayed at the center of the voucher and the destination to be displayed on the voucher, such as the top left of the voucher. By referring to the location information, the title and destination of the voucher are automatically determined. Accuracy can be improved.
  • One of the other aspects of the present invention is a program (program product) that causes a computer to operate as a system having a journal unit that outputs a journal entry for evidence that is evidence of a user's transaction.
  • the program may include a unit (functional unit) that causes the computer to further operate as means for generating a journalized database including a plurality of journal reference entries from the user's past book data input to the computer.
  • the program (program product) can be recorded on an appropriate recording medium and provided.
  • One further different aspect of the present invention is a method that includes outputting a voucher journal entry that is evidence of a user's transaction by a computer.
  • the system includes a transmission / reception unit in which a computer exchanges data with a plurality of workers via the Internet, and the method includes the following steps. 1.
  • the computer transmits the divided data obtained by dividing a plurality of pieces of character information included in the voucher to be journalized according to the notation position in the voucher together with identification information indicating the voucher to be journaled to different workers for digitization. To be distributed and transmitted via 2.
  • the step of distributing and transmitting may include the step of classifying the divided data according to the notation position in the voucher and distributing and transmitting to different workers for each type. Further, the step of transmitting in a distributed manner may include the step of transmitting divided data including a plurality of voucher divided images obtained by dividing the journal to be journalized according to the notation position of the character information into different workers.
  • the computer includes a plurality of journal references each including a word sequence extracted from the transaction date, amount, and other character information for each entry in the book that includes the user's past journalized voucher information as an entry in the memory.
  • You may have a journalized database that includes entries, and the step of outputting the journal may include the following steps.
  • the computer obtains the journal data including the word date extracted from the transaction date, amount, and other character information of the voucher to be journalized.
  • Calculating the distance includes calculating the distance between the journal reference entry and the journal target data in the same category as the category determined based on at least the title and destination word included in the word array of the journal target data. But you can.
  • Acquiring is to acquire information (division and digitized character information) digitized by different workers after a plurality of character information included in the voucher to be journalized is divided according to the notation position in the voucher.
  • the method may include generating journal entry data based on the identification information indicating the voucher of the journal entry from the character information converted into the divided data.
  • summary of an accounting assistance system The block diagram which shows the outline
  • Fig. 1 shows an example of an accounting support system.
  • This accounting support system (accounting support apparatus) 1 is a system for organizing and journalizing a plurality of users 3 vouchers (certificates), for example, expense settlement vouchers 5.
  • the user 3 may be, for example, a personal accountant, company, or other organization such as an accounting office or a tax accountant office that uses the accounting support system 1.
  • the accounting support system 1 provides services including digitization of the voucher original 5, original management, and journaling work.
  • the accounting support system 1 uses a plurality of remote workers 8 connected via the Internet (cloud) 9 in order to process enormous digitization work at low cost.
  • digitization means information such as handwritten character information, printed character information, etc., which is written in a voucher (voucher) or software that operates on a computer. This indicates that the data is converted into processable data, that is, electronic data, digital data, or the like.
  • a part of the voucher document 5 is converted into image data and distributed to the terminal of the remote worker 8 connected to the accounting support system 1 via the Internet 9.
  • the remote worker 8 performs work for digitizing a part of the voucher document 5 or verifies it.
  • the accounting support system 1 aggregates the work results of the remote worker 8 via the Internet 9 and then determines the journal entry.
  • the accounting support system 1 can improve the efficiency of the work by separating the data conversion work that does not require specialization but requires man-hours from the journalizing work that requires accounting specialization. It is possible to improve the accounting power.
  • the voucher 5 is also called voucher or voucher document (may be described in this specification), and provides evidence of transactions such as receipts, invoices, invoices, purchase orders, invoices, payment certificates, etc.
  • An accounting book is prepared based on the contents of the transaction described in the voucher 5.
  • the voucher 5 is obliged to be organized and stored for a predetermined period by each user 3.
  • the accounting support system 1 adopts a new method that radically changes the method.
  • the accounting support system 1 roughly includes an electronic data conversion process and a journalizing process.
  • the main process of the digitization process is to extract a character string from the voucher 5 and digitize it. It is pure string digitization and does not require any accounting expertise.
  • the journalizing process is a task that performs journaling work based on digitized data and determines account items, and requires accounting expertise.
  • the journalizing work can be automated using information technology. It is also possible to set up a process to check the final results by accounting professionals with accounting expertise such as accountants and tax accountants who are familiar with the work of each user 3, verifying the results of automatic sorting work, journaling accuracy Can be guaranteed.
  • the accounting support system 1 includes a factory department (factory) 11 that accepts the voucher 5 provided by the user 3, organizes, confirms, and images it, and a warehouse department (warehouse) 12 that stores the voucher 5 in the state of the original 6.
  • a data processing department (server) 13 that digitizes the voucher 5 and makes accounting entries based on the imaged data (voucher image) 7 of the voucher 5 is included.
  • the original 6 of the voucher 5 is stored in the warehouse 12, and the user 3 can access the electronic voucher 5 data of the server 13 via the Internet 9. Furthermore, the user 3 can obtain the original 6 of the voucher 5 stored in the warehouse 12 based on the electronic data of the server 13 or refer to the warehouse 12 as necessary.
  • the server 13 has computer resources such as a memory and a CPU, and realizes a function provided by a program (program product).
  • the server 13 includes a unit 14 that transmits and receives data to and from the terminal of the remote worker 8 via a computer network (Internet, cloud) 9.
  • the server 13 further uses a data input processing function (data input processing unit) 20 for digitizing the voucher image 7 using the remote worker 8 and the digitized information or accounts for the information.
  • a data mining function (data mining unit) 30 for adding accounting information such as, a database 50 for storing digitized data, and a data display providing function for providing the digitized data to the user 3 via the network 9 (Data display unit) 40.
  • FIG. 2 shows an outline of processing in the factory 11.
  • Each employee 103 of each user 3 puts the voucher 5 and the expense settlement label 105 in a suitable container, for example, a plastic bag (chuck poly) 107 with a chuck.
  • the voucher 5 journalized for each employee by the chuck poly 107 is put in a dedicated envelope 108 and delivered to the factory 11 by delivery means such as mail or consignment.
  • the identification information (ID) attached to the expense settlement label 105 by a barcode, a two-dimensional code or the like becomes the main ID of the voucher 5 included in the chuck poly 107, and each voucher 5 included in the chuck poly 107 has a main ID. In addition to the branch ID, each voucher 5 is completely identified.
  • pre-processing 111 In the factory 11, pre-processing 111, organizing data conversion processing 114, and storage processing 116 are performed.
  • the voucher 5 included in the incoming chuck poly 107 is confirmed, and identification information (ID) having the expense settlement label as the main ID is associated with each voucher 5 on a one-to-one basis.
  • identification information ID
  • each voucher 5 associated with the identification information on a one-to-one basis is organized, confirmed, and imaged.
  • Data obtained by imaging the voucher 5 is fed back to the user 3 as a receipt confirmation image 118.
  • Data obtained by imaging the voucher 5 is supplied to the server 13 as image data (voucher image) 7 for digitization.
  • the voucher 5 is stored again in the chuck poly 107 and stored as the original 6 in the warehouse 12. Since the same identification information as the voucher image 7 is attached to the original 6, the original 6 stored in the warehouse 12 can be easily reached from the information obtained by digitizing the voucher image 7 if necessary.
  • FIG. 3 is a block diagram showing functions of the server 13.
  • the server 13 uses a data input processing unit (data input processing function) 20 for digitizing the voucher image 7 and a journalizing unit (journaling apparatus, journalizing system) for journalizing vouchers using the digitized information (data to be journalized) 60.
  • Data mining function data mining function
  • the journal unit 30 can also be used as a data mining function (data mining unit) for adding accounting information such as account items to the input information.
  • the data display providing function 40 can also be used as a data display unit 40 that provides digitized data stored in the database 50 to the user 3 via the network 9.
  • the data mining function 30 includes an automatic journal unit 80 that automatically journals and determines account items.
  • the server 13 includes a functional unit (process) 90 for confirming the result of automatic sorting, and the journalizing result, that is, the automatically output account item is confirmed manually by the accounting specialist 91.
  • the electronic journal data 60 is stored in the database 50 as transaction information 95. Therefore, the database 50 includes a function as the updated journal diary 52 of the user 3.
  • the data input processing unit (data input processing device) 20 includes an image reading unit (image reading device) 21 that acquires the voucher image 7 to be journalized in units of vouchers, and a plurality of voucher divisions obtained by dividing the voucher image 7 corresponding to the voucher.
  • an aggregating unit (aggregating apparatus, aggregating function) 23 for acquiring character information 28 that has been digitized (divided and digitized) by a plurality of workers 8.
  • the distribution unit 22 divides the divided data 29 obtained by dividing a plurality of character information included in the voucher to be journalized according to the notation position in the voucher together with identification information 27 indicating the voucher to be journalized into different workers for digitization 8 is distributed and transmitted. For this reason, the distribution unit 22 divides the voucher image 7 according to the notation position of the character information included in the voucher image 7 and generates a plurality of voucher divided images (divided data) 29 (image dividing apparatus, image). Division function) 24.
  • the voucher divided image 29 includes a divided image 29a, OCR information 29b obtained by converting the divided image 29a into character data using OCR, and identification information (ID that associates the voucher divided image 29 with a specific voucher image 7, that is, the voucher 5. 27).
  • the image dividing unit 24 includes a function of classifying divided data by adding a category according to the notation position in the voucher and transmitting the divided data to different workers for each category (for each type and for each group).
  • the aggregation unit 23 acquires character information 28 divided and digitized by different workers 8, and generates journal object data 60 to be journalized by the journal unit 30 based on the identification information 27 from the character information 28 divided and digitized. . Therefore, the aggregation unit 23 acquires character information (divided digitized character information) 28 in which the character information included in the voucher divided image 29 is digitized by the worker 8 from each worker 8 and stores it in the identification information 27.
  • a generation unit (generation device, generation function) 25 that generates the journalizing target data 60 by aggregating the divided data character information 28 based on the generated data is included.
  • the process of digitizing information contained in the voucher 5 tends to increase man-hours.
  • a mechanism for reducing time and cost by adopting parallel work by a plurality of workers is adopted. Yes.
  • the voucher base paper 5 is converted into an electronic image by an image reading device or the like at the factory 11.
  • it is image-recognized, a rectangular shape is detected, and divided into rectangular units to form divided images 29a. Since the rectangle includes a collection of character string information, it can be converted into OCR information 29b if it can be read by OCR.
  • Existing image recognition software can be used for OCR.
  • FIG. 4 is a flowchart showing an outline of processing provided by the server 13 of the accounting support system 1.
  • the image reading unit 21 acquires the voucher image 7.
  • the distribution unit 22 implemented in the server 13 obtains the divided data (voucher divided image) 29 obtained by dividing the plurality of character information included in the journal voucher (voucher image) 7 according to the notation position in the voucher.
  • the information is distributed and transmitted to different workers 8 via the transmission / reception unit 14 for digitization.
  • the worker 8 on the cloud who has received the voucher divided image 29 independently performs digitized work of limited character information within the range included in the divided image 29.
  • step 153 the aggregation unit 23 mounted on the server 13 acquires (receives) the segmented digitized character information 28 digitized by different workers 8 on the cloud via the transmission / reception unit 14, Based on the identification information 27, the journal data 60 is generated from the divided electronic character information 28.
  • step 154 the automatic journal unit 80 performs journal processing for the journal target data 60.
  • FIG. 5 is a flowchart showing the process (step 152) in the image dividing unit 24 in more detail.
  • step 201 the orientation of the voucher base paper 5 converted into image data from the voucher image 7 is detected, and the rotation is corrected so that the orientation of the image 7 matches the orientation of the base paper 5.
  • A4 size paper may be used vertically or horizontally.
  • the character information described in the voucher 5 is in the horizontal direction, and it is necessary to cut out the divided images along the line of characters. Therefore, the orientation of the voucher image 7 is corrected so that the direction of the character information matches from the arrangement of characters.
  • step 202 the size of the voucher base paper 5 included in the voucher image 7 is automatically detected.
  • a library preset in accordance with the size is selected.
  • the library includes information about image division such as the size and position of a rectangle to be recognized for each size of the base paper 5.
  • step 203 the voucher image 7 is recognized by being divided into rectangles having a size set for each size of the base paper 5 by image processing.
  • the information recognized in units of rectangles is stored in a table, which is used as a rectangle list.
  • preprocessing such as an image filter may be added. This step is important for image segmentation, and a process for manual segmentation may be inserted when it is determined that automatic image segmentation is insufficient.
  • the rectangle list includes the result of dividing the voucher image 7 into small rectangles suitable for the size and dividing all areas.
  • a rectangle including character information is found from the rectangle list, and each image is cut out to generate a plurality of divided images 29a from one voucher image 7.
  • the divided image 29a is basically an accumulation of rectangular images registered in the rectangular list, and one rectangular image may be included in the plurality of divided images 29a, and such a state includes character information. Can be adjusted by the distribution map of the rectangular divided image 29a.
  • the divided image 29a may be classified (grouped) by adding a character information category based on information included in the library for each position, order, and size of the rectangle including the found character information.
  • a character information category based on information included in the library for each position, order, and size of the rectangle including the found character information.
  • the divided image 29a divided by the rectangle including the character information first appearing on the upper side and the left side of the voucher image 7 is likely to include character information related to the title of the voucher, and “title” is included in the divided image 29a.
  • the character information category may indicate a specific type of content such as a title, date, amount, etc., and is mapping information indicating the position, size, etc., of the divided image 29a cut out from the voucher image 7. Also good.
  • FIG. 6 shows an example of the divided image 29 a generated from the voucher image 7. It is also possible to assign a character information category to the divided image 29a such that the upper right divided image 29a is a voucher number, the upper left divided image 29a is a title, and the next left divided image 29a is a destination.
  • the character information category may not be attached to the divided image 29a, but the character information category may be automatically attached according to the recognition result by OCR described later or the input result by the worker 8, and the worker 8 manually converts the data into divided data.
  • a character information category may be added to the character information 28.
  • the divided image 29a may be a character string unit, and the notation collected in a table or the like may be a unit such as a table or a column.
  • identification information (ID) 27 is attached to each divided image 29a in step 205. It is desirable that the identification information 27 is a combination of identification information indicating the voucher 5 that has become a division source and identification information indicating individual divided images 29a.
  • the divided image 29a is character-recognized using character recognition software (OCR) to generate OCR data 29b.
  • OCR character recognition software
  • the voucher divided image 29 including the identification information 27, the divided image 29 a and the OCR data 29 b is distributed to the worker 8 via the Internet 9.
  • Each worker 8 looks at one of the divided images 29a shown in FIG. 6 and confirms the OCR data 29b. Therefore, each worker 8 only grasps the information of the fragment of the voucher 5, the contents of the voucher 5 do not leak to the worker 8, and the information regarding the accounting of the user 3 does not leak to the worker 8. Absent.
  • the worker 8 may perform the digitization work by distributing the divided image 29a of the voucher 5 indiscriminately, that is, regardless of the character information category.
  • the voucher divided image 29 having the same character information category to the worker 8, it is possible to perform the digitization work more efficiently. For example, if a worker 8 specializes in the work of digitizing the voucher divided image 29 related to the voucher title, the range of character information for interpreting the divided image 29a is limited, and the efficiency and accuracy are improved. If the voucher division image 29 relating to the amount is digitized, the character information can be limited to being a number, and the operation can be performed repeatedly, so that the operation efficiency and accuracy are easily improved.
  • a plurality of workers 8 are delivered with a rectangular divided image 29a and a character string (OCR data) 29b. However, without performing OCR, only the rectangular divided image 29a is delivered and the worker 8 converts the data into data. Also good.
  • the operator 8 visually checks the rectangular image (divided image) 29a and the character string 29b to confirm whether they are correct. If it is wrong, the character string 29b is corrected. When the character string 29b is not passed, the character string is manually input.
  • a plurality of divided images 29a are not delivered to the worker 8. This is to prevent the business information of the company to which the voucher belongs from being revealed. In the range where the single divided image 29a is handled, it is impossible to determine the business information of the company, and the confidentiality of the company information is ensured.
  • Fragmented character strings (divided data character information) 28 confirmed by the worker 8 are collected by the server 13 and collected into one piece of information.
  • the aggregation unit 23 of the server 13 collects character information (character information that has been digitized and divided data) that has been digitized by each worker 8 via the Internet 9 and is generated by the generation unit 25 as a journal entry target.
  • Data 60 is generated.
  • the journal entry data 60 includes a word sequence extracted from the transaction date, amount, and other character information of the voucher 5 to be entered.
  • the unit 80 that performs the journaling of the data mining unit 30 calculates the distance L in the multidimensional space between the journal object data 60 and the plurality of journal reference entries 70, and refers to the journal entry with the shortest distance L from the journal object data 60.
  • the account item of entry 70 is output as the journal.
  • the journal reference entry 70 is information obtained by converting the entry of a voucher that has been previously journalized by the user 3 as an entry, for example, a journal diary entry.
  • the data mining unit 30 automatically generates a journal reference entry 70 from the journal diaries 51 and 52, and generates a reference library 53.
  • Journalizing unit (automatic journaling function, automatic journalizing device) 80 that performs automatically.
  • the journal diary 51 to be converted may be the journal diary 51 used by each user 3 in the past accounting process, or the diary 52 including information journalized by the accounting support system 1. Good.
  • the conversion unit 31 generates a reference library 53 from the journal diaries 51 and 52 (which will be described below with reference to the journal diary 51), which is the past journal data of the user 3.
  • the reference library 53 is a database for finding the most similar past journal entry for a new voucher.
  • FIG. 7 shows the function of the conversion unit 31.
  • Each entry (diary entry) 51a in the journal diary 51 is managed by an ID 51b.
  • Each entry (journal reference entry) 70 in the reference library 53 is managed by a category 71 described later, and an ID 51b for tracking the entry 51a in the journal diary 51 is included as content.
  • Each entry 51a of the journal diary 51 includes a transaction date 51c, an amount 51d, a debit item 51e, a credit item 51f, a debit tax code 51g, a credit tax code 51h, a debit auxiliary item 51i, a credit auxiliary item 51j, and a summary 51k. included.
  • the conversion unit 31 includes a word extraction function 32, generates a word array 73 by dividing the information included in the summary 51k into word units, and outputs it as one of the keys of the journal entry entry 70.
  • the word extraction function 32 has a general Japanese syntax analysis function.
  • the journal entry entry 70 is linked to the entry 51a of the journal diary 51 through an ID 51b included as content (value).
  • the conversion unit 31 further includes a category generation unit (category generation function, category generation means) 33.
  • the category generation unit 33 refers to the item / category conversion table 34 and determines a category (journal category) 71 from the debit item 51e and the credit item 51f of the diary entry 51a.
  • Other information of the diary entry 51a for example, the debit auxiliary subject 51i, the credit auxiliary subject 51j, and the summary k are converted into a word array 73 in which the information is divided into words by the word extraction function 32.
  • the debit item 51e and the credit item 51f are included in the key information of the journal entry reference 70 as the category 71. Therefore, it is not included in the word array 73.
  • the debit subject 51e and the credit subject 51f may be included in the word array 73.
  • FIG. 8 shows an example of the subject / category conversion table 34.
  • This category (journal category) 71 is a parameter newly defined (independently) in the accounting support system 1.
  • the category 71 may be any information that is clearly and relatively easy to distinguish in accounting and can be easily distinguished without duplication.
  • the category 71 includes “income recording”, “expenditure recording”, “payment application”, and “withdrawal application”. These categories 71 are suitable for classifying the vouchers 5 according to their properties, and the corresponding category 71 of the voucher 5 can be easily and accurately determined from the title of the voucher 5 and the destination.
  • FIG. 9 is a block diagram showing the configuration of the journalizing unit 80.
  • the journal unit 80 includes an acquisition unit (acquisition function, acquisition means) 81 for acquiring journal object data 60 that is digitized information of the journal voucher 5, a plurality of journal reference entries 70, and journal object data 60.
  • a first journal that outputs the account items 51e and 51f of the journal reference entry 70 with the shortest distance L between the similarity determination unit (analysis determination function, similarity determination means) 82 for calculating the distance and the journal entry target data 60 as the journal entry.
  • a destination output unit (first journal destination output function, first journal destination output means) 86.
  • the journal entry data 60 includes a transaction date 64, an amount 65, and a word array 63 extracted from other character information.
  • the acquisition unit 81 extracts a word 63 a included in the word array 63 of the journal entry data 60.
  • the acquisition unit 81 automatically identifies an appropriate number of words, collocations, compound words, and the like from the word sequence even if the words included in the word sequence that is the metadata of the voucher 5 are not defined in advance. (Including compound words).
  • the similarity determination unit 82 multi-dimensionalizes the journal target data 60 and the journal reference entry 70 by using not only the transaction dates 64 and 51c and the amounts 65 and 51d but also the words included in the word arrays 63 and 73 as similar parameters.
  • the distance L between the journal object data 60 and the plurality of journal reference entries 70 is calculated. Therefore, in the analogy determination unit 82, the transaction dates 64 and 51c and the amounts 65 and 51d are used as essential parameters, but other information is appropriately extracted from the word array even if not defined in advance.
  • the extracted (extracted) words are used as similar judgment criteria based on the words, the meaning suggested by the words, and the order of the words. Therefore, the analogy between the journal reference entry 70 and the journal object data 60 can be automatically determined from various fields. Therefore, the journal unit 80 can automatically output the account item for journalizing the voucher 5 from which the journal object data 60 is extracted based on the past journal results with high accuracy.
  • This journal unit 80 constructs a metadata database based on the past journal diary or data corresponding thereto, and uses it as data (journal reference entry) 70 for finding the journal of the voucher 5.
  • the journal entry 70 has a plurality of keys, and the new voucher 5 can be converted into metadata. Therefore, the distance L between the journal object data 60 obtained by converting the voucher 5 into metadata and the journal reference entry 70 is evaluated, and the account item of the journal reference entry 70 having the closest distance is determined as the account item of the new voucher 5.
  • Neither the metadata contained in the journal reference entry 70 nor the metadata contained in the journal target data 60 need to be pre-assigned a common key in the journal unit 80, except for the date and amount required for the voucher 5.
  • the similarity of an arbitrary word extracted from an arbitrary word sequence is calculated as a distance L and evaluated.
  • the function for extracting words preferably has a general Japanese parsing function.
  • journal unit 80 the reference library 53 including the journal reference entry 70 is generated from the journal diary 51 which is the user's past journal data. Therefore, the journal unit 80 searches for the most recent past journal diary entry 51 a for the new voucher 5.
  • Each entry 51a in the journal diary 51 is managed by an ID 51b. By including the ID 51 b in the journal entry 70, a new voucher 5 can be journalized based on the past journal diary 51.
  • the journal reference entry 70 may be based on the past journal result of another user or the logic of majority vote on the Internet, instead of the past journal diary entry of the user 3. However, the information of the user 3 is opened. For this reason, it is desirable to determine the analogy by replacing the information that leads to the identification of the economic activity of the user 3 with general-purpose information using the category 71 described above.
  • the journal unit 80 has the difference between the amount 65 of the journal object data 60 and the amount 51d of the journal reference entry 70 within the range of the second threshold value Vt2, and the journal whose transaction date 51c is closest to the transaction date 64 of the journal object data 60. It includes a second journal output unit 87 that finds the reference entry 70 and outputs the account items 51e and 51f of the journal reference entry 70 as a journal. Specifically, the journal entry 70 that has an amount difference within ⁇ Vt2 is selected and sorted in ascending order. Among them, the one having a date difference within D days is selected, and the journal reference entry 70 having the closest date is found by sorting in the order of close date.
  • This second journal output unit (second journal output function, second journal output means) 87 is the distance L in the shortest journal reference entry 70 selected by the first journal output unit 86. Is larger than the first threshold value Vt1, and it is determined that it is determined that an accountingally significant journal cannot be selected. Transactions with little difference in transaction dates and similar amounts are likely to be the same or similar transactions, and the proof 5 that is the title of the transaction is likely to be able to be journalized to the same account item .
  • the similarity determination unit 82 of the journal unit 80 further includes a category to which the journal data 60 belongs (a journal category, a journal target category) based on at least a word indicating the title 63b and the destination 63c included in the word array 63 of the journal data 60. )
  • journal unit 80 extracts a title 63b and a destination 63c from the word array 63 based on position information (mapping information) of the word 63a of the word array 63 of the journal data 60, and extracts a title / address extraction unit (title / address extraction). Function) 83.
  • the title / address extraction unit 83 extracts the title 63b and the destination 63c from the journal data 60, the category determination unit 84 determines the transaction direction from the title 63b and the address 63c, and the category 61 from the transaction direction and the title 63b. Judging.
  • FIG. 10 shows a more detailed configuration of the title / address extraction unit 83 and the category determination unit 84 in a block diagram.
  • the title / address extraction unit 83 includes a function of detecting a sender in order to determine the address more clearly. That is, the title / address extraction unit 83 includes a title detection function 83a, a destination detection function 83b, a transmission source detection function 83c, and a company name / customer name determination function 83d.
  • the title detection function 83a searches for the title 63b from the journalizing target data 60 that is digitized based on the distribution map 55a including the position information of the title candidates prepared in the library 55 and the title dictionary 55b.
  • the title dictionary 55b includes words that are widely used as the title of the voucher 5 such as “invoice”, “purchase order”, “delivery note”, and “description”.
  • the address detection function 83b searches for the address 63c from the journalizing target data 60 that is digitized based on the distribution map 55a and the address prefix dictionary 55c based on the position information of the address candidates prepared in the library 55.
  • the prefix dictionary 55c includes words that are widely used to indicate destinations such as “Gion”, “To”, “Sama”, and “Line”.
  • the transmission source detection function 83c searches for the transmission source from the journalizing target data 60 that is digitized based on the distribution map 55a and the supplier dictionary 55d based on the location information of the transmission source candidates prepared in the library 55.
  • the supplier dictionary 55d includes the names of past suppliers of the user 3.
  • the journal entry data 60 Since the journal entry data 60 is input or reviewed by the operator 8, it includes character information that is much more accurate than character information obtained simply by OCR. Therefore, the title detection function 83a, the destination detection function 83b, and the source detection function 83c refer to the respective dictionaries 55b to 55d without using the position information on the voucher 5, and based on the character information, the title and destination Further, the sender may be determined.
  • the company name / customer name determination function 83d determines whether the user 3 is a transmission source or a destination. A match search may be performed in character units, a longest match search may be performed, and a matching value may be determined.
  • the sender If the sender has the company name and the destination does not have the company name, the sender is determined to be the company and the destination is determined to be the business partner. If the sender does not have the company name and the destination has the company name, the sender is determined to be the business partner and the destination is determined to be the company. If there is a company name at the source and destination, or there is no company name at the source and destination, automatic determination is impossible and manual determination is performed.
  • the category determination function 84 includes a transaction direction determination unit 84a and a category selection unit 84b.
  • the transaction direction determination unit 84a determines the direction of the transaction by referring to the transaction direction determination table 56 based on whether the destination 63c and the transmission source 63d are the company name or the customer name and the title 63b. If the title 63b is an invoice and the destination 63c is a supplier name, the transaction direction 69 is outward (OUT). If the title 63b is an invoice and the destination 63c is the company name, the transaction direction 69 is in (IN). When the transaction direction 69 is OUT, the issuer of the voucher 5 is defined as the company, and IN is defined as the issuer of the voucher 5 is the customer.
  • the category selection unit 84 b determines the category 61 based on the title / category conversion table 57 from the title 63 b and the transaction direction 69. For example, if the title 63b is an invoice and the transaction direction 69 is OUT, the category 61 is determined to be income.
  • the category 61 of the journal object data 60 is compared with the category 71 of the journal reference entry 70. As shown in FIG. 8, if the categories 61 and 71 are accounted for as revenue, the account item is determined to be one.
  • the category 61 is determined to be expenditure recording.
  • the category 61 of the journal object data 60 is compared with the category 71 of the journal reference entry 70. As shown in FIG. 8, if the categories 61 and 71 are recorded as expenses, there are several candidates for the account item. . Accordingly, the distance calculation unit 85 shown in FIG. 9 selects the journal reference entry 70 in which the category 71 is recorded as an expense from the journal reference entries 70 in the reference database (reference library) 53, and calculates the distance L. To do.
  • categories 61 and 71 are meaningful in accounting processing such as income recording and expenditure recording, and a plurality of account items can be grouped into any category. Therefore, by determining the category 61 first, it is possible to reduce the number of journal entry entries 70 that are referred to in order to determine the account item. When judging only by the distance L, the judgment accuracy when the distance L is small is very high. On the other hand, the error when the distance L is large is also large. By determining the category 61 and limiting the number of related entries 70, the possibility of misjudgment of the account item is reduced, and even if it is misjudged, the mistake is reduced. Furthermore, by determining category 61 by determining category 61, it is ensured that the balance in the meaning of accounting processing is correct.
  • This journal unit 80 employs a mechanism that automatically determines the transaction direction 69 from the title 63b and the destination 63c of the voucher 5.
  • the transaction direction 69 refers to information indicating whether the source of the voucher 5 is its own company or a business partner when the company to which the voucher 5 belongs is mainly used.
  • FIG. 11 is a flowchart showing an outline of processing (automatic journaling, step 154) in the automatic journalizing unit 80.
  • These processes are realized by executing a program (program product) for automatic sorting in an apparatus having computer resources including a CPU and a memory such as the server 13.
  • the program (program product) for automatic sorting can be supplied via the Internet 9 or can be provided by being recorded on an appropriate recording medium such as a DVD or a flash memory.
  • a database (journal diary) 50 (52) storing electronic information (data to be journalized) 60 of the voucher 5, a reference database 53 including journal reference entries 70, and other libraries are a single computer. It may be stored in a (server) memory, or may be distributed and stored in a plurality of servers connected via a computer network or the Internet.
  • the journal unit 80 acquires the journal data 60 by the acquisition unit 81 in step 211.
  • the journal object data 60 is digitized data of the voucher 5 and includes a transaction date 64, an amount 65, and a word array 63 extracted from other character information of the voucher 5 to be journalized.
  • the step 211 of acquiring data may include a process of generating the journalizing target data 60 by aggregating the divided data character information 28 digitized by different workers based on the identification information 27.
  • step 212 the acquisition unit 81 further extracts the word 63a from the word array 63 using the Japanese parsing function.
  • step 213 the title / address extracting unit 83 extracts the title 63b and address 63c of the voucher 5 from the words extracted from the word array 63.
  • the title / address extracting unit 83 further extracts the sender 63d.
  • the transaction direction determination unit 84a of the category determination unit 84 refers to the transaction direction determination table 56 to determine the transaction direction 69 from the title 63b, the address 63c, and the transmission source 63d.
  • the category selection unit 84 b refers to the title / category conversion table 57, selects the category 61 of the journal entry data 60 from the title 63 b and the transaction direction 69, and adds it to the journal entry data 60.
  • the distance calculation unit 85 selects the journal entry 70 having the same category 71 as the category 61 of the journal data 60 as a key, and calculates the distance L between them. If the distance L is smaller than the first threshold value Vt1 in step 217, in step 218, the account item of the shortest journal entry entry 70 is output as the account item of the journal object data 60, and automatic verification of the voucher 5 of the journal object data 60 is performed. Sorting is complete. On the other hand, if the distance L is equal to or greater than the first threshold value Vt1, the accuracy of the journal entry by the distance calculation is low. Therefore, in step 219, among the journal reference entries 70 in the same category, the journal reference entry 70 having a difference in amount from the journal data 60 smaller than the second threshold value Vt2 and the latest transaction date is selected. , The account item is output.
  • this accounting support system 1 converts the data of the voucher by a third party via the cloud (computer network), acquires the character information divided into divided data, and collects it from the collected character information. Generate target data.
  • the character information converted into divided data is information obtained by dividing a plurality of character information included in a voucher to be journalized according to a notation position in the voucher, and the information included in one voucher is electronically converted by a plurality of different workers. Turn into. For this reason, one worker only looks at a piece of data contained in the voucher and uses the network (cloud) personnel to digitize the voucher while ensuring the confidentiality of the information contained in the voucher. It is possible to digitalize voucher data safely at low cost.
  • the accounting support system 1 includes a journal unit 80 that automatically performs journal entries.
  • the account item determination process is automated, the work of the accounting specialist converges only to confirm the final result. Therefore, accounting processing including journal entries can be performed by a minimum number of accounting specialists.
  • the account item is determined by the account item determination unit of the character string group that has been transferred from the digitization process. You can select either automatic processing using information technology or manual judgment by an accounting professional. This system optimizes the journal entry process and can greatly improve the accounting processing capacity. As a result, the operation cost can be greatly reduced.

Abstract

Provided is an accounting assistance system (1) having a classification unit (80), a distribution unit (22), and an aggregation unit (23). The classification unit outputs a classification destination for a voucher serving as evidence of a user transaction. The distribution unit distributes and transmits divided data (29), together with identification information indicating the voucher to be classified, to different workers (8) for the purpose of digitization, said divided data having been divided on the basis of the position, within the voucher to be classified, of multiple items of character information in the voucher. The aggregation unit obtains the divided and digitized character information (28) that has been digitized by the different workers (8) and, on the basis of the identification information, generates, from the divided and digitized character information, classification subject data (60) to be classified by the classification unit.

Description

会計支援システムAccounting support system
 本発明は、会計業務を支援するシステムに関するものである。 The present invention relates to a system that supports accounting operations.
 日本国特開2014-235484号には、証憑のデータをWeb端末から送信するだけでその証憑に示される取引の仕訳結果をユーザがリアルタイムに得ることが可能なクラウド型のシステムを提供することが記載されている。このシステムは、仕訳解析サービスの処理を実行するサーバと、商品名と商品グループとを対応付けて記憶する第1マスタと、商品グループと勘定科目とを1対としてその対での仕訳パターンによる仕訳処理人数を記録する第2マスタと、を含む全ユーザ共用のマスタが格納されたデータベースと、を備え、前記サーバは、仕訳解析サービスを要求するWeb端末から送信された証憑のデータを解析して仕訳要素情報を抽出する手段と、その要素情報に含まれる商品名に対応する商品グループを第1マスタから得て、その商品グループに対応する前記第2マスタ内の勘定科目の中で全ユーザの仕訳処理人数が一番多い勘定科目を選択して仕訳を生成し、その仕訳を推奨仕訳として提示する手段と、を備えた構成としている。 Japanese Patent Application Laid-Open No. 2014-235484 provides a cloud-type system in which a user can obtain a journal entry result of a transaction indicated in a voucher in real time simply by transmitting the voucher data from a Web terminal. Are listed. This system includes a server that executes processing of a journal analysis service, a first master that stores a product name and a product group in association with each other, and a journal by a journal pattern of the product group and account item as a pair. A database storing a master shared by all users including a second master that records the number of persons to be processed, and the server analyzes the voucher data transmitted from the Web terminal that requests the journal analysis service. Means for extracting journal element information, and a product group corresponding to the product name included in the element information is obtained from the first master, and among all account items in the second master corresponding to the product group, Means for generating a journal by selecting an account item having the largest number of journal processing persons and presenting the journal as a recommended journal.
 仕訳をさらに効率よく行うことができる会計支援システムが求められている。 There is a need for an accounting support system that can make journaling more efficient.
 本発明の一態様は、ユーザの取引の証拠となる証憑の仕訳先を出力する仕訳ユニットと、仕訳対象の証憑に含まれる複数の文字情報が証憑中の表記位置により分割された分割データを、仕訳対象の証憑を示す識別情報とともに、電子化のために異なる作業者に分散して送信する分散ユニットと、異なる作業者により電子化された、分割電子化(分割および電子化)された文字情報を取得し、分割電子化された文字情報から識別情報に基づき仕訳ユニットにより仕訳される仕訳対象データを生成する集約ユニットとを有するシステムである。 In one aspect of the present invention, a journal unit that outputs a journal entry of a voucher that is evidence of a user's transaction, and divided data obtained by dividing a plurality of character information included in the journal to be journalized according to a notation position in the voucher, Along with the identification information indicating the voucher to be journalized, a distributed unit that is distributed and transmitted to different workers for digitization, and character information that has been digitized (divided and digitized) by different workers. And an aggregation unit that generates journal data to be journalized by the journal unit based on the identification information from the divided digitized character information.
 このシステムにおいては、1つの証憑に含まれる情報を複数の異なる作業者により電子化する。このため、一人の作業者は証憑に含まれるデータの断片を見るだけとなる。したがって、秘匿性を担保しながら、複数の作業者により、効率よく証憑の電子化が可能となる。 In this system, information contained in one voucher is digitized by a plurality of different workers. For this reason, one worker only sees a piece of data contained in the voucher. Therefore, the voucher can be efficiently digitized by a plurality of workers while ensuring confidentiality.
 さらに、分散ユニットにおいては、証憑のデータを分散し、秘匿性の高い状態で送信するので、コンピュータネットワーク(クラウド)、典型的にはインターネットを介して安全に送受信することが可能となる。このため、ネットワーク(クラウド)に接続可能な人員を証憑の電子化に利用でき、低コストで、安全に証憑データを電子化できる。 Furthermore, since the voucher data is distributed and transmitted in a highly confidential state in the distributed unit, it can be transmitted and received safely via a computer network (cloud), typically the Internet. For this reason, a person who can connect to the network (cloud) can be used for digitizing the voucher, and voucher data can be digitized safely at low cost.
 分散ユニットは、分割データを証憑中の表記位置により分類し、たとえば表示位置によるカテゴリを加え、種類毎に異なる作業者に分散して送信するユニットを含んでもよい。作業者が同一のカテゴリ(種類、グループ、タイプ)の分割データを電子化することになるので、電子化作業の効率を向上できる。 The distribution unit may include a unit that classifies the divided data according to the notation position in the voucher, adds a category based on the display position, and transmits the divided data to different workers for each type. Since the worker digitizes the divided data of the same category (type, group, type), the efficiency of the digitization work can be improved.
 分散ユニットは、仕訳対象の証憑を文字情報の表記位置により分割して画像化した複数の証憑分割画像を生成するユニットと、複数の証憑分割画像を含む分割データを、コンピュータネットワークを介して異なる作業者に送信するユニットとを含んでいてもよい。送信するユニットは、インターネットを介して異なる作業者へ分割データを送信してもよい。インターネット(クラウド)を介して場所的にも分散した異なる作業者により、独立し
て、証憑に含まれるデータを分散して電子化することにより、いっそう安全に、証憑データを電子化できる。
The distribution unit is configured to generate a plurality of voucher divided images obtained by dividing an image of a journal to be journalized according to a character information notation position and a divided data including the plurality of voucher divided images through different operations via a computer network. And a unit for transmitting to a person. The transmitting unit may transmit the divided data to different workers via the Internet. By separating and digitizing the data contained in the voucher independently by different workers distributed in places via the Internet (cloud), the voucher data can be digitized more safely.
 仕訳対象の証憑の取引日、金額、および他の文字情報から抽出された単語配列を含む仕訳対象データを取得する仕訳ユニットは、仕訳対象データと複数の仕訳参照エントリとの距離を、取引日、金額、単語配列に含まれる各単語の相似をパラメータとして計算する類比判断ユニットとを含んでもよい。複数の仕訳参照エントリは、ユーザの過去の仕訳済みの証憑の情報をエントリとして含む帳簿のエントリ毎の取引日、金額、および他の文字情報から抽出された単語配列をそれぞれ含む。仕訳ユニットは、さらに、仕訳対象データとの距離が最短の仕訳参照エントリの勘定科目を仕訳先として出力する第1の仕訳先出力ユニットを含んでもよい。 The journal unit that obtains the journal data including the word sequence extracted from the transaction date, amount, and other text information of the journal voucher will determine the distance between the journal data and multiple journal reference entries, the transaction date, An analogy determination unit that calculates the amount of money and the similarity of each word included in the word array as a parameter may be included. Each of the plurality of journal entry entries includes a transaction date, an amount, and a word array extracted from other character information for each entry in the book including information on past vouchers that have been journalized by the user as entries. The journal unit may further include a first journal output unit that outputs the account item of the journal reference entry having the shortest distance from the journal data as a journal.
 この仕訳ユニットにおいては、過去の仕訳結果である仕訳参照エントリ、および仕訳対象の仕訳対象データを、取引日、金額、および単語配列を含むメタデータとして取得する。さらに、類比判断ユニットは、それらのメタデータに含まれる要素、特に、単語配列に含まれる各単語を相似が判断できる単語に切りだし、複数の仕訳参照エントリと仕訳対象データとを、取引日、金額および複数の相似判断できる単語をパラメータとする多次元空間にマッピングし、仕訳参照エントリと仕訳対象データとの距離を計算する。そして、第1の仕訳先出力ユニットは、仕訳対象データとの距離が最短の仕訳参照エントリの勘定科目を仕訳先として出力する。このため、メタデータの単語配列に含まれる単語が予め定義されていなくても、単語配列から適当な語数、連語、複合語などを自動的に識別して単語(連語、複合語なども含む)を抽出し、それらの単語同士や、単語が示唆する意味などを相似の判断基準として多方面から仕訳参照エントリと仕訳対象データとの類比を自動的に判断できる。したがって、過去の仕訳結果に基づき、仕訳対象データが抽出された証憑を仕訳する勘定科目を高い精度で自動的に出力できる。 In this journal unit, the journal reference entry that is the past journal result and the journal object data to be journaled are acquired as metadata including the transaction date, the amount, and the word array. Furthermore, the similarity determination unit cuts out the elements included in the metadata, in particular, each word included in the word array into words that can be determined by similarity, and converts a plurality of journal reference entries and journal target data into transaction dates, A distance between a journal reference entry and journal data is calculated by mapping the amount and a plurality of words that can be judged to be similar to a multidimensional space. Then, the first journal entry output unit outputs the account item of the journal reference entry having the shortest distance from the journal entry data as the journal entry. For this reason, even if the word included in the word array of the metadata is not defined in advance, an appropriate number of words, collocations, compound words, etc. are automatically identified from the word array (including collocations, compound words, etc.). , And the analogy between the journal reference entry and the journal data can be automatically determined from various directions using similarities between the words and the meaning suggested by the words. Therefore, it is possible to automatically output the account item for journalizing the voucher from which the journal entry data is extracted based on the past journal result with high accuracy.
 仕訳ユニットは、第1の仕訳先出力ユニットにより選択された最短の仕訳参照エントリとの距離が第1の閾値よりも大きいときは、仕訳対象データの金額との差が第2の閾値内で最も取引日が近い仕訳参照エントリの勘定科目を仕訳先として出力する第2の仕訳先出力ユニットを含んでいてもよい。金額と日付とは証憑の仕訳先を決定する最も有効な類比判断項目に含まれる。したがって、他の要素により、仕訳対象データと仕訳参照エントリとの距離が離れてしまいすぎる場合は、他の要素を外して類比判断することにより最適な勘定科目を出力できるケースがある。 When the distance between the journal unit and the shortest journal reference entry selected by the first journal output unit is larger than the first threshold, the difference from the amount of the journal data is the largest within the second threshold. A second journal output unit that outputs the account item of the journal reference entry having a close transaction date as the journal may be included. The amount and date are included in the most effective analogy judgment items that determine the journal entry for the voucher. Accordingly, when the distance between the journal entry data and the journal reference entry is too far away due to other factors, there are cases in which the optimal account item can be output by removing the other factors and making a comparison.
 勘定科目はいくつかのカテゴリに分けることが可能である。複数の仕訳参照エントリの勘定科目は複数のカテゴリに分けられ、カテゴリは少なくとも1つの勘定科目を含む。複数の仕訳参照エントリのそれぞれがカテゴリの情報を含んでもよい。類比判断ユニットは、仕訳対象データの単語配列に含まれるタイトルおよび宛先を少なくとも示す単語に基づき決定されたカテゴリと同一のカテゴリの仕訳参照エントリと仕訳対象データとの距離を計算するカテゴリ別類比判断ユニットを含んでもよい。距離計算する仕訳参照エントリの数を共通するカテゴリを用いて限定することにより計算時間を短縮でき、類比判断の精度も向上できる。カテゴリは、取引方向の相違を区別するものであってもよく、証憑のタイトルの相違を区別するものであってもよい。 Account items can be divided into several categories. The accounts of the plurality of journal entry entries are divided into a plurality of categories, and the categories include at least one account. Each of the plurality of journal reference entries may include category information. The similarity determination unit is a category-based similarity determination unit that calculates the distance between the journal reference entry of the same category as the category determined based on at least the title and the word included in the word array of the journal object data and the journal object data. May be included. By limiting the number of journal entry entries for distance calculation using a common category, the calculation time can be shortened, and the accuracy of similarity determination can be improved. The category may distinguish a difference in transaction direction, or may distinguish a difference in titles of vouchers.
 証憑類の宛先またはタイトルをカテゴリという情報に置き換え、類比判断するときはメタデータからこれらの情報を削除してもよく、クラウドで仕訳先を多数決で決定するようなシステムにおいては、ユーザの証憑類の情報がクラウド上に拡散することを抑制できる。 If the recipient or title of the voucher is replaced with information called a category and the comparison is judged, the information may be deleted from the metadata. In a system where the journal is determined by majority vote in the cloud, the user's voucher Can be prevented from spreading on the cloud.
 カテゴリという情報は、証憑から仕訳対象データを抽出する際に証憑のタイトルなどか
ら求めて予め仕訳対象データに含められていてもよく、仕訳ユニットがカテゴリ判定ユニットを含んでいてもよい。カテゴリ判定ユニットは、仕訳対象データの単語配列に含まれるタイトルおよび宛先を少なくとも示す単語に基づき、仕訳対象データがいずれかのカテゴリに属するかを決定する。
The information “category” may be obtained from the title of the voucher when extracting the journal entry data from the voucher and included in the journal entry data in advance, and the journal entry unit may include a category determination unit. The category determination unit determines which category the journal entry data belongs to based on at least a word indicating the title and destination included in the word array of the journal entry data.
 仕訳対象データの単語配列に含まれる単語の少なくとも一部がその単語の証憑中に表記された位置情報、たとえば、右上、中央上、左上、右下などを含んでいれば、仕訳ユニットは、単語の位置情報に基づき、単語配列からタイトルおよび宛先を抽出するタイトル・宛名抽出ユニットを含んでいてもよい。タイトルは証憑の中央上、宛先は証憑の左上など、証憑上のいずれの位置に表示されるかは傾向があり、位置情報を参照することにより、証憑類のタイトル、宛先を自動的に判断する精度を向上できる。 If at least a part of the words included in the word array of the journal entry data includes position information described in the proof of the word, for example, upper right, upper center, upper left, lower right, etc., the journalizing unit A title / address extracting unit for extracting a title and a destination from a word array based on the position information of the word sequence may be included. There is a tendency for the title to be displayed at the center of the voucher and the destination to be displayed on the voucher, such as the top left of the voucher. By referring to the location information, the title and destination of the voucher are automatically determined. Accuracy can be improved.
 本発明の他の態様の一つは、コンピュータを、ユーザの取引の証拠となる証憑の仕訳先を出力する仕訳ユニットを有するシステムとして動作させるプログラム(プログラム製品)である。プログラムは、コンピュータを、コンピュータに入力されたユーザの過去の帳簿のデータから、複数の仕訳参照エントリを含む仕訳済みデータベースを生成する手段として、さらに動作させるユニット(機能ユニット)を含んでいてもよい。プログラム(プログラム製品)は適当な記録媒体に記録して提供することが可能である。 One of the other aspects of the present invention is a program (program product) that causes a computer to operate as a system having a journal unit that outputs a journal entry for evidence that is evidence of a user's transaction. The program may include a unit (functional unit) that causes the computer to further operate as means for generating a journalized database including a plurality of journal reference entries from the user's past book data input to the computer. . The program (program product) can be recorded on an appropriate recording medium and provided.
 本発明の、さらに異なる他の態様の1つは、コンピュータによりユーザの取引の証拠となる証憑の仕訳先を出力することを含む方法である。システムは、コンピュータがインターネットを介して複数の作業者とデータを交換する送受信ユニットを含み、当該方法は以下のステップを有する。
1.コンピュータが、仕訳対象の証憑に含まれる複数の文字情報が証憑中の表記位置により分割された分割データを、仕訳対象の証憑を示す識別情報とともに、電子化のために異なる作業者に、送受信ユニットを介して分散して送信すること。
2.異なる作業者により電子化された、分割電子化された文字情報を、送受信ユニットを介して取得し、分割電子化された文字情報から識別情報に基づき、仕訳先を出力するステップにおいて処理される仕訳対象データを生成すること。
One further different aspect of the present invention is a method that includes outputting a voucher journal entry that is evidence of a user's transaction by a computer. The system includes a transmission / reception unit in which a computer exchanges data with a plurality of workers via the Internet, and the method includes the following steps.
1. The computer transmits the divided data obtained by dividing a plurality of pieces of character information included in the voucher to be journalized according to the notation position in the voucher together with identification information indicating the voucher to be journaled to different workers for digitization. To be distributed and transmitted via
2. Journal information that is digitized by different workers, acquired through the transmission / reception unit, and processed in the step of outputting a journal based on identification information from the segmented digitized character information Generate target data.
 分散して送信するステップは、分割データを証憑中の表記位置により分類し、種類毎に異なる作業者に分散して送信するステップを含んでもよい。また、分散して送信するステップは、仕訳対象の証憑を文字情報の表記位置により分割して画像化した複数の証憑分割画像を含む分割データを異なる作業者に送信するステップを含んでもよい。 The step of distributing and transmitting may include the step of classifying the divided data according to the notation position in the voucher and distributing and transmitting to different workers for each type. Further, the step of transmitting in a distributed manner may include the step of transmitting divided data including a plurality of voucher divided images obtained by dividing the journal to be journalized according to the notation position of the character information into different workers.
 コンピュータは、メモリ上に、ユーザの過去の仕訳済みの証憑の情報をエントリとして含む帳簿のエントリ毎の取引日、金額、および他の文字情報から抽出された単語配列をそれぞれ含む、複数の仕訳参照エントリを含む仕訳済みデータベースを有してもよく、さらに、仕訳先を出力するステップは以下のステップを含んでもよい。
・コンピュータが、仕訳対象の証憑の取引日、金額、および他の文字情報から抽出された単語配列を含む仕訳対象データを取得すること。
・取引日、金額、単語配列に含まれる各単語の相似をパラメータとして、仕訳済みデータベースの複数の仕訳参照エントリと仕訳対象データとの距離を計算すること。
・仕訳対象データとの距離が最短の仕訳参照エントリの勘定科目を仕訳先として出力すること。
The computer includes a plurality of journal references each including a word sequence extracted from the transaction date, amount, and other character information for each entry in the book that includes the user's past journalized voucher information as an entry in the memory. You may have a journalized database that includes entries, and the step of outputting the journal may include the following steps.
-The computer obtains the journal data including the word date extracted from the transaction date, amount, and other character information of the voucher to be journalized.
-Calculate the distance between multiple journal reference entries in the journalized database and journal entry data using the transaction date, amount, and similarity of each word included in the word array as parameters.
-Output the account of the journal entry with the shortest distance from the journal entry data as the journal.
 距離を計算することは、仕訳対象データの単語配列に含まれるタイトルおよび宛先を少なくとも示す単語に基づき決定されたカテゴリと同一のカテゴリの仕訳参照エントリと仕訳対象データとの距離を計算することを含んでもよい。 Calculating the distance includes calculating the distance between the journal reference entry and the journal target data in the same category as the category determined based on at least the title and destination word included in the word array of the journal target data. But you can.
 取得することは、仕訳対象の証憑に含まれる複数の文字情報が、証憑中の表記位置により分割された後に異なる作業者により電子化された情報(分割および電子化された文字情報)を取得し、分割データ化された文字情報から仕訳対象の証憑を示す識別情報に基づき仕訳対象データを生成することを含んでもよい。 Acquiring is to acquire information (division and digitized character information) digitized by different workers after a plurality of character information included in the voucher to be journalized is divided according to the notation position in the voucher. The method may include generating journal entry data based on the identification information indicating the voucher of the journal entry from the character information converted into the divided data.
会計支援システムの概要を示すブロック図。The block diagram which shows the outline | summary of an accounting assistance system. 証憑を受け入れる工場の概要を示すブロック図。The block diagram which shows the outline | summary of the factory which accepts a voucher. サーバの概要を示すブロック図。The block diagram which shows the outline | summary of a server. サーバの処理の概要を示すフローチャート。The flowchart which shows the outline | summary of a process of a server. 証憑画像を分割して作業者に配布するプロセスを示すフローチャート。The flowchart which shows the process which divides | segments a voucher image and distributes it to an operator. 証憑が分割される例を示す図。The figure which shows the example where a voucher is divided | segmented. 過去の仕訳帳から仕訳参照エントリを生成する変換ユニットの機能を示すブロック図。The block diagram which shows the function of the conversion unit which produces a journal reference entry from the past journal. 科目/カテゴリ変換テーブルの一例。An example of a subject / category conversion table. 仕訳ユニットの概要を示すブロック図。The block diagram which shows the outline | summary of a journal entry unit. タイトル・宛名抽出ユニットおよびカテゴリ判定ユニットの概要を示すブロック図。The block diagram which shows the outline | summary of a title and address extraction unit, and a category determination unit. 仕訳ユニットの処理の概要を示すフローチャート。The flowchart which shows the outline | summary of the process of a journal entry unit.
 図1に、会計支援システムの一例を示している。この会計支援システム(会計支援装置)1は、複数のユーザ3の証憑(証憑類)、たとえば経費精算の証憑類5の整理と仕訳作業とを行うシステムである。ユーザ3は、たとえば、この会計支援システム1を利用する会計事務所、税理士事務所などの顧問先の個人、会社、その他の組織であってもよい。この会計支援システム1は、証憑類原本5の電子化と、原本管理と、さらに、仕訳作業とを含むサービスを提供する。また、会計支援システム1は、膨大な電子化の作業を低コストで処理するためにインターネット(クラウド)9を経由して接続する複数の遠隔作業者8を利用する。なお、本明細書において電子化とは、手書きの文字情報、印刷された文字情報などの情報であって、証憑(証憑類)に記載された、あるいは記載されうる情報をコンピュータで稼働するソフトウェアにおいて処理可能なデータ、すなわち、電子データ、デジタルデータなどに変換することを示す。 Fig. 1 shows an example of an accounting support system. This accounting support system (accounting support apparatus) 1 is a system for organizing and journalizing a plurality of users 3 vouchers (certificates), for example, expense settlement vouchers 5. The user 3 may be, for example, a personal accountant, company, or other organization such as an accounting office or a tax accountant office that uses the accounting support system 1. The accounting support system 1 provides services including digitization of the voucher original 5, original management, and journaling work. In addition, the accounting support system 1 uses a plurality of remote workers 8 connected via the Internet (cloud) 9 in order to process enormous digitization work at low cost. In this specification, digitization means information such as handwritten character information, printed character information, etc., which is written in a voucher (voucher) or software that operates on a computer. This indicates that the data is converted into processable data, that is, electronic data, digital data, or the like.
 このシステム1においては、インターネット9を介して会計支援システム1に接続した遠隔作業者8の端末に、証憑書類5の一部分を画像データ化して分散して受け渡す。遠隔作業者8は、証憑書類5の一部分を電子化する作業、またはその検証を行う。会計支援システム1は、遠隔作業者8の作業結果を、インターネット9を介して集約したあと、仕訳判定を行う。会計支援システム1は、専門性は必要ないが工数のかかるデータ化作業と、会計専門性を必要とする仕訳作業との分離をすることで、作業の効率化を図ることができ、全体として大幅に会計処理力を向上させることができる。 In this system 1, a part of the voucher document 5 is converted into image data and distributed to the terminal of the remote worker 8 connected to the accounting support system 1 via the Internet 9. The remote worker 8 performs work for digitizing a part of the voucher document 5 or verifies it. The accounting support system 1 aggregates the work results of the remote worker 8 via the Internet 9 and then determines the journal entry. The accounting support system 1 can improve the efficiency of the work by separating the data conversion work that does not require specialization but requires man-hours from the journalizing work that requires accounting specialization. It is possible to improve the accounting power.
 証憑5は、証憑類、証憑書類とも呼ばれ(本明細書においても記載されることがある)、領収書、請求書、納品書、注文書、送り状、支払証明書などの取引の証拠となる書面であり、証憑5に記載された取引の内容をもとに会計帳簿を作成する。また、証憑5は、各ユーザ3において整理および所定の期間の保管が義務付けられている。 The voucher 5 is also called voucher or voucher document (may be described in this specification), and provides evidence of transactions such as receipts, invoices, invoices, purchase orders, invoices, payment certificates, etc. An accounting book is prepared based on the contents of the transaction described in the voucher 5. The voucher 5 is obliged to be organized and stored for a predetermined period by each user 3.
 現在、企業会計の世界では、会計知識を持った人材が、証憑5を見て直接仕訳を行っている。会計支援システム1では、抜本的にその方法を変えた新しい方式を採用する。会計支援システム1は、大きく分けて電子データ化のプロセスと、仕訳プロセスとを有する。電子化プロセスは、証憑5から文字列を抽出し、電子化することが主な作業である。純粋
な文字列の電子化作業であり、会計の専門知識を全く必要としない。一方、仕訳プロセスは、電子化されたデータを元に、仕訳作業を行い、勘定科目を決定する作業であり、会計の専門知識が要求される作業である。この会計支援システム1においては、電子化されたデータを用いることにより、情報技術を使い、仕訳作業は自動化することを可能とする。最終結果を、各ユーザ3の業務に精通した会計士、税理士などの会計の専門知識を有した会計専門家が確認するプロセスを設けることも可能であり、自動仕分け作業の結果を検証し、仕訳精度を保証することができる。
Currently, in the world of corporate accounting, human resources with accounting knowledge look at the voucher 5 and make direct journal entries. The accounting support system 1 adopts a new method that radically changes the method. The accounting support system 1 roughly includes an electronic data conversion process and a journalizing process. The main process of the digitization process is to extract a character string from the voucher 5 and digitize it. It is pure string digitization and does not require any accounting expertise. On the other hand, the journalizing process is a task that performs journaling work based on digitized data and determines account items, and requires accounting expertise. In this accounting support system 1, by using the digitized data, the journalizing work can be automated using information technology. It is also possible to set up a process to check the final results by accounting professionals with accounting expertise such as accountants and tax accountants who are familiar with the work of each user 3, verifying the results of automatic sorting work, journaling accuracy Can be guaranteed.
 会計支援システム1は、ユーザ3から提供される証憑5を受け入れて整理・確認および画像化する工場部門(工場)11と、証憑5を原本6の状態で保管する倉庫部門(倉庫)12と、証憑5の画像化されたデータ(証憑画像)7をもとに、証憑5を電子化し会計仕訳を行うデータ処理部門(サーバ)13とを含む。会計支援システム1においては、証憑5の原本6が倉庫12に保管されるとともに、ユーザ3は、サーバ13の電子化された証憑5のデータにインターネット9を介してアクセスできる。さらに、ユーザ3は、必要に応じ、サーバ13の電子データに基づいて倉庫12に保管されている証憑5の原本6を取り寄せたり、倉庫12において参照したりすることができる。 The accounting support system 1 includes a factory department (factory) 11 that accepts the voucher 5 provided by the user 3, organizes, confirms, and images it, and a warehouse department (warehouse) 12 that stores the voucher 5 in the state of the original 6. A data processing department (server) 13 that digitizes the voucher 5 and makes accounting entries based on the imaged data (voucher image) 7 of the voucher 5 is included. In the accounting support system 1, the original 6 of the voucher 5 is stored in the warehouse 12, and the user 3 can access the electronic voucher 5 data of the server 13 via the Internet 9. Furthermore, the user 3 can obtain the original 6 of the voucher 5 stored in the warehouse 12 based on the electronic data of the server 13 or refer to the warehouse 12 as necessary.
 サーバ13は、メモリ、CPUなどのコンピュータ資源を有し、プログラム(プログラム製品)により提供された機能を実現する。このサーバ13は、コンピュータネットワーク(インターネット、クラウド)9を介して遠隔作業者8の端末との間でデータを送受信するユニット14を含む。サーバ13は、さらに、証憑画像7を、遠隔作業者8を利用して電子化するデータ入力処理機能(データ入力処理ユニット)20と、電子化された情報を用いて、あるいは情報に対し勘定科目などの会計情報を追加するデータマイニング機能(データマイニングユニット)30と、電子化されたデータを格納するデータベース50と、電子化されたデータをユーザ3にネットワーク9を介して提供するデータ表示提供機能(データ表示ユニット)40とを含む。 The server 13 has computer resources such as a memory and a CPU, and realizes a function provided by a program (program product). The server 13 includes a unit 14 that transmits and receives data to and from the terminal of the remote worker 8 via a computer network (Internet, cloud) 9. The server 13 further uses a data input processing function (data input processing unit) 20 for digitizing the voucher image 7 using the remote worker 8 and the digitized information or accounts for the information. A data mining function (data mining unit) 30 for adding accounting information such as, a database 50 for storing digitized data, and a data display providing function for providing the digitized data to the user 3 via the network 9 (Data display unit) 40.
 図2に、工場11の処理の概要を示している。各ユーザ3の各社員103は、証憑5と経費精算ラベル105とを仕訳できる適当な容器、例えば、チャック付のポリ袋(チャックポリ)107に入れる。チャックポリ107により社員ごとに仕訳された証憑5は専用封筒108に入れられ、郵送、託送などの配送手段により工場11に届けられる。経費精算ラベル105にバーコード、二次元コードなどにより付された識別情報(ID)は、チャックポリ107に含まれる証憑5の主IDとなり、チャックポリ107に含まれる各証憑5には、主IDに加えて枝IDが付され、各証憑5が完全に識別されるようになる。 FIG. 2 shows an outline of processing in the factory 11. Each employee 103 of each user 3 puts the voucher 5 and the expense settlement label 105 in a suitable container, for example, a plastic bag (chuck poly) 107 with a chuck. The voucher 5 journalized for each employee by the chuck poly 107 is put in a dedicated envelope 108 and delivered to the factory 11 by delivery means such as mail or consignment. The identification information (ID) attached to the expense settlement label 105 by a barcode, a two-dimensional code or the like becomes the main ID of the voucher 5 included in the chuck poly 107, and each voucher 5 included in the chuck poly 107 has a main ID. In addition to the branch ID, each voucher 5 is completely identified.
 工場11においては、前処理111と、整理データ化処理114と、保管処理116とが行われる。前処理111においては、到来したチャックポリ107に含まれる証憑5が確認され、経費精算ラベルを主IDとする識別情報(ID)が各証憑5と一対一に関連づけられる。整理データ化処理114においては、識別情報と一対一に関連付けられた各証憑5が整理され、確認され、画像化される。証憑5を画像化したデータは、受領確認画像118としてユーザ3にフィードバックされる。また、証憑5を画像化したデータは、電子化用の画像データ(証憑画像)7としてサーバ13に供給される。これらの処理が終了すると、証憑5は、再びチャックポリ107に収納され、倉庫12に原本6として保管される。原本6には、証憑画像7と同じ識別情報が付されるので、証憑画像7を電子化した情報から、必要により、倉庫12に保管されている原本6に容易に到達できる。 In the factory 11, pre-processing 111, organizing data conversion processing 114, and storage processing 116 are performed. In the pre-processing 111, the voucher 5 included in the incoming chuck poly 107 is confirmed, and identification information (ID) having the expense settlement label as the main ID is associated with each voucher 5 on a one-to-one basis. In the organized data conversion process 114, each voucher 5 associated with the identification information on a one-to-one basis is organized, confirmed, and imaged. Data obtained by imaging the voucher 5 is fed back to the user 3 as a receipt confirmation image 118. Data obtained by imaging the voucher 5 is supplied to the server 13 as image data (voucher image) 7 for digitization. When these processes are completed, the voucher 5 is stored again in the chuck poly 107 and stored as the original 6 in the warehouse 12. Since the same identification information as the voucher image 7 is attached to the original 6, the original 6 stored in the warehouse 12 can be easily reached from the information obtained by digitizing the voucher image 7 if necessary.
 図3に、サーバ13の機能をブロック図により示している。サーバ13は証憑画像7を電子化するデータ入力処理ユニット(データ入力処理機能)20と、電子化された情報(仕訳対象データ)60を用いて証憑の仕訳を行う仕訳ユニット(仕訳装置、仕訳システム、データマイニング機能)30と、電子化された会計データなどを格納するデータベース
50と、仕訳された証憑のデータをユーザ3に供給するデータ表示提供機能40とを含む。仕訳ユニット30は、入力された情報に対し勘定科目などの会計情報を追加するデータマイニング機能(データマイニングユニット)として使用することも可能である。また、データ表示提供機能40は、データベース50に蓄積された電子化されたデータをユーザ3にネットワーク9を介して提供するデータ表示ユニット40として使用することも可能である。
FIG. 3 is a block diagram showing functions of the server 13. The server 13 uses a data input processing unit (data input processing function) 20 for digitizing the voucher image 7 and a journalizing unit (journaling apparatus, journalizing system) for journalizing vouchers using the digitized information (data to be journalized) 60. , Data mining function) 30, a database 50 for storing digitized accounting data and the like, and a data display providing function 40 for supplying journalized voucher data to the user 3. The journal unit 30 can also be used as a data mining function (data mining unit) for adding accounting information such as account items to the input information. The data display providing function 40 can also be used as a data display unit 40 that provides digitized data stored in the database 50 to the user 3 via the network 9.
 データマイニング機能30は、自動的に仕訳を行い、勘定科目を判定する自動仕訳ユニット80を含む。サーバ13は、自動仕分けの結果を確認する機能ユニット(工程)90を含んでおり、会計専門家91により手動で仕訳結果、すなわち、自動出力された勘定科目が確認される。電子化された仕訳対象データ60は、会計情報95としてデータベース50に格納される。したがって、データベース50は、ユーザ3のアップデートされた仕訳日記帳52としての機能を含む。 The data mining function 30 includes an automatic journal unit 80 that automatically journals and determines account items. The server 13 includes a functional unit (process) 90 for confirming the result of automatic sorting, and the journalizing result, that is, the automatically output account item is confirmed manually by the accounting specialist 91. The electronic journal data 60 is stored in the database 50 as transaction information 95. Therefore, the database 50 includes a function as the updated journal diary 52 of the user 3.
 データ入力処理ユニット(データ入力処理装置)20は、仕訳対象の証憑画像7を証憑単位で取得する画像読取ユニット(画像読取装置)21と、証憑に該当する証憑画像7を分割した複数の証憑分割画像(分割データ)29を、ネットワーク9を経由して、電子化のために、複数の作業者(遠隔作業者)8に供給する分散ユニット(分散装置、分散機能)22と、分割データ29が複数の作業者8により電子化された、分割電子化(分割および電子化)された文字情報28を取得する集約ユニット(集約装置、集約機能)23とを含む。 The data input processing unit (data input processing device) 20 includes an image reading unit (image reading device) 21 that acquires the voucher image 7 to be journalized in units of vouchers, and a plurality of voucher divisions obtained by dividing the voucher image 7 corresponding to the voucher. A distributed unit (distributed device, distributed function) 22 for supplying an image (divided data) 29 to a plurality of workers (remote workers) 8 for digitization via the network 9 and divided data 29 And an aggregating unit (aggregating apparatus, aggregating function) 23 for acquiring character information 28 that has been digitized (divided and digitized) by a plurality of workers 8.
 分散ユニット22は、仕訳対象の証憑に含まれる複数の文字情報が証憑中の表記位置により分割された分割データ29を、仕訳対象の証憑を示す識別情報27とともに、電子化のために異なる作業者8に分散して送信する。このため、分散ユニット22は、証憑画像7を、それに含まれる文字情報の表記位置により分割して画像化した複数の証憑分割画像(分割データ)29を生成する画像分割ユニット(画像分割装置、画像分割機能)24を含む。証憑分割画像29は、分割画像29aと、分割画像29aを、OCRを用いて文字データ化したOCR情報29bと、証憑分割画像29を特定の証憑画像7、すなわち証憑5と関連付けする識別情報(ID)27とを含む。この画像分割ユニット24は、分割データを前記証憑中の表記位置によるカテゴリを加えて分類し、カテゴリ毎(種類毎、グループ毎)に異なる作業者に分散して送信する機能を含む。 The distribution unit 22 divides the divided data 29 obtained by dividing a plurality of character information included in the voucher to be journalized according to the notation position in the voucher together with identification information 27 indicating the voucher to be journalized into different workers for digitization 8 is distributed and transmitted. For this reason, the distribution unit 22 divides the voucher image 7 according to the notation position of the character information included in the voucher image 7 and generates a plurality of voucher divided images (divided data) 29 (image dividing apparatus, image). Division function) 24. The voucher divided image 29 includes a divided image 29a, OCR information 29b obtained by converting the divided image 29a into character data using OCR, and identification information (ID that associates the voucher divided image 29 with a specific voucher image 7, that is, the voucher 5. 27). The image dividing unit 24 includes a function of classifying divided data by adding a category according to the notation position in the voucher and transmitting the divided data to different workers for each category (for each type and for each group).
 集約ユニット23は、異なる作業者8により分割電子化された文字情報28を取得し、分割電子化された文字情報28から識別情報27に基づき仕訳ユニット30により仕訳される仕訳対象データ60を生成する。このため、集約ユニット23は、証憑分割画像29に含まれる文字情報が作業者8により電子化された文字情報(分割電子化文字情報)28をそれぞれの作業者8から取得し、識別情報27に基づいて分割データ化文字情報28を集約して仕訳対象データ60を生成する生成ユニット(生成装置、生成機能)25を含む。 The aggregation unit 23 acquires character information 28 divided and digitized by different workers 8, and generates journal object data 60 to be journalized by the journal unit 30 based on the identification information 27 from the character information 28 divided and digitized. . Therefore, the aggregation unit 23 acquires character information (divided digitized character information) 28 in which the character information included in the voucher divided image 29 is digitized by the worker 8 from each worker 8 and stores it in the identification information 27. A generation unit (generation device, generation function) 25 that generates the journalizing target data 60 by aggregating the divided data character information 28 based on the generated data is included.
 証憑5に含まれる情報を電子化する工程は、工数が大きくなる傾向があり、このシステム1においては、複数の作業者により並列作業を行うことにより時間とコストとを低減する仕組みが採用されている。まず、証憑類原紙5は工場11で画像読み取り装置等で電子画像化する。次に、それを画像認識し、矩形の形状検出を行い、矩形単位に分離して分割画像29aにする。矩形には、まとまった文字列情報が含まれるのでOCRで読み取れればOCR情報29bに変換できる。OCRには既存の画像認識ソフトウェアを利用できる。しかしながら、手書きの証憑であったり、文字がかすれていたり、判読し難かったり、漢字が読み違えやすかったりなど、OCRで文字情報に変換できなかったり、誤変換するケースは多い。ネットワーク9を介して分散作業を行う作業者8は、証憑分割画像29に
含まれる分割画像29aを自分の目で見て確認し、OCR情報29bが正しいか否かを判断するとともに、正しくない場合は、手動で入力したり、手動で訂正することにより正しい分割データ化文字情報28を生成する。
The process of digitizing information contained in the voucher 5 tends to increase man-hours. In this system 1, a mechanism for reducing time and cost by adopting parallel work by a plurality of workers is adopted. Yes. First, the voucher base paper 5 is converted into an electronic image by an image reading device or the like at the factory 11. Next, it is image-recognized, a rectangular shape is detected, and divided into rectangular units to form divided images 29a. Since the rectangle includes a collection of character string information, it can be converted into OCR information 29b if it can be read by OCR. Existing image recognition software can be used for OCR. However, there are many cases where it cannot be converted into character information by OCR, such as handwritten vouchers, characters are faint, difficult to read, or Kanji characters are easily misread, or misconverted. The worker 8 who performs the distributed work via the network 9 visually checks the divided image 29a included in the voucher divided image 29 to determine whether or not the OCR information 29b is correct. The correct divided data character information 28 is generated by manual input or manual correction.
 図4に、この会計支援システム1のサーバ13により提供される処理の概要をフローチャートにより示している。ステップ151において、画像読取ユニット21が証憑画像7を取得する。ステップ152において、サーバ13に実装される分散ユニット22が、仕訳対象の証憑(証憑画像)7に含まれる複数の文字情報が証憑中の表記位置により分割された分割データ(証憑分割画像)29を、仕訳対象の証憑を示す識別情報27とともに、電子化のために異なる作業者8に、送受信ユニット14を介して分散して送信する。ステップ155において、証憑分割画像29を受信したクラウド上の作業者8は、分割画像29に含まれた範囲の、限定された文字情報の電子化作業を、独立して行う。 FIG. 4 is a flowchart showing an outline of processing provided by the server 13 of the accounting support system 1. In step 151, the image reading unit 21 acquires the voucher image 7. In step 152, the distribution unit 22 implemented in the server 13 obtains the divided data (voucher divided image) 29 obtained by dividing the plurality of character information included in the journal voucher (voucher image) 7 according to the notation position in the voucher. In addition to the identification information 27 indicating the voucher to be journalized, the information is distributed and transmitted to different workers 8 via the transmission / reception unit 14 for digitization. In step 155, the worker 8 on the cloud who has received the voucher divided image 29 independently performs digitized work of limited character information within the range included in the divided image 29.
 ステップ153において、サーバ13に実装される集約ユニット23が、クラウド上の異なる作業者8により電子化された、分割電子化された文字情報28を、送受信ユニット14を介して取得(受信)し、分割電子化された文字情報28から識別情報27に基づき仕訳対象データ60を生成する。ステップ154において、自動仕訳ユニット80が、仕訳対象データ60の仕訳処理を行う。 In step 153, the aggregation unit 23 mounted on the server 13 acquires (receives) the segmented digitized character information 28 digitized by different workers 8 on the cloud via the transmission / reception unit 14, Based on the identification information 27, the journal data 60 is generated from the divided electronic character information 28. In step 154, the automatic journal unit 80 performs journal processing for the journal target data 60.
 図5に、画像分割ユニット24における処理(ステップ152)をさらに詳しくフローチャートにより示している。まず、ステップ201において、証憑画像7から画像データ化された証憑原紙5の向きを検出し、画像7の向きが原紙5の向きと一致するように回転補正する。証憑原紙5は、たとえばA4サイズの用紙を縦に使用したり横に使用したりすることがある。証憑5に記載されている文字情報はほとんどのケースが横方向であり、分割画像も文字の並びに沿って切り出す必要がある。したがって、文字の並びなどから、文字情報の方向が一致するように証憑画像7の向きを補正する。 FIG. 5 is a flowchart showing the process (step 152) in the image dividing unit 24 in more detail. First, in step 201, the orientation of the voucher base paper 5 converted into image data from the voucher image 7 is detected, and the rotation is corrected so that the orientation of the image 7 matches the orientation of the base paper 5. As the voucher base paper 5, for example, A4 size paper may be used vertically or horizontally. In most cases, the character information described in the voucher 5 is in the horizontal direction, and it is necessary to cut out the divided images along the line of characters. Therefore, the orientation of the voucher image 7 is corrected so that the direction of the character information matches from the arrangement of characters.
 次に、ステップ202において、証憑画像7に含まれている証憑原紙5のサイズを自動検出する。証憑原紙5のサイズを決定することにより、そのサイズで予め設定されているライブラリが選択される。ライブラリには、原紙5のサイズごとに、認識する矩形のサイズ、位置などの画像分割に関する情報が含まれる。ステップ203において、証憑画像7を画像処理により、原紙5のサイズごとに設定されたサイズの矩形に分割して認識する。矩形単位で認識した情報をテーブルにし、それを矩形リストとする。精度を上げるために画像フィルタ等の前処理を入れてもよい。この段階が画像分割するために重要であり、自動的な画像分割が不十分であると判断されると手動で分割するようなプロセスを挿入してもよい。 Next, in step 202, the size of the voucher base paper 5 included in the voucher image 7 is automatically detected. By determining the size of the voucher base paper 5, a library preset in accordance with the size is selected. The library includes information about image division such as the size and position of a rectangle to be recognized for each size of the base paper 5. In step 203, the voucher image 7 is recognized by being divided into rectangles having a size set for each size of the base paper 5 by image processing. The information recognized in units of rectangles is stored in a table, which is used as a rectangle list. In order to increase the accuracy, preprocessing such as an image filter may be added. This step is important for image segmentation, and a process for manual segmentation may be inserted when it is determined that automatic image segmentation is insufficient.
 矩形リストは証憑画像7を、サイズに適した微小な矩形に分けて全エリアを分割した結果を含む。ステップ204において、矩形リストから文字情報を含む矩形を見つけ、それぞれの画像を切り出し、1つの証憑画像7から複数の分割画像29aを生成する。分割画像29aは基本は矩形リストに登録された矩形の画像の集積であり、1つの矩形の画像が、複数の分割画像29aに含まれていてもよく、そのような状態は、文字情報が含まれる矩形の分割画像29aの分布マップにより調整できる。 The rectangle list includes the result of dividing the voucher image 7 into small rectangles suitable for the size and dividing all areas. In step 204, a rectangle including character information is found from the rectangle list, and each image is cut out to generate a plurality of divided images 29a from one voucher image 7. The divided image 29a is basically an accumulation of rectangular images registered in the rectangular list, and one rectangular image may be included in the plurality of divided images 29a, and such a state includes character information. Can be adjusted by the distribution map of the rectangular divided image 29a.
 ステップ204において、発見された文字情報を含む矩形の位置、順番、サイズ毎のライブラリに含まれる情報などから分割画像29aに文字情報カテゴリを付して分類(グループ分け)してもよい。たとえば、証憑画像7の相対的な上側および左側に最初に表れる文字情報を含む矩形により分割された分割画像29aは証憑のタイトルに関する文字情報を含む可能性が高く、その分割画像29aに「タイトル」という文字情報カテゴリを付すことが可能である。文字情報カテゴリは、タイトル、日付、金額などのコンテンツの具体
的な種類を示すものであってもよく、証憑画像7から分割画像29aが切り出された位置、大きさなどを示すマッピング情報であってもよい。
In step 204, the divided image 29a may be classified (grouped) by adding a character information category based on information included in the library for each position, order, and size of the rectangle including the found character information. For example, the divided image 29a divided by the rectangle including the character information first appearing on the upper side and the left side of the voucher image 7 is likely to include character information related to the title of the voucher, and “title” is included in the divided image 29a. It is possible to attach a character information category. The character information category may indicate a specific type of content such as a title, date, amount, etc., and is mapping information indicating the position, size, etc., of the divided image 29a cut out from the voucher image 7. Also good.
 図6に、証憑画像7から生成される分割画像29aの例を示している。右上の分割画像29aは証憑番号、左上の分割画像29aはタイトル、左側の次の分割画像29aはあて先というように分割画像29aに文字情報カテゴリを付すことも可能である。分割画像29aに文字情報カテゴリを付さず、後述するOCRによる認識結果や、作業者8による入力結果により自動的に文字情報カテゴリを付してもよく、作業者8が手動で分割データ化した文字情報28に文字情報カテゴリを付してもよい。分割画像29aは、文字列単位であってもよく、表などにまとめられた表記は、その表または欄などの単位であってもよい。 FIG. 6 shows an example of the divided image 29 a generated from the voucher image 7. It is also possible to assign a character information category to the divided image 29a such that the upper right divided image 29a is a voucher number, the upper left divided image 29a is a title, and the next left divided image 29a is a destination. The character information category may not be attached to the divided image 29a, but the character information category may be automatically attached according to the recognition result by OCR described later or the input result by the worker 8, and the worker 8 manually converts the data into divided data. A character information category may be added to the character information 28. The divided image 29a may be a character string unit, and the notation collected in a table or the like may be a unit such as a table or a column.
 分割画像29aが生成されると、ステップ205において、各分割画像29aに識別情報(ID)27が付される。識別情報27は、分割元となった証憑5を示す識別情報と個々の分割画像29aを示す識別情報との組み合わせであることが望ましい。 When the divided images 29a are generated, identification information (ID) 27 is attached to each divided image 29a in step 205. It is desirable that the identification information 27 is a combination of identification information indicating the voucher 5 that has become a division source and identification information indicating individual divided images 29a.
 ステップ206において、分割画像29aを文字認識ソフト(OCR)を用いて文字認識し、OCRデータ29bを生成する。ステップ207において、識別情報27、分割画像29aおよびOCRデータ29bを含む証憑分割画像29を、インターネット9を介して作業者8に配布する。各作業者8は、図6に示した分割画像29aの1つを見て、そのOCRデータ29bを確認する。したがって、各作業者8は、証憑5の断片の情報のみを把握するだけであり、証憑5の内容が作業者8に漏れることはなく、ユーザ3の会計に関する情報が作業者8に漏れることはない。 In step 206, the divided image 29a is character-recognized using character recognition software (OCR) to generate OCR data 29b. In step 207, the voucher divided image 29 including the identification information 27, the divided image 29 a and the OCR data 29 b is distributed to the worker 8 via the Internet 9. Each worker 8 looks at one of the divided images 29a shown in FIG. 6 and confirms the OCR data 29b. Therefore, each worker 8 only grasps the information of the fragment of the voucher 5, the contents of the voucher 5 do not leak to the worker 8, and the information regarding the accounting of the user 3 does not leak to the worker 8. Absent.
 作業者8には、証憑5の分割画像29aを無差別に、すなわち、文字情報カテゴリとは無関係に証憑分割画像29を配布して電子化の作業を行わせてもよい。作業者8に、文字情報カテゴリが共通する証憑分割画像29を配布することにより、電子化の作業をさらに効率よく行わせることも可能である。たとえば、ある作業者8が証憑のタイトルに関する証憑分割画像29を電子化する作業に特化して行うのであれば、分割画像29aを解釈する文字情報の範囲は限定され、効率と精度が向上する。金額に関する証憑分割画像29を電子化するのであれば、文字情報は数字であることに限定して作業を行うことが可能となり、単純作業の繰り返しになるので、作業効率と精度が向上しやすい。 The worker 8 may perform the digitization work by distributing the divided image 29a of the voucher 5 indiscriminately, that is, regardless of the character information category. By distributing the voucher divided image 29 having the same character information category to the worker 8, it is possible to perform the digitization work more efficiently. For example, if a worker 8 specializes in the work of digitizing the voucher divided image 29 related to the voucher title, the range of character information for interpreting the divided image 29a is limited, and the efficiency and accuracy are improved. If the voucher division image 29 relating to the amount is digitized, the character information can be limited to being a number, and the operation can be performed repeatedly, so that the operation efficiency and accuracy are easily improved.
 複数の作業者8には、矩形の分割画像29aと文字列(OCRデータ)29bとを受け渡すが、OCRを実施せず、矩形の分割画像29aのみを受け渡して作業者8が自らデータ化してもよい。作業者8は、矩形画像(分割画像)29aと文字列29bとを目視し、正しいかを確認する。間違っている場合は文字列29bを修正する。文字列29bが渡されない場合は、文字列を手動で入力する。作業者8には、複数の分割画像29aは受け渡さない。これは、証憑類が帰属する企業の事業情報が判明することを防ぐためである。単一の分割画像29aを扱う範囲においては、企業の事業情報を判別することは不可能であり、企業情報の秘匿性は担保される。作業者8により確認された、フラグメント化された文字列(分割データ化文字情報)28は、サーバ13に集められて、ひとつの情報に集約される。 A plurality of workers 8 are delivered with a rectangular divided image 29a and a character string (OCR data) 29b. However, without performing OCR, only the rectangular divided image 29a is delivered and the worker 8 converts the data into data. Also good. The operator 8 visually checks the rectangular image (divided image) 29a and the character string 29b to confirm whether they are correct. If it is wrong, the character string 29b is corrected. When the character string 29b is not passed, the character string is manually input. A plurality of divided images 29a are not delivered to the worker 8. This is to prevent the business information of the company to which the voucher belongs from being revealed. In the range where the single divided image 29a is handled, it is impossible to determine the business information of the company, and the confidentiality of the company information is ensured. Fragmented character strings (divided data character information) 28 confirmed by the worker 8 are collected by the server 13 and collected into one piece of information.
 サーバ13の集約ユニット23は、各作業者8が電子化した文字情報(分割電子化された文字情報、分割データ化文字情報)28を、インターネット9を介して収集し、生成ユニット25において仕訳対象データ60を生成する。仕訳対象データ60は、仕訳対象の証憑5の取引日、金額、および他の文字情報から抽出された単語配列を含む。データマイニングユニット30の仕訳を行うユニット80は、仕訳対象データ60と、複数の仕訳参照エントリ70との多次元空間内の距離Lを計算し、仕訳対象データ60との距離Lが最
短の仕訳参照エントリ70の勘定科目を仕訳先として出力する。仕訳参照エントリ70は、ユーザ3の過去の仕訳済みの証憑の情報をエントリとして含む帳簿、例えば仕訳日記帳のエントリをメタデータ化した情報である。
The aggregation unit 23 of the server 13 collects character information (character information that has been digitized and divided data) that has been digitized by each worker 8 via the Internet 9 and is generated by the generation unit 25 as a journal entry target. Data 60 is generated. The journal entry data 60 includes a word sequence extracted from the transaction date, amount, and other character information of the voucher 5 to be entered. The unit 80 that performs the journaling of the data mining unit 30 calculates the distance L in the multidimensional space between the journal object data 60 and the plurality of journal reference entries 70, and refers to the journal entry with the shortest distance L from the journal object data 60. The account item of entry 70 is output as the journal. The journal reference entry 70 is information obtained by converting the entry of a voucher that has been previously journalized by the user 3 as an entry, for example, a journal diary entry.
 図3に示すように、データマイニングユニット30は、仕訳日記帳51および52から仕訳参照エントリ70を生成し、参照ライブラリ53を生成する変換機能(変換装置、変換ユニット)31と、仕訳作業を自動的に行う仕訳ユニット(自動仕訳機能、自動仕訳装置)80とを含む。変換の対象となる仕訳日記帳51は、各ユーザ3が過去の会計処理に用いた仕訳日記帳51であってもよく、この会計支援システム1で仕訳した情報を含む日記帳52であってもよい。変換ユニット31は、ユーザ3の過去の仕訳データである仕訳日記帳51および52(以降においては仕訳日記帳51を参照して説明する)から、参照ライブラリ53を生成する。参照ライブラリ53は、新たな証憑類に対して、最も類似する過去の仕訳日記帳エントリを探しだすためのデータベースである。 As shown in FIG. 3, the data mining unit 30 automatically generates a journal reference entry 70 from the journal diaries 51 and 52, and generates a reference library 53. Journalizing unit (automatic journaling function, automatic journalizing device) 80 that performs automatically. The journal diary 51 to be converted may be the journal diary 51 used by each user 3 in the past accounting process, or the diary 52 including information journalized by the accounting support system 1. Good. The conversion unit 31 generates a reference library 53 from the journal diaries 51 and 52 (which will be described below with reference to the journal diary 51), which is the past journal data of the user 3. The reference library 53 is a database for finding the most similar past journal entry for a new voucher.
 図7に、変換ユニット31の機能を示している。仕訳日記帳51の各エントリ(日記帳エントリ)51aは、ID51bにより管理されている。参照ライブラリ53の各エントリ(仕訳参照エントリ)70は、後述するカテゴリ71により管理され、仕訳日記帳51のエントリ51aを追跡するID51bはコンテンツとして含まれる。仕訳日記帳51の各エントリ51aはコンテンツとして、取引日51c、金額51d、借方科目51e、貸方科目51f、借方税コード51g、貸方税コード51h、借方補助科目51i、貸方補助科目51j、摘要51kが含まれる。変換ユニット31は、単語抽出機能32を含み、摘要51kに含まれる情報を単語単位に区切って単語配列73を生成し、仕訳参照エントリ70のキーの1つとして出力する。単語抽出機能32は、一般的な日本語構文解析機能を有する。仕訳参照エントリ70は、コンテンツ(バリュー)として含むID51bを通して、仕訳日記帳51のエントリ51aに紐づけられる。 FIG. 7 shows the function of the conversion unit 31. Each entry (diary entry) 51a in the journal diary 51 is managed by an ID 51b. Each entry (journal reference entry) 70 in the reference library 53 is managed by a category 71 described later, and an ID 51b for tracking the entry 51a in the journal diary 51 is included as content. Each entry 51a of the journal diary 51 includes a transaction date 51c, an amount 51d, a debit item 51e, a credit item 51f, a debit tax code 51g, a credit tax code 51h, a debit auxiliary item 51i, a credit auxiliary item 51j, and a summary 51k. included. The conversion unit 31 includes a word extraction function 32, generates a word array 73 by dividing the information included in the summary 51k into word units, and outputs it as one of the keys of the journal entry entry 70. The word extraction function 32 has a general Japanese syntax analysis function. The journal entry entry 70 is linked to the entry 51a of the journal diary 51 through an ID 51b included as content (value).
 変換ユニット31は、さらに、カテゴリ生成ユニット(カテゴリ生成機能、カテゴリ生成手段)33を含む。カテゴリ生成ユニット33は、科目/カテゴリ変換テーブル34を参照し、日記帳エントリ51aの借方科目51eと貸方科目51fからカテゴリ(仕訳カテゴリ)71を決定する。日記帳エントリ51aのその他の情報、例えば、借方補助科目51i、貸方補助科目51j、摘要kは、単語抽出機能32により情報が単語単位に区切られた単語配列73に変換される。この例では、借方科目51e、貸方科目51fは、カテゴリ71として仕訳参照エントリ70のキーとなる情報に含まれる。したがって、単語配列73に含まれない。しかしながら、借方科目51e、貸方科目51fを単語配列73に含めてもよい。 The conversion unit 31 further includes a category generation unit (category generation function, category generation means) 33. The category generation unit 33 refers to the item / category conversion table 34 and determines a category (journal category) 71 from the debit item 51e and the credit item 51f of the diary entry 51a. Other information of the diary entry 51a, for example, the debit auxiliary subject 51i, the credit auxiliary subject 51j, and the summary k are converted into a word array 73 in which the information is divided into words by the word extraction function 32. In this example, the debit item 51e and the credit item 51f are included in the key information of the journal entry reference 70 as the category 71. Therefore, it is not included in the word array 73. However, the debit subject 51e and the credit subject 51f may be included in the word array 73.
 図8に、科目/カテゴリ変換テーブル34の一例を示している。このカテゴリ(仕訳カテゴリ)71は、この会計支援システム1において新たに(独自に)定義するパラメータである。カテゴリ71は会計上で明確に、比較的簡単に、重複なく区別しやすい情報であれば良い。会計支援システム1においては、カテゴリ71として、取引の方向と、計上および消込との組み合わせにより4つのパラメータを設定している。カテゴリ71は、「収入計上」、「支出計上」、「入金消込」、「出金消込」を含む。これらのカテゴリ71は、証憑5を性質別に分類するために適しており、証憑5のタイトルと、宛先とから、証憑5の該当するカテゴリ71を容易に、そして精度よく決めることができる。 FIG. 8 shows an example of the subject / category conversion table 34. This category (journal category) 71 is a parameter newly defined (independently) in the accounting support system 1. The category 71 may be any information that is clearly and relatively easy to distinguish in accounting and can be easily distinguished without duplication. In the accounting support system 1, as the category 71, four parameters are set according to the combination of the direction of transaction, accounting, and application. The category 71 includes “income recording”, “expenditure recording”, “payment application”, and “withdrawal application”. These categories 71 are suitable for classifying the vouchers 5 according to their properties, and the corresponding category 71 of the voucher 5 can be easily and accurately determined from the title of the voucher 5 and the destination.
 図9に仕訳ユニット80の構成をブロック図により示している。仕訳ユニット80は、仕訳対象の証憑5の電子化された情報である仕訳対象データ60を取得する取得ユニット(取得機能、取得手段)81と、複数の仕訳参照エントリ70と仕訳対象データ60との距離を計算する類比判断ユニット(類比判断機能、類比判断手段)82と、仕訳対象データ60との距離Lが最短の仕訳参照エントリ70の勘定科目51eおよび51fを仕訳先
として出力する第1の仕訳先出力ユニット(第1の仕訳先出力機能、第1の仕訳先出力手段)86とを含む。仕訳対象データ60は、取引日64、金額65、および他の文字情報から抽出された単語配列63を含む。取得ユニット81は、仕訳対象データ60の単語配列63に含まれる単語63aを抽出する。取得ユニット81は、証憑5のメタデータである単語配列に含まれる単語が予め定義されていなくても、単語配列から適当な語数、連語、複合語などを自動的に識別して単語(連語、複合語なども含む)を抽出する。
FIG. 9 is a block diagram showing the configuration of the journalizing unit 80. The journal unit 80 includes an acquisition unit (acquisition function, acquisition means) 81 for acquiring journal object data 60 that is digitized information of the journal voucher 5, a plurality of journal reference entries 70, and journal object data 60. A first journal that outputs the account items 51e and 51f of the journal reference entry 70 with the shortest distance L between the similarity determination unit (analysis determination function, similarity determination means) 82 for calculating the distance and the journal entry target data 60 as the journal entry. And a destination output unit (first journal destination output function, first journal destination output means) 86. The journal entry data 60 includes a transaction date 64, an amount 65, and a word array 63 extracted from other character information. The acquisition unit 81 extracts a word 63 a included in the word array 63 of the journal entry data 60. The acquisition unit 81 automatically identifies an appropriate number of words, collocations, compound words, and the like from the word sequence even if the words included in the word sequence that is the metadata of the voucher 5 are not defined in advance. (Including compound words).
 類比判断ユニット82は、仕訳対象データ60と仕訳参照エントリ70とを、取引日64および51c、金額65および51d、のみならず、単語配列63および73に含まれる単語を相似のパラメータとして多次元で仕訳対象データ60と複数の仕訳参照エントリ70との距離Lをそれぞれ計算する。したがって、類比判断ユニット82においては、取引日64および51c、金額65および51dは必須のパラメータとして使用されるが、他の情報については、事前に定義されていなくても、単語配列から適当に切り出された(抽出された)単語を、単語同士や、単語が示唆する意味、単語の並び順などを相似の判断基準として使用する。したがって、多方面から仕訳参照エントリ70と仕訳対象データ60との類比を自動的に判断できる。このため、仕訳ユニット80は、過去の仕訳結果に基づき、仕訳対象データ60が抽出された証憑5を仕訳する勘定科目を高い精度で自動的に出力できる。 The similarity determination unit 82 multi-dimensionalizes the journal target data 60 and the journal reference entry 70 by using not only the transaction dates 64 and 51c and the amounts 65 and 51d but also the words included in the word arrays 63 and 73 as similar parameters. The distance L between the journal object data 60 and the plurality of journal reference entries 70 is calculated. Therefore, in the analogy determination unit 82, the transaction dates 64 and 51c and the amounts 65 and 51d are used as essential parameters, but other information is appropriately extracted from the word array even if not defined in advance. The extracted (extracted) words are used as similar judgment criteria based on the words, the meaning suggested by the words, and the order of the words. Therefore, the analogy between the journal reference entry 70 and the journal object data 60 can be automatically determined from various fields. Therefore, the journal unit 80 can automatically output the account item for journalizing the voucher 5 from which the journal object data 60 is extracted based on the past journal results with high accuracy.
 この仕訳ユニット80は、過去の仕訳日記帳またはそれに準ずるデータを元にメタデータ・データベースを構築し、それを、証憑5の仕訳先を見つけるデータ(仕訳参照エントリ)70としている。仕訳参照エントリ70は、複数キーを持ち、新しい証憑5もメタデータ化することが可能である。したがって、証憑5をメタデータ化した仕訳対象データ60と仕訳参照エントリ70との距離Lを評価し、最も距離の近い仕訳参照エントリ70の持つ勘定科目を、新しい証憑5の勘定科目と判定する。仕訳参照エントリ70に含まれるメタデータも、仕訳対象データ60に含まれるメタデータも、仕訳ユニット80においては、予め共通なキーを付す必要はなく、証憑5として必須の日付と金額とを除けば、任意の単語配列から抽出される任意の単語の類似性を距離Lとして計算し評価する。単語を抽出する機能は、一般的な日本語構文解析機能を有することが望ましい。 This journal unit 80 constructs a metadata database based on the past journal diary or data corresponding thereto, and uses it as data (journal reference entry) 70 for finding the journal of the voucher 5. The journal entry 70 has a plurality of keys, and the new voucher 5 can be converted into metadata. Therefore, the distance L between the journal object data 60 obtained by converting the voucher 5 into metadata and the journal reference entry 70 is evaluated, and the account item of the journal reference entry 70 having the closest distance is determined as the account item of the new voucher 5. Neither the metadata contained in the journal reference entry 70 nor the metadata contained in the journal target data 60 need to be pre-assigned a common key in the journal unit 80, except for the date and amount required for the voucher 5. The similarity of an arbitrary word extracted from an arbitrary word sequence is calculated as a distance L and evaluated. The function for extracting words preferably has a general Japanese parsing function.
 この仕訳ユニット80においては、ユーザの過去の仕訳データである仕訳日記帳51から仕訳参照エントリ70を含む参照ライブラリ53を生成する。したがって、仕訳ユニット80は、新たな証憑類5に対して、最も類似する過去の仕訳日記帳エントリ51aを探しだす。仕訳日記帳51の各エントリ51aは、ID51bにより管理されている。仕訳参照エントリ70にID51bを含めておくことにより、新たな証憑5を過去の仕訳日記帳51に基づいて仕訳できる。 In this journal unit 80, the reference library 53 including the journal reference entry 70 is generated from the journal diary 51 which is the user's past journal data. Therefore, the journal unit 80 searches for the most recent past journal diary entry 51 a for the new voucher 5. Each entry 51a in the journal diary 51 is managed by an ID 51b. By including the ID 51 b in the journal entry 70, a new voucher 5 can be journalized based on the past journal diary 51.
 仕訳参照エントリ70は、ユーザ3の過去の仕訳日記帳のエントリの代わりに、他のユーザの過去の仕訳結果や、インターネット上での多数決の論理によるものであってもよい。ただし、ユーザ3の情報をオープンすることになる。このため、上述したカテゴリ71を用いてユーザ3の経済活動の特定につながる情報を汎用的な情報に置き換えて類比を判断することが望ましい。 The journal reference entry 70 may be based on the past journal result of another user or the logic of majority vote on the Internet, instead of the past journal diary entry of the user 3. However, the information of the user 3 is opened. For this reason, it is desirable to determine the analogy by replacing the information that leads to the identification of the economic activity of the user 3 with general-purpose information using the category 71 described above.
 仕訳ユニット80は、仕訳対象データ60の金額65と仕訳参照エントリ70の金額51dとの差が第2の閾値Vt2の範囲内で、仕訳対象データ60の取引日64に最も取引日51cが近い仕訳参照エントリ70を見つけ、その仕訳参照エントリ70の勘定科目51eおよび51fを仕訳先として出力する第2の仕訳先出力ユニット87を含む。具体的には金額差が ±Vt2以内の仕訳参照エントリ70を選択して金額が近い順にソートする。それらの中で、日付差がD日以内のものを選択し、日付が近い順にソートして最も日付が近い仕訳参照エントリ70を発見する。 The journal unit 80 has the difference between the amount 65 of the journal object data 60 and the amount 51d of the journal reference entry 70 within the range of the second threshold value Vt2, and the journal whose transaction date 51c is closest to the transaction date 64 of the journal object data 60. It includes a second journal output unit 87 that finds the reference entry 70 and outputs the account items 51e and 51f of the journal reference entry 70 as a journal. Specifically, the journal entry 70 that has an amount difference within ± Vt2 is selected and sorted in ascending order. Among them, the one having a date difference within D days is selected, and the journal reference entry 70 having the closest date is found by sorting in the order of close date.
 この第2の仕訳先出力ユニット(第2の仕訳先出力機能、第2の仕訳先出力手段)87は、第1の仕訳先出力ユニット86により選択された最短の仕訳参照エントリ70では、距離Lが第1の閾値Vt1よりも大きく、会計的に有意な仕訳先を選択できないと判断されたときに動作する。取引日に差がほとんどなく、金額が類似している取引は、同一または類似の取引である可能性が高く、その取引の称呼となる証憑5は、同一の勘定科目に仕訳できる可能性が高い。 This second journal output unit (second journal output function, second journal output means) 87 is the distance L in the shortest journal reference entry 70 selected by the first journal output unit 86. Is larger than the first threshold value Vt1, and it is determined that it is determined that an accountingally significant journal cannot be selected. Transactions with little difference in transaction dates and similar amounts are likely to be the same or similar transactions, and the proof 5 that is the title of the transaction is likely to be able to be journalized to the same account item .
 仕訳ユニット80の類比判断ユニット82は、さらに、仕訳対象データ60の単語配列63に含まれるタイトル63bおよび宛先63cを少なくとも示す単語に基づき、仕訳対象データ60が属するカテゴリ(仕訳カテゴリ、仕訳対象のカテゴリ)61を決定するカテゴリ判定ユニット(カテゴリ判定機能)84と、この仕訳対象のカテゴリ61と同一の仕訳参照のカテゴリ71の仕訳参照エントリ70と仕訳対象データ60との距離を計算するカテゴリ別類比判断ユニット(距離計算機能、距離計算ユニット)85とを含む。さらに、仕訳ユニット80は、仕訳対象データ60の単語配列63の単語63aの位置情報(マッピング情報)に基づき、単語配列63からタイトル63bおよび宛先63cを抽出するタイトル・宛名抽出ユニット(タイトル・宛名抽出機能)83を含む。 The similarity determination unit 82 of the journal unit 80 further includes a category to which the journal data 60 belongs (a journal category, a journal target category) based on at least a word indicating the title 63b and the destination 63c included in the word array 63 of the journal data 60. ) Category determination unit (category determination function) 84 for determining 61, and category-specific ratio determination for calculating the distance between the journal reference entry 70 of the same journal entry category 71 as the journal entry category 61 and the journal entry data 60 Unit (distance calculation function, distance calculation unit) 85. Further, the journal unit 80 extracts a title 63b and a destination 63c from the word array 63 based on position information (mapping information) of the word 63a of the word array 63 of the journal data 60, and extracts a title / address extraction unit (title / address extraction). Function) 83.
 タイトル・宛名抽出ユニット83が仕訳対象データ60からタイトル63bおよび宛先63cを抽出し、カテゴリ判定ユニット84がタイトル63bおよび宛名63cから取引の方向を判断するとともに、取引の方向とタイトル63bとからカテゴリ61を判断する。 The title / address extraction unit 83 extracts the title 63b and the destination 63c from the journal data 60, the category determination unit 84 determines the transaction direction from the title 63b and the address 63c, and the category 61 from the transaction direction and the title 63b. Judging.
 図10に、タイトル・宛名抽出ユニット83と、カテゴリ判定ユニット84とのさらに詳しい構成をブロック図により示している。このタイトル・宛名抽出ユニット83は、宛名をさらに明確に判断するために発信元を検出する機能を含む。すなわち、タイトル・宛名抽出ユニット83は、タイトル検出機能83aと、宛先検出機能83bと、発信元検出機能83cと、自社名・取引先名判定機能83dとを含む。タイトル検出機能83aは、ライブラリ55に用意されているタイトル候補の位置情報を含む分布マップ55aと、タイトル辞書55bとに基づき電子化された仕訳対象データ60の中からタイトル63bを探し出す。タイトル辞書55bは、「請求書」、「発注書」、「納品書」、「品書き」などの証憑5のタイトルとして広く使われる単語が含まれている。 FIG. 10 shows a more detailed configuration of the title / address extraction unit 83 and the category determination unit 84 in a block diagram. The title / address extraction unit 83 includes a function of detecting a sender in order to determine the address more clearly. That is, the title / address extraction unit 83 includes a title detection function 83a, a destination detection function 83b, a transmission source detection function 83c, and a company name / customer name determination function 83d. The title detection function 83a searches for the title 63b from the journalizing target data 60 that is digitized based on the distribution map 55a including the position information of the title candidates prepared in the library 55 and the title dictionary 55b. The title dictionary 55b includes words that are widely used as the title of the voucher 5 such as “invoice”, “purchase order”, “delivery note”, and “description”.
 宛先検出機能83bは、ライブラリ55に用意されている宛名候補の位置情報を分布マップ55aと、宛名プレフィックス辞書55cとに基づき電子化された仕訳対象データ60の中から宛名63cを探し出す。プレフィクス辞書55cは、「御中」、「宛」、「様」、「行」などの宛先を示すために広く使われる単語が含まれている。 The address detection function 83b searches for the address 63c from the journalizing target data 60 that is digitized based on the distribution map 55a and the address prefix dictionary 55c based on the position information of the address candidates prepared in the library 55. The prefix dictionary 55c includes words that are widely used to indicate destinations such as “Gion”, “To”, “Sama”, and “Line”.
 発信元検出機能83cは、ライブラリ55に用意されている発信元候補の位置情報を分布マップ55aと、取引先辞書55dとに基づき電子化された仕訳対象データ60の中から発信元を探し出す。取引先辞書55dは、ユーザ3の過去の取引先の名称が含まれる。 The transmission source detection function 83c searches for the transmission source from the journalizing target data 60 that is digitized based on the distribution map 55a and the supplier dictionary 55d based on the location information of the transmission source candidates prepared in the library 55. The supplier dictionary 55d includes the names of past suppliers of the user 3.
 仕訳対象データ60は、作業者8が入力またはレビューしているので、単にOCRにより取得された文字情報よりもはるかに精度の高い文字情報が含まれる。したがって、タイトル検出機能83a、宛先検出機能83bおよび発信元検出機能83cは、証憑5の上の位置情報を使わずに、それぞれの辞書55b~55dを参照して、文字情報に基づいてタイトル、宛先、さらに発信元を判断するようにしてもよい。自社名・取引先名判定機能83dは、ユーザ3が発信元なのか宛先なのかを判断する。文字単位で一致検索してもよく、最長一致検索をしてもよく、マッチングの値を判断してもよい。発信元に自社名があり、宛先に自社名がなければ発信元が自社、宛先を取引先と判断する。発信元に自社名がな
く、宛先に自社名があれば発信元が取引先、宛先が自社と判断する。発信元および宛先に自社名があったり、発信元および宛先に自社名がなければ自動判定が不可能であり、手動判定を行う。
Since the journal entry data 60 is input or reviewed by the operator 8, it includes character information that is much more accurate than character information obtained simply by OCR. Therefore, the title detection function 83a, the destination detection function 83b, and the source detection function 83c refer to the respective dictionaries 55b to 55d without using the position information on the voucher 5, and based on the character information, the title and destination Further, the sender may be determined. The company name / customer name determination function 83d determines whether the user 3 is a transmission source or a destination. A match search may be performed in character units, a longest match search may be performed, and a matching value may be determined. If the sender has the company name and the destination does not have the company name, the sender is determined to be the company and the destination is determined to be the business partner. If the sender does not have the company name and the destination has the company name, the sender is determined to be the business partner and the destination is determined to be the company. If there is a company name at the source and destination, or there is no company name at the source and destination, automatic determination is impossible and manual determination is performed.
 カテゴリ判定機能84は、取引方向判定ユニット84aと、カテゴリ選択ユニット84bとを含む。取引方向判定ユニット84aは、宛先63cおよび発信元63dが自社名か取引先名かと、タイトル63bとにより、取引方向判定テーブル56を参照して取引の方向を判断する。タイトル63bが請求書で、宛先63cが取引先名であれば、取引の方向69は外(OUT)になる。タイトル63bが請求書で、宛先63cが自社名であれば、取引の方向69は内(IN)になる。取引方向69がOUTは証憑類5の発行元が自社でありINは証憑類5の発行元が取引先であると定義される。 The category determination function 84 includes a transaction direction determination unit 84a and a category selection unit 84b. The transaction direction determination unit 84a determines the direction of the transaction by referring to the transaction direction determination table 56 based on whether the destination 63c and the transmission source 63d are the company name or the customer name and the title 63b. If the title 63b is an invoice and the destination 63c is a supplier name, the transaction direction 69 is outward (OUT). If the title 63b is an invoice and the destination 63c is the company name, the transaction direction 69 is in (IN). When the transaction direction 69 is OUT, the issuer of the voucher 5 is defined as the company, and IN is defined as the issuer of the voucher 5 is the customer.
 カテゴリ選択ユニット84bは、タイトル63bと取引の方向69とから、タイトル/カテゴリ変換テーブル57に基づいてカテゴリ61を判断する。たとえば、タイトル63bが請求書で取引方向69がOUTであればカテゴリ61は収入計上と判断される。この仕訳対象データ60のカテゴリ61は、仕訳参照エントリ70のカテゴリ71と対比され、図8に示すように、カテゴリ61および71が収入計上であれば、勘定科目は1つに決まる。 The category selection unit 84 b determines the category 61 based on the title / category conversion table 57 from the title 63 b and the transaction direction 69. For example, if the title 63b is an invoice and the transaction direction 69 is OUT, the category 61 is determined to be income. The category 61 of the journal object data 60 is compared with the category 71 of the journal reference entry 70. As shown in FIG. 8, if the categories 61 and 71 are accounted for as revenue, the account item is determined to be one.
 タイトル63bが請求書で取引方向69がINであればカテゴリ61は支出計上と判断される。この仕訳対象データ60のカテゴリ61は、仕訳参照エントリ70のカテゴリ71と対比され、図8に示すように、カテゴリ61および71が支出計上であれば、勘定科目にはいくつかの候補が存在する。したがって、図9に示す距離計算ユニット85は、参照データベース(参照ライブラリ)53にある仕訳参照エントリ70の中から、カテゴリ71が支出計上となっている仕訳参照エントリ70を選択し、距離Lを計算する。 If the title 63b is an invoice and the transaction direction 69 is IN, the category 61 is determined to be expenditure recording. The category 61 of the journal object data 60 is compared with the category 71 of the journal reference entry 70. As shown in FIG. 8, if the categories 61 and 71 are recorded as expenses, there are several candidates for the account item. . Accordingly, the distance calculation unit 85 shown in FIG. 9 selects the journal reference entry 70 in which the category 71 is recorded as an expense from the journal reference entries 70 in the reference database (reference library) 53, and calculates the distance L. To do.
 このように、パラメータとして「カテゴリ」を導入することにより、勘定科目の自動判定精度を向上できる。カテゴリ61および71の具体的な例は、収入計上、支出計上などの会計処理において意味のあるものであり、複数の勘定科目はいずれかのカテゴリにグループ分けできる。したがって、先にカテゴリ61を判断することにより、勘定科目を判断するために参照する仕訳参照エントリ70の数を少なくできる。距離Lのみで判断する場合、距離Lが小さいときの判断精度は非常に高い。一方、距離Lが大きいときの誤差も大きい。カテゴリ61を判断して関連するエントリ70の数を制限することにより、勘定科目を誤判定する可能性が小さくなり、さらに、誤判定したとしても間違い方が小さくなる。さらに、カテゴリ61を判断して勘定科目を決めることにより、会計処理の意味における収支は正しいことが保証される。 In this way, by introducing “category” as a parameter, automatic determination accuracy of account items can be improved. Specific examples of the categories 61 and 71 are meaningful in accounting processing such as income recording and expenditure recording, and a plurality of account items can be grouped into any category. Therefore, by determining the category 61 first, it is possible to reduce the number of journal entry entries 70 that are referred to in order to determine the account item. When judging only by the distance L, the judgment accuracy when the distance L is small is very high. On the other hand, the error when the distance L is large is also large. By determining the category 61 and limiting the number of related entries 70, the possibility of misjudgment of the account item is reduced, and even if it is misjudged, the mistake is reduced. Furthermore, by determining category 61 by determining category 61, it is ensured that the balance in the meaning of accounting processing is correct.
 仕訳対象データ60のカテゴリ61の決定において重要な役割を果たすのが、取引の方向69を判定することである。この仕訳ユニット80においては、取引の方向69を証憑5のタイトル63bと宛先63cとから自動的に決定する仕組みを採用している。この明細書において取引の方向69とは、証憑5が帰属する企業を主体にしたときに、該証憑5の発信元が自社であるか、取引先であるかを示す情報をいう。 An important role in determining the category 61 of the journal data 60 is to determine the transaction direction 69. This journal unit 80 employs a mechanism that automatically determines the transaction direction 69 from the title 63b and the destination 63c of the voucher 5. In this specification, the transaction direction 69 refers to information indicating whether the source of the voucher 5 is its own company or a business partner when the company to which the voucher 5 belongs is mainly used.
 図11に、自動仕訳ユニット80における処理(自動仕訳、ステップ154)の概要をフローチャートにより示している。これらの処理は、サーバ13などのCPUおよびメモリを含むコンピュータ資源を備えた装置において、自動仕分け用のプログラム(プログラム製品)を実行することにより実現される。自動仕分け用のプログラム(プログラム製品)は、インターネット9を経由して供給されたり、DVDあるいはフラッシュメモリなどの適当な記録媒体に記録して提供することも可能である。 FIG. 11 is a flowchart showing an outline of processing (automatic journaling, step 154) in the automatic journalizing unit 80. These processes are realized by executing a program (program product) for automatic sorting in an apparatus having computer resources including a CPU and a memory such as the server 13. The program (program product) for automatic sorting can be supplied via the Internet 9 or can be provided by being recorded on an appropriate recording medium such as a DVD or a flash memory.
 また、証憑5の電子化された情報(仕訳対象データ)60を蓄積したデータベース(仕訳日記帳)50(52)、仕訳参照エントリ70を含む参照データベース53、さらにその他のライブラリは、単一のコンピュータ(サーバ)のメモリに格納されていてもよく、コンピュータネットワークあるいはインターネットで接続された複数のサーバに分散して格納されていてもよい。 In addition, a database (journal diary) 50 (52) storing electronic information (data to be journalized) 60 of the voucher 5, a reference database 53 including journal reference entries 70, and other libraries are a single computer. It may be stored in a (server) memory, or may be distributed and stored in a plurality of servers connected via a computer network or the Internet.
 仕訳ユニット80は、ステップ211において、取得ユニット81により仕訳対象データ60を取得する。仕訳対象データ60は、電子化された証憑5のデータであり、仕訳対象の証憑5の取引日64、金額65、および他の文字情報から抽出された単語配列63を含む。データを取得するステップ211は、異なる作業者により電子化された分割データ化文字情報28を識別情報27に基づいて集約して仕訳対象データ60を生成するプロセスを含んでいてもよい。 The journal unit 80 acquires the journal data 60 by the acquisition unit 81 in step 211. The journal object data 60 is digitized data of the voucher 5 and includes a transaction date 64, an amount 65, and a word array 63 extracted from other character information of the voucher 5 to be journalized. The step 211 of acquiring data may include a process of generating the journalizing target data 60 by aggregating the divided data character information 28 digitized by different workers based on the identification information 27.
 ステップ212において、取得ユニット81は、さらに、日本語構文解析機能を用いて単語配列63から単語63aを抽出する。 In step 212, the acquisition unit 81 further extracts the word 63a from the word array 63 using the Japanese parsing function.
 ステップ213において、タイトル・宛名抽出ユニット83が、単語配列63から抽出された単語の中から証憑5のタイトル63bと宛名63cとを抽出する。この仕訳ユニット80においては、タイトル・宛名抽出ユニット83は、さらに、発信元63dも抽出する。 In step 213, the title / address extracting unit 83 extracts the title 63b and address 63c of the voucher 5 from the words extracted from the word array 63. In this journal unit 80, the title / address extracting unit 83 further extracts the sender 63d.
 ステップ214において、カテゴリ判定ユニット84の取引方向判定ユニット84aが取引方向判定テーブル56を参照して、タイトル63b、宛名63cおよび発信元63dから取引の方向69を判定する。ステップ215において、カテゴリ選択ユニット84bは、タイトル/カテゴリ変換テーブル57を参照し、タイトル63bおよび取引の方向69から、仕訳対象データ60のカテゴリ61を選択し、仕訳対象データ60に追加する。 In step 214, the transaction direction determination unit 84a of the category determination unit 84 refers to the transaction direction determination table 56 to determine the transaction direction 69 from the title 63b, the address 63c, and the transmission source 63d. In step 215, the category selection unit 84 b refers to the title / category conversion table 57, selects the category 61 of the journal entry data 60 from the title 63 b and the transaction direction 69, and adds it to the journal entry data 60.
 ステップ216において、距離計算ユニット85は、仕訳対象データ60のカテゴリ61と同一のカテゴリ71をキーとして持つ仕訳参照エントリ70を選択し、それらの間の距離Lを計算する。ステップ217において距離Lが第1の閾値Vt1よりも小さければ、ステップ218において、最短の仕訳参照エントリ70の勘定科目を仕訳対象データ60の勘定科目として出力し、仕訳対象データ60の証憑5の自動仕分けが完了する。一方、距離Lが第1の閾値Vt1以上であれば、距離計算による仕訳の精度が低い。このため、ステップ219において、同一カテゴリの仕訳参照エントリ70の中で、仕訳対象データ60と金額差が第2の閾値Vt2よりも小さく、さらに、直近の取引日である仕訳参照エントリ70が選択され、その勘定科目が出力される。 In step 216, the distance calculation unit 85 selects the journal entry 70 having the same category 71 as the category 61 of the journal data 60 as a key, and calculates the distance L between them. If the distance L is smaller than the first threshold value Vt1 in step 217, in step 218, the account item of the shortest journal entry entry 70 is output as the account item of the journal object data 60, and automatic verification of the voucher 5 of the journal object data 60 is performed. Sorting is complete. On the other hand, if the distance L is equal to or greater than the first threshold value Vt1, the accuracy of the journal entry by the distance calculation is low. Therefore, in step 219, among the journal reference entries 70 in the same category, the journal reference entry 70 having a difference in amount from the journal data 60 smaller than the second threshold value Vt2 and the latest transaction date is selected. , The account item is output.
 以上に説明したように、この会計支援システム1は、クラウド(コンピュータネットワーク)を介して第三者により証憑のデータを電子化し、分割データ化された文字情報を取得して集約した文字情報から仕訳対象データを生成する。分割データ化された文字情報は、仕訳対象の証憑に含まれる複数の文字情報が、証憑中の表記位置により分割された情報であり、1つの証憑に含まれる情報を複数の異なる作業者により電子化する。このため、一人の作業者は証憑に含まれるデータの断片を見るだけであり、証憑に含まれる情報の秘匿性を担保しながら、ネットワーク(クラウド)に接続可能な人員を証憑の電子化に利用でき、低コストで、安全に証憑のデータを電子化できる。 As described above, this accounting support system 1 converts the data of the voucher by a third party via the cloud (computer network), acquires the character information divided into divided data, and collects it from the collected character information. Generate target data. The character information converted into divided data is information obtained by dividing a plurality of character information included in a voucher to be journalized according to a notation position in the voucher, and the information included in one voucher is electronically converted by a plurality of different workers. Turn into. For this reason, one worker only looks at a piece of data contained in the voucher and uses the network (cloud) personnel to digitize the voucher while ensuring the confidentiality of the information contained in the voucher. It is possible to digitalize voucher data safely at low cost.
 さらに、この会計支援システム1は、自動的に仕訳を行う仕訳ユニット80を含む。勘定科目判定処理の自動化が進むことにより、会計専門家の作業は、最終結果確認のみに収束する。したがって、最小人数の会計専門家によって仕訳を含む会計処理を実施できる。この段階では、まず、電子化工程から渡ってきた文字列群は、勘定科目判定部にて、勘定
科目が判定される。情報技術を使い、自動で処理する方法と、会計専門家が手動で判断する方法、のどちらも選択できる。このシステムにより仕訳作業工程は最適化され、大幅な経理処理能力の向上を得ることができ、結果として、大幅に作業コストを下げることができる。
Furthermore, the accounting support system 1 includes a journal unit 80 that automatically performs journal entries. As the account item determination process is automated, the work of the accounting specialist converges only to confirm the final result. Therefore, accounting processing including journal entries can be performed by a minimum number of accounting specialists. At this stage, first, the account item is determined by the account item determination unit of the character string group that has been transferred from the digitization process. You can select either automatic processing using information technology or manual judgment by an accounting professional. This system optimizes the journal entry process and can greatly improve the accounting processing capacity. As a result, the operation cost can be greatly reduced.

Claims (15)

  1.  ユーザの取引の証拠となる証憑の仕訳先を出力する仕訳ユニットと、
     仕訳対象の証憑に含まれる複数の文字情報が前記証憑中の表記位置により分割された分割データを、前記仕訳対象の証憑を示す識別情報とともに、電子化のために異なる作業者に分散して送信する分散ユニットと、
     前記異なる作業者により電子化された、分割電子化された文字情報を取得し、前記分割電子化された文字情報から前記識別情報に基づき前記仕訳ユニットにより仕訳される仕訳対象データを生成する集約ユニットとを有するシステム。
    A journal unit that outputs the journal entry for voucher that is evidence of the user's transaction;
    A plurality of pieces of character information included in a journal to be journalized are divided and transmitted to different workers for digitization together with identification information indicating the journal to be journalized, along with identification information indicating the journal to be journalized. A decentralized unit to
    An aggregation unit that obtains characterized information that has been digitized by the different workers and that is generated by the journalizing unit based on the identification information from the characterized information that has been digitized. And a system having.
  2.  請求項1において、
     前記分散ユニットは、前記分割データを前記証憑中の表記位置により分類し、種類毎に前記異なる作業者に分散して送信するユニットを含む、システム。
    In claim 1,
    The distributed unit includes a unit that classifies the divided data according to a notation position in the voucher and distributes the divided data to the different workers for each type.
  3.  請求項1または2において、
     前記分散ユニットは、前記仕訳対象の証憑を文字情報の表記位置により分割して画像化した複数の証憑分割画像を生成するユニットと、
     前記複数の証憑分割画像を含む前記分割データを、コンピュータネットワークを介して前記異なる作業者に送信するユニットとを含む、システム。
    In claim 1 or 2,
    The distribution unit is a unit that generates a plurality of voucher-divided images obtained by dividing the journal of the journal to be journalized according to character information notation positions;
    A unit that transmits the divided data including the plurality of voucher divided images to the different workers via a computer network.
  4.  請求項1ないし3のいずれかにおいて、
     前記仕訳対象データは、前記仕訳対象の証憑の取引日、金額、および他の文字情報から抽出された単語配列を含み、
     前記仕訳ユニットは、前記仕訳対象データと、前記ユーザの過去の仕訳済みの証憑の情報をエントリとして含む帳簿のエントリ毎の取引日、金額、および他の文字情報から抽出された単語配列を含む複数の仕訳参照エントリとの距離を、取引日、金額、単語配列に含まれる各単語の相似をパラメータとして計算する類比判断ユニットと、
     前記仕訳対象データとの距離が最短の仕訳参照エントリの勘定科目を仕訳先として出力する第1の仕訳先出力ユニットとを含む、システム。
    In any of claims 1 to 3,
    The journal entry data includes a word sequence extracted from the transaction date, amount, and other character information of the journal entry voucher,
    The journal unit includes a plurality of word sequences extracted from the transaction date, the amount, and other character information for each entry in the book including the journal object data and the voucher information of the user's past journal as entries. A similarity determination unit that calculates the distance from the journal reference entry of the transaction date, the amount, and the similarity of each word included in the word array as a parameter;
    And a first journal output unit that outputs the account item of the journal reference entry having the shortest distance from the journal object data as a journal.
  5.  請求項4において、
     前記仕訳ユニットは、前記第1の仕訳先出力ユニットにより選択された前記最短の仕訳参照エントリとの距離が第1の閾値よりも大きいときは、前記仕訳対象データの金額との差が第2の閾値内で最も取引日が近い仕訳参照エントリの勘定科目を仕訳先として出力する第2の仕訳先出力ユニットを含む、システム。
    In claim 4,
    When the distance between the journal unit and the shortest journal reference entry selected by the first journal destination output unit is greater than a first threshold value, a difference from the amount of the journal data is a second value. A system including a second journal output unit that outputs the account of the journal reference entry having the closest transaction date within the threshold as the journal.
  6.  請求項4または5において、
     前記複数の仕訳参照エントリの前記勘定科目は複数のカテゴリに分けられ、前記複数の仕訳参照エントリのそれぞれはカテゴリの情報を含み、
     前記類比判断ユニットは、前記仕訳対象データの前記単語配列に含まれるタイトルおよび宛先を少なくとも示す単語に基づき決定されたカテゴリと同一のカテゴリの前記仕訳参照エントリと前記仕訳対象データとの距離を計算するカテゴリ別類比判断ユニットを含む、システム。
    In claim 4 or 5,
    The account items of the plurality of journal reference entries are divided into a plurality of categories, each of the plurality of journal reference entries includes category information;
    The similarity determination unit calculates a distance between the journal reference entry of the same category as the category determined based on at least a title and a word included in the word array of the journal target data and the journal target data. A system that includes a categorical comparison unit.
  7.  請求項6において、
     前記仕訳ユニットは、前記仕訳対象データの前記単語配列に含まれるタイトルおよび宛先を少なくとも示す単語に基づき、前記仕訳対象データが前記複数のカテゴリのいずれかのカテゴリに属するかを決定するカテゴリ判定ユニットを含む、システム。
    In claim 6,
    The journal unit includes a category determination unit that determines whether the journal data belongs to any one of the plurality of categories based on at least a word indicating a title and a destination included in the word array of the journal data. Including the system.
  8.  請求項7において、
     前記仕訳対象データの前記単語配列に含まれる単語の少なくとも一部は、前記証憑中に表記された前記単語の位置情報を含み、
     前記仕訳ユニットは、前記単語の位置情報に基づき、前記単語配列から前記タイトルおよび前記宛先を抽出するタイトル・宛名抽出ユニットを含む、システム。
    In claim 7,
    At least a part of words included in the word array of the journalizing target data includes positional information of the words written in the voucher,
    The journal unit includes a title / address extracting unit that extracts the title and the destination from the word array based on the position information of the word.
  9.  コンピュータを含むシステムによりユーザの取引の証拠となる証憑の仕訳先を出力することを含む方法であって、
     前記システムは、前記コンピュータがインターネットを介して複数の作業者とデータを交換する送受信ユニットを含み、
     当該方法は、
     前記コンピュータが、仕訳対象の証憑に含まれる複数の文字情報が前記証憑中の表記位置により分割された分割データを、前記仕訳対象の証憑を示す識別情報とともに、電子化のために異なる作業者に、前記送受信ユニットを介して分散して送信することと、
     前記異なる作業者により電子化された、分割電子化された文字情報を、前記送受信ユニットを介して取得し、前記分割電子化された文字情報から前記識別情報に基づき、前記仕訳先を出力することにおいて処理される仕訳対象データを生成することとを有する、方法。
    Outputting a voucher journal that is evidence of a user's transaction by a system including a computer, comprising:
    The system includes a transmission / reception unit in which the computer exchanges data with a plurality of workers via the Internet,
    The method is
    The computer converts the divided data obtained by dividing the plurality of character information included in the voucher to be journalized according to the notation position in the voucher together with identification information indicating the voucher to be journalized to different workers for digitization. , Transmitting in a distributed manner via the transceiver unit;
    Obtaining the digitized character information digitized by the different workers via the transmission / reception unit and outputting the journal from the divided digitized character information based on the identification information Generating journal object data to be processed at.
  10.  請求項9において、
     前記分散して送信することは、前記分割データを前記証憑中の表記位置により分類し、種類毎に前記異なる作業者に分散して送信することを含む、方法。
    In claim 9,
    The method of transmitting in a distributed manner includes classifying the divided data according to a notation position in the voucher and transmitting the divided data to the different workers for each type.
  11.  請求項9または10において、
     前記分散して送信することは、前記仕訳対象の証憑を文字情報の表記位置により分割して画像化した複数の証憑分割画像を含む前記分割データを前記異なる作業者に送信することを含む、方法。
    In claim 9 or 10,
    The method of transmitting in a distributed manner includes transmitting the divided data including a plurality of voucher divided images obtained by dividing the journal to be journalized according to the notation position of character information to the different workers. .
  12.  請求項9ないし11のいずれかにおいて、
     前記コンピュータは、メモリ上に、前記ユーザの過去の仕訳済みの証憑の情報をエントリとして含む帳簿のエントリ毎の取引日、金額、および他の文字情報から抽出された単語配列をそれぞれ含む、複数の仕訳参照エントリを含む仕訳済みデータベースを有し、
     前記仕訳対象データは、前記仕訳対象の証憑の取引日、金額、および他の文字情報から抽出された単語配列を含み、
     前記仕訳先を出力することは、
     前記コンピュータが、取引日、金額、単語配列に含まれる各単語の相似をパラメータとして、前記仕訳済みデータベースの前記複数の仕訳参照エントリと前記仕訳対象データとの距離を計算することと、
     前記仕訳対象データとの距離が最短の仕訳参照エントリの勘定科目を仕訳先として出力することとを含む、方法。
    In any of claims 9 to 11,
    The computer includes a plurality of word sequences extracted from a transaction date, an amount, and other character information for each entry in the book including, as an entry, information on the user's past journalized vouchers as entries. Have a journalized database with journal reference entries,
    The journal entry data includes a word sequence extracted from the transaction date, amount, and other character information of the journal entry voucher,
    Outputting the journal is as follows:
    The computer calculates a distance between the plurality of journal reference entries in the journalized database and the journal target data using a transaction date, an amount of money, and similarity of each word included in the word array as a parameter;
    Outputting the account item of the journal reference entry having the shortest distance from the journal object data as the journal.
  13.  コンピュータを、ユーザの取引の証拠となる証憑の仕訳先を出力する仕訳ユニットを有するシステムとして動作させるプログラムであって、
     前記システムは、さらに、仕訳対象の証憑に含まれる複数の文字情報が前記証憑中の表記位置により分割された分割データを前記仕訳対象の証憑を示す識別情報とともに、電子化のために異なる作業者に分散して送信する分散ユニットと、
     前記異なる作業者により電子化された、分割電子化された文字情報を取得し、前記分割電子化された文字情報から前記識別情報に基づき前記仕訳ユニットにより仕訳される仕訳対象データを生成する集約ユニットとを含む、プログラム。
    A program that causes a computer to operate as a system having a journal unit that outputs a journal entry of a voucher that is evidence of a user's transaction,
    The system further includes different workers for digitization of divided data obtained by dividing a plurality of character information included in a voucher to be journalized according to a notation position in the voucher together with identification information indicating the voucher to be journalized. A distributed unit that distributes and transmits to
    An aggregation unit that obtains characterized information that has been digitized by the different workers and that is generated by the journalizing unit based on the identification information from the characterized information that has been digitized. Including the program.
  14.  請求項13において、
     前記仕訳対象データは、前記仕訳対象の証憑の取引日、金額、および他の文字情報から抽出された単語配列を含み、
     前記仕訳ユニットは、前記仕訳対象データと、前記ユーザの過去の仕訳済みの証憑の情報をエントリとして含む帳簿のエントリ毎の取引日、金額、および他の文字情報から抽出された単語配列を含む複数の仕訳参照エントリとの距離を、取引日、金額、単語配列に含まれる各単語の相似をパラメータとして計算する類比判断ユニットと、
     前記仕訳対象データとの距離が最短の仕訳参照エントリの勘定科目を仕訳先として出力する第1の仕訳先出力ユニットとを含む、プログラム。
    In claim 13,
    The journal entry data includes a word sequence extracted from the transaction date, amount, and other character information of the journal entry voucher,
    The journal unit includes a plurality of word sequences extracted from the transaction date, the amount, and other character information for each entry in the book including the journal object data and the voucher information of the user's past journal as entries. A similarity determination unit that calculates the distance from the journal reference entry of the transaction date, the amount, and the similarity of each word included in the word array as a parameter;
    A first journal output unit that outputs the account item of the journal reference entry having the shortest distance from the journal object data as a journal.
  15.  請求項14において、
     前記コンピュータを、前記コンピュータに入力された前記ユーザの過去の前記帳簿のデータから、前記複数の仕訳参照エントリを含む仕訳済みデータベースを生成する手段として、さらに動作させるプログラム。
    In claim 14,
    A program for causing the computer to further operate as means for generating a journalized database including the plurality of journal reference entries from the user's past book data input to the computer.
PCT/JP2016/064758 2015-05-18 2016-05-18 Accounting assistance system WO2016186137A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2017519380A JP6835713B2 (en) 2015-05-18 2016-05-18 Accounting support system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2015-101000 2015-05-18
JP2015101000 2015-05-18

Publications (1)

Publication Number Publication Date
WO2016186137A1 true WO2016186137A1 (en) 2016-11-24

Family

ID=57320027

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2016/064758 WO2016186137A1 (en) 2015-05-18 2016-05-18 Accounting assistance system

Country Status (2)

Country Link
JP (2) JP6835713B2 (en)
WO (1) WO2016186137A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6165957B1 (en) * 2016-12-16 2017-07-19 ファーストアカウンティング株式会社 Accounting processing apparatus, accounting processing system, accounting processing method, and accounting processing program
JP2019023793A (en) * 2017-07-24 2019-02-14 株式会社オービックビジネスコンサルタント Journalizing information processing apparatus, journalizing information processing method, and program
JP2020017140A (en) * 2018-07-26 2020-01-30 富士ゼロックス株式会社 Information processing apparatus, program, and information processing system
JP2020035351A (en) * 2018-08-31 2020-03-05 富士ゼロックス株式会社 Information processing device, information processing system, and information processing program
WO2020050391A1 (en) * 2018-09-06 2020-03-12 株式会社スキャる System and method for displaying journalization

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11143973A (en) * 1997-11-10 1999-05-28 Sg Engineering Kk Document information processing system and input work assisting system
JP2005085112A (en) * 2003-09-10 2005-03-31 Toshiba Corp Information classification system and program
WO2014148045A1 (en) * 2013-03-18 2014-09-25 フリー株式会社 Accounting apparatus, accounting method, and accounting program
JP2014235484A (en) * 2013-05-31 2014-12-15 弥生株式会社 Journalizing center system providing journalizing analysis service by cloud computing

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4047090B2 (en) * 2002-07-31 2008-02-13 キヤノン株式会社 Image processing method and image processing apparatus
JP3707738B2 (en) * 2003-05-29 2005-10-19 株式会社日立製作所 Medical remuneration statement examination support apparatus and method and program thereof
JP4783200B2 (en) * 2006-05-08 2011-09-28 弘明 小野 Accounting journal system, accounting journal program, and recording medium recording the program
JP2009251914A (en) * 2008-04-04 2009-10-29 Hiroaki Matsuda Thin client network system
JP6100532B2 (en) * 2013-01-15 2017-03-22 株式会社富士通マーケティング Receipt definition data creation device and program thereof

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11143973A (en) * 1997-11-10 1999-05-28 Sg Engineering Kk Document information processing system and input work assisting system
JP2005085112A (en) * 2003-09-10 2005-03-31 Toshiba Corp Information classification system and program
WO2014148045A1 (en) * 2013-03-18 2014-09-25 フリー株式会社 Accounting apparatus, accounting method, and accounting program
JP2014235484A (en) * 2013-05-31 2014-12-15 弥生株式会社 Journalizing center system providing journalizing analysis service by cloud computing

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6165957B1 (en) * 2016-12-16 2017-07-19 ファーストアカウンティング株式会社 Accounting processing apparatus, accounting processing system, accounting processing method, and accounting processing program
WO2018110522A1 (en) * 2016-12-16 2018-06-21 ファーストアカウンティング株式会社 Accounts processing device, accounts processing system, accounts processing method, and accounts processing program
JP2018097813A (en) * 2016-12-16 2018-06-21 ファーストアカウンティング株式会社 Accounting processing device, accounting processing system, accounting processing method, and accounting processing program
JP2019023793A (en) * 2017-07-24 2019-02-14 株式会社オービックビジネスコンサルタント Journalizing information processing apparatus, journalizing information processing method, and program
JP2020017140A (en) * 2018-07-26 2020-01-30 富士ゼロックス株式会社 Information processing apparatus, program, and information processing system
JP7192287B2 (en) 2018-07-26 2022-12-20 富士フイルムビジネスイノベーション株式会社 Information processing device, program, and information processing system
JP2020035351A (en) * 2018-08-31 2020-03-05 富士ゼロックス株式会社 Information processing device, information processing system, and information processing program
JP7247492B2 (en) 2018-08-31 2023-03-29 富士フイルムビジネスイノベーション株式会社 Information processing device, information processing system, and information processing program
WO2020050391A1 (en) * 2018-09-06 2020-03-12 株式会社スキャる System and method for displaying journalization
JP6741326B1 (en) * 2018-09-06 2020-08-19 株式会社スキャる System and method for displaying journals

Also Published As

Publication number Publication date
JP2019204535A (en) 2019-11-28
JP6835713B2 (en) 2021-02-24
JPWO2016186137A1 (en) 2018-03-08

Similar Documents

Publication Publication Date Title
JP2019204535A (en) Accounting support system
CN109034727B (en) Self-service electronic government affair processing method
CN109858453B (en) Universal multi-engine bill identification system and method
US6954729B2 (en) Address learning system and method for using same
US7668363B2 (en) Lockbox imaging system
US20230206347A1 (en) Electronic receipt-linking database system
US9002838B2 (en) Distributed capture system for use with a legacy enterprise content management system
US20060167877A1 (en) Inbound link incoming mail stream
US20100161460A1 (en) Method and system for source document data entry and form association
US20080103790A1 (en) System and method for duplicate detection
US10528626B2 (en) Document processing
US9390089B2 (en) Distributed capture system for use with a legacy enterprise content management system
CA2745712C (en) Systems and methods for intelligent paperless document management
JP2017054469A (en) Data management system, method and program
CN107844960B (en) Investment analysis tool for automatically and intelligently analyzing business plan
CN112785404A (en) Invoice issuing management system
WO2006105607A1 (en) A method and system for managing information
JP2018190064A (en) Accounting processing system
CA2914591C (en) Systems and methods for intelligent paperless document management
JP6741326B1 (en) System and method for displaying journals
CN116824604B (en) Financial data management method and system based on image processing
KR102283191B1 (en) Method and server for inspecting food advertising images
CN110175853B (en) Social group customer complaint information sorting method and social group customer complaint information sorting system
CN116851287A (en) Financial bill sorting method and system
JP2023182865A (en) Accounting assistance system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16796530

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2017519380

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16796530

Country of ref document: EP

Kind code of ref document: A1