JPWO2014010068A1

JPWO2014010068A1 - Program, document conversion apparatus, and document conversion method

Info

Publication number: JPWO2014010068A1
Application number: JP2014524560A
Authority: JP
Inventors: 承剛大山
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2012-07-12
Filing date: 2012-07-12
Publication date: 2016-06-20
Anticipated expiration: 2032-07-12
Also published as: EP2874070A4; US20150120769A1; EP2874070A1; JP5991376B2; AU2012385075A1; US10127208B2; AU2012385075B2; WO2014010068A1

Abstract

文書変換装置（１）は、ＸＢＲＬ文書（４１）の項目間の階層関係を含む表示ツリー（２２１）に基づいて、財務報告書（２１）に含まれる項目のうちＸＢＲＬ文書（４１）において子を有する項目に対応する項目を検出し、検出した項目である検出項目の内容を分割し、分割された内容を、検出項目に対応するＸＢＲＬ文書（４１）における項目の子項目に対応付け、対応付けられた内容と子項目とを用いて、財務報告書（２１）をＸＢＲＬ文書（４１）に変換する。これにより、文書変換装置（１）は、財務報告書（２１）の科目を表示ツリー（２２１）の要素に適切にマッピングできる。Based on the display tree (221) including the hierarchical relationship between items of the XBRL document (41), the document conversion device (1) generates a child in the XBRL document (41) among the items included in the financial report (21). The item corresponding to the item that is included is detected, the content of the detected item that is the detected item is divided, and the divided content is associated with the child item of the item in the XBRL document (41) corresponding to the detected item. The financial report (21) is converted into an XBRL document (41) using the content and child items. Thereby, the document conversion apparatus (1) can appropriately map the subjects of the financial report (21) to the elements of the display tree (221).

Description

本発明は、プログラム、文書変換装置および文書変換方法に関する。 The present invention relates to a program, a document conversion apparatus, and a document conversion method.

ＸＢＲＬ（eXtensible Business Reporting Language）文書による財務諸表を金融庁へ提出することが義務付けられている。ＸＢＲＬ文書とは、例えば財務会計報告に関して、ＸＭＬ（eXtensible Markup Language）をベースにした報告書記述言語であるＸＢＲＬで記述された文書である。ＸＢＲＬ文書で用いられる要素（ＸＢＲＬ要素）には階層関係があり、階層関係は階層化文書によって示される。ＸＢＲＬ文書による財務諸表を提出すべき企業は、提出前に、ワープロ文書の財務諸表に含まれる各科目を、ＸＢＲＬ文書で用いられる各要素に対応付けて、ＸＢＲＬ文書を作成する。 It is obliged to submit financial statements in XBRL (eXtensible Business Reporting Language) documents to the Financial Services Agency. An XBRL document is a document described in XBRL, which is a report description language based on XML (eXtensible Markup Language), for example, for financial accounting reports. Elements used in the XBRL document (XBRL elements) have a hierarchical relationship, and the hierarchical relationship is indicated by the hierarchical document. A company that is to submit a financial statement based on an XBRL document creates an XBRL document by associating each subject included in the financial statement of the word processor document with each element used in the XBRL document before submission.

また、テキスト文書の各科目を階層化文書の各要素に対応付けるマッピングエンジンの技術が開示されている。かかる技術では、マッピングエンジンが、テキスト文書の各科目について、予め定められた関係に従った順序での入力を受け付け、入力を受け付けた科目に対応する階層化文書の要素を検索する。 In addition, a mapping engine technology that associates each subject of a text document with each element of a hierarchical document is disclosed. In this technique, the mapping engine accepts input in an order according to a predetermined relationship for each subject of the text document, and searches for an element of the hierarchical document corresponding to the subject that accepted the input.

特開２００３−３１６７６５号公報JP 2003-316765 A

しかしながら、上述した技術では、テキスト文書の財務諸表からＸＢＲＬ文書を作成する際、テキスト文書の科目を階層化文書の要素に適切にマッピングすることができない場合があるという問題があった。すなわち、テキスト文書の科目と階層化文書の要素とが１対１に対応していることを前提としているので、１対Ｎの場合に対応できない。つまり、テキスト文書の１個の科目が階層化文書のＮ個の要素に対応している場合、マッピングエンジンは、当該科目を階層化文書の該当する要素に適切にマッピングすることができない。 However, the above-described technique has a problem that when an XBRL document is created from a financial statement of a text document, the subject of the text document cannot be properly mapped to the elements of the hierarchical document. That is, since it is assumed that the text document subject and the hierarchical document element have a one-to-one correspondence, the one-to-N case cannot be handled. That is, when one subject of the text document corresponds to N elements of the hierarchical document, the mapping engine cannot appropriately map the subject to the corresponding element of the hierarchical document.

図１２〜図１４は、テキスト文書の科目を階層化文書の要素に適切にマッピングすることができない例を示す図である。図１２は、財務報告書の各科目と科目値の一例を示す図である。図１３および図１４は、階層化文書の各ＸＢＲＬ要素の関係の一例を示す図である。マッピングエンジンは、図１２に示す１個の科目である「報告期間」を、図１３に示す階層化文書の「報告期間」にマッピングすることができる。しかしながら、図１４に示す階層化文書の「報告期間」には、１つ下の階層に「期」と「開始日」と「終了日」とが存在している。すなわち、１個の科目が３個の要素に対応しているので、マッピングエンジンは、図１２に示す１個の科目である「報告期間」を、図１４に示す階層化文書の「報告期間」に適切にマッピングすることができない。 12 to 14 are diagrams illustrating examples in which subjects of a text document cannot be appropriately mapped to elements of a hierarchical document. FIG. 12 is a diagram showing an example of each subject and subject value of the financial report. 13 and 14 are diagrams showing an example of the relationship between the XBRL elements of the hierarchical document. The mapping engine can map “reporting period” which is one subject shown in FIG. 12 to “reporting period” of the hierarchical document shown in FIG. However, in the “reporting period” of the hierarchized document shown in FIG. 14, “period”, “start date”, and “end date” exist in the next lower layer. That is, since one subject corresponds to three elements, the mapping engine converts the “reporting period” that is one subject shown in FIG. 12 into the “reporting period” of the hierarchical document shown in FIG. Cannot be mapped properly.

１つの側面では、本発明は、テキスト文書の財務情報からＸＢＲＬ文書を作成する際、テキスト文書の科目を階層化文書の要素に適切にマッピングすることを目的とする。 In one aspect, an object of the present invention is to appropriately map a subject of a text document to an element of a hierarchical document when an XBRL document is created from financial information of the text document.

一態様のプログラムは、第１の文書から第２の文書に変換するプログラムにおいて、前記第２の文書の項目間の階層関係を含む階層化文書に基づいて、前記第１の文書に含まれる項目のうち前記第２の文書において子を有する項目に対応する項目を検出し、前記検出した項目である検出項目の内容を分割し、前記分割された内容を、前記検出項目に対応する前記第２の文書における項目の子項目に対応付け、前記対応付けられた内容と前記子項目とを用いて、前記第１の文書を前記第２の文書に変換する処理をコンピュータに実行させる。 The program according to one aspect is an item included in the first document based on a hierarchical document including a hierarchical relationship between items of the second document in the program for converting the first document into the second document. In the second document, an item corresponding to an item having a child is detected, the content of the detected item that is the detected item is divided, and the divided content is divided into the second item corresponding to the detected item. The computer is caused to execute a process of converting the first document into the second document using the associated contents and the child items in association with the child items of the items in the document.

一つの態様によれば、テキスト文書の財務情報からＸＢＲＬ文書を作成する際、テキスト文書の科目を階層化文書の要素に適切にマッピングすることができる。 According to one aspect, when an XBRL document is created from financial information of a text document, the text document subjects can be appropriately mapped to the elements of the hierarchical document.

図１は、実施例に係る文書変換装置の構成を示す機能ブロック図である。FIG. 1 is a functional block diagram illustrating the configuration of the document conversion apparatus according to the embodiment. 図２は、財務報告書の表紙のレイアウトの一例を示す図である。FIG. 2 is a diagram illustrating an example of a cover layout of a financial report. 図３は、表示ツリーの構成の一例を示す図である。FIG. 3 is a diagram illustrating an example of a configuration of a display tree. 図４は、要素宣言のデータ構造の一例を示す図である。FIG. 4 is a diagram illustrating an example of the data structure of the element declaration. 図５Ａは、マッピング補助情報のデータ構造の一例を示す図（１）である。FIG. 5A is a diagram (1) illustrating an example of a data structure of mapping auxiliary information. 図５Ｂは、マッピング補助情報のデータ構造の一例を示す図（２）である。FIG. 5B is a diagram (2) illustrating an example of the data structure of the mapping auxiliary information. 図６は、実施例に係る文書変換処理の主処理の手順を示すフローチャートである。FIG. 6 is a flowchart illustrating the procedure of the main process of the document conversion process according to the embodiment. 図７は、実施例に係るマッピング処理の手順を示すフローチャート（１）である。FIG. 7 is a flowchart (1) illustrating the procedure of the mapping process according to the embodiment. 図８は、実施例に係るマッピング処理の手順を示すフローチャート（２）である。FIG. 8 is a flowchart (2) illustrating the procedure of the mapping process according to the embodiment. 図９は、実施例に係るマッピング処理の手順を示すフローチャート（３）である。FIG. 9 is a flowchart (3) illustrating the procedure of the mapping process according to the embodiment. 図１０は、実施例に係るマッピング処理の手順を示すフローチャート（４）である。FIG. 10 is a flowchart (4) illustrating the procedure of the mapping process according to the embodiment. 図１１は、文書変換プログラムを実行するコンピュータの一例を示す図である。FIG. 11 is a diagram illustrating an example of a computer that executes a document conversion program. 図１２は、財務報告書の各科目と科目値を示す図である。FIG. 12 is a diagram showing each subject and subject value of the financial report. 図１３は、階層化文書の各ＸＢＲＬ要素の関係を示す図（１）である。FIG. 13 is a diagram (1) showing the relationship between the XBRL elements of the hierarchical document. 図１４は、階層化文書の各ＸＢＲＬ要素の関係を示す図（２）である。FIG. 14 is a diagram (2) showing the relationship between the XBRL elements of the hierarchical document.

以下に、本願の開示するプログラム、文書変換装置および文書変換方法の実施例を図面に基づいて詳細に説明する。なお、実施例によりこの発明が限定されるものではない。 Embodiments of a program, a document conversion apparatus, and a document conversion method disclosed in the present application will be described below in detail with reference to the drawings. The present invention is not limited to the embodiments.

［実施例に係る文書変換装置の構成］
図１は、実施例に係る文書変換装置の構成を示す機能ブロック図である。図１に示すように、文書変換装置１は、財務報告書２１とＸＢＲＬ要素の定義体２２とマッピング補助情報２３とを入力し、ＸＢＲＬ要素の定義体２２とマッピング補助情報２３とを用いて、財務報告書２１をＸＢＲＬ文書４１に変換する。[Configuration of Document Conversion Apparatus According to Embodiment]
FIG. 1 is a functional block diagram illustrating the configuration of the document conversion apparatus according to the embodiment. As shown in FIG. 1, the document conversion apparatus 1 inputs a financial report 21, an XBRL element definition body 22 and mapping auxiliary information 23, and uses the XBRL element definition body 22 and mapping auxiliary information 23. The financial report 21 is converted into an XBRL document 41.

財務報告書２１は、テキスト形式（ワード形式を含む）で作成された財務諸表であり、科目名と科目値とから表される。なお、財務報告書２１のレイアウトの一例については、後述する。 The financial report 21 is a financial statement prepared in a text format (including a word format), and is represented by a subject name and a subject value. An example of the layout of the financial report 21 will be described later.

ＸＢＲＬ要素の定義体２２は、ＸＢＲＬ文書４１で用いられる要素（ＸＢＲＬ要素）を定義する定義体であり、表示ツリー２２１および要素宣言２２２を有する。ここで、ＸＢＲＬ文書４１とは、財務報告に関して、ＸＭＬをベースにした報告書記述言語であるＸＢＲＬで記述された文書である。ＸＢＲＬ文書４１は、タクソノミおよびインスタンスからなる。タクソノミとは、ＸＢＲＬ要素の体系を定義したものであり、スキーマとリンクベースとからなる。スキーマとは、ＸＢＲＬ要素の名前やデータ型等の属性情報を記憶する辞書であり、実施例では要素宣言２２２に対応する。リンクベースとは、例えばＸＢＲＬ要素間の親子関係や表示順や表示名等を記述する文書である。そして、インスタンス（ＸＢＲＬインスタンスともいう）とは、ＸＢＲＬ要素の具体的な値を記述した報告文書である。なお、表示ツリー２２１および要素宣言２２２の構成例については、後述する。 The XBRL element definition body 22 is a definition body that defines an element (XBRL element) used in the XBRL document 41, and includes a display tree 221 and an element declaration 222. Here, the XBRL document 41 is a document described in XBRL, which is a report description language based on XML, regarding financial reports. The XBRL document 41 includes a taxonomy and an instance. A taxonomy defines a system of XBRL elements and includes a schema and a link base. The schema is a dictionary that stores attribute information such as the name and data type of the XBRL element, and corresponds to the element declaration 222 in the embodiment. The link base is a document that describes, for example, a parent-child relationship between XBRL elements, a display order, a display name, and the like. An instance (also referred to as an XBRL instance) is a report document describing a specific value of an XBRL element. A configuration example of the display tree 221 and the element declaration 222 will be described later.

マッピング補助情報２３は、財務報告書２１の科目値をＸＢＲＬ要素にマッピングする際に用いられる補助情報である。なお、マッピング補助情報２３の内容については、後述する。 The mapping auxiliary information 23 is auxiliary information used when mapping the subject value of the financial report 21 to the XBRL element. The contents of the mapping auxiliary information 23 will be described later.

また、文書変換装置１は、記憶部２と、制御部３とを有する。 In addition, the document conversion apparatus 1 includes a storage unit 2 and a control unit 3.

記憶部２は、例えばフラッシュメモリ（Flash Memory）やＦＲＡＭ（登録商標）（Ferroelectric Random Access Memory）等の不揮発性の半導体メモリ素子等の記憶装置に対応する。そして、記憶部２は、財務報告書２１、ＸＢＲＬ要素の定義体２２およびマッピング補助情報２３を有する。 The memory | storage part 2 respond | corresponds to memory | storage devices, such as non-volatile semiconductor memory elements, such as flash memory (Flash Memory) and FRAM (trademark) (Ferroelectric Random Access Memory), for example. The storage unit 2 includes a financial report 21, an XBRL element definition body 22, and mapping auxiliary information 23.

財務報告書２１は、後述する入力部３０によって記憶部２に格納される。ＸＢＲＬ要素の定義体２２は、後述する入力部３０によって記憶部２に格納される。マッピング補助情報２３は、後述する入力部３０によって記憶部２に格納される。 The financial report 21 is stored in the storage unit 2 by the input unit 30 described later. The definition body 22 of the XBRL element is stored in the storage unit 2 by the input unit 30 described later. The mapping auxiliary information 23 is stored in the storage unit 2 by the input unit 30 described later.

制御部３は、各種の処理手順を規定したプログラムや制御データを格納するための内部メモリを有し、これらによって種々の処理を実行する。そして、制御部３は、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路またはＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等の電子回路に対応する。さらに、制御部３は、入力部３０と、レイアウト解析部３１と、定義体解析部３２と、マッピング部３３と、出力部３４とを有する。 The control unit 3 has an internal memory for storing programs defining various processing procedures and control data, and executes various processes using these. And the control part 3 respond | corresponds to electronic circuits, such as integrated circuits, such as ASIC (Application Specific Integrated Circuit) and FPGA (Field Programmable Gate Array), or CPU (Central Processing Unit) and MPU (Micro Processing Unit), for example. Further, the control unit 3 includes an input unit 30, a layout analysis unit 31, a definition body analysis unit 32, a mapping unit 33, and an output unit 34.

入力部３０は、財務報告書２１を入力し、入力した財務報告書２１を記憶部２に格納する。また、入力部３０は、ＸＢＲＬ要素の定義体２２を入力し、入力したＸＢＲＬ要素の定義体２２を記憶部２に格納する。また、入力部３０は、マッピング補助情報２３を入力し、入力したマッピング補助情報２３を記憶部２に格納する。例えば、入力部３０は、財務報告書２１、ＸＢＲＬ要素の定義体２２、マッピング補助情報２３をファイルにより入力する。 The input unit 30 inputs the financial report 21 and stores the input financial report 21 in the storage unit 2. The input unit 30 also inputs the XBRL element definition body 22 and stores the input XBRL element definition body 22 in the storage unit 2. The input unit 30 inputs the mapping auxiliary information 23 and stores the input mapping auxiliary information 23 in the storage unit 2. For example, the input unit 30 inputs the financial report 21, the XBRL element definition body 22, and the mapping auxiliary information 23 by a file.

レイアウト解析部３１は、財務報告書２１のレイアウトを解析する。例えば、レイアウト解析部３１は、財務報告書２１を記憶部２から読み出し、読み出した財務報告書２１より、ＸＢＲＬ要素にマッピングする科目の名前と値とを取得する。ここで、財務報告書２１のレイアウトについて、図２を参照して説明する。図２は、財務報告書の表紙のレイアウトの一例を示す図である。図２に示すように、財務報告書２１の表紙には、科目名２１ａと科目値２１ｂが記述されている。科目名２１ａは、科目の名前を示す。科目値２１ｂは、科目名に対する値（文字列）を示す。一例として、科目名２１ａが「提出書類」である場合、科目値２１ｂとして「有価証券報告書」が記述されている。科目名２１ａが「報告期間」である場合、科目値２１ｂとして「第１００期（自平成２２年４月１日至平成２３年３月３１日」が記述されている。 The layout analysis unit 31 analyzes the layout of the financial report 21. For example, the layout analysis unit 31 reads the financial report 21 from the storage unit 2, and acquires the name and value of the subject to be mapped to the XBRL element from the read financial report 21. Here, the layout of the financial report 21 will be described with reference to FIG. FIG. 2 is a diagram illustrating an example of a cover layout of a financial report. As shown in FIG. 2, on the cover of the financial report 21, a subject name 21a and a subject value 21b are described. The course name 21a indicates the name of the course. The subject value 21b indicates a value (character string) for the subject name. As an example, when the subject name 21a is “submission document”, “securities report” is described as the subject value 21b. When the subject name 21a is “reporting period”, “100th term (from April 1, 2010 to March 31, 2011)” is described as the subject value 21b.

定義体解析部３２は、ＸＢＲＬ要素の定義体２２を解析する。例えば、定義体解析部３２は、ＸＢＲＬ要素の定義体２２を記憶部２から読み出す。そして、定義体解析部３２は、ＸＢＲＬ要素の定義体２２の表示ツリー２２１より、いずれかのＸＢＲＬ要素にマッピングしようとしている財務報告書２１の科目の名前と一致または類似する表示名か要素名を持つＸＢＲＬ要素を取得する。そして、定義体解析部３２は、ＸＢＲＬ要素の定義体２２の要素宣言２２２より、取得したＸＢＲＬ要素の要素宣言を取得する。ここで、表示ツリー２２１および要素宣言２２２の構成について、図３および図４を参照して説明する。 The definition body analysis unit 32 analyzes the definition body 22 of the XBRL element. For example, the definition body analysis unit 32 reads the definition body 22 of the XBRL element from the storage unit 2. Then, the definition body analysis unit 32 displays a display name or element name that matches or is similar to the name of the subject of the financial report 21 that is to be mapped to any XBRL element from the display tree 221 of the definition body 22 of the XBRL element. Get the XBRL element you have. Then, the definition body analysis unit 32 acquires the element declaration of the acquired XBRL element from the element declaration 222 of the definition body 22 of the XBRL element. Here, the configuration of the display tree 221 and the element declaration 222 will be described with reference to FIGS.

図３は、表示ツリーの構成の一例を示す図である。図３に示すように、表示ツリー２２１には、ＸＢＲＬ要素の表示上の要素間の関係がツリー構造で表されている。ＸＢＲＬ要素は、表示名と要素名を使って表される。一例として、表示名が「提出書類」である場合、要素名として「ReportName」が記述されている。このＸＢＲＬ要素には、子孫関係の要素が存在しない。また、別の例として、表示名が「報告期間」である場合、要素名として「ReportingPeriod」が記述されている。このＸＢＲＬ要素には、表示名がそれぞれ「期」、「開始日」、「終了日」の子孫関係のＸＢＲＬ要素が存在している。表示名が「期」である場合、要素名として「StageOfReportingPeriod」が記述されている。表示名が「開始日」である場合、要素名として「StartDateOfReportingPeriod」が記述されている。表示名が「終了日」である場合、要素名として「EndDateOfReportingPeriod」が記述されている。 FIG. 3 is a diagram illustrating an example of a configuration of a display tree. As shown in FIG. 3, the display tree 221 shows the relationship between the elements on the display of the XBRL element in a tree structure. The XBRL element is represented using a display name and an element name. As an example, when the display name is “submission document”, “ReportName” is described as the element name. This XBRL element has no descendant relationship element. As another example, when the display name is “reporting period”, “ReportingPeriod” is described as the element name. In this XBRL element, there are XBRL elements having descendant relationships whose display names are “period”, “start date”, and “end date”, respectively. When the display name is “period”, “StageOfReportingPeriod” is described as the element name. When the display name is “start date”, “StartDateOfReportingPeriod” is described as the element name. When the display name is “end date”, “EndDateOfReportingPeriod” is described as the element name.

図４は、要素宣言のデータ構造の一例を示す図である。図４に示すように、要素宣言２２２には、要素名２２２ｂと、抽象要素フラグ２２２ｃと、ｎｉｌ値許可フラグ２２２ｄと、データ型２２２ｅと、制約２２２ｆとが対応付けて記憶される。なお、説明の便宜上、要素宣言２２２には、ＸＢＲＬ要素の表示名であって要素名２２２ｂに対応する表示名を表示名２２２ａとして記載するものとする。要素名２２２ｂは、ＸＢＲＬ要素の要素名を示す。抽象要素フラグ２２２ｃは、具体的な値がマッピングされない抽象要素（見出し）であるか否かを示すフラグである。抽象要素である場合には、例えば「○」が設定され、抽象要素でない場合には、例えば「×」が設定される。ｎｉｌ値許可フラグ２２２ｄは、ｎｉｌ値を許可する要素であるか否かを示すフラグである。ｎｉｌ値を許可する要素である場合には、例えば「○」が設定され、ｎｉｌ値を許可しない要素である場合には、例えば「×」が設定される。なお、ｎｉｌ値とは、ＮＵＬＬ値（空値）と同義であり、値をもたないことを意味する。データ型２２２ｅは、マッピングされる値のデータ型を示す。例えば、文字列の場合には「文字列型」、日付の場合には「日付型」、正の整数の場合には「正の整数型」が設定される。制約２２２ｆは、要素にマッピングされる値の書式の制約を示す。例えば、数字３桁と数字４桁をハイフンでつなぐ書式の制約がある場合には、その旨が設定される。 FIG. 4 is a diagram illustrating an example of the data structure of the element declaration. As shown in FIG. 4, in the element declaration 222, an element name 222b, an abstract element flag 222c, a nil value permission flag 222d, a data type 222e, and a constraint 222f are stored in association with each other. For convenience of description, the element declaration 222 includes a display name corresponding to the element name 222b, which is the display name of the XBRL element, as the display name 222a. The element name 222b indicates the element name of the XBRL element. The abstract element flag 222c is a flag indicating whether or not an abstract element (heading) to which a specific value is not mapped. If it is an abstract element, for example, “◯” is set, and if it is not an abstract element, for example, “x” is set. The nil value permission flag 222d is a flag indicating whether or not the element is a nil value permission element. For example, “◯” is set for an element that allows a nil value, and “X” is set for an element that does not allow a nil value. Note that the nil value is synonymous with a NULL value (null value) and means having no value. The data type 222e indicates the data type of the value to be mapped. For example, “character string type” is set for a character string, “date type” for a date, and “positive integer type” for a positive integer. The constraint 222f indicates a format restriction of a value mapped to the element. For example, if there is a format restriction that connects 3 digits and 4 digits with a hyphen, this is set.

一例として、表示名２２２ａが「報告期間」である場合、抽象要素フラグ２２２ｃとして「○」、ｎｉｌ値フラグ２２２ｄとして「×」、データ型２２２ｅとして「文字列型」が設定される。表示名２２２ａが「期」である場合、抽象要素フラグ２２２ｃとして「×」、ｎｉｌ値フラグ２２２ｄとして「×」、データ型２２２ｅとして「正の整数型」が設定される。表示名２２２ａが「開始日」である場合、抽象要素フラグ２２２ｃとして「×」、ｎｉｌ値フラグ２２２ｄとして「×」、データ型２２２ｅとして「日付型」が設定される。 As an example, when the display name 222a is “reporting period”, “◯” is set as the abstract element flag 222c, “x” is set as the nil value flag 222d, and “character string type” is set as the data type 222e. When the display name 222a is “period”, “x” is set as the abstract element flag 222c, “x” is set as the nil value flag 222d, and “positive integer type” is set as the data type 222e. When the display name 222a is “start date”, “x” is set as the abstract element flag 222c, “x” is set as the nil value flag 222d, and “date type” is set as the data type 222e.

図１に戻って、マッピング部３３は、ＸＢＲＬ要素の定義体２２に基づいて、財務報告書２１に含まれる科目のうちＸＢＲＬ文書４１において子孫を有する要素に対応する科目を検出する。例えば、マッピング部３３は、定義体解析部３２によって取得されたＸＢＲＬ要素の要素宣言に基づいて、ＸＢＲＬ要素が抽象要素であるか否かを判定する。そして、マッピング部３３は、ＸＢＲＬ要素が抽象要素であると判定した場合、表示ツリー２２１のＸＢＲＬ要素間の親子関係に基づいて、当該ＸＢＲＬ要素が表示ツリー２２１上で階層化されている、すなわち子孫の要素を有しているか否かを判定する。そして、マッピング部３３は、ＸＢＲＬ要素が表示ツリー２２１上で階層化されていると判定した場合、当該ＸＢＲＬ要素を幾つかの要素から構成された要素と判断する。すなわち、マッピング部３３は、当該ＸＢＲＬ要素にマッピングする科目を、その科目値を分割すべき科目として検出する。 Returning to FIG. 1, the mapping unit 33 detects subjects corresponding to elements having descendants in the XBRL document 41 among subjects included in the financial report 21 based on the definition body 22 of the XBRL element. For example, the mapping unit 33 determines whether the XBRL element is an abstract element based on the element declaration of the XBRL element acquired by the definition body analysis unit 32. When the mapping unit 33 determines that the XBRL element is an abstract element, the XBRL element is hierarchized on the display tree 221 based on the parent-child relationship between the XBRL elements of the display tree 221. It is determined whether or not it has any element. When the mapping unit 33 determines that the XBRL element is hierarchized on the display tree 221, the mapping unit 33 determines that the XBRL element is an element composed of several elements. That is, the mapping unit 33 detects a subject to be mapped to the XBRL element as a subject whose subject value should be divided.

マッピング部３３は、ＸＢＲＬ要素の定義体２２に基づいて、検出した科目の科目値を分割し、分割した科目値の各値を、対応するＸＢＲＬ要素の子孫の要素にマッピングする。例えば、マッピング部３３は、表示ツリー２２１から、検出した科目に対応するＸＢＲＬ要素の子孫要素を、出現順に選択する。そして、マッピング部３３は、選択した子孫要素に対応する要素宣言を要素宣言２２２から取得する。そして、マッピング部３３は、取得した要素宣言内のデータ型２２２ｅに合致した部分を、検出した科目の科目値から抽出し、抽出した部分を、選択中の子孫要素にマッピングする。ここで、マッピング部３３は、データ型２２２ｅに合致した部分を科目値から抽出できない場合、選択中の子孫要素がｎｉｌ値を許可する要素であれば科目値の入力が省略されたと判断する。そして、マッピング部３３は、選択中の子孫要素に何もマッピングしない。そして、マッピング部は、代わりに、ｎｉｌ値に関する属性であるｎｉｌ属性にｎｉｌ値であることを示す「ｔｒｕｅ」を設定する。なお、ｎｉｌ値を許可する要素であるか否かは、子孫要素の要素宣言内のｎｉｌ値許可フラグ２２２ｄによって判定される。 The mapping unit 33 divides the subject value of the detected subject based on the definition body 22 of the XBRL element, and maps each value of the divided subject value to the descendant element of the corresponding XBRL element. For example, the mapping unit 33 selects descendant elements of the XBRL element corresponding to the detected subject from the display tree 221 in the order of appearance. Then, the mapping unit 33 acquires an element declaration corresponding to the selected descendant element from the element declaration 222. Then, the mapping unit 33 extracts a portion that matches the data type 222e in the acquired element declaration from the subject value of the detected subject, and maps the extracted portion to the selected descendant element. Here, if the portion matching the data type 222e cannot be extracted from the course value, the mapping unit 33 determines that the entry of the course value is omitted if the selected descendant element permits the nil value. The mapping unit 33 then maps nothing to the selected descendant element. Instead, the mapping unit sets “true” indicating the nil value to the nil attribute that is an attribute related to the nil value. Whether or not an element permits a nil value is determined by the nil value permission flag 222d in the element declaration of the descendant element.

具体例として、図２で示す財務報告書２１内の科目「報告期間」を図３で示す表示ツリー２２１上のＸＢＲＬ要素「報告期間」へマッピングする場合について説明する。なお、ＸＢＲＬ要素の要素宣言として、図４で示す要素宣言２２２が用いられるものとする。マッピング部３３は、ＸＢＲＬ要素「報告期間」の要素宣言に基づいて、ＸＢＲＬ要素が抽象要素であるか否かを判定する。ここでは、マッピング部３３は、図４で示す要素宣言２２２のうちＸＢＲＬ要素「報告期間」の要素宣言の抽象要素フラグ２２２ｃが「○」であるので、ＸＢＲＬ要素「報告期間」は抽象要素であると判定する。 As a specific example, the case where the subject “reporting period” in the financial report 21 shown in FIG. 2 is mapped to the XBRL element “reporting period” on the display tree 221 shown in FIG. 3 will be described. Note that the element declaration 222 shown in FIG. 4 is used as the element declaration of the XBRL element. The mapping unit 33 determines whether the XBRL element is an abstract element based on the element declaration of the XBRL element “report period”. Here, since the abstract element flag 222c of the element declaration of the XBRL element “reporting period” in the element declaration 222 shown in FIG. 4 is “◯”, the mapping unit 33 is an XBRL element “reporting period”. Is determined.

そして、マッピング部３３は、ＸＢＲＬ要素「報告期間」が抽象要素であると判定したので、表示ツリー２２１のＸＢＲＬ要素間の親子関係に基づいて、ＸＢＲＬ要素「報告期間」が表示ツリー２２１上で子孫の要素を持つか否かを判定する。ここでは、マッピング部３３は、図３で示す表示ツリー２２１に基づいて、ＸＢＲＬ要素「報告期間」が３つの子孫の要素「期」、「開始日」、「終了日」を持つと判定する。したがって、マッピング部３３は、ＸＢＲＬ要素「報告期間」を３つの要素から構成されていると判断する。すなわち、マッピング部３３は、ＸＢＲＬ要素「報告期間」にマッピングする科目「報告期間」を、その科目値を分割すべき科目として検出する。 Then, since the mapping unit 33 determines that the XBRL element “report period” is an abstract element, the XBRL element “report period” is a descendant on the display tree 221 based on the parent-child relationship between the XBRL elements in the display tree 221. It is determined whether or not it has any element. Here, the mapping unit 33 determines that the XBRL element “report period” has three descendant elements “period”, “start date”, and “end date” based on the display tree 221 shown in FIG. Therefore, the mapping unit 33 determines that the XBRL element “report period” is composed of three elements. In other words, the mapping unit 33 detects a subject “reporting period” to be mapped to the XBRL element “reporting period” as a subject whose subject value should be divided.

そして、マッピング部３３は、表示ツリー２２１から、検出した科目「報告期間」に対応するＸＢＲＬ要素「報告期間」の子孫要素を順番に選択する。ここでは、マッピング部３３は、図３で示す表示ツリー２２１からＸＢＲＬ要素「報告期間」の子孫要素「期」、「開始日」、「終了日」を順番に選択する。そして、マッピング部３３は、検出した科目「報告期間」の科目値から、選択した子孫要素に対応する要素宣言内のデータ型２２２ｅに合致した部分を抽出する。そして、マッピング部３３は、抽出した部分を、選択した子孫要素にマッピングする。 Then, the mapping unit 33 sequentially selects descendant elements of the XBRL element “reporting period” corresponding to the detected subject “reporting period” from the display tree 221. Here, the mapping unit 33 selects descendant elements “period”, “start date”, and “end date” of the XBRL element “report period” from the display tree 221 shown in FIG. Then, the mapping unit 33 extracts a part that matches the data type 222e in the element declaration corresponding to the selected descendant element from the subject value of the detected subject “reporting period”. Then, the mapping unit 33 maps the extracted part to the selected descendant element.

ここでは、図２に示すように、科目名２１ａ「報告期間」の科目値２１ｂが「第１００期（自平成２２年４月１日至平成２３年３月３１日）」である。そこで、マッピング部３３は、科目値２１ｂから、子孫要素「期」のデータ型２２２ｅ「正の整数型」に合致した部分「１００」を抽出する。そして、マッピング部３３は、抽出した部分「１００」を、子孫要素「期」にマッピングする。以下のタグは、ＸＢＲＬ文書４１を出力したときの結果である。
<StageOfReportingPeriod>100</StageOfReportingPeriod>Here, as shown in FIG. 2, the subject value 21b of the subject name 21a “reporting period” is “100th term (from April 1, 2010 to March 31, 2011)”. Therefore, the mapping unit 33 extracts a portion “100” that matches the data type 222e “positive integer type” of the descendant element “period” from the subject value 21b. Then, the mapping unit 33 maps the extracted part “100” to the descendant element “period”. The following tags are the results when the XBRL document 41 is output.
<StageOfReportingPeriod> 100 </ StageOfReportingPeriod>

また、マッピング部３３は、科目値２１ｂから、子孫要素「開始日」のデータ型２２２ｅ「日付型」に合致した部分「平成２２年４月１日」を抽出する。そして、マッピング部３３は、抽出した部分を、子孫要素「開始日」にマッピングする。以下のタグは、ＸＢＲＬ文書４１を出力したときの結果である。なお、以下の例では、抽出した部分は、和暦から西暦に変換された後マッピングされているものとする。
<StartDateOfReportingPeriod>2010-04-01</StartDateOfReportingPeriod>Further, the mapping unit 33 extracts a part “April 1, 2010” that matches the data type 222e “date type” of the descendant element “start date” from the subject value 21b. The mapping unit 33 maps the extracted part to the descendant element “start date”. The following tags are the results when the XBRL document 41 is output. In the following example, it is assumed that the extracted portion is mapped after being converted from the Japanese calendar to the Western calendar.
<StartDateOfReportingPeriod> 2010-04-01 </ StartDateOfReportingPeriod>

さらに、マッピング部３３は、科目値２１ｂから、子孫要素「終了日」のデータ型２２２ｅ「日付型」に合致した部分「平成２３年３月３１日」を抽出する。そして、マッピング部３３は、抽出した部分を、子孫要素「終了日」にマッピングする。以下のタグは、ＸＢＲＬ文書４１を出力したときの結果である。なお、以下の例では、抽出した部分は、和暦から西暦に変換された後マッピングされているものとする。
<EndDateOfReportingPeriod>2011-03-31</EndDateOfReportingPeriod>Further, the mapping unit 33 extracts a part “March 31, 2011” that matches the data type 222e “date type” of the descendant element “end date” from the subject value 21b. The mapping unit 33 maps the extracted part to the descendant element “end date”. The following tags are the results when the XBRL document 41 is output. In the following example, it is assumed that the extracted portion is mapped after being converted from the Japanese calendar to the Western calendar.
<EndDateOfReportingPeriod> 2011-03-31 </ EndDateOfReportingPeriod>

別の具体例として、図２で示す財務報告書２１内の科目の科目名２１ａ「本店の所在の場所」を図３で示す表示ツリー２２１上のＸＢＲＬ要素「本店の所在の場所」へマッピングする場合について説明する。なお、ＸＢＲＬ要素の要素宣言として、図４で示す要素宣言２２２が用いられるものとする。マッピング部３３は、ＸＢＲＬ要素「本店の所在の場所」の要素宣言に基づいて、ＸＢＲＬ要素が抽象要素であるか否かを判定する。ここでは、マッピング部３３は、図４で示す要素宣言２２２のうちＸＢＲＬ要素「本店の所在の場所」の要素宣言の抽象要素フラグ２２２ｃが「○」であるので、ＸＢＲＬ要素「本店の所在の場所」は抽象要素であると判定する。 As another specific example, the subject name 21a “location of the head office” of the subject in the financial report 21 shown in FIG. 2 is mapped to the XBRL element “location of the head office” on the display tree 221 shown in FIG. The case will be described. Note that the element declaration 222 shown in FIG. 4 is used as the element declaration of the XBRL element. The mapping unit 33 determines whether the XBRL element is an abstract element based on the element declaration of the XBRL element “location of the head office”. Here, since the abstract element flag 222c of the element declaration of the XBRL element “location of the head office” in the element declaration 222 shown in FIG. 4 is “◯”, the mapping unit 33 sets the XBRL element “location of the head office” "Is determined to be an abstract element.

そして、マッピング部３３は、ＸＢＲＬ要素「本店の所在の場所」が抽象要素であると判定したので、表示ツリー２２１のＸＢＲＬ要素間の親子関係に基づいて、ＸＢＲＬ要素「本店の所在の場所」が表示ツリー２２１上で子孫の要素を持つか否かを判定する。ここでは、マッピング部３３は、図３で示す表示ツリー２２１に基づいて、ＸＢＲＬ要素「本店の所在の場所」が２つの子孫の要素「郵便番号」、「住所」を持つと判定する。したがって、マッピング部３３は、ＸＢＲＬ要素「本店の所在の場所」を２つの要素から構成されていると判断する。すなわち、マッピング部３３は、ＸＢＲＬ要素「本店の所在の場所」にマッピングする科目「本店の所在の場所」を、その科目値を分割すべき科目として検出する。 Then, since the mapping unit 33 determines that the XBRL element “location of the head office” is an abstract element, the XBRL element “location of the head office” is determined based on the parent-child relationship between the XBRL elements of the display tree 221. It is determined whether or not there are descendant elements on the display tree 221. Here, the mapping unit 33 determines that the XBRL element “location of the head office” has two descendant elements “zip code” and “address” based on the display tree 221 shown in FIG. 3. Therefore, the mapping unit 33 determines that the XBRL element “location of the head office” is composed of two elements. That is, the mapping unit 33 detects the subject “location of the head office” that is mapped to the XBRL element “location of the head office” as a subject to which the subject value should be divided.

そして、マッピング部３３は、表示ツリー２２１から、検出した科目の科目名２１ａ「本店の所在の場所」に対応するＸＢＲＬ要素「本店の所在の場所」の子孫要素を順番に選択する。ここでは、マッピング部３３は、図３で示す表示ツリー２２１からＸＢＲＬ要素「本店の所在の場所」の子孫要素「郵便番号」、「住所」を順番に選択する。そして、マッピング部３３は、検出した科目の科目名２１ａ「本店の所在の場所」の科目値２１ｂから、選択した子孫要素に対応する要素宣言内のデータ型２２２ｅおよび制約２２２ｆに合致した部分を抽出する。そして、マッピング部３３は、抽出した部分を、選択した子孫要素にマッピングする。 Then, the mapping unit 33 sequentially selects descendant elements of the XBRL element “location of the head office” corresponding to the subject name 21 a “location of the head office” of the detected subject from the display tree 221. Here, the mapping unit 33 sequentially selects descendant elements “zip code” and “address” of the XBRL element “location of the head office” from the display tree 221 shown in FIG. Then, the mapping unit 33 extracts a part that matches the data type 222e and the constraint 222f in the element declaration corresponding to the selected descendant element from the subject value 21b of the subject name 21a “location of the head office” of the detected subject. To do. Then, the mapping unit 33 maps the extracted part to the selected descendant element.

ここでは、図２に示すように、科目の科目名２１ａ「本店の所在の場所」の科目値２１ｂは、「東京都新宿区新宿１丁目１番１号」である。そこで、マッピング部３３は、この科目値２１ｂから、子孫要素「郵便番号」のデータ型２２２ｅ「文字列型」と制約２２２ｆ「数字３桁“−”数字４桁」に合致した部分の抽出を試みる。ところが、マッピング部３３は、科目値から、データ型２２２ｅと制約２２２ｆに合致した部分を抽出できない。そこで、マッピング部３３は、選択した子孫要素「郵便番号」のｎｉｌ値許可フラグ２２２ｄが「○」であるので、子孫要素「郵便番号」がｎｉｌ値を許可する要素であると判定する。そこで、マッピング部３３は、科目値の入力が省略されたと判断して、選択した子孫要素「郵便番号」に何もマッピングしないで、代わりに、ｎｉｌ属性に「ｔｒｕｅ」を設定する。以下のタグは、ＸＢＲＬ文書４１を出力したときの、子孫要素「郵便番号」に関する結果である。
<ZipcodeOfCompanyLocation nil=”true”/>Here, as shown in FIG. 2, the subject value 21b of the subject name 21a “location of the head office” is “1-1, Shinjuku 1-1, Shinjuku-ku, Tokyo”. Therefore, the mapping unit 33 tries to extract a portion that matches the data type 222e “character string type” of the descendant element “zip code” and the constraint 222f “three digits“ − ”four digits” from the subject value 21b. . However, the mapping unit 33 cannot extract a portion that matches the data type 222e and the constraint 222f from the subject value. Therefore, since the nil value permission flag 222d of the selected descendant element “zip code” is “◯”, the mapping unit 33 determines that the descendant element “zip code” is an element that permits the nil value. Therefore, the mapping unit 33 determines that the entry of the subject value has been omitted, does not map anything to the selected descendant element “zip code”, and instead sets “true” to the nil attribute. The following tag is a result regarding the descendant element “zip code” when the XBRL document 41 is output.
<ZipcodeOfCompanyLocation nil = ”true” />

また、マッピング部３３は、科目値から、子孫要素「住所」のデータ型２２２ｅ「文字列型」に合致した部分「東京都新宿区新宿１丁目１番１号」を抽出する。そして、マッピング部３３は、抽出した部分を、子孫要素「住所」にマッピングする。以下のタグは、ＸＢＲＬ文書４１を出力したときの結果である。
<AddressOfCompanyLocation>東京都新宿区新宿１丁目１番１号</AddressOfCompanyLocation>Further, the mapping unit 33 extracts a portion “1-1-1 Shinjuku 1-chome Shinjuku-ku, Tokyo” that matches the data type 222e “character string type” of the descendant element “address” from the subject value. The mapping unit 33 maps the extracted part to the descendant element “address”. The following tags are the results when the XBRL document 41 is output.
<AddressOfCompanyLocation> 1-1 1-1 Shinjuku Shinjuku-ku, Tokyo </ AddressOfCompanyLocation>

このようにして、マッピング部３３は、財務報告書２１からＸＢＲＬ文書４１を作成する際、財務報告書２１の科目をＸＢＲＬ要素間の階層関係を含む表示ツリー２２１の要素に適切にマッピングすることができる。 In this way, when creating the XBRL document 41 from the financial report 21, the mapping unit 33 can appropriately map the subjects of the financial report 21 to the elements of the display tree 221 including the hierarchical relationship between the XBRL elements. it can.

出力部３４は、マッピング部３３によってマッピングされた結果を示すＸＢＲＬインスタンスを出力する。実施例では、このＸＢＲＬインスタンスがＸＢＲＬ文書４１となる。例えば、出力部３４は、マッピング結果であるＸＢＲＬ文書４１をモニタに出力しても良いし、記憶部２に記憶するようにしても良い。 The output unit 34 outputs an XBRL instance indicating the result mapped by the mapping unit 33. In the embodiment, this XBRL instance becomes the XBRL document 41. For example, the output unit 34 may output the XBRL document 41 that is the mapping result to a monitor or may store it in the storage unit 2.

ところで、要素宣言２２２に、ＸＢＲＬ要素のデータ型２２２ｅとして制約２２２ｆのない文字列型が指定される場合がある。かかる場合に、マッピング部３３は、該当する科目値から期待値だけを抽出することができないことがある。一例として、図２で示す財務報告書２１内の科目の科目名２１ａ「代表者の役職氏名」の科目値２１ｂを図３で示す表示ツリー２２１上のＸＢＲＬ要素「代表者の役職氏名」へマッピングする場合について説明する。マッピング部３３は、図３で示す表示ツリー２２１に基づいて、ＸＢＲＬ要素「代表者の役職氏名」が２つの子孫の要素「役職」と「氏名」を持つと判定する。そして、マッピング部３３は、科目名２１ａ「代表者の役職氏名」の科目値２１ｂ「代表取締役日本太郎」から、子孫要素「役職」のデータ型２２２ｅ「文字列型」に合致した部分を抽出する。ここでは、マッピング部３３は、子孫要素「役職」に応じた期待値として、科目値２１ｂから「代表取締役」を抽出したいところ、科目値２１ｂ全体の「代表取締役日本太郎」を抽出してしまう。すなわち、マッピング部３３は、科目値２１ｂから期待値だけを抽出することができない。 By the way, in the element declaration 222, a character string type without the constraint 222f may be specified as the data type 222e of the XBRL element. In such a case, the mapping unit 33 may not be able to extract only the expected value from the corresponding subject value. As an example, the subject value 21b of the subject name 21a of the subject in the financial report 21 shown in FIG. 2 is mapped to the XBRL element “representative title” on the display tree 221 shown in FIG. The case where it does is demonstrated. Based on the display tree 221 shown in FIG. 3, the mapping unit 33 determines that the XBRL element “representative title name” has two descendant elements “title” and “name”. Then, the mapping unit 33 extracts a portion that matches the data type 222e “character string type” of the descendant element “title” from the subject value 21b “representative director Nihon Taro” of the subject name 21a “representative title”. . Here, when the mapping unit 33 wants to extract “representative director” from the subject value 21b as the expected value according to the descendant element “post”, it extracts “representative director Nihon Taro” of the subject value 21b as a whole. That is, the mapping unit 33 cannot extract only the expected value from the subject value 21b.

そこで、文書変換装置１は、科目値２１ｂから抽出したい、ＸＢＲＬ要素に応じた期待値を、マッピング用の補助情報としてマッピング補助情報２３−１に予め記憶しておくようにすれば良い。図５Ａは、マッピング補助情報のデータ構造の一例を示す図である。図５Ａに示すように、マッピング補助情報２３−１には、表示名２３１ａと、要素名２３１ｂと、期待値の候補２３１ｃとが対応付けて記憶される。表示名２３１ａは、ＸＢＲＬ要素の表示名である。要素名２３１ｂは、ＸＢＲＬ要素の要素名である。期待値の候補２３１ｃは、ＸＢＲＬ要素に応じた期待値の候補を示す。一例として、表示名２３１ａが「役職」である場合、期待値の候補２３１ｃとして「代表取締役」、「部長」、「部長代理」、「課長」と記憶している。表示名２３１ａが「氏名」である場合、期待値の候補２３１ｃとして「日本太郎」、「日本次郎」と記憶している。 Therefore, the document conversion apparatus 1 may store the expected value corresponding to the XBRL element to be extracted from the subject value 21b in advance in the mapping auxiliary information 23-1 as auxiliary information for mapping. FIG. 5A is a diagram illustrating an example of a data structure of mapping auxiliary information. As illustrated in FIG. 5A, the mapping auxiliary information 23-1 stores a display name 231 a, an element name 231 b, and an expected value candidate 231 c in association with each other. The display name 231a is a display name of the XBRL element. The element name 231b is an element name of the XBRL element. An expected value candidate 231c indicates an expected value candidate corresponding to the XBRL element. As an example, when the display name 231a is “position”, “representative director”, “department manager”, “deputy manager”, and “section manager” are stored as expected value candidates 231c. When the display name 231a is "name", "Nippon Taro" and "Nihon Jiro" are stored as expected value candidates 231c.

このようなマッピング補助情報２３−１を利用して、マッピング部３３は、科目値２１ｂからＸＢＲＬ要素のデータ型２２２ｅ「文字列型」に合致した部分を抽出する。すなわち、マッピング部３３は、マッピング補助情報２３−１を利用して、ＸＢＲＬ要素に応じた期待値の候補２３１ｃと合致する値を科目値２１ｂから抽出する。ここでは、マッピング部３３は、科目値２１ｂ「代表取締役日本太郎」から、子孫要素「役職」のデータ型２２２ｅ「文字列型」に合致した部分として、マッピング補助情報２３−１を利用して「代表取締役」を抽出できる。また、マッピング部３３は、科目値２１ｂ「代表取締役日本太郎」から、子孫要素「氏名」のデータ型２２２ｅ「文字列型」に合致した部分として、マッピング補助情報２３−１を利用して「日本太郎」を抽出できる。 Using such mapping auxiliary information 23-1, the mapping unit 33 extracts a portion that matches the data type 222e “character string type” of the XBRL element from the subject value 21b. That is, the mapping unit 33 uses the mapping auxiliary information 23-1 to extract a value that matches the expected value candidate 231c corresponding to the XBRL element from the subject value 21b. In this case, the mapping unit 33 uses the mapping auxiliary information 23-1 as a part matching the data type 222e “character string type” of the descendant element “title” from the subject value 21b “representative director Nihon Taro”. "Representative director" can be extracted. Further, the mapping unit 33 uses the mapping auxiliary information 23-1 as the part matching the data type 222 e “character string type” of the descendant element “name” from the subject value 21 b “representative director Nihon Taro”. Taro "can be extracted.

また、マッピング部３３が、該当する科目値２１ｂから期待値だけを抽出することができない別の例を以下に示す。図２で示す財務報告書２１内の科目の科目名２１ａ「電話番号」の科目値２１ｂを図３で示す表示ツリー２２１上のＸＢＲＬ要素「電話番号」へマッピングする場合について説明する。マッピング部３３は、科目「電話番号」の科目値２１ｂ「１１１（２２２）３３３３（代表）」からデータ型２２２ｅ「文字列型」に合致した部分を抽出する。ここでは、マッピング部３３は、ＸＢＲＬ要素「電話番号」に応じた期待値として、科目値から「１１１（２２２）３３３３」を抽出したいところ、末尾文字の“（代表）”を含んだ「１１１（２２２）３３３３（代表）」を抽出してしまう。すなわち、マッピング部３３は、科目値から期待値だけを抽出することができない。ここで、マッピング部３３が、科目値２１ｂから「１１１（２２２）３３３３」を抽出するために、要素宣言２２２の制約２２２ｆに抽出したい書式を指定する手段が考えられる。しかしながら、電話番号には、市外局番と市内局番の区切り文字として、様々な区切り文字（例えば、“（”、“）”、“−”、“ ”）がある。したがって、ＸＢＲＬ要素「電話番号」のデータ型２２２ｅは、制約２２２ｆの指定がない単純な文字列型とせざるを得ない。 Another example in which the mapping unit 33 cannot extract only the expected value from the corresponding subject value 21b is shown below. The case where the subject value 21b of the subject name 21a “telephone number” of the subject in the financial report 21 shown in FIG. 2 is mapped to the XBRL element “telephone number” on the display tree 221 shown in FIG. The mapping unit 33 extracts a part that matches the data type 222e “character string type” from the subject value 21b “111 (222) 3333 (representative)” of the subject “phone number”. Here, the mapping unit 33 wants to extract “111 (222) 3333” from the subject value as an expected value according to the XBRL element “phone number”, and therefore, “111 ( 222) 3333 (representative) ". That is, the mapping unit 33 cannot extract only the expected value from the subject value. Here, in order for the mapping unit 33 to extract “111 (222) 3333” from the subject value 21b, a means for designating a format to be extracted in the constraint 222f of the element declaration 222 can be considered. However, the telephone number includes various delimiters (for example, “(”, “)”, “−”, “”) as delimiters between the area code and the city code. Therefore, the data type 222e of the XBRL element “telephone number” must be a simple character string type that is not specified by the constraint 222f.

そこで、文書変換装置１は、科目値２１ｂに入る可能性のある不要な文字列を、マッピング用の補助情報としてマッピング補助情報２３−２に予め記憶しておくようにすれば良い。言い換えれば、文書変換装置１は、科目値２１ｂから削除する可能性のある文字列を、マッピング補助情報２３−２に予め記憶しておく。図５Ｂは、マッピング補助情報のデータ構造の一例を示す図である。図５Ｂに示すように、マッピング補助情報２３−２には、削除項目２３２ａと、削除候補２３２ｂとが対応付けて記憶される。削除項目２３２ａは、科目値２１ｂから削除する可能性のある文字列の項目を示す。削除候補２３２ｂは、削除項目に応じた削除する可能性のある文字列を示す。一例として、削除項目２３２ａが「先頭文字列」である場合、削除候補２３２ｂとして「〒」、「Ｔｅｌ」と記憶している。削除項目２３２ａが「区切り文字」である場合、削除候補２３２ｂとして「．」、「／」と記憶している。削除項目２３２ａが「末尾文字列」である場合、削除候補２３２ｂとして「（代表）」、「（直通）」と記憶している。 Therefore, the document conversion apparatus 1 may store an unnecessary character string that may be included in the subject value 21b in the mapping auxiliary information 23-2 in advance as auxiliary information for mapping. In other words, the document conversion apparatus 1 stores a character string that may be deleted from the subject value 21b in the mapping auxiliary information 23-2 in advance. FIG. 5B is a diagram illustrating an example of a data structure of mapping auxiliary information. As illustrated in FIG. 5B, the mapping auxiliary information 23-2 stores a deletion item 232a and a deletion candidate 232b in association with each other. The deletion item 232a indicates a character string item that may be deleted from the subject value 21b. The deletion candidate 232b indicates a character string that may be deleted according to the deletion item. As an example, when the deletion item 232a is “first character string”, “〒” and “Tel” are stored as deletion candidates 232b. When the deletion item 232a is “delimiter”, “.” And “/” are stored as deletion candidates 232b. When the deletion item 232a is “tail character string”, “(representative)” and “(direct communication)” are stored as deletion candidates 232b.

このようなマッピング補助情報２３−２を利用して、マッピング部３３は、科目値２１ｂに削除項目２３２ａに応じた削除候補２３２ｂがあれば、科目値２１ｂからその部分を削除する。ここでは、マッピング部３３は、科目値２１ｂ「１１１（２２２）３３３３（代表）」に削除項目２３２ａ「末尾文字列」に応じた「（代表）」があるので、科目値２１ｂからその部分を削除する。この結果、マッピング部３３は、マッピング補助情報２３−２を利用して、科目値２１ｂから「１１１（２２２）３３３３」を抽出できる。 Using such mapping auxiliary information 23-2, if there is a deletion candidate 232b corresponding to the deletion item 232a in the subject value 21b, the mapping unit 33 deletes that portion from the subject value 21b. Here, the mapping unit 33 deletes the part from the subject value 21b because the subject value 21b “111 (222) 3333 (representative)” has “(representative)” corresponding to the deletion item 232a “tail character string”. To do. As a result, the mapping unit 33 can extract “111 (222) 3333” from the subject value 21b using the mapping auxiliary information 23-2.

［文書変換処理の主処理］
次に、実施例に係る文書変換処理の主処理の手順について、図６を参照して説明する。図６は、実施例に係る文書変換処理の主処理の手順を示すフローチャートである。なお、図６では、ワード形式の財務報告書２１に対して、ワード画面上でタグ付けしたい科目の値をＸＢＲＬ要素にマッピングし、ＸＢＲＬ文書４１に変換する場合を説明する。[Main process of document conversion process]
Next, the procedure of the main process of the document conversion process according to the embodiment will be described with reference to FIG. FIG. 6 is a flowchart illustrating the procedure of the main process of the document conversion process according to the embodiment. FIG. 6 illustrates a case where the value of a subject to be tagged on the word screen is mapped to the XBRL element and converted into the XBRL document 41 for the financial report 21 in the word format.

まず、制御部３は、ＸＢＲＬ文書４１への文書変換の要求がされたか否かを判定する（ステップＳ１０）。例えば、制御部３は、ワード画面上でタグ付けしたい科目が選択されたか否かを判定することで、ＸＢＲＬ文書４１への文書変換の要求がされたか否かを判定する。文書変換の要求がされなかったと判定した場合（ステップＳ１０；Ｎｏ）、制御部３は、文書変換の要求がされるまで、判定処理を繰り返す。 First, the control unit 3 determines whether or not a document conversion request for the XBRL document 41 has been made (step S10). For example, the control unit 3 determines whether or not a document conversion request to the XBRL document 41 is requested by determining whether or not a subject to be tagged is selected on the word screen. If it is determined that no document conversion request has been made (step S10; No), the control unit 3 repeats the determination process until a document conversion request is made.

一方、文書変換の要求がされたと判定した場合（ステップＳ１０；Ｙｅｓ）、入力部３０は、財務報告書２１を入力し（ステップＳ１１）、記憶部２に格納する。そして、入力部３０は、ＸＢＲＬ要素の定義体２２を入力し（ステップＳ１２）、記憶部２に格納する。 On the other hand, if it is determined that a document conversion request has been made (step S10; Yes), the input unit 30 inputs the financial report 21 (step S11) and stores it in the storage unit 2. Then, the input unit 30 inputs the definition body 22 of the XBRL element (step S12) and stores it in the storage unit 2.

続いて、レイアウト解析部３１は、記憶部２から財務報告書２１を読み出し、読み出した財務報告書２１より、選択された科目の名前（科目名）と値（科目値）を取得する（ステップＳ１３）。 Subsequently, the layout analysis unit 31 reads the financial report 21 from the storage unit 2, and acquires the name (subject name) and value (subject value) of the selected subject from the read financial report 21 (step S13). ).

そして、定義体解析部３２は、記憶部２からＸＢＲＬ要素の定義体２２を読み出し、読み出したＸＢＲＬ要素の定義体２２より、科目の名前に等しいか近い表示名か要素名を持つＸＢＲＬ要素の定義を取得する（ステップＳ１４）。例えば、定義体解析部３２は、表示ツリー２２１より、ＸＢＲＬ要素にマッピングする科目の名前と一致または類似する表示名か要素名を持つＸＢＲＬ要素を取得する。そして、定義体解析部３２は、要素宣言２２２より、取得したＸＢＲＬ要素の要素宣言を取得する。 Then, the definition body analysis unit 32 reads the definition body 22 of the XBRL element from the storage unit 2 and defines the XBRL element having a display name or an element name that is equal to or close to the name of the subject from the read definition body 22 of the XBRL element. Is acquired (step S14). For example, the definition body analysis unit 32 acquires an XBRL element having a display name or element name that matches or is similar to the name of the subject to be mapped to the XBRL element from the display tree 221. Then, the definition body analysis unit 32 acquires the element declaration of the acquired XBRL element from the element declaration 222.

続いて、マッピング部３３は、取得したＸＢＲＬ要素の定義である要素宣言を解析し、表示情報およびマッピング補助情報２３に応じて、取得した科目値を当該ＸＢＲＬ要素にマッピングする（ステップＳ１５）。なお、表示情報には、表示ツリー２２１および要素宣言２２２が含まれる。 Subsequently, the mapping unit 33 analyzes the element declaration which is the definition of the acquired XBRL element, and maps the acquired subject value to the XBRL element according to the display information and the mapping auxiliary information 23 (step S15). The display information includes a display tree 221 and an element declaration 222.

そして、出力部３４は、マッピング部３３によってマッピングされた結果、ＸＢＲＬインスタンス（ＸＢＲＬ文書４１）として出力する（ステップＳ１６）。すなわち、出力部３４は、ＸＢＲＬインスタンスを例えば記憶部２に出力する。これにより、文書変換装置１は、財務報告書２１をＸＢＲＬ文書４１に変換する文書変換処理を終了する。 Then, the output unit 34 outputs the result of mapping by the mapping unit 33 as an XBRL instance (XBRL document 41) (step S16). That is, the output unit 34 outputs the XBRL instance to, for example, the storage unit 2. Thereby, the document conversion apparatus 1 ends the document conversion process for converting the financial report 21 into the XBRL document 41.

［マッピング処理の手順］
次に、図６に示すＳ１５におけるマッピング処理の手順について、図７〜図１０を参照して説明する。図７〜図１０は、実施例に係るマッピング処理の手順を示すフローチャートである。[Mapping procedure]
Next, the procedure of the mapping process in S15 shown in FIG. 6 will be described with reference to FIGS. 7 to 10 are flowcharts illustrating the procedure of the mapping process according to the embodiment.

まず、マッピング部３３は、科目値に、マッピング補助情報２３に定義された任意の先頭文字列または末尾文字列を含むか否かを判定する（ステップＳ２１）。これは、科目値から不要な文字列を削除するためである。例えば、科目値が「１１１（２２２）３３３３（代表）」である場合に、末尾文字列である「（代表）」を削除する場合である。 First, the mapping unit 33 determines whether or not the subject value includes any leading character string or trailing character string defined in the mapping auxiliary information 23 (step S21). This is because an unnecessary character string is deleted from the subject value. For example, when the subject value is “111 (222) 3333 (representative)”, the last character string “(representative)” is deleted.

そして、マッピング部３３は、科目値に任意の先頭文字列または末尾文字列を含むと判定した場合（ステップＳ２１；Ｙｅｓ）、科目値から、マッピング補助情報２３に定義された任意の先頭文字列または末尾文字列を削除する（ステップＳ２２）。すなわち、科目値には、含むと判定された文字列を削除した後の文字列が設定される。そして、マッピング部３３は、ステップＳ２３に移行する。一方、マッピング部３３は、科目値に任意の先頭文字列および末尾文字列を含まないと判定した場合（ステップＳ２１；Ｎｏ）、ステップＳ２３に移行する。 When the mapping unit 33 determines that the subject value includes an arbitrary first character string or an end character string (step S21; Yes), the mapping value 33 determines from the subject value an arbitrary first character string defined in the mapping auxiliary information 23 or The last character string is deleted (step S22). That is, the character string after the character string determined to be included is set as the subject value. Then, the mapping unit 33 proceeds to step S23. On the other hand, when the mapping unit 33 determines that the subject value does not include any initial character string and end character string (step S21; No), the mapping unit 33 proceeds to step S23.

ステップＳ２３では、マッピング部３３は、科目の名前に対応したＸＢＲＬ要素の要素宣言を解析した結果、当該ＸＢＲＬ要素が抽象要素か否かを判定する（ステップＳ２３）。当該ＸＢＲＬ要素が抽象要素でないと判定した場合（ステップＳ２３；Ｎｏ）、マッピング部３３は、科目値を当該ＸＢＲＬ要素に割り当てる（ステップＳ２４）。そして、マッピング部３３は、マッピング処理を終了する。 In step S23, the mapping unit 33 determines whether the XBRL element is an abstract element as a result of analyzing the element declaration of the XBRL element corresponding to the subject name (step S23). When it determines with the said XBRL element not being an abstract element (step S23; No), the mapping part 33 allocates a subject value to the said XBRL element (step S24). Then, the mapping unit 33 ends the mapping process.

一方、当該ＸＢＲＬ要素が抽象要素であると判定した場合（ステップＳ２３；Ｙｅｓ）、マッピング部３３は、表示ツリー２２１をトラバースし、当該ＸＢＲＬ要素における末端の子孫要素の表示名および要素名をスタックする（ステップＳ２５）。そして、マッピング部３３は、当該ＸＢＲＬ要素が子孫を持つか否かを判定する（ステップＳ２６）。ここでは、マッピング部３３は、当該ＸＢＲＬ要素におけるスタックがされたか否かによって、当該ＸＢＲＬ要素が子孫を持つか否かを判定する。そして、当該ＸＢＲＬ要素が子孫を持つと判定した場合（ステップＳ２６；Ｙｅｓ）、マッピング部３３は、子孫要素へのマッピング処理を行うべく、ステップＳ３１に移行する。 On the other hand, when it is determined that the XBRL element is an abstract element (step S23; Yes), the mapping unit 33 traverses the display tree 221 and stacks the display name and the element name of the terminal descendant element in the XBRL element. (Step S25). Then, the mapping unit 33 determines whether or not the XBRL element has a descendant (step S26). Here, the mapping unit 33 determines whether or not the XBRL element has a descendant depending on whether or not the XBRL element is stacked. If it is determined that the XBRL element has a descendant (step S26; Yes), the mapping unit 33 proceeds to step S31 to perform the mapping process to the descendant element.

一方、当該ＸＢＲＬ要素が子孫を持たないと判定した場合（ステップＳ２６；Ｎｏ）、マッピング部３３は、当該ＸＢＲＬ要素が抽象要素でありながら子孫を持たないので、マッピングエラーを文書変換の要求元へ通知する（ステップＳ２７）。そして、マッピング部３３は、マッピング処理を終了する。 On the other hand, if it is determined that the XBRL element does not have a descendant (step S26; No), the mapping unit 33 does not have a descendant although the XBRL element is an abstract element, so that a mapping error is sent to the document conversion request source. Notification is made (step S27). Then, the mapping unit 33 ends the mapping process.

ステップＳ３１では、マッピング部３３は、スタックから子孫要素の要素名を取得し（ステップＳ３１）、取得できたか否かを判定する（ステップＳ３２）。ここで、マッピング部３３は、取得できなかったと判定した場合（ステップＳ３２；Ｎｏ）、さらに、科目値が空文字列であるか否かを判定する（ステップＳ３３）。科目値が空文字列であると判定した場合（ステップＳ３３；Ｙｅｓ）、マッピング部３３は、全てのマッピングを終了したと判断し、マッピング処理を終了する。 In step S31, the mapping unit 33 acquires the element name of the descendant element from the stack (step S31), and determines whether it has been acquired (step S32). Here, when it is determined that the mapping unit 33 could not be acquired (step S32; No), the mapping unit 33 further determines whether or not the subject value is an empty character string (step S33). If it is determined that the subject value is an empty character string (step S33; Yes), the mapping unit 33 determines that all mapping has been completed and ends the mapping process.

一方、科目値が空文字列でないと判定した場合（ステップＳ３３；Ｎｏ）、マッピング部３３は、科目値にまだデータがありながら子孫要素を取得できなかったので、マッピングエラーを文書変換の要求元へ通知する（ステップＳ３４）。そして、マッピング部３３は、マッピング処理を終了する。 On the other hand, if it is determined that the subject value is not an empty character string (step S33; No), the mapping unit 33 cannot acquire a descendant element while the subject value still has data, and therefore a mapping error is sent to the request source of document conversion. Notification is made (step S34). Then, the mapping unit 33 ends the mapping process.

一方、マッピング部３３は、スタックから子孫要素の要素名を取得できたと判定した場合（ステップＳ３２；Ｙｅｓ）、ＸＢＲＬ要素の定義体２２より、取得できた子孫要素の定義である要素宣言を取得する（ステップＳ３５）。マッピング部３３は、取得した要素宣言を解析した結果、処理中の子孫要素が抽象要素か否かを判定する（ステップＳ３６）。処理中の子孫要素が抽象要素であると判定した場合（ステップＳ３６；Ｙｅｓ）、マッピング部３３は、処理中の子孫要素の下位の階層にある子孫要素をスタックすべく、ステップＳ２５に移行する。 On the other hand, when the mapping unit 33 determines that the element name of the descendant element has been acquired from the stack (step S32; Yes), the mapping unit 33 acquires an element declaration that is the definition of the acquired descendant element from the XBRL element definition body 22. (Step S35). The mapping unit 33 determines whether the descendant element being processed is an abstract element as a result of analyzing the acquired element declaration (step S36). If it is determined that the descendant element being processed is an abstract element (step S36; Yes), the mapping unit 33 proceeds to step S25 in order to stack the descendant elements in the lower hierarchy of the descendant element being processed.

一方、処理中の子孫要素が抽象要素でないと判定した場合（ステップＳ３６；Ｎｏ）、マッピング部３３は、取得した要素宣言より、データ型２２２ｅと制約２２２ｆとｎｉｌ値許可フラグ２２２ｄを取得する（ステップＳ３７）。そして、マッピング部３３は、科目値が空文字列でないか否かを判定する（ステップＳ３８）。科目値が空文字列であると判定した場合（ステップＳ３８；Ｎｏ）、マッピング部３３は、処理中の子孫要素に科目値をマッピングできないので、マッピングエラーを文書変換の要求元へ通知する（ステップＳ３９）。そして、マッピング部３３は、マッピング処理を終了する。 On the other hand, if it is determined that the descendant element being processed is not an abstract element (step S36; No), the mapping unit 33 acquires the data type 222e, the constraint 222f, and the nil value permission flag 222d from the acquired element declaration (step S36). S37). Then, the mapping unit 33 determines whether or not the subject value is not an empty character string (step S38). If it is determined that the subject value is an empty string (step S38; No), the mapping unit 33 notifies the mapping error to the document conversion request source because the subject value cannot be mapped to the descendant element being processed (step S39). ). Then, the mapping unit 33 ends the mapping process.

一方、科目値が空文字列でないと判定した場合（ステップＳ３８；Ｙｅｓ）、マッピング部３３は、さらにマッピング処理を進めるべく、ステップＳ４１に移行する。ステップＳ４１では、マッピング部３３は、処理中の子孫要素の要素宣言のデータ型２２２ｅが文字列型であるか否かを判定する（ステップＳ４１）。 On the other hand, when it is determined that the subject value is not an empty character string (step S38; Yes), the mapping unit 33 proceeds to step S41 in order to further perform the mapping process. In step S41, the mapping unit 33 determines whether the data type 222e of the element declaration of the descendant element being processed is a character string type (step S41).

データ型２２２ｅが文字列型であると判定した場合（ステップＳ４１；Ｙｅｓ）、マッピング部３３は、科目値に、マッピング補助情報２３に定義された任意の文字列に合致する文字列を含むか否かを判定する（ステップＳ４２）。ここで言う任意の文字列とは、ＸＢＲＬ要素に応じて、科目値から抽出したい期待値の候補を意味する。これは、科目値からＸＢＲＬ要素に応じた期待値を抽出するためである。例えば、科目値が「代表取締役日本太郎」である場合に、ＸＢＲＬ要素「役職」に応じて「代表取締役」、ＸＢＲＬ要素「氏名」に応じて「日本太郎」を抽出する場合である。 If it is determined that the data type 222e is a character string type (step S41; Yes), the mapping unit 33 includes a character string that matches an arbitrary character string defined in the mapping auxiliary information 23 in the subject value. Is determined (step S42). The arbitrary character string referred to here means a candidate for an expected value to be extracted from the subject value according to the XBRL element. This is because an expected value corresponding to the XBRL element is extracted from the subject value. For example, when the subject value is “Representative Director Nihon Taro”, “Representative Director” is extracted according to the XBRL element “Position” and “Nippon Taro” is extracted according to the XBRL element “Name”.

科目値に、マッピング補助情報２３に定義された任意の文字列に合致する文字列を含むと判定した場合（ステップＳ４２；Ｙｅｓ）、マッピング部３３は、科目値より、任意の文字列に合致する文字列を抽出する（ステップＳ４３）。そして、マッピング部３３は、さらにマッピング処理を進めるべく、ステップＳ５１に移行する。 When it is determined that the subject value includes a character string that matches the arbitrary character string defined in the mapping auxiliary information 23 (step S42; Yes), the mapping unit 33 matches the arbitrary character string from the subject value. A character string is extracted (step S43). Then, the mapping unit 33 proceeds to step S51 in order to proceed with the mapping process.

一方、科目値に、マッピング補助情報２３に定義された文字列を含まないと判定した場合（ステップＳ４２；Ｎｏ）、マッピング部３３は、さらに、次の判定処理を行う。すなわち、マッピング部３３は、科目値に、マッピング補助情報２３に定義された任意の区切り文字または空白文字を含むか否かを判定する（ステップＳ４４）。これは、科目値の中で区切りとなる文字を探索するためである。 On the other hand, when it is determined that the subject value does not include the character string defined in the mapping auxiliary information 23 (step S42; No), the mapping unit 33 further performs the following determination process. That is, the mapping unit 33 determines whether or not the subject value includes any delimiter or blank character defined in the mapping auxiliary information 23 (step S44). This is for searching for a delimiter character in the subject value.

科目値に、マッピング補助情報２３に定義された任意の区切り文字または空白文字を含むと判定した場合（ステップＳ４４；Ｙｅｓ）、マッピング部３３は、区切り文字もしくは空白文字で科目値を分割し、先頭の文字列を抽出する（ステップＳ４５）。一例として、科目値が「ＡＡＡＢＢＢ」の場合、マッピング部３３は、空白文字で科目値を分割し、空白文字の前までの文字列「ＡＡＡ」を抽出する。そして、マッピング部３３は、さらにマッピング処理を進めるべく、ステップＳ５１に移行する。 If it is determined that the subject value includes any delimiter or blank character defined in the mapping auxiliary information 23 (step S44; Yes), the mapping unit 33 divides the subject value by the delimiter or blank character, Is extracted (step S45). As an example, when the subject value is “AAA BBB”, the mapping unit 33 divides the subject value by a blank character and extracts the character string “AAA” up to the front of the blank character. Then, the mapping unit 33 proceeds to step S51 in order to proceed with the mapping process.

一方、科目値に、マッピング補助情報２３に定義された区切り文字または空白文字を含まないと判定した場合（ステップＳ４４；Ｎｏ）、マッピング部３３は、科目値を抽出する（ステップＳ４６）。そして、マッピング部３３は、さらにマッピング処理を進めるべく、ステップＳ５１に移行する。 On the other hand, when it is determined that the subject value does not include the delimiter or blank character defined in the mapping auxiliary information 23 (step S44; No), the mapping unit 33 extracts the subject value (step S46). Then, the mapping unit 33 proceeds to step S51 in order to proceed with the mapping process.

ステップＳ４１では、データ型２２２ｅが文字列型でないと判定した場合（ステップＳ４１；Ｎｏ）、マッピング部３３は、科目値より、データ型に合致する文字列を抽出する（ステップＳ４７）。そして、マッピング部３３は、さらにマッピング処理を進めるべく、ステップＳ５１に移行する。 If it is determined in step S41 that the data type 222e is not a character string type (step S41; No), the mapping unit 33 extracts a character string that matches the data type from the subject value (step S47). Then, the mapping unit 33 proceeds to step S51 in order to proceed with the mapping process.

ステップＳ５１では、マッピング部３３は、科目値より文字列を抽出できたか否かを判定する（ステップＳ５１）。科目値より文字列を抽出できたと判定した場合（ステップＳ５１；Ｙｅｓ）、マッピング部３３は、抽出した文字列を処理中の子孫要素に割り当てる（ステップＳ５２）。そして、マッピング部３３は、ステップＳ５６に移行する。 In step S51, the mapping unit 33 determines whether or not a character string has been extracted from the subject value (step S51). If it is determined that the character string can be extracted from the subject value (step S51; Yes), the mapping unit 33 assigns the extracted character string to the descendant element being processed (step S52). Then, the mapping unit 33 proceeds to step S56.

一方、科目値より文字列を抽出できなかったと判定した場合（ステップＳ５１；Ｎｏ）、マッピング部３３は、処理中の子孫要素がｎｉｌ値を許可する要素か否かを判定する（ステップＳ５３）。ｎｉｌ値を許可する要素か否かは、要素宣言より取得されたｎｉｌ値許可フラグ２２２ｄを用いて判定される。 On the other hand, when it is determined that the character string cannot be extracted from the subject value (step S51; No), the mapping unit 33 determines whether the descendant element being processed is an element that permits the nil value (step S53). Whether or not the element allows the nil value is determined using the nil value permission flag 222d acquired from the element declaration.

ｎｉｌ値を許可する要素であると判定した場合（ステップＳ５３；Ｙｅｓ）、マッピング部３３は、処理中の子孫要素に何も割り当てず、ｎｉｌ属性に「ｔｒｕｅ」を設定する（ステップＳ５４）。そして、マッピング部３３は、スタックされた次の子孫要素を処理すべく、ステップＳ３１に移行する。 If it is determined that the element is an element that allows the nil value (step S53; Yes), the mapping unit 33 assigns nothing to the child element being processed and sets “true” in the nil attribute (step S54). Then, the mapping unit 33 proceeds to step S31 in order to process the next descendant element stacked.

一方、ｎｉｌ値を許可する要素でないと判定した場合（ステップＳ５３；Ｎｏ）、マッピング部３３は、科目値よりデータを抽出できなかったので、マッピングエラーを文書変換の要求元へ通知する（ステップＳ５５）。そして、マッピング部３３は、マッピング処理を終了する。 On the other hand, if it is determined that the element is not an element that permits the nil value (step S53; No), the mapping unit 33 notifies the mapping conversion requester of the mapping error because data could not be extracted from the subject value (step S55). ). Then, the mapping unit 33 ends the mapping process.

ステップＳ５６では、マッピング部３３は、科目値について、抽出した文字列の前に文字列が存在するか否かを判定する（ステップＳ５６）。抽出した文字列の前に文字列が存在すると判定した場合（ステップＳ５６；Ｙｅｓ）、マッピング部３３は、科目値について、抽出した文字列の前の文字列を削除する（ステップＳ５７）。一例として、科目値が「第１００期（自平成２２年・・・」であり、抽出した文字列が「１００」である場合について説明する。科目値について、抽出した文字列の前の文字列は「第」である。すると、マッピング部３３によって、科目値は、「第」と抽出した文字列「１００」を除外した「期（自平成２２年・・・」となる。そして、マッピング部３３は、ステップＳ５８に移行する。 In step S56, the mapping unit 33 determines whether a character string exists before the extracted character string for the subject value (step S56). When it is determined that a character string exists before the extracted character string (step S56; Yes), the mapping unit 33 deletes the character string before the extracted character string for the subject value (step S57). As an example, a case will be described in which the subject value is “100th period (own 2010 ...)” and the extracted character string is “100.” For the subject value, the character string before the extracted character string Then, the subject value is “period (excluding 2010)” excluding the character string “100” extracted as “first” by the mapping unit 33. Then, the mapping unit In step 33, the process proceeds to step S58.

一方、抽出した文字列の前に文字列が存在しないと判定した場合（ステップＳ５６；Ｎｏ）、マッピング部３３は、ステップＳ５８に移行する。ステップＳ５８では、マッピング部３３は、科目値を更新する（ステップＳ５８）。すなわち、次に処理される子孫要素に用いられる科目値は、抽出した文字列の前の文字列があった場合、抽出した文字列と抽出した文字列の前の文字列を除外した内容に更新される。また、次に処理される子孫要素に用いられる科目値は、抽出した文字列の前の文字列がなかった場合、抽出した文字列を除外した内容に更新される。そして、マッピング部３３は、スタックされた次の子孫要素を処理すべく、ステップＳ３１に移行する。 On the other hand, if it is determined that there is no character string before the extracted character string (step S56; No), the mapping unit 33 proceeds to step S58. In step S58, the mapping unit 33 updates the subject value (step S58). In other words, if there is a character string before the extracted character string, the subject value used for the descendant element to be processed next is updated to the content excluding the extracted character string and the character string before the extracted character string. Is done. Further, the subject value used for the descendant element to be processed next is updated to the content excluding the extracted character string when there is no character string before the extracted character string. Then, the mapping unit 33 proceeds to step S31 in order to process the next descendant element stacked.

［実施例の効果］
上記実施例によれば、文書変換装置１は、ＸＢＲＬ文書４１の要素間の階層関係を含む表示ツリー２２１に基づいて、財務報告書２１に含まれる科目のうちＸＢＲＬ文書４１において子孫を有する要素に対応する科目を検出する。そして、文書変換装置１は、検出した科目の値を分割する。そして、文書変換装置１は、分割された値を、検出した科目に対応するＸＢＲＬ文書４１における要素の子要素にマッピングする。さらに、文書変換装置１は、マッピングされた値と子要素とを用いて、財務報告書２１をＸＢＲＬ文書４１に変換する。かかる構成によれば、文書変換装置１は、財務報告書２１からＸＢＲＬ文書４１を作成する際、財務報告書２１の科目を表示ツリー２２１の要素に適切にマッピングすることができる。[Effect of Example]
According to the above-described embodiment, the document conversion apparatus 1 uses the display tree 221 including the hierarchical relationship between the elements of the XBRL document 41 as the elements having descendants in the XBRL document 41 among the subjects included in the financial report 21. Find the corresponding subject. Then, the document conversion apparatus 1 divides the detected subject value. Then, the document conversion apparatus 1 maps the divided values to the child elements of the elements in the XBRL document 41 corresponding to the detected subject. Further, the document conversion apparatus 1 converts the financial report 21 into the XBRL document 41 using the mapped value and the child element. According to this configuration, the document conversion apparatus 1 can appropriately map the subjects of the financial report 21 to the elements of the display tree 221 when creating the XBRL document 41 from the financial report 21.

また、上記実施例によれば、文書変換装置１は、科目の科目値を、表示ツリー２２１に含まれる要素の子孫関係となる要素のデータ型に応じて分割する。かかる構成によれば、文書変換装置１は、例えば要素のデータ型が日付型であったり整数型であったりする場合、要素のデータ型に合った文字列を科目値から抽出することが可能となる。この結果、文書変換装置１は、財務報告書２１の科目の科目値を表示ツリー２２１の要素に適切にマッピングすることができる。 Further, according to the above embodiment, the document conversion apparatus 1 divides the subject value of the subject according to the data type of the element that is the descendant relationship of the element included in the display tree 221. According to such a configuration, the document conversion apparatus 1 can extract a character string that matches the data type of the element from the subject value, for example, when the data type of the element is a date type or an integer type. Become. As a result, the document conversion apparatus 1 can appropriately map the subject values of the subjects of the financial report 21 to the elements of the display tree 221.

また、上記実施例によれば、文書変換装置１は、科目の科目値に、要素に対応付ける内容として不要な文字列がある場合には、科目の科目値から不要な文字列を削除する。かかる構成によれば、文書変換装置１は、要素へマッピングする内容の精度を高めることができる。 Further, according to the above embodiment, the document conversion apparatus 1 deletes an unnecessary character string from the subject value of the subject when the subject value of the subject includes an unnecessary character string as content to be associated with the element. According to such a configuration, the document conversion apparatus 1 can increase the accuracy of the contents mapped to the elements.

また、上記実施例によれば、文書変換装置１は、表示ツリー２２１に含まれる要素の子孫関係となる要素のデータ型が文字列型である場合、科目の科目値に予め定義された任意の文字列と一致する文字列が含まれていれば、科目値から一致する文字列を抽出する。そして、文書変換装置１は、抽出した文字列を分割された内容とする。かかる構成によれば、文書変換装置１は、要素のデータ型が文字列型である場合に、科目値から期待する文字列を抽出することができる。すなわち、文書変換装置１は、要素へマッピングする内容の精度をさらに高めることができる。 Further, according to the above-described embodiment, the document conversion apparatus 1 is configured so that when the data type of an element that is a descendant relationship of an element included in the display tree 221 is a character string type, the document conversion apparatus 1 is arbitrarily defined as a subject value of a subject If a character string that matches the character string is included, the character string that matches the subject value is extracted. Then, the document conversion apparatus 1 sets the extracted character string as divided contents. According to this configuration, the document conversion apparatus 1 can extract an expected character string from the subject value when the data type of the element is a character string type. In other words, the document conversion apparatus 1 can further increase the accuracy of the contents mapped to the elements.

また、上記実施例によれば、文書変換装置１は、科目の科目値に予め定義された任意の文字列と一致する文字列が含まれていなければ、科目の科目値に区切り文字または空白文字がある場合には、以下の処理を行う。すなわち、文書変換装置１は、科目の科目値の先頭から区切り文字または空白文字の前までの文字列を分割された内容とする。かかる構成によれば、文書変換装置１は、要素のデータ型が文字列型である場合であって、区切り文字または空白文字を用いて文字列を分割できるので、科目値を子孫関係の要素に適切にマッピングをすることが可能となる。 Further, according to the above embodiment, the document conversion apparatus 1 determines that the subject value of the subject does not include a character string that matches an arbitrary character string defined in advance, the delimiter or the blank character is included in the subject value of the subject. If there is, the following processing is performed. In other words, the document conversion apparatus 1 sets the content of the character string from the beginning of the subject value of the subject to the character before the delimiter or blank character. According to such a configuration, the document conversion apparatus 1 is a case where the data type of the element is a character string type, and the character string can be divided using a delimiter or a blank character. It becomes possible to map appropriately.

［プログラム等］
なお、実施例によれば、文書変換装置１は、財務報告書２１に関し、財務報告書２１に含まれる科目の値を、ＸＢＲＬ要素にマッピングする場合について説明した。しかしながら、文書変換装置１は、財務報告書２１のみならず財務会計報告全般に関する報告書であってテキスト文書の報告書に含まれる科目の値を、ＸＢＲＬ要素にマッピングする場合であっても良い。これにより、文書変換装置１は、テキスト文書（ワープロ文書を含む）からＸＢＲＬ文書への変換を汎用的に行うことができる。[Programs]
In addition, according to the Example, the document conversion apparatus 1 demonstrated the case where the value of the subject included in the financial report 21 was mapped to the XBRL element regarding the financial report 21. However, the document conversion apparatus 1 may be a case where not only the financial report 21 but also a general financial accounting report and a subject value included in the text document report are mapped to the XBRL element. Thereby, the document conversion apparatus 1 can perform conversion from a text document (including a word processor document) to an XBRL document for general use.

また、文書変換装置１は、既知のパーソナルコンピュータ、ワークステーション等の情報処理装置に、上記した制御部３と、記憶部２等の各機能を搭載することによって実現することができる。 Further, the document conversion apparatus 1 can be realized by mounting each function such as the control unit 3 and the storage unit 2 in an information processing apparatus such as a known personal computer or workstation.

また、図示した文書変換装置１の各構成要素は、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、文書変換装置１の分散・統合の具体的態様は図示のものに限られず、その全部または一部を、各種の負荷や使用状況等に応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。例えば、レイアウト解析部３１と定義体解析部３２とを１個の部として統合しても良い。一方、マッピング部３３を、財務報告書２１の中で科目値を分割する科目を検出する検出部と、検出した科目の科目値を分割し、分割した科目値の各値を各ＸＢＲＬ要素にマッピングするマッピング部とに分散しても良い。また、財務報告書２１やＸＢＲＬ要素の定義体２２等の記憶部２を文書変換装置１の外部装置としてネットワーク経由で接続するようにしても良い。 Further, each component of the illustrated document conversion apparatus 1 does not necessarily need to be physically configured as illustrated. That is, the specific mode of distribution / integration of the document conversion apparatus 1 is not limited to the illustrated one, and all or a part thereof is functionally or physically in arbitrary units according to various loads, usage conditions, and the like. It can be configured to be distributed and integrated. For example, the layout analysis unit 31 and the definition body analysis unit 32 may be integrated as one unit. On the other hand, the mapping unit 33 detects a subject that divides the subject value in the financial report 21, and divides the subject value of the detected subject, and maps each value of the divided subject value to each XBRL element May be distributed to the mapping unit. Further, the storage unit 2 such as the financial report 21 and the XBRL element definition body 22 may be connected as an external device of the document conversion device 1 via a network.

また、上記実施例で説明した各種の処理は、あらかじめ用意されたプログラムをパーソナルコンピュータやワークステーション等のコンピュータで実行することによって実現することができる。そこで、以下では、図１に示した文書変換装置１と同様の機能を実現する文書変換プログラムを実行するコンピュータの一例を説明する。図１１は、文書変換プログラムを実行するコンピュータの一例を示す図である。 The various processes described in the above embodiments can be realized by executing a program prepared in advance on a computer such as a personal computer or a workstation. In the following, an example of a computer that executes a document conversion program that realizes the same function as the document conversion apparatus 1 shown in FIG. 1 will be described. FIG. 11 is a diagram illustrating an example of a computer that executes a document conversion program.

図１１に示すように、コンピュータ２００は、各種演算処理を実行するＣＰＵ２０３と、ユーザからのデータの入力を受け付ける入力装置２１５と、表示装置２０９を制御する表示制御部２０７を有する。また、コンピュータ２００は、記憶媒体からプログラム等を読取るドライブ装置２１３と、ネットワークを介して他のコンピュータとの間でデータの授受を行う通信制御部２１７とを有する。また、コンピュータ２００は、各種情報を一時記憶するメモリ２０１と、ＨＤＤ２０５を有する。そして、メモリ２０１、ＣＰＵ２０３、ＨＤＤ２０５、表示制御部２０７、ドライブ装置２１３、入力装置２１５、通信制御部２１７は、バス２１９で接続されている。 As illustrated in FIG. 11, the computer 200 includes a CPU 203 that executes various arithmetic processes, an input device 215 that receives input of data from the user, and a display control unit 207 that controls the display device 209. The computer 200 also includes a drive device 213 that reads a program or the like from a storage medium, and a communication control unit 217 that exchanges data with another computer via a network. The computer 200 also includes a memory 201 that temporarily stores various types of information and an HDD 205. The memory 201, CPU 203, HDD 205, display control unit 207, drive device 213, input device 215, and communication control unit 217 are connected by a bus 219.

ドライブ装置２１３は、例えばリムーバブルディスク２１１用の装置である。ＨＤＤ２０５は、文書変換プログラム２０５ａおよび文書変換関連情報２０５ｂを記憶する。 The drive device 213 is a device for the removable disk 211, for example. The HDD 205 stores a document conversion program 205a and document conversion related information 205b.

ＣＰＵ２０３は、文書変換プログラム２０５ａを読み出して、メモリ２０１に展開する。文書変換プログラム２０５ａは、文書変換プロセス２０１ａとして機能する。 The CPU 203 reads the document conversion program 205a and develops it in the memory 201. The document conversion program 205a functions as a document conversion process 201a.

例えば、文書変換プロセス２０１ａは、制御部３の各機能部に対応する。文書変換関連情報２０５ｂは、財務報告書２１、ＸＢＲＬ要素の定義体２２およびマッピング補助情報２３に対応する。 For example, the document conversion process 201 a corresponds to each functional unit of the control unit 3. The document conversion related information 205 b corresponds to the financial report 21, the XBRL element definition body 22, and the mapping auxiliary information 23.

なお、文書変換プログラム２０５ａについては、必ずしも最初からＨＤＤ２０５に記憶させておかなくても良い。例えば、コンピュータ２００に挿入されるフレキシブルディスク（ＦＤ）、ＣＤ−ＲＯＭ、ＤＶＤディスク、光磁気ディスク、ＩＣカード等の「可搬用の物理媒体」に当該プログラムを記憶させておく。そして、コンピュータ２００がこれらから文書変換プログラム２０５ａを読み出して実行するようにしても良い。 Note that the document conversion program 205a is not necessarily stored in the HDD 205 from the beginning. For example, the program is stored in a “portable physical medium” such as a flexible disk (FD), a CD-ROM, a DVD disk, a magneto-optical disk, or an IC card inserted into the computer 200. Then, the computer 200 may read out and execute the document conversion program 205a from these.

１文書変換装置
２記憶部
３制御部
２１財務報告書
２２ＸＢＲＬ要素の定義体
２２１表示ツリー
２２２要素宣言
２３マッピング補助情報
３１レイアウト解析部
３２定義体解析部
３３マッピング部
３４出力部
４１ＸＢＲＬ文書DESCRIPTION OF SYMBOLS 1 Document converter 2 Memory | storage part 3 Control part 21 Financial report 22 XBRL element definition body 221 Display tree 222 Element declaration 23 Mapping auxiliary information 31 Layout analysis part 32 Definition body analysis part 33 Mapping part 34 Output part 41 XBRL document

Claims

In a program for converting from a first document to a second document,
Detecting an item corresponding to an item having a child in the second document among items included in the first document based on a hierarchical document including a hierarchical relationship between items of the second document; Divide the content of the detected item that is the detected item,
Associating the divided contents with child items of items in the second document corresponding to the detection items;
A program that causes a computer to execute a process of converting the first document into the second document using the associated contents and the child items.

The division process causes the computer to execute a process of dividing the content of the detection item according to a data type of a child item of the item in the second document corresponding to the detection item. The program according to 1.

In the case where the content of the detection item includes an unnecessary character string as the content to be associated with the child item, the dividing process causes the computer to execute a process of deleting the unnecessary character string from the content of the detection item. The program according to claim 1 or 2, wherein the program is characterized by the following.

When the data type of the child item of the item in the second document corresponding to the detection item is a character string type, the dividing process is associated with the child item defined in advance in the content of the detection item. If a character string that matches a specific character string is included, a character string that matches is extracted from the content of the detection item, and the computer is caused to execute a process of making the extracted character string into divided content The program according to claim 2.

If the content of the detection item does not include a predefined character string that matches a specific character string associated with the child item, the process of dividing the content of the detection item is a delimiter or a blank character. 5. The program according to claim 4, wherein if there is, the computer is caused to execute a process of dividing the character string from the beginning of the content of the detection item to before the delimiter or the blank character. .

In a document conversion apparatus for converting a first document into a second document,
Detecting an item corresponding to an item having a child in the second document among items included in the first document based on a hierarchical document including a hierarchical relationship between items of the second document; A dividing unit that divides the content of the detected item that is the detected item;
An associating unit that associates the content divided by the dividing unit with a child item of an item in the second document corresponding to the detection item;
A document conversion apparatus comprising: a conversion unit that converts the first document into the second document using the contents associated with the association unit and the child items.

A computer for converting from a first document to a second document;
Detecting an item corresponding to an item having a child in the second document among items included in the first document based on a hierarchical document including a hierarchical relationship between items of the second document; Divide the content of the detected item that is the detected item,
Associating the divided contents with child items of items in the second document corresponding to the detection items;
A document conversion method comprising: executing each process of converting the first document into the second document using the associated contents and the child items.