JP5030985B2

JP5030985B2 - Character string processing apparatus and image forming apparatus

Info

Publication number: JP5030985B2
Application number: JP2009061613A
Authority: JP
Inventors: 透松田
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2003-01-24
Filing date: 2009-03-13
Publication date: 2012-09-19
Anticipated expiration: 2024-01-09
Also published as: JP2009146439A; CN101079023A; CN101079023B

Description

本発明は、文字列処理装置および画像形成装置に係り、特に一の符号化方式（文字コード・セット）で符号化された文字列を他の符号化方式に変換する文字列処理装置および画像形成装置に関する。 The present invention relates to a character string processing equipment Contact and image forming apparatus, a character string processing equipment for converting the encoded character string, especially in one of the encoding (character code set) to other encodings you and relates to an image forming apparatus.

符号化方式は、文字や記号など（以下、単に文字という）をコンピュータ上で扱うため、文字に割り当てられている文字コードに変換するものである。この符号化方式は、言語やコンピュータシステムにより異なるのが普通である。例えばインターネットでは、世界の主要言語に対応するためＵｎｉｃｏｄｅを利用したＵＴＦ−８，ＵＴＦ−１６などが標準的な符号化方式として利用されている。また、文字列処理装置および画像形成装置などでは、シフトＪＩＳやＬａｔｉｎ１等が符号化方式として利用されている。 The encoding method converts characters and symbols (hereinafter simply referred to as characters) into character codes assigned to the characters in order to handle them on a computer. This encoding method usually differs depending on the language and the computer system. For example, on the Internet, UTF-8, UTF-16, etc. using Unicode are used as standard encoding methods in order to support major languages in the world. In character string processing apparatuses and image forming apparatuses, Shift JIS, Latin 1 and the like are used as encoding methods.

複数の符号化方式を利用可能な文字列処理装置は、必要に応じて一の符号化方式で符号化された文字列を他の符号化方式に変換する。なお、画像形成装置はフォントＲＯＭの容量を節約するため、ユーザインターフェースに必要十分な少数の符号化方式を販売先の言語に応じて用意している。 A character string processing apparatus that can use a plurality of encoding methods converts a character string encoded by one encoding method to another encoding method as necessary. In order to save the capacity of the font ROM, the image forming apparatus prepares a small number of encoding methods necessary and sufficient for the user interface according to the language of the sales destination.

従来、インターネットなどのネットワークに接続した文字列処理装置および画像形成装置は、ネットワーク側から例えばＵｎｉｃｏｄｅで表現された文字列を含む要求（例えば、文書名の変更要求など）を受信すると、その文字列を内部処理で利用する符号化方式に変換していた。特許文献１には、一の符号化方式で符号化された文字列を他の符号化方式に変換する内容が記載されている。 Conventionally, when a character string processing apparatus and an image forming apparatus connected to a network such as the Internet receive a request (for example, a request for changing a document name) including a character string expressed in Unicode, for example, from the network side, the character string Was converted to an encoding method used in internal processing. Patent Document 1 describes contents for converting a character string encoded by one encoding method into another encoding method.

通常、文字列処理装置および画像形成装置は複数の符号化方式が利用可能であるため、利用する符号化方式を選択する必要があった。しかしながら、符号化方式で表現可能な文字の集合は符号化方式ごとに異なっており、変換前の符号化方式で表現可能な文字が変換後の符号化方式で表現可能な文字であるとは限らない。したがって、変換前の符号化方式と変換後の符号化方式との組み合わせによっては、全ての文字列を完全に変換できない場合があるという問題があった。 Usually, since a plurality of encoding methods can be used for the character string processing device and the image forming apparatus, it is necessary to select the encoding method to be used. However, the set of characters that can be expressed by the encoding method differs for each encoding method, and characters that can be expressed by the encoding method before conversion are not necessarily characters that can be expressed by the encoding method after conversion. Absent. Therefore, depending on the combination of the encoding method before conversion and the encoding method after conversion, there is a problem that all character strings may not be completely converted.

例えばＵｎｉｃｏｄｅで表現可能な文字の集合は、文字列処理装置および画像形成装置が内部処理で利用するシフトＪＩＳやＬａｔｉｎ１等の符号化方式で表現可能な文字の集合に全て包含されているとは限らない。したがって、Ｕｎｉｃｏｄｅで表現可能な文字であっても、内部処理で利用する符号化方式で変換可能であるとは限らないという問願があった。 For example, a set of characters that can be expressed in Unicode is not always included in a set of characters that can be expressed by an encoding method such as Shift JIS or Latin 1 that is used by the character string processing apparatus and the image forming apparatus for internal processing. Absent. Therefore, there has been a request that even characters that can be expressed in Unicode cannot always be converted by an encoding method used in internal processing.

以上のように、従来の文字列処理装置および画像形成装置は、一の符号化方式で符号化された文字列をどの符号化方式に変換するかで、変換できない文字の数が異なるため、符号化方式の選択が重要であった。また、従来の文字列処理装置および画像形成装置は、変換後の文字列を照合するとき等に、変換できなかった文字の扱いが重要であった。さらに、いくつかの符号化方式では、同一の文字に異なる文字コードを割り当てており、このような例外的な文字の扱いも重要であった。 As described above, the conventional character string processing device and the image forming apparatus have different numbers of characters that cannot be converted depending on which encoding method the character string encoded by one encoding method is converted. The selection of the conversion method was important. Further, in the conventional character string processing device and image forming apparatus, it is important to handle characters that could not be converted when collating the converted character strings. Furthermore, in some encoding methods, different character codes are assigned to the same character, and handling of such exceptional characters is also important.

本発明は、上記の点に鑑みなされたもので、一の符号化方式で符号化された文字列を他の符号化方式に変換するときに、変換後の符号化方式で表現された文字列を適切に扱うことが可能な文字列処理装置および画像形成装置を提供することを目的とする。 The present invention has been made in view of the above points. When a character string encoded by one encoding method is converted to another encoding method, the character string expressed by the converted encoding method the and to provide a character string processing equipment Contact and image forming apparatus capable of properly dealing.

そこで、上記課題を解決するため、本発明は、一の符号化方式で符号化された文字列を用いて、内部処理で利用する内部符号化方式で符号化された文字列を照合する文字列処理装置において、前記内部符号化方式で符号化された文字列を前記一の符号化方式に変換し、変換した文字列を前記一の符号化方式で符号化された文字列を用いて照合するときに、前記変換した文字列に含まれる代用符号を、全ての文字を表す役割をもった文字として扱う文字列照合手段と、前記内部符号化方式で符号化される前の前記一の符号化方式で符号化された文字列を格納している格納手段とを有し、前記文字列照合手段は、前記内部符号化方式で符号化された文字列を前記一の符号化方式で符号化された文字列を用いて照合するときに、前記格納手段に格納している前記内部符号化方式で符号化される前の前記一の符号化方式で符号化された文字列と、前記内部符号化方式で符号化された文字列から変換された前記一の符号化方式で符号化された文字列とが一致とみなせなければ、前記前記内部符号化方式で符号化された文字列から変換された前記一の符号化方式で符号化された文字列を前記格納手段に格納することを有することを特徴とする。 Therefore, in order to solve the above-mentioned problem, the present invention uses a character string encoded by one encoding method, and a character string that collates a character string encoded by an internal encoding method used in internal processing. In the processing device, the character string encoded by the internal encoding method is converted to the one encoding method, and the converted character string is collated using the character string encoded by the one encoding method. Sometimes, the character string matching means that handles the substitute code included in the converted character string as a character having the role of representing all characters, and the one encoding before being encoded by the internal encoding method Storage means for storing a character string encoded by the method, wherein the character string collating means encodes the character string encoded by the internal encoding method by the one encoding method. Stored in the storage means when collating using a character string The character string encoded by the one encoding method before being encoded by the inner encoding method and the one encoding converted from the character string encoded by the inner encoding method If the character string encoded by the method cannot be regarded as a match, the storage means stores the character string encoded by the one encoding method converted from the character string encoded by the inner encoding method. It is characterized by having storing in .

本発明では、一の符号化方式で符号化された文字列を他の符号化方式の文字列に変換するときに、代用符号の数情報および位置情報の少なくとも一方に基づいて符号化方式を決定する。また、本発明では他の符号化方式に変換した文字列に含まれる代用符号を、全ての文字を表す役割ともった文字（ワイルドカード）として扱う。 In the present invention, when a character string encoded by one encoding method is converted to a character string of another encoding method, the encoding method is determined based on at least one of the number information and the position information of the substitute code. To do. In the present invention, a substitute code included in a character string converted into another encoding method is treated as a character (wild card) having a role of representing all characters.

上述の如く、本発明によれば、一の符号化方式で符号化された文字列を他の符号化方式に変換するときに、変換後の符号化方式で表現された文字列を適切に扱うことが可能な文字列処理装置および画像形成装置を提供することができる。 As described above, according to the present invention, when a character string encoded by one encoding method is converted to another encoding method, the character string expressed by the converted encoding method is appropriately handled. it is possible to provide a character string processing equipment Contact and image forming apparatus capable.

符号化方式で表現可能な文字の集合が符号化方式ごとに異なっていることを表す図である。It is a figure showing that the set of the character which can be expressed with an encoding system differs for every encoding system. 符号化方式で表現可能な文字の集合が符号化方式ごとに異なっていることを表す図である。It is a figure showing that the set of the character which can be expressed with an encoding system differs for every encoding system. 符号化方式で表現可能な文字の集合が符号化方式ごとに異なっていることを表す図である。It is a figure showing that the set of the character which can be expressed with an encoding system differs for every encoding system. 符号化方式で表現可能な文字の集合が符号化方式ごとに異なっていることを表す図である。It is a figure showing that the set of the character which can be expressed with an encoding system differs for every encoding system. 符号化方式で表現可能な文字の集合が符号化方式ごとに異なっていることを表す図である。It is a figure showing that the set of the character which can be expressed with an encoding system differs for every encoding system. 符号化方式で表現可能な文字の集合が符号化方式ごとに異なっていることを表す図である。It is a figure showing that the set of the character which can be expressed with an encoding system differs for every encoding system. 文字に割り当てられている文字コードを符号化方式ごとに表したルックアップテーブルである。It is the look-up table which represented the character code assigned to the character for every encoding system. 本発明による融合機の一実施例の構成図である。It is a block diagram of one Example of the compound machine by this invention. 本発明による融合機の一実施例のハードウェア構成図である。It is a hardware block diagram of one Example of the compound machine by this invention. 本発明による第１実施例の文字列処理方法について説明するための融合機の構成図である。It is a block diagram of the compound machine for demonstrating the character string processing method of 1st Example by this invention. 文書名の変更要求を行うＳＯＡＰメッセージの一例の構成図である。It is a block diagram of an example of a SOAP message for making a document name change request. 符号化方式決定処理の一例のフローチャートである。It is a flowchart of an example of an encoding system determination process. ＵＴＦ−８，Ｓｈｉｆｔ＿ＪＩＳおよびＬａｔｉｎ１の文字コードの割り当てを表した一例のルックアップテーブルである。It is an example look-up table showing allocation of character codes of UTF-8, Shift_JIS, and Latin1. 書誌情報データの形式を表した図である。It is a figure showing the format of bibliographic information data. 文字列の照合について説明するための図である。It is a figure for demonstrating collation of a character string. 例外的な文字を設定したテーブルの一例の構成図である。It is a block diagram of an example of the table which set the exceptional character. 本発明による第２実施例の文字列処理方法について説明するための融合機の構成図である。It is a block diagram of the compound machine for demonstrating the character string processing method of 2nd Example by this invention. 文字列の照合処理について説明するための一例のフローチャートである。It is a flowchart of an example for demonstrating the collation process of a character string. 符号化処理の一例のフローチャートである。It is a flowchart of an example of an encoding process. 文書名の検索要求を行うＳＯＡＰメッセージの一例の構成図である。It is a block diagram of an example of a SOAP message for making a search request for a document name. 文字列の照合処理について説明するための一例のフローチャートである。It is a flowchart of an example for demonstrating the collation process of a character string. 本発明による第３実施例の文字列処理方法について説明するための融合機の構成図である。It is a block diagram of the compound machine for demonstrating the character string processing method of 3rd Example by this invention. 本発明による第３実施例の文字列処理方法について説明するためのシーケンス図である。It is a sequence diagram for demonstrating the character string processing method of 3rd Example by this invention.

次に、本発明を実施するための最良の形態を、以下の実施例に基づき図面を参照しつつ説明していく。 Next, the best mode for carrying out the present invention will be described based on the following embodiments with reference to the drawings.

まず、本発明の理解を容易とするために、本発明の原理について図１〜７を参照しつつ説明する。図１〜６は、符号化方式で表現可能な文字の集合が符号化方式ごとに異なっていることを表す図である。また、図７は文字に割り当てられている文字コードを符号化方式ごとに表したルックアップテーブルである。 First, in order to facilitate understanding of the present invention, the principle of the present invention will be described with reference to FIGS. 1 to 6 are diagrams showing that a set of characters that can be expressed by the encoding method is different for each encoding method. FIG. 7 is a look-up table showing character codes assigned to characters for each encoding method.

図１〜図６では、Ωが全ての文字の集合を表し、Ａが符号化方式ａで符号化可能な文字の集合を表し、Ｂが符号化方式ｂで符号化可能な文字の集合を表し、Ｃが符号化方式ｂで符号化可能な文字の集合を表している。 1 to 6, Ω represents a set of all characters, A represents a set of characters that can be encoded by the encoding method a, and B represents a set of characters that can be encoded by the encoding method b. , C represents a set of characters that can be encoded by the encoding method b.

図１は、符号化方式ｂ又はｃで符号化可能な文字の集合Ｂ∪Ｃが、符号化方式ａで符号化可能な文字の集合Ａに包含されていることを表している。例えば符号化方式ａがＵＴＦ−８，符号化方式ｂがＳｈｉｆｔ＿ＪＩＳ，符号化方式ｃがＬａｔｉｎ１であるのときに、図１のような関係を有する。なお、符号化方式がＵＴＦ−８，Ｓｈｉｆｔ＿ＪＩＳ，Ｌａｔｉｎ１のときのルックアップテーブルは図７のように表される。 FIG. 1 shows that the character set B∪C that can be encoded by the encoding method b or c is included in the character set A that can be encoded by the encoding method a. For example, when the encoding method a is UTF-8, the encoding method b is Shift_JIS, and the encoding method c is Latin1, the relationship shown in FIG. Note that the look-up table when the encoding method is UTF-8, Shift_JIS, and Latin1 is expressed as shown in FIG.

図２は、符号化方式ａ，ｂ及びｃで符号化可能な文字の集合Ａ∩Ｂ∩Ｃが、斜線で表されている。例えば図７のルックアップテーブルでは、文字「１」が図２の斜線部分に含まれる。図２の斜線部分に含まれる文字は、一の符号化方式から他の符号化方式に変換できる。 In FIG. 2, a set of characters A∩B∩C that can be encoded by the encoding methods a, b, and c is represented by hatching. For example, in the lookup table of FIG. 7, the character “1” is included in the hatched portion of FIG. Characters included in the hatched portion in FIG. 2 can be converted from one encoding method to another encoding method.

図３は、符号化方式ａ，ｂで符号化可能かつ符号化方式ｃで符号化できない文字の集合Ａ∩Ｂ−Ｃが、斜線で表されている。例えば図７のルックアップテーブルでは、文字「円」が図３の斜線部分に含まれる。図３の斜線部分に含まれる文字は、一の符号化方式から符号化方式ｃに変換できない。 In FIG. 3, a set of characters A−B-C that can be encoded by the encoding methods a and b but cannot be encoded by the encoding method c is indicated by hatching. For example, in the lookup table of FIG. 7, the character “circle” is included in the hatched portion of FIG. Characters included in the hatched portion in FIG. 3 cannot be converted from one encoding method to encoding method c.

本発明では、図３の斜線部分に含まれている文字を符号化方式ｃに変換する場合、その文字を代用符号に置き換える。代用符号には、文字が割り当てられていない文字コードを用いる。 In the present invention, when a character included in the hatched portion in FIG. 3 is converted to the encoding scheme c, the character is replaced with a substitute code. As the substitute code, a character code to which no character is assigned is used.

例えば代用符号は、文字を１バイトで表現可能な符号化方式を２バイトに拡張した文字コードを用いることが考えられる。また、代用符号は文字が割り当てられていない隙間の文字コードを用いることが考えられる。さらに、文字列処理装置や画像形成装置などで用いない例えば「字削除」の制御文字を用いることも考えられる。 For example, as a substitute code, it is conceivable to use a character code obtained by extending an encoding method capable of expressing a character by 1 byte to 2 bytes. Further, it is conceivable to use a character code in a gap where no character is assigned as the substitute code. Furthermore, it is also conceivable to use, for example, a “character deletion” control character that is not used in a character string processing apparatus or an image forming apparatus.

図４は、符号化方式ａ，ｃで符号化可能かつ符号化方式ｂで符号化できない文字の集合Ａ∩Ｃ−Ｂが、斜線で表されている。例えば図７のルックアップテーブルでは、文字 In FIG. 4, a set A−C-B of characters that can be encoded by the encoding methods a and c but cannot be encoded by the encoding method b is represented by hatching. For example, in the lookup table of FIG.

［外１］

が図４の斜線部分に含まれる。図４の斜線部分に含まれる文字は、一の符号化方式から符号化方式ｂに変換できない。本発明では、図４の斜線部分に含まれている文字を符号化方式ｂに変換する場合、その文字を代用符号に置き換える。 [Outside 1]

Is included in the hatched portion of FIG. Characters included in the shaded area in FIG. 4 cannot be converted from one encoding method to encoding method b. In the present invention, when a character included in the shaded portion in FIG. 4 is converted into the encoding method b, the character is replaced with a substitute code.

図５は、符号化方式ａで符号化可能かつ符号化方式ｂ，ｃで符号化できない文字の集合Ａ−（Ｂ∪Ｃ）が、斜線で表されている。例えば図７のルックアップテーブルでは、文字 In FIG. 5, a set A- (B∪C) of characters that can be encoded by the encoding method a and cannot be encoded by the encoding methods b and c is represented by hatching. For example, in the lookup table of FIG.

［外２］

が図５の斜線部分に含まれる。図５の斜線部分に含まれる文字は、一の符号化方式から符号化方式ｂ，ｃに変換できない。本発明では、図５の斜線部分に含まれている文字を符号化方式ｂ，ｃに変換する場合、その文字を代用符号に置き換える。 [Outside 2]

Is included in the hatched portion of FIG. Characters included in the hatched portion in FIG. 5 cannot be converted from one encoding method to encoding methods b and c. In the present invention, when a character included in the shaded portion in FIG. 5 is converted into the encoding methods b and c, the character is replaced with a substitute code.

図６は、符号化方式ａ，ｂ，ｃで符号化できない文字の集合 FIG. 6 shows a set of characters that cannot be encoded by the encoding methods a, b, and c.

［外３］

が、斜線で表されている。図６の斜線部分に含まれる文字は、一の符号化方式から符号化方式ａ，ｂ，ｃに変換できない。もっとも、符号化方式ａに変換できない文字が文字コードで表現されているなら、最初から代用符号になっているものと考えられる。したがって、本発明では図６の斜線部分に含まれている文字を符号化方式ｂ，ｃに変換する場合、符号化方式ａの代用符号を符号化方式ｂ，ｃの代用符号に置き換える。代用符号に置き換えられた文字は、例えばオペレーションパネルなどの操作画面に不明文字として「□」で表示される。 [Outside 3]

Is represented by diagonal lines. Characters included in the hatched portion in FIG. 6 cannot be converted from one encoding method to encoding methods a, b, and c. However, if a character that cannot be converted into the encoding method a is expressed by a character code, it can be considered that the character is a substitute code from the beginning. Therefore, in the present invention, when the character included in the hatched portion in FIG. 6 is converted into the encoding methods b and c, the substitute code of the encoding method a is replaced with the substitute code of the encoding methods b and c. The character replaced with the substitute code is displayed as “□” as an unknown character on an operation screen such as an operation panel.

本発明の文字列処理装置は、複数の符号化方式が利用可能であるとき、代用符号に置き換えた文字の数，代用符号に置き換えた文字の位置および符号化方式の優先度により、後述するように符号化方式を選択する。 When a plurality of encoding methods are available, the character string processing apparatus of the present invention will be described later depending on the number of characters replaced with substitute codes, the positions of characters replaced with substitute codes, and the priority of the encoding method. Select the encoding method.

以下、本発明の文字列処理方法を利用する文字列処理装置の一例として画像形成装置の処理を中心に説明するが、一の符号化方式で符号化された文字列を他の符号化方式に変換する如何なる装置であってもよい。 Hereinafter, the processing of the image forming apparatus will be mainly described as an example of a character string processing apparatus using the character string processing method of the present invention. However, a character string encoded by one encoding method is changed to another encoding method. Any device that converts can be used.

なお、本実施例で説明する画像形成装置は、プリンタ，コピー，ファクシミリおよびスキャナなどの各装置の機能を１つの筐体内に収納しているため、融合機とも呼ばれる。 Note that the image forming apparatus described in this embodiment is also called a multi-function machine because the functions of each apparatus such as a printer, a copy machine, a facsimile machine, and a scanner are housed in one housing.

この融合機は、１つの筐体内に表示部，印刷部および撮像部などを設けると共に、プリンタ，コピー，ファクシミリおよびスキャナにそれぞれ対応する４種類のソフトウェアを設け、そのソフトウェアを切り替えることより、プリンタ，コピー，ファクシミリおよびスキャナとして動作させるものである。 This multi-function apparatus is provided with a display unit, a printing unit, an imaging unit, and the like in one casing, and four types of software corresponding to a printer, a copy, a facsimile, and a scanner, respectively. Operates as a copy, facsimile and scanner.

図８は、本発明による融合機の一実施例の構成図である。融合機１は、ソフトウェア群２と，融合機起動部３と，ハードウェア資源４とを含むように構成される。 FIG. 8 is a block diagram of an embodiment of the compound machine according to the present invention. The multi-function machine 1 is configured to include a software group 2, a multi-function machine starting unit 3, and hardware resources 4.

融合機起動部３は融合機１の電源投入時に最初に実行され、アプリケーション層５およびプラットフォーム６を起動する。例えば融合機起動部３は、アプリケーション層５およびプラットフォーム６のプログラムをハードディスク装置（以下、ＨＤＤという）などから読み出し、読み出した各プログラムをメモリ領域に転送して起動する。 The compound machine starting unit 3 is executed first when the power of the compound machine 1 is turned on, and activates the application layer 5 and the platform 6. For example, the multi-function apparatus activation unit 3 reads the programs of the application layer 5 and the platform 6 from a hard disk device (hereinafter referred to as HDD) and transfers each read program to a memory area and activates it.

ハードウェア資源４は、白黒レーザプリンタ（Ｂ＆ＷＬＰ）１１と，カラーレーザプリンタ（ＣｏｌｏｒＬＰ）１２と，スキャナやファクシミリなどのその他のハードウェアリソース１３とを含む。 The hardware resources 4 include a monochrome laser printer (B & W LP) 11, a color laser printer (Color LP) 12, and other hardware resources 13 such as a scanner and a facsimile.

また、ソフトウェア群２は、ＵＮＩＸ（登録商標）などのオペレーティングシステム（以下、ＯＳという）上に起動されているアプリケーション層５とプラットフォーム６とを含む。アプリケーション層５は、プリンタ，コピー，ファックスおよびスキャナなどの画像形成にかかるユーザサービスにそれぞれ固有の処理を行うプログラムを含む。 The software group 2 includes an application layer 5 and a platform 6 activated on an operating system (hereinafter referred to as OS) such as UNIX (registered trademark). The application layer 5 includes programs that perform processing unique to user services related to image formation such as printers, copies, faxes, and scanners.

アプリケーション層５は、プリンタアプリ２１と，コピーアプリ２２と，ファックスアプリ２３と，スキャナアプリ２４と，ＷＥＢサービスインターフェース（以下、ＷＥＢサービスＩ／Ｆという）２６とを含む。 The application layer 5 includes a printer application 21, a copy application 22, a fax application 23, a scanner application 24, and a WEB service interface (hereinafter referred to as WEB service I / F) 26.

プラットフォーム６は、アプリケーション層５からの処理要求を解釈してハードウェア資源４の獲得要求を発生するコントロールサービス層９と、１つ以上のハードウェア資源４の管理を行ってコントロールサービス層９からの獲得要求を調停するシステムリソースマネージャ（以下、ＳＲＭという）３９と、ＳＲＭ３９からの獲得要求に応じてハードウェア資源４の管理を行うハンドラ層１０とを含む。 The platform 6 interprets a processing request from the application layer 5 and generates a hardware resource 4 acquisition request, and manages one or more hardware resources 4 to control the control service layer 9 A system resource manager (hereinafter referred to as SRM) 39 that arbitrates acquisition requests and a handler layer 10 that manages hardware resources 4 in response to acquisition requests from the SRM 39 are included.

コントロールサービス層９は、ネットワークコントロールサービス（以下、ＮＣＳという）３１，デリバリーコントロールサービス（以下、ＤＣＳという）３２，オペレーションパネルコントロールサービス（以下、ＯＣＳという）３３，ファックスコントロールサービス（以下、ＦＣＳという）３４，エンジンコントロールサービス（以下、ＥＣＳという）３５，メモリコントロールサービス（以下、ＭＣＳという）３６，ユーザ情報コントロールサービス（以下、ＵＣＳという）３７，システムコントロールサービス（以下、ＳＣＳという）３８など、一つ以上のサービスモジュールを含むように構成されている。 The control service layer 9 includes a network control service (hereinafter referred to as NCS) 31, a delivery control service (hereinafter referred to as DCS) 32, an operation panel control service (hereinafter referred to as OCS) 33, a fax control service (hereinafter referred to as FCS) 34. , Engine control service (hereinafter referred to as ECS) 35, memory control service (hereinafter referred to as MCS) 36, user information control service (hereinafter referred to as UCS) 37, system control service (hereinafter referred to as SCS) 38, etc. The service module is configured to be included.

なお、プラットフォーム６は予め定義されている関数により、アプリケーション層５からの処理要求を受信するアプリケーションプログラムインターフェース（以下、ＡＰＩという）５１を含むように構成されている。ＯＳは、アプリケーション層５およびプラットフォーム６の各ソフトウェアをプロセスとして並列実行する。 The platform 6 is configured to include an application program interface (hereinafter referred to as API) 51 that receives a processing request from the application layer 5 using a predefined function. The OS executes the software of the application layer 5 and the platform 6 in parallel as processes.

ＮＣＳ３１のプロセスは、ネットワーク側から各プロトコルによって受信したデータを各アプリケーションに振り分けたり、各アプリケーションからのデータをネットワーク側に送信する際の仲介を行う。 The process of the NCS 31 distributes data received from the network side according to each protocol to each application, and mediates when data from each application is transmitted to the network side.

例えばＮＣＳ３１のプロセスは、ネットワークを介して接続されるクライアントとのデータ通信を例えばｈｔｔｐｄ（HyperText Transfer Protocol Daemon）により、ＨＴＴＰ（HyperText Transfer Protocol）で制御する。 For example, the process of the NCS 31 controls data communication with a client connected via a network by, for example, HTTP (HyperText Transfer Protocol Daemon) by HTTP (HyperText Transfer Protocol).

ＤＣＳ３２のプロセスは、蓄積文書の配送などの制御を行う。ＯＣＳ３３のプロセスは、オペレーションパネルの制御を行う。ＦＣＳ３４のプロセスは、アプリケーション層５からＰＳＴＮ，ＩＳＤＮ網を利用したファックス送受信，バックアップ用のメモリで管理されている各種ファックスデータの登録／引用，ファックス読み取り，ファックス受信印刷などを行うためのＡＰＩを提供する。 The process of the DCS 32 performs control such as delivery of stored documents. The process of the OCS 33 controls the operation panel. The FCS34 process provides APIs to perform fax transmission / reception using the PSTN and ISDN networks from the application layer 5, registration / quotation of various fax data managed in the backup memory, fax reading, fax reception printing, etc. To do.

ＥＣＳ３５のプロセスは、白黒レーザプリンタ１１，カラーレーザプリンタ１２，ハードウェアリソース１３などのエンジン部の制御を行う。ＭＣＳ３６のプロセスは、メモリの取得および解放，ＨＤＤの利用，画像データの圧縮および伸張などのメモリ制御を行う。ＵＣＳ３７のプロセスは、ユーザ情報の管理を行うものである。 The process of the ECS 35 controls engine units such as the monochrome laser printer 11, the color laser printer 12, and the hardware resource 13. The process of the MCS 36 performs memory control such as acquisition and release of memory, use of the HDD, and compression and expansion of image data. The process of the UCS 37 manages user information.

ＳＣＳ３８のプロセスは、アプリケーション管理，操作部制御，システム画面表示，ＬＥＤ表示，ハードウェア資源管理，割り込みアプリケーション制御などの処理を行う。 The process of the SCS 38 performs processing such as application management, operation unit control, system screen display, LED display, hardware resource management, interrupt application control.

ＳＲＭ３９のプロセスは、ＳＣＳ３８と共にシステムの制御およびハードウェア資源４の管理を行うものである。例えばＳＲＭ３９のプロセスは、白黒レーザプリンタ１１やカラーレーザプリンタ１２などのハードウェア資源４を利用する上位層からの獲得要求に従って調停を行い、実行制御する。 The SRM 39 process controls the system and manages the hardware resources 4 together with the SCS 38. For example, the process of the SRM 39 performs arbitration in accordance with an acquisition request from an upper layer using the hardware resource 4 such as the black and white laser printer 11 and the color laser printer 12 and controls execution.

具体的に、ＳＲＭ３９のプロセスは獲得要求されたハードウェア資源４が利用可能であるか（他の獲得要求により利用されていないかどうか）を判定し、利用可能であれば獲得要求されたハードウェア資源４が利用可能である旨を上位層に通知する。また、ＳＲＭ３９のプロセスは上位層からの獲得要求に対してハードウェア資源４を利用するためのスケジューリングを行い、要求内容（例えば、プリンタエンジンによる紙搬送と作像動作，メモリ確保，ファイル生成など）を直接実施している。 Specifically, the process of the SRM 39 determines whether or not the requested hardware resource 4 is available (whether it is not used by another acquisition request). The higher layer is notified that the resource 4 is available. Further, the process of the SRM 39 performs scheduling for using the hardware resource 4 in response to an acquisition request from an upper layer, and the request contents (for example, paper conveyance and image forming operation by the printer engine, memory allocation, file generation, etc.) Has been implemented directly.

また、ハンドラ層１０は後述するファックスコントロールユニット（以下、ＦＣＵという）の管理を行うファックスコントロールユニットハンドラ（以下、ＦＣＵＨという）４０と、プロセスに対するメモリの割り振り及びプロセスに割り振ったメモリの管理を行うイメージメモリハンドラ（以下、ＩＭＨという）４１とを含む。 The handler layer 10 includes a fax control unit handler (hereinafter referred to as FCUH) 40 that manages a fax control unit (hereinafter referred to as FCU), which will be described later, and an image that allocates memory to the process and manages the memory allocated to the process. A memory handler (hereinafter referred to as IMH) 41.

ＳＲＭ３９，ＦＣＵＨ４０およびＩＭＨ４１は、予め定義されている関数によりハードウェア資源４に対する処理要求を送信するエンジンＩ／Ｆ５２を利用してハードウェア資源４に対する処理要求を行う。 The SRM 39, the FCUH 40, and the IMH 41 make a processing request for the hardware resource 4 using the engine I / F 52 that transmits a processing request for the hardware resource 4 using a predefined function.

図８のような構成により、融合機１は各アプリケーションで共通的に必要な処理をプラットフォーム６で一元的に処理することができる。 With the configuration as shown in FIG. 8, the multi-function apparatus 1 can centrally process processes commonly required for each application on the platform 6.

次に、融合機１のハードウェア構成について説明する。図９は、本発明による融合機の一実施例のハードウェア構成図である。融合機１は、コントローラ６０と，オペレーションパネル７０と，ＦＣＵ８０と，ＵＳＢデバイス９０と，ＩＥＥＥ１３９４デバイス１００と，エンジン部１１０とを含む。 Next, a hardware configuration of the multifunction machine 1 will be described. FIG. 9 is a hardware configuration diagram of an embodiment of the compound machine according to the present invention. The multi-function apparatus 1 includes a controller 60, an operation panel 70, an FCU 80, a USB device 90, an IEEE 1394 device 100, and an engine unit 110.

また、コントローラ６０は、ＣＰＵ６１と，システムメモリ６２と，ノースブリッジ（以下、ＮＢという）６３と，サウスブリッジ（以下、ＳＢという）６４と，ＡＳＩＣ６６と，ローカルメモリ６７と，ＨＤＤ６８，ネットワークＩ／Ｆコントローラ６９とを含む。 The controller 60 includes a CPU 61, a system memory 62, a north bridge (hereinafter referred to as NB) 63, a south bridge (hereinafter referred to as SB) 64, an ASIC 66, a local memory 67, an HDD 68, and a network I / F. Controller 69.

オペレーションパネル７０は、コントローラ６０のＡＳＩＣ６６に接続されている。また、ＦＣＵ８０，ＵＳＢデバイス９０，ＩＥＥＥ１３９４デバイス１００およびエンジン部１１０は、コントローラ６０のＡＳＩＣ６６にＰＣＩバスで接続されている。 The operation panel 70 is connected to the ASIC 66 of the controller 60. The FCU 80, the USB device 90, the IEEE 1394 device 100, and the engine unit 110 are connected to the ASIC 66 of the controller 60 via a PCI bus.

コントローラ６０は、ＡＳＩＣ６６にローカルメモリ６７，ＨＤＤ６８，ネットワークＩ／Ｆコントローラ６９などが接続されると共に、ＣＰＵ６１とＡＳＩＣ６６とがＣＰＵチップセットのＮＢ６３を介して接続されている。ＮＢ６３を介してＣＰＵ６１とＡＳＩＣ６６とを接続すれば、ＣＰＵ６１のＩ／Ｆが公開されていない場合に対応できる。 In the controller 60, a local memory 67, an HDD 68, a network I / F controller 69, and the like are connected to the ASIC 66, and the CPU 61 and the ASIC 66 are connected via an NB 63 of a CPU chip set. If the CPU 61 and the ASIC 66 are connected via the NB 63, it is possible to cope with the case where the I / F of the CPU 61 is not disclosed.

なお、ＡＳＩＣ６６とＮＢ６３とは、ＰＣＩバスを介して接続されているのでなく、ＡＧＰ（Accelerated Graphics Port ）６５を介して接続されている。このように、図８のアプリケーション層５やプラットフォーム６を形成する一つ以上のプロセスを実行制御するため、ＡＳＩＣ６６とＮＢ６３とを低速のＰＣＩバスでなくＡＧＰ６５を介して接続し、パフォーマンスの低下を防いでいる。 The ASIC 66 and the NB 63 are not connected via a PCI bus, but are connected via an AGP (Accelerated Graphics Port) 65. Thus, in order to control the execution of one or more processes forming the application layer 5 and the platform 6 in FIG. 8, the ASIC 66 and the NB 63 are connected via the AGP 65 instead of the low-speed PCI bus to prevent performance degradation. It is out.

ＣＰＵ６１は、融合機１の全体制御を行うものである。ＣＰＵ６１は、ＮＣＳ３１，ＤＣＳ３２，ＯＣＳ３３，ＦＣＳ３４，ＥＣＳ３５，ＭＣＳ３６，ＵＣＳ３７，ＳＣＳ３８，ＳＲＭ３９，ＦＣＵＨ４０およびＩＭＨ４１をＯＳ上にそれぞれプロセスとして起動して実行させると共に、アプリケーション層１１１を形成するプリンタアプリ２１，コピーアプリ２２，ファックスアプリ２３，スキャナアプリ２４，ネットファイルアプリ２５，ＷＥＢサービスＩ／Ｆ２６を起動して実行させる。 The CPU 61 performs overall control of the compound machine 1. The CPU 61 starts and executes NCS31, DCS32, OCS33, FCS34, ECS35, MCS36, UCS37, SCS38, SRM39, FCUH40, and IMH41 as processes on the OS, and also executes the printer application 21 and the copy application that form the application layer 111 22, the fax application 23, the scanner application 24, the net file application 25, and the WEB service I / F 26 are activated and executed.

ＮＢ６３は、ＣＰＵ６１，システムメモリ６２，ＳＢ６４およびＡＳＩＣ６６を接続するためのブリッジである。システムメモリ６２は、融合機１の描画用メモリ等として用いるメモリである。ＳＢ６４は、図示していないＲＯＭ，ＰＣＩバス，周辺デバイス等とＮＢ６３とを接続するためのブリッジである。 The NB 63 is a bridge for connecting the CPU 61, the system memory 62, the SB 64, and the ASIC 66. The system memory 62 is a memory used as a drawing memory or the like of the multifunction machine 1. The SB 64 is a bridge for connecting the NB 63 to a ROM, PCI bus, peripheral device, etc. (not shown).

ローカルメモリ６７はコピー用画像バッファ，符号バッファとして用いるメモリである。ＡＳＩＣ６６は、画像処理用のハードウェア要素を有する画像処理用途向けのＩＣである。ＨＤＤ６８は、画像データの蓄積，文書データの蓄積，プログラムの蓄積，フォントデータの蓄積，フォームの蓄積などを行うためのストレージである。また、オペレーションパネル７０は、ユーザからの入力操作を受け付けると共に、ユーザに向けた表示を行う操作部である。 The local memory 67 is a memory used as a copy image buffer and a code buffer. The ASIC 66 is an IC for use in image processing having hardware elements for image processing. The HDD 68 is a storage for storing image data, document data, programs, font data, forms, and the like. The operation panel 70 is an operation unit that accepts an input operation from the user and performs display for the user.

以下、ネットワークを介して接続されたクライアントから文字列を含む要求（文書名や作者名の変更，照合）を受信したときの融合機１の処理について説明していく。 Hereinafter, processing of the multi-function apparatus 1 when a request including a character string (change of document name or author name, collation) is received from a client connected via a network will be described.

図１０は、本発明による第１実施例の文字列処理方法について説明するための融合機の構成図である。なお、図１０の構成図は説明に必要のない構成を省略している。 FIG. 10 is a block diagram of the compound machine for explaining the character string processing method according to the first embodiment of the present invention. In the configuration diagram of FIG. 10, configurations that are not necessary for description are omitted.

図１０では、融合機１にインターネット，ＬＡＮなどのネットワーク２１０を介してクライアント２００が接続されている。クライアント２００と融合機１とは、ＸＭＬ（eXtensible Markup Language）形式でデータ交換を行う。なお、クライアント２００と融合機１とは、例えばデータ・アクセス・プロトコルとしてＳＯＡＰ（Simple Object Access Protocol）と呼ばれる通信プロトコル規格を利用する。 In FIG. 10, a client 200 is connected to the multi-function apparatus 1 via a network 210 such as the Internet or a LAN. The client 200 and the MFP 1 exchange data in an XML (eXtensible Markup Language) format. The client 200 and the multi-function apparatus 1 use a communication protocol standard called SOAP (Simple Object Access Protocol) as a data access protocol, for example.

ステップＳ１では、クライアント２００が、ＸＭＬ形式で記述した図１１のようなＳＯＡＰメッセージを作成し、そのＳＯＡＰメッセージをＨＴＴＰメッセージに格納させて融合機１に送信する。図１１は、文書名の変更要求を行うＳＯＡＰメッセージの一例の構成図である。クライアント２００から送信されたＨＴＴＰメッセージは、ＮＣＳ３１のｈｔｔｐｄ２２０を介してＷｅｂサービスＩ／Ｆ２６に供給される。 In step S1, the client 200 creates a SOAP message as shown in FIG. 11 described in the XML format, stores the SOAP message in an HTTP message, and transmits it to the multi-function device 1. FIG. 11 is a configuration diagram of an example of a SOAP message for making a document name change request. The HTTP message transmitted from the client 200 is supplied to the Web service I / F 26 via the httpd 220 of the NCS 31.

ＨＴＴＰメッセージを受信すると、ＷｅｂサービスＩ／Ｆ２６はディスパッチャによりＳＯＡＰメッセージを解釈したあと、文書名の変更要求に応じたｓｅｔＴｉｔｌｅのオブジェクト２３０に処理を依頼する。 When receiving the HTTP message, the Web service I / F 26 interprets the SOAP message by the dispatcher, and then requests the setTitle object 230 in response to the request for changing the document name.

まず、ｓｅｔＴｉｔｌｅのオブジェクト２３０は、内部処理で利用する符号化方式以外の文字列（変更後の文書名）を受信すると、その文字列を内部処理で利用する符号化方式に変換する必要がある。ここでは、ＵＴＦ−８で符号化された文字列をＳｈｉｆｔ＿ＪＩＳまたはＬａｔｉｎ１で符号化された文字列に変換する例を説明する。 First, when the setTitle object 230 receives a character string (changed document name) other than the encoding method used in internal processing, the character string needs to be converted into an encoding method used in internal processing. Here, an example in which a character string encoded by UTF-8 is converted into a character string encoded by Shift_JIS or Latin1 will be described.

ＷｅｂサービスＩ／Ｆ２６から処理を依頼されると、オブジェクト２３０は文字列の符号化方式を変換する必要があるかを判定し、必要があれば図１２のようなフローチャートで表す処理を実行する。 When processing is requested from the Web service I / F 26, the object 230 determines whether or not the character string encoding method needs to be converted, and if necessary, executes processing represented by a flowchart shown in FIG.

図１２は、符号化方式決定処理の一例のフローチャートである。図１３はＵＴＦ−８，Ｓｈｉｆｔ＿ＪＩＳおよびＬａｔｉｎ１の文字コードの割り当てを表した一例のルックアップテーブルである。 FIG. 12 is a flowchart of an example of an encoding method determination process. FIG. 13 is an example of a look-up table showing allocation of character codes of UTF-8, Shift_JIS, and Latin1.

ステップＳ１１では、オブジェクト２３０は融合機１で利用可能な符号化方式のリストを例えばＨＤＤ６８などから取得し、最初の符号化方式として例えばＳｈｉｆｔ＿ＪＩＳを選択する。なお、符号化方式のリストはランダムに並んだものであってもよいし、優先度順に並んだものであってもよい。 In step S11, the object 230 obtains a list of encoding methods that can be used in the MFP 1, for example, from the HDD 68, and selects, for example, Shift_JIS as the first encoding method. Note that the list of encoding methods may be randomly arranged or may be arranged in order of priority.

ステップＳ１１に続いてステップＳ１２に進み、オブジェクト２３０は符号化方式を変更する文字列 Proceeding to step S12 following step S11, the object 230 is a character string for changing the encoding method.

［外４］

をステップＳ１１で選択したＳｈｉｆｔ＿ＪＩＳで符号化する。図１３のルックアップテーブルを参照すると、符号化方式Ｓｈｉｆｔ＿ＪＩＳでは文字 [Outside 4]

Is encoded by Shift_JIS selected in step S11. Referring to the look-up table in FIG. 13, in the encoding method Shift_JIS, characters

［外５］

が変換できない。ステップＳ１２に続いてステップＳ１３に進み、オブジェクト２３０はステップＳ１２で符号化した結果、符号化方式を変更する文字列がステップＳ１１で選択したＳｈｉｆｔ＿ＪＩＳで所定基準以上の符号化が可能であるか否かを判定する。例えば所定基準としては、完全な符号化であってもよいし、選択した符号化方式で変換できない文字の数が所定数以下であってもよい。ここでは、所定基準が完全な符号化である例を説明する。 [Outside 5]

Cannot be converted. Progressing to step S13 following step S12, the object 230 is encoded in step S12. As a result, it is determined whether or not the character string whose encoding method is changed can be encoded with the Shift_JIS selected in step S11 above a predetermined standard. Determine. For example, the predetermined standard may be complete encoding, or the number of characters that cannot be converted by the selected encoding method may be a predetermined number or less. Here, an example in which the predetermined criterion is complete encoding will be described.

この場合、Ｓｈｉｆｔ＿ＪＩＳで完全に符号化が可能ではないので（Ｓ１３においてＮＯ）、オブジェクト２３０はステップＳ１４に進む。なお、ステップＳ１１で選択した符号化方式で完全に符号化が可能であれば（Ｓ１３においてＹＥＳ）、オブジェクト２３０はステップＳ１１で選択した符号化方式に決定して処理を終了する。 In this case, since it is not possible to completely encode with Shift_JIS (NO in S13), the object 230 proceeds to step S14. If the encoding method selected in step S11 can be completely encoded (YES in S13), the object 230 is determined to be the encoding method selected in step S11, and the process ends.

ステップＳ１４では、オブジェクト２３０が、ステップＳ１２で符号化した結果から、代用文字に置き換える文字の数および文字の位置を例えばシステムメモリ６２に記憶する。この場合、オブジェクト２３０は代用文字に置き換える文字の数「１」および文字の位置「最初から２番目」を記憶する。 In step S14, the object 230 stores, for example, in the system memory 62 the number of characters to be replaced with the substitute character and the position of the character from the result of encoding in step S12. In this case, the object 230 stores the number of characters to be replaced with substitute characters “1” and the character position “second from the beginning”.

ステップＳ１４に続いてステップＳ１５に進み、オブジェクト２３０は符号化方式のリストから選択されていない符号化方式があるか否かを判定する。この場合、Ｌａｔｉｎ１が選択されていないので（Ｓ１５においてＹＥＳ）、オブジェクト２３０はステップＳ１６に進む。 Proceeding to step S15 following step S14, the object 230 determines whether there is an encoding method not selected from the list of encoding methods. In this case, since Latin 1 is not selected (YES in S15), the object 230 proceeds to step S16.

ステップＳ１５に続いてステップＳ１６に進み、オブジェクト２３０は符号化方式のリストからＳｈｉｆｔ＿ＪＩＳの次のＬａｔｉｎ１を選択してステップＳ１２に戻る。ステップＳ１２では、オブジェクト２３０が符号化方式を変更する文字列 Progressing to step S16 following step S15, the object 230 selects Latin1 next to Shift_JIS from the list of encoding methods, and returns to step S12. In step S12, the character string whose object 230 changes the encoding method

［外６］

をステップＳ１６で選択したＬａｔｉｎ１で符号化する。図１３のルックアップテーブルから、符号化方式Ｌａｔｉｎ１では文字「作」が変換できないことが分かる。ステップＳ１２に続いてステップＳ１３に進み、オブジェクト２３０はステップＳ１２で符号化した結果、符号化方式を変更する文字列がステップＳ１６で選択したＬａｔｉｎ１で完全に符号化可能であるか否かを判定する。 [Outside 6]

Is encoded with Latin1 selected in step S16. From the lookup table of FIG. 13, it can be seen that the character “saku” cannot be converted by the encoding method Latin1. Progressing to step S13 following step S12, the object 230 determines whether or not the character string whose encoding method is to be changed can be completely encoded with the Latin 1 selected in step S16 as a result of encoding in step S12. .

この場合、Ｌａｔｉｎ１で完全に符号化が可能ではないので（Ｓ１３においてＮＯ）、オブジェクト２３０はステップＳ１４に進む。ステップＳ１４では、オブジェクト２３０が、ステップＳ１２で符号化した結果から、代用文字に置き換える文字の数および文字の位置を例えばシステムメモリ６２に記憶する。この場合、オブジェクト２３０は代用文字に置き換える文字の数「１」および文字の位置「最初から５番目」を記憶する。 In this case, since encoding is not possible in Latin 1 (NO in S13), the object 230 proceeds to step S14. In step S14, the object 230 stores, for example, in the system memory 62 the number of characters to be replaced with the substitute character and the position of the character from the result of encoding in step S12. In this case, the object 230 stores the number of characters to be replaced with substitute characters “1” and the character position “fifth from the beginning”.

ステップＳ１４に続いてステップＳ１５に進み、オブジェクト２３０は符号化方式のリストから選択されていない符号化方式があるか否かを判定する。この場合、全ての符号化方式が選択されているので（Ｓ１５においてＮＯ）、オブジェクト２３０はステップＳ１８に進む。 Proceeding to step S15 following step S14, the object 230 determines whether there is an encoding method not selected from the list of encoding methods. In this case, since all the encoding methods have been selected (NO in S15), the object 230 proceeds to step S18.

ステップＳ１８では、オブジェクト２３０が、ステップＳ１４で記憶した代用文字に置き換える文字の数，文字の位置などを用いて、所定条件に基づき最適な符号化方式を決定して処理を終了する。 In step S18, the object 230 uses the number of characters to be replaced with the substitute character stored in step S14, the character position, and the like to determine an optimal encoding method based on predetermined conditions, and the process ends.

例えばステップＳ１８では、代用文字に置き換える文字の数が一番少ない符号化方式を最適な符号化方式として決定できる。また、代用文字に置き換える文字が連続して出現する回数が一番少ない符号化方式を最適な符号化方式として決定することもできる。また、代用文字に置き換える文字の最初の出現位置が一番後ろの符号化方式を最適な符号化方式として決定することもできる。 For example, in step S18, an encoding method with the least number of characters to be replaced with substitute characters can be determined as the optimal encoding method. It is also possible to determine the coding method with the smallest number of consecutive occurrences of characters to be replaced with substitute characters as the optimum coding method. It is also possible to determine the coding method with the last appearance position of the character to be replaced with the substitute character as the optimum coding method.

また、代用文字に置き換える文字の平均の出現位置が一番後ろの符号化方式を最適な符号化方式として決定することもできる。また、符号化方式のリスト中で先に現れる符号化方式を最適な符号化方式として決定することもできる。 It is also possible to determine the coding method with the last appearance position of the average character to be replaced with the substitute character as the optimum coding method. In addition, the encoding method that appears first in the list of encoding methods can be determined as the optimal encoding method.

さらに、上記のような符号化方式の決定を２つ以上組み合わせるか、ポイント制にすることにより、最適な符号化方式を決定することもできる。例えば代用文字に置き換える文字の数が一番少なく、且つ代用文字に置き換える文字の最初の出現位置が一番後ろの符号化方式を最適な符号化方式として決定する場合、 Furthermore, an optimal encoding method can be determined by combining two or more determinations of the encoding method as described above or using a point system. For example, when the number of characters to be replaced with a substitute character is the smallest and the first appearance position of the character to be replaced with a substitute character is determined as the most suitable encoding method,

［外７］

を符号化する符号化方式としてＬａｔｉｎ１が選択される。なお、代用文字に置き換える文字の数が一番少ない符号化方式を最適な符号化方式として決定すると共に、代用文字に置き換える文字の数が同じ場合に符号化方式のリスト中で先に現れる符号化方式を最適な符号化方式として決定することもできる。 [Outside 7]

Latin1 is selected as an encoding method for encoding. The coding method with the least number of characters to be replaced with substitute characters is determined as the optimal coding method, and the coding that appears first in the list of coding methods when the number of characters to be replaced with substitute characters is the same. It is also possible to determine the system as an optimal encoding system.

図１２のフローチャートにより最適な符号化方式を選択したあと、オブジェクト２３０は選択した符号化方式で文字列 After selecting the optimum encoding method according to the flowchart of FIG. 12, the object 230 is a character string in the selected encoding method.

［外８］

を符号化する。オブジェクト２３０は、符号化できない文字を代用文字に置き換える。例えば [Outside 8]

Is encoded. Object 230 replaces characters that cannot be encoded with substitute characters. For example

［外９］

をＬａｔｉｎ１で符号化した場合、最初から５番目の「作」が代用文字に置き換えられる。そして、オブジェクト２３０はステップＳ２に進み、符号化した文字列を用いてＭＣＳ３６に文書名の変更要求を行う。ＭＣＳ３６は例えば図１４のような形式の書誌情報データをＨＤＤ６８等に格納しており、オブジェクト２３０からの文書名の変更要求に応じて書誌情報データを更新する。 [Outside 9]

Is encoded with Latin 1, the fifth “saku” from the beginning is replaced with a substitute character. Then, the object 230 proceeds to step S2, and makes a request for changing the document name to the MCS 36 using the encoded character string. For example, the MCS 36 stores bibliographic information data in a format as shown in FIG. 14 in the HDD 68 or the like, and updates the bibliographic information data in response to a document name change request from the object 230.

そして、ステップＳ２に続いてステップＳ３に進み、ＭＣＳ３６は文書名の変更応答をオブジェクト２３０に送信する。オブジェクト２３０は文書名の変更応答を受信すると、ＷｅｂサービスＩ／Ｆ２６に処理の終了を通知する。 In step S3 following step S2, the MCS 36 transmits a document name change response to the object 230. When the object 230 receives the document name change response, the object 230 notifies the Web service I / F 26 of the end of the process.

ステップＳ４では、ＷｅｂサービスＩ／Ｆ２６が、文書名の変更応答を行うＳＯＡＰメッセージをＸＭＬ形式で記述し、そのＳＯＡＰメッセージをＨＴＴＰメッセージに格納させてクライアント２００に送信する。 In step S 4, the Web service I / F 26 describes a SOAP message for performing a document name change response in the XML format, stores the SOAP message in an HTTP message, and transmits it to the client 200.

以上、本発明による第１実施例の文字列処理方法によれば、一の符号化方式で符号化された文字列を他の符号化方式に変換するときに、適切な符号化方式を選択することができる。 As described above, according to the character string processing method of the first embodiment of the present invention, an appropriate encoding method is selected when a character string encoded by one encoding method is converted into another encoding method. be able to.

なお、図１０ではｓｅｔＴｉｔｌｅのオブジェクト２３０をＷｅｂサービスＩ／Ｆ２６に設けた例について説明したが、ＭＣＳ３６に設けてもよい。ＭＣＳ３６にオブジェクト２３０を設けた場合、ＷｅｂサービスＩ／Ｆ２６はディスパッチャによりＳＯＡＰメッセージを解釈したあと、文書名の変更要求をＭＣＳ３６に対して行う。そして、ＭＣＳ３６は文書名の変更要求に応じたｓｅｔＴｉｔｌｅのオブジェクト２３０に処理を依頼する。 In addition, although the example which provided the object 230 of setTitle in Web service I / F26 was demonstrated in FIG. 10, you may provide in MCS36. When the object 230 is provided in the MCS 36, the Web service I / F 26 makes a request to change the document name to the MCS 36 after interpreting the SOAP message by the dispatcher. The MCS 36 requests the setTitle object 230 in response to the document name change request.

ＷｅｂサービスＩ／Ｆ２６から処理を依頼されると、オブジェクト２３０は文字列の符号化方式を変換する必要があるかを判定し、必要があれば図１２のようなフローチャートで表す処理により最適な符号化方式を選択する。オブジェクト２３０は選択した符号化方式で文字列を符号化すると共に、符号化できない文字を代用文字に置き換える。そして、オブジェクト２３０はステップＳ２に進み、符号化した文字列を用いてＭＣＳ３６に文書名の変更要求を行う。その他の処理は、ｓｅｔＴｉｔｌｅのオブジェクト２３０をＷｅｂサービスＩ／Ｆ２６に設けた例と同様である。 When processing is requested from the Web service I / F 26, the object 230 determines whether it is necessary to convert the encoding method of the character string, and if necessary, the optimal code is obtained by the processing shown in the flowchart of FIG. Select the conversion method. The object 230 encodes the character string using the selected encoding method, and replaces characters that cannot be encoded with substitute characters. Then, the object 230 proceeds to step S2, and makes a request for changing the document name to the MCS 36 using the encoded character string. Other processes are the same as the example in which the setTitle object 230 is provided in the Web service I / F 26.

ｓｅｔＴｉｔｌｅのオブジェクト２３０をＷｅｂサービスＩ／Ｆ２６に設けた場合のメリットは、第１実施例の文字列処理方法が、ネットワーク２１０に接続しない融合機に不要な機能であるため、Ｗｅｂサービス対応のためのオプションという構成にできることである。また、ｓｅｔＴｉｔｌｅのオブジェクト２３０をＭＣＳ３６に設けた場合のメリットは、他のＩ／Ｆ（例えばプリンタＩ／Ｆなど）でも利用できることである。 The merit when the setTitle object 230 is provided in the Web service I / F 26 is that the character string processing method of the first embodiment is an unnecessary function for a multi-function peripheral that is not connected to the network 210. It can be configured as an option. An advantage of providing the setTitle object 230 in the MCS 36 is that it can also be used in other I / Fs (for example, a printer I / F).

第１実施例のように、変換できない文字を代用符号に置き換えた場合、変換後の文字列を照合するとき等、代用符号に置き換えた全ての文字が同じ文字としてみなされるという問題もあった。また、符号化方式が等しくないと文字の一致とみなさない場合、同じ文字が異なる文字とみなされるという問題があった。 When characters that cannot be converted are replaced with substitute codes as in the first embodiment, there is also a problem that all characters replaced with substitute codes are regarded as the same character when collating the converted character strings. In addition, when the encoding methods are not equal, there is a problem that the same character is regarded as a different character when it is not regarded as a character match.

そこで、第２実施例の融合機１は照合を行う文字列の符号化方式を等しい符号化方式に変換させ、置き換えられた代用符号をワイルドカード扱いとすることにより、上記の問題を解決している。 Therefore, the MFP 1 of the second embodiment solves the above problem by converting the encoding method of the character string to be collated into an equivalent encoding method and treating the replaced substitute code as a wild card. Yes.

図１５は、文字列の照合について説明するための図である。例えばＵＴＦ−８で符号化された FIG. 15 is a diagram for explaining collation of character strings. For example, encoded with UTF-8

［外１０］

をＳｈｉｆｔ＿ＪＩＳまたはＬａｔｉｎ１で符号化し、再びＵＴＦ−８に符号化した文字列は、 [Outside 10]

Is a character string encoded with Shift_JIS or Latin1 and encoded again with UTF-8.

［外１１］

に一致していない。そこで、代用符号で表される不明文字をワイルドカード扱いとすることで、 [Outside 11]

Does not match. Therefore, by treating the unknown character represented by the substitute code as a wild card,

［外１２］

に完全一致する文字列を照合できる。また、 [Outside 12]

Can be matched with the exact string. Also,

［外１３］

に前方一致する文字列も照合できる。また、Ｗｉｎｄｏｗｓ（登録商標）やＭＡＣ（登録商標）などのＯＳの違いにより、ＵＴＦ−８では同一の文字に異なる文字コードを例えば図１６のように割り当てている。このような例外的な文字を予め図１６のようなテーブルに設定しておくことで、本発明の文字列処理方法では異なる文字コードを割り当てられた文字であっても同一の文字としてみなすことができる。なお、図１６のテーブルはデータの保持形式の一例であって、例えばＸＭＬなどであってもよい。 [Outside 13]

You can also collate a character string that matches the prefix. Also, due to differences in OS such as Windows (registered trademark) and MAC (registered trademark), UTF-8 assigns different character codes to the same character as shown in FIG. By setting such exceptional characters in the table as shown in FIG. 16 in advance, the character string processing method of the present invention can regard even characters assigned different character codes as the same character. it can. The table in FIG. 16 is an example of a data holding format, and may be XML, for example.

図１６は、例外的な文字を設定したテーブルの一例の構成図である。図１６のテーブルを利用することで、本発明の文字列処理方法は例外的な文字を含む文字列であっても正確な照合が可能である。なお、本発明の文字列処理方法では大文字および小文字，平仮名および片仮名の関係を例外的な文字と同様、テーブルに設定しておくことで、異なる文字であっても同一の文字としてみなすこともできる。 FIG. 16 is a configuration diagram of an example of a table in which exceptional characters are set. By using the table of FIG. 16, the character string processing method of the present invention can accurately collate even a character string including exceptional characters. In the character string processing method according to the present invention, the relationship between uppercase and lowercase letters, hiragana and katakana is set in a table in the same manner as exceptional characters, so that even different characters can be regarded as the same character. .

図１７は、本発明による第２実施例の文字列処理方法について説明するための融合機の構成図である。なお、図１７の構成図は説明に必要のない構成を省略している。また、図１７の融合機１は図１０の融合機１と一部を除いて同様な構成であるため適宜説明を省略する。 FIG. 17 is a block diagram of a compound machine for explaining a character string processing method according to the second embodiment of the present invention. In the configuration diagram of FIG. 17, configurations that are not necessary for description are omitted. 17 has the same configuration as that of the multi-function apparatus 1 shown in FIG.

図１７の融合機１は、ＷｅｂサービスＩ／Ｆ２６に文書名の照合要求に応じたＳｅａｒｃｈのオブジェクト２４０を備え、また、ＷｅｂサービスＩ／Ｆ２６が利用するキャッシュ２５０を備えるように構成される。 The MFP 1 of FIG. 17 includes a Search object 240 corresponding to a document name collation request in the Web service I / F 26 and a cache 250 used by the Web service I / F 26.

図１８は、文字列の照合処理について説明するための一例のフローチャートである。クライアント２００から上記の文字列の符号化方式の変換を経て文書名の変更または文書の作成に伴う文書名の設定を指示するか、操作者がオペレーションパネル７０を操作して文書名の変更または文書の作成を指示すると、融合機１のＯＣＳ３３はステップＳ２１に進み、文書名の変更要求または文書の作成要求をＭＣＳ３６に送信する。 FIG. 18 is a flowchart illustrating an example of character string matching processing. The client 200 instructs the document name change or the document name setting accompanying the creation of the document through the conversion of the character string encoding method, or the operator operates the operation panel 70 to change the document name or the document. The OCS 33 of the MFP 1 proceeds to step S21, and transmits a document name change request or a document creation request to the MCS 36.

ステップＳ２１に続いてステップＳ２２に進み、ＭＣＳ３６は文書名の変更要求または文書の作成要求を受信し、その文書名の変更要求または文書の作成要求に応じて図１４のような書誌情報データを変更する。そして、ＭＣＳ３６はファイル変化通知をＷｅｂサービスＩ／Ｆ２６に送信する。 In step S22 following step S21, the MCS 36 receives a document name change request or document creation request, and changes the bibliographic information data as shown in FIG. 14 in response to the document name change request or document creation request. To do. Then, the MCS 36 transmits a file change notification to the Web service I / F 26.

ファイル変化通知を受信すると、ＷｅｂサービスＩ／Ｆ２６は文書名の照合要求に応じたｓｅａｒｃｈのオブジェクト２４０に処理を依頼する。図１９は、符号化処理の一例のフローチャートである。 When the file change notification is received, the Web service I / F 26 requests the search object 240 in response to the document name verification request. FIG. 19 is a flowchart of an example of the encoding process.

ステップＳ３１では、受信したファイル変化通知に含まれている文書名が内部処理で利用するＳｈｉｆｔ＿ＪＩＳまたはＬａｔｉｎ１等の符号化方式で符号化されているため、ＵＴＦ−８に変換される。ステップＳ３１に続いてステップＳ３２に進み、オブジェクト２４０は受信したファイル変化通知に含まれている文書ＩＤをキー情報としてキャッシュ２５０から該当する文書名を検索して照合する。 In step S31, since the document name included in the received file change notification is encoded by an encoding method such as Shift_JIS or Latin1 used in internal processing, it is converted to UTF-8. Progressing to step S32 following step S31, the object 240 searches the corresponding document name from the cache 250 using the document ID included in the received file change notification as key information, and collates it.

ステップＳ３２に続いてステップＳ３３に進み、オブジェクト２４０はステップＳ３２での照合結果から、ＵＴＦ−８に変換された文書名とキャッシュ２５０から検索された文書名とが一致とみなせるか否かを判定する。 Proceeding to step S33 following step S32, the object 240 determines whether or not the document name converted into UTF-8 and the document name retrieved from the cache 250 can be regarded as matching from the collation result in step S32. .

ＵＴＦ−８に変換された文書名とキャッシュ２５０から検索された文書名とが一致とみなせると判定すると（Ｓ３３においてＹＥＳ）、オブジェクト２４０は図１９の符号化処理を終了する。 If it is determined that the document name converted to UTF-8 matches the document name retrieved from the cache 250 (YES in S33), the object 240 ends the encoding process of FIG.

例えばＵＴＦ−８に変換された文書名が For example, the document name converted to UTF-8 is

［外１４］

であり、キャッシュ２５０から検索された文書名が [Outside 14]

The document name retrieved from the cache 250 is

［外１５］

である場合、前述したように代用符号で表される不明文字をワイルドカード扱いとすることで、ＵＴＦ−８に変換された文書名とキャッシュ２５０から検索された文書名とを一致とみなせる。クライアント２００からの文書名の設定の際には内部処理で利用する符号化方式に変換する前の文字列をキャッシュ２５０にまず格納する方式をとれば、クライアント２００からの文書名の設定の際に、このような場合が発生する。 [Outside 15]

If the unknown character represented by the substitute code is treated as a wild card as described above, the document name converted to UTF-8 and the document name retrieved from the cache 250 can be regarded as a match. When setting a document name from the client 200, a method of first storing a character string before conversion into an encoding method used in internal processing in the cache 250 is used. Such a case occurs.

一方、ＵＴＦ−８に変換された文書名とキャッシュ２５０から検索された文書名とが一致とみなせないと判定すると（Ｓ３３においてＮＯ）、オブジェクト２４０はＵＴＦ−８に変換された文書名を新しい文書名としてキャッシュ２５０に格納して図１９の符号化処理を終了する。 On the other hand, if it is determined that the document name converted to UTF-8 and the document name retrieved from the cache 250 cannot be regarded as coincident (NO in S33), the object 240 uses the document name converted to UTF-8 as a new document. The name is stored in the cache 250 and the encoding process in FIG. 19 is terminated.

［外１６］

であり、キャッシュ２５０から検索された文書名が [Outside 16]

The document name retrieved from the cache 250 is

［外１７］

である場合、ＵＴＦ−８に変換された文書名とキャッシュ２５０から検索された文書名とを一致とみなせない。そこで、オブジェクト２４０は
ＵＴＦ−８に変換された文書名 [Outside 17]

, The document name converted to UTF-8 and the document name retrieved from the cache 250 cannot be regarded as matching. Therefore, the object 240 is the document name converted to UTF-8.

［外１８］

を新しい文書名としてキャッシュ２５０に格納する。クライアント２００からの文書名の設定の際には内部処理で利用する符号化方式に変換する前の文字列をキャッシュ２５０に格納しない方式をとれば、キャッシュ２５０から検索された文書名が [Outside 18]

Is stored in the cache 250 as a new document name. When setting a document name from the client 200, if a method is adopted in which a character string before conversion to an encoding method used in internal processing is not stored in the cache 250, the document name retrieved from the cache 250 is stored.

［外１９］

ではなく [Outside 19]

not

［外２０］

であったりするが、不明文字をワイルドカード扱いとしても [Outside 20]

Although unknown characters are treated as wild cards

［外２１］

とは一致しないので、同様の結果となる。 [Outside 21]

Does not match, so the same result is obtained.

このように、図１７の融合機１では文書名の変更があったときに、文書名を内部処理で利用するＳｈｉｆｔ＿ＪＩＳまたはＬａｔｉｎ１等からＵＴＦ−８に変換している。 As described above, in the MFP 1 shown in FIG. 17, when a document name is changed, the document name is converted from Shift_JIS or Latin1 used for internal processing to UTF-8.

ステップＳ２３では、クライアント２００が、ＸＭＬ形式で記述した図２０のようなＳＯＡＰメッセージを作成し、そのＳＯＡＰメッセージをＨＴＴＰメッセージに格納させて融合機１に送信する。図２０は、文書名の検索要求を行うＳＯＡＰメッセージの一例の構成図である。クライアント２００から送信されたＨＴＴＰメッセージは、ＮＣＳ３１のｈｔｔｐｄ２２０を介してＷｅｂサービスＩ／Ｆ２６に供給される。 In step S23, the client 200 creates a SOAP message as shown in FIG. 20 described in the XML format, stores the SOAP message in the HTTP message, and transmits it to the MFP 1. FIG. 20 is a configuration diagram of an example of a SOAP message for making a document name search request. The HTTP message transmitted from the client 200 is supplied to the Web service I / F 26 via the httpd 220 of the NCS 31.

ＨＴＴＰメッセージを受信すると、ＷｅｂサービスＩ／Ｆ２６はディスパッチャによりＳＯＡＰメッセージを解釈したあと、ｓｅａｒｃｈのオブジェクト２４０に処理を依頼する。 When receiving the HTTP message, the Web service I / F 26 interprets the SOAP message by the dispatcher, and then requests the search object 240 to perform processing.

ｓｅａｒｃｈのオブジェクト２４０は、ＵＴＦ−８で符号化された文字列（検索を行う文書名）を受信すると、その文字列をキー情報としてキャッシュ２５０から該当する文書名を検索して照合する。オブジェクト２４０は文字列の照合処理が終了すると、ＷｅｂサービスＩ／Ｆ２６に処理の終了を通知する。 When the search object 240 receives a character string (document name to be searched) encoded by UTF-8, the search object 240 searches the corresponding document name from the cache 250 using the character string as key information and collates it. When the character string collation process ends, the object 240 notifies the Web service I / F 26 of the end of the process.

ステップＳ２４では、ＷｅｂサービスＩ／Ｆ２６が、文書名の検索応答を行うＳＯＡＰメッセージをＸＭＬ形式で記述し、そのＳＯＡＰメッセージをＨＴＴＰメッセージに格納させてクライアント２００に送信する。 In step S 24, the Web service I / F 26 describes a SOAP message for performing a document name search response in the XML format, stores the SOAP message in an HTTP message, and transmits it to the client 200.

以上、本発明による第２実施例の文字列処理方法によれば、変換後の文字列を照合するときに、符号化方式の違いによる変換後の文字列の差異を吸収することができる。また、キャッシュ２５０を利用することにより、変換後の文字列を照合するときに、いちいちＭＣＳ３６に問い合わせる必要がない。更に、クライアント２００からの文書名の設定の際には内部処理で利用する符号化方式に変換する前の文字列をキャッシュ２５０にまず格納する方式を取っておけば、クライアント２００からの文書名の検索の際には不明文字を含まない変換前の文字列同士の照合となるので、不明文字をワイルドカード扱いとするよりも検索の精度が高いものとなる。 As described above, according to the character string processing method of the second embodiment of the present invention, when the converted character strings are collated, the difference in the converted character strings due to the difference in the encoding method can be absorbed. Further, by using the cache 250, it is not necessary to inquire the MCS 36 every time when the converted character string is collated. Further, when a document name is set from the client 200, if a method of first storing the character string before conversion into the encoding method used in the internal processing in the cache 250, the document name from the client 200 is changed. When searching, since character strings before conversion that do not include unknown characters are collated, the accuracy of the search is higher than when unknown characters are treated as wild cards.

なお、図１７ではｓｅａｒｃｈのオブジェクト２４０をＷｅｂサービスＩ／Ｆ２６に設けた例について説明したが、ＭＣＳ３６に設けてもよい。図２１は、文字列の照合処理について説明するための一例のフローチャートである。 Note that although an example in which the search object 240 is provided in the Web service I / F 26 has been described with reference to FIG. 17, it may be provided in the MCS 36. FIG. 21 is a flowchart illustrating an example of character string matching processing.

クライアント２００から上記の文字列の符号化方式の変換を経て文書名の変更または文書の作成に伴う文書名の設定を指示するか、操作者がオペレーションパネル７０を操作して文書名の変更または文書の作成を指示すると、融合機１のＯＣＳ３３はステップＳ２１に進み、文書名の変更要求または文書の作成要求をＭＣＳ３６に送信する。 The client 200 instructs the document name change or the document name setting accompanying the creation of the document through the conversion of the character string encoding method, or the operator operates the operation panel 70 to change the document name or the document. The OCS 33 of the MFP 1 proceeds to step S21, and transmits a document name change request or a document creation request to the MCS 36.

ステップＳ２１に続いてステップＳ２２に進み、ＭＣＳ３６は文書名の変更要求または文書の作成要求を受信し、その文書名の変更要求または文書の作成要求に応じて図１４のような書誌情報データを変更する。そして、ＭＣＳ３６は文書名の照合要求に応じたｓｅａｒｃｈのオブジェクト２４０に処理を依頼する。 In step S22 following step S21, the MCS 36 receives a document name change request or document creation request, and changes the bibliographic information data as shown in FIG. 14 in response to the document name change request or document creation request. To do. Then, the MCS 36 requests the search object 240 in response to the document name collation request.

オブジェクト２４０は図１９のような符号化処理を行い、ＵＴＦ−８に変換された文書名とキャッシュ２５０から検索された文書名とが一致とみなせないと判定したときに、ＵＴＦ−８に変換された文書名を新しい文書名としてキャッシュ２５０に格納する。 The object 240 performs the encoding process as shown in FIG. 19, and when it is determined that the document name converted to UTF-8 and the document name retrieved from the cache 250 cannot be regarded as a match, the object 240 is converted to UTF-8. The stored document name is stored in the cache 250 as a new document name.

ステップＳ４２では、クライアント２００が、ＸＭＬ形式で記述した図２０のようなＳＯＡＰメッセージを作成し、そのＳＯＡＰメッセージをＨＴＴＰメッセージに格納させて融合機１に送信する。クライアント２００から送信されたＨＴＴＰメッセージは、ＮＣＳ３１のｈｔｔｐｄ２２０を介してＷｅｂサービスＩ／Ｆ２６に供給される。 In step S42, the client 200 creates a SOAP message as shown in FIG. 20 described in the XML format, stores the SOAP message in an HTTP message, and transmits it to the multi-function apparatus 1. The HTTP message transmitted from the client 200 is supplied to the Web service I / F 26 via the httpd 220 of the NCS 31.

ＨＴＴＰメッセージを受信すると、ＷｅｂサービスＩ／Ｆ２６はディスパッチャによりＳＯＡＰメッセージを解釈したあと、ステップＳ４３で文書名の検索要求をＭＣＳ３６に送信する。ＭＣＳ３６は文書名の検索要求を受信すると、ｓｅａｒｃｈのオブジェクト２４０に処理を依頼する。 When the HTTP message is received, the Web service I / F 26 interprets the SOAP message by the dispatcher, and then transmits a document name search request to the MCS 36 in step S43. Upon receiving the document name search request, the MCS 36 requests the search object 240 to perform processing.

ｓｅａｒｃｈのオブジェクト２４０は、ＵＴＦ−８で符号化された文字列（検索を行う文書名）を受信すると、その文字列をキー情報としてキャッシュ２５０から該当する文書名を検索して照合する。オブジェクト２４０は文字列の照合処理が終了すると、ＭＣＳ３６に処理の終了を通知する。 When the search object 240 receives a character string (document name to be searched) encoded by UTF-8, the search object 240 searches the corresponding document name from the cache 250 using the character string as key information and collates it. When the character string collation process ends, the object 240 notifies the MCS 36 of the end of the process.

ステップＳ４４では、ＭＣＳ３６が、文書名の検索結果をＷｅｂサービスＩ／Ｆ２６に送信する。ステップＳ４５では、ＷｅｂサービスＩ／Ｆ２６が、文書名の検索応答を行うＳＯＡＰメッセージをＸＭＬ形式で記述し、そのＳＯＡＰメッセージをＨＴＴＰメッセージに格納させてクライアント２００に送信する。 In step S44, the MCS 36 transmits the search result of the document name to the web service I / F 26. In step S 45, the Web service I / F 26 describes a SOAP message that makes a document name search response in XML format, stores the SOAP message in an HTTP message, and transmits the SOAP message to the client 200.

以上、本発明による第２実施例の文字列処理方法によれば、変換後の文字列を照合するときに、符号化方式の違いによる変換後の文字列の差異を吸収することができる。 As described above, according to the character string processing method of the second embodiment of the present invention, when the converted character strings are collated, the difference in the converted character strings due to the difference in the encoding method can be absorbed.

Ｓｅａｒｃｈのオブジェクト２４０をＷｅｂサービスＩ／Ｆ２６に設けた場合のメリットは、第２実施例の文字列処理方法が、ネットワーク２１０に接続しない融合機に不要な機能であるため、Ｗｅｂサービス対応のためのオプションという構成にできることである。また、Ｓｅａｒｃｈのオブジェクト２４０をＭＣＳ３６に設けた場合のメリットは、他のＩ／Ｆ（例えばプリンタＩ／Ｆなど）でも利用できることである。 The advantage of providing the Search object 240 in the Web service I / F 26 is that the character string processing method of the second embodiment is an unnecessary function for a multi-function peripheral that is not connected to the network 210. It can be configured as an option. An advantage of providing the Search object 240 in the MCS 36 is that it can also be used in other I / Fs (for example, a printer I / F).

前述した第１実施例の融合機１ではキャッシュ２５０を有していないが、図１７の融合機１と同様に、ＷｅｂサービスＩ／Ｆ２６またはＭＣＳ３６が利用するキャッシュ２５０を備えるようにしてもよい。 Although the multi-function apparatus 1 of the first embodiment does not have the cache 250, it may be provided with the cache 250 used by the Web service I / F 26 or the MCS 36 as in the multi-function apparatus 1 of FIG.

第１実施例の融合機１でキャッシュ２５０を備えることにより、変換前の文字列をキャッシュ２５０に格納しておくことができる。このように、変換前の文字列をキャッシュ２５０に格納しておくことにより、変換前の文字列を文字列の照合処理に利用することができ、文字列の照合処理の精度を上げることができる。 By providing the cache 250 in the compound machine 1 of the first embodiment, the character string before conversion can be stored in the cache 250. By storing the character string before conversion in the cache 250 in this way, the character string before conversion can be used for character string matching processing, and the accuracy of character string matching processing can be improved. .

上記の実施例１では、符号化方式の優先順位が固定されている例を説明したが、様々な条件に応じて符号化方式の優先順位が変化するようにしてもよい。そこで、実施例３では様々な条件に応じて符号化方式の優先順位が変化する例を説明する。 In the first embodiment, the example in which the priority order of the encoding scheme is fixed has been described. However, the priority order of the encoding scheme may be changed according to various conditions. Therefore, in the third embodiment, an example in which the priority order of the encoding scheme changes according to various conditions will be described.

図２２は、本発明による第３実施例の文字列処理方法について説明するための融合機の構成図である。図２３は、本発明による第３実施例の文字列処理方法について説明するためのシーケンス図である。なお、図２２及び図２３では説明に必要のない構成を省略している。また、図２２の構成図は図１０の構成図と一部を除いて同様であるため、適宜説明を省略する。 FIG. 22 is a block diagram of a compound machine for explaining a character string processing method according to a third embodiment of the present invention. FIG. 23 is a sequence diagram for explaining the character string processing method according to the third embodiment of the present invention. In FIG. 22 and FIG. 23, configurations that are not necessary for description are omitted. The configuration diagram of FIG. 22 is the same as the configuration diagram of FIG.

ステップＳ５１では、クライアント２００のブラウザ２０１が、ＵＲＬを指定してｈｔｔｐリクエストを行う。例えばステップＳ５１で指定されるＵＲＬは、「ｈｔｔｐ：／／１１１．２２２．３３．４４／ｊａ／ｄｏｃｕｍｅｎｔｂｏｘ／」等である。このＵＲＬには、ブラウザ２０１の表示言語が日本語である例であり、ＵＲＬ中の「ｊａ」で表示言語が日本語に指定されている。ブラウザ２０１から送信されたＨＴＴＰリクエストは、融合機１のＮＣＳ３１に含まれるｈｔｔｐｄ２２０に供給される。 In step S51, the browser 201 of the client 200 makes an http request specifying a URL. For example, the URL specified in step S51 is “http://111.222.33.44/ja/documentbox/” or the like. This URL is an example in which the display language of the browser 201 is Japanese, and “ja” in the URL designates the display language as Japanese. The HTTP request transmitted from the browser 201 is supplied to the httpd 220 included in the NCS 31 of the MFP 1.

ステップＳ５２に進み、ｈｔｔｐｄ２２０は供給されたｈｔｔｐリクエストをＷｅｂアプリ２７に供給する。ｈｔｔｐリクエストを受信すると、Ｗｅｂアプリ２７はｈｔｔｐリクエストで指定されているＵＲＬからブラウザ２０１の表示言語を特定し、その表示言語に応じた画面データを生成する。 In step S52, the httpd 220 supplies the supplied http request to the Web application 27. When receiving the http request, the Web application 27 specifies the display language of the browser 201 from the URL specified in the http request, and generates screen data corresponding to the display language.

ステップＳ５３に進み、Ｗｅｂアプリ２７は生成した画面データをｈｔｔｐｄ２２０に供給する。ステップＳ５４に進み、ｈｔｔｐｄ２２０は供給された画面データをクライアント２００のブラウザ２０１に送信する。 In step S53, the web application 27 supplies the generated screen data to the httpd 220. In step S54, the httpd 220 transmits the supplied screen data to the browser 201 of the client 200.

画面データを受信すると、ブラウザ２０１はステップＳ５５に進み、画面データに応じた画面を表示する。そして、ユーザはブラウザ２０１に表示された画面に文字列を入力する。ステップＳ５６に進み、クライアント２００のブラウザ２０１はステップＳ５５で画面に入力された文字列を含むｈｔｔｐリクエストを行う。ブラウザ２０１から送信されたｈｔｔｐリクエストは、融合機１のｈｔｔｐｄ２２０に供給される。 When the screen data is received, the browser 201 proceeds to step S55, and displays a screen corresponding to the screen data. Then, the user inputs a character string on the screen displayed on the browser 201. In step S56, the browser 201 of the client 200 makes an http request including the character string input on the screen in step S55. The http request transmitted from the browser 201 is supplied to the httpd 220 of the MFP 1.

ステップＳ５７に進み、ｈｔｔｐｄ２２０は供給されたｈｔｔｐリクエストをＷｅｂアプリ２７に供給する。ｈｔｔｐリクエストを受信すると、Ｗｅｂアプリ２７はステップＳ５８に進み、ｈｔｔｐリクエストに含まれる文字列を取得してｓｅｔＴｉｔｌｅのオブジェクト２３０に処理を依頼する。 In step S57, the httpd 220 supplies the supplied http request to the Web application 27. When receiving the http request, the web application 27 proceeds to step S58, obtains a character string included in the http request, and requests the setTitle object 230 to perform processing.

まず、ｓｅｔＴｉｔｌｅのオブジェクト２３０は、内部処理で利用する符号化方式以外の文字列を受信した場合、その文字列を内部処理で利用する符号化方式に変換する必要がある。そこで、オブジェクト２３０は文字列の符号化方式を変換する必要があるかを判定し、必要があれば前述した図１２のようなフローチャートで表す処理を実行する。 First, when the setTitle object 230 receives a character string other than the encoding method used in the internal processing, it is necessary to convert the character string into an encoding method used in the internal processing. Therefore, the object 230 determines whether or not the character string encoding method needs to be converted, and if necessary, executes the processing represented by the flowchart shown in FIG.

なお、実施例３の場合、図１２のステップＳ１１の処理が実施例１の場合と異なる。オブジェクト２３０は、符号化方式の優先順位を次のように決定し、その優先順位に従って符号化方式を選択する。優先順位の決定は、次のように行う。まず、ブラウザ２０１の表示言語に対応する符号化方式は、上位の優先順位とする。オペレーションパネル７０の表示言語に対応する符号化方式は、中位の優先順位とする。そして、それ以外の符号化方式は、下位の優先順位とする。この場合の優先順位は、ブラウザ２０１の表示言語に対応する符号化方式、オペレーションパネル７０の表示言語に対応する符号化方式、それ以外の符号化方式となる。 In the case of the third embodiment, the process in step S11 in FIG. 12 is different from that in the first embodiment. The object 230 determines the priority of the encoding method as follows, and selects the encoding method according to the priority. The priority order is determined as follows. First, the encoding method corresponding to the display language of the browser 201 is given a higher priority. The encoding method corresponding to the display language of the operation panel 70 has a medium priority. The other encoding methods have lower priority. In this case, the priority order is an encoding method corresponding to the display language of the browser 201, an encoding method corresponding to the display language of the operation panel 70, and other encoding methods.

ステップＳ１１以外の処理は、実施例１の場合と同様であるため説明を省略する。図１２のフローチャートにより最適な符号化方式を選択したあと、オブジェクト２３０は選択した符号化方式で文字列を符号化する。なお、オブジェクト２３０は、符号化できない文字を代用文字に置き換える。 Since the processes other than step S11 are the same as those in the first embodiment, description thereof is omitted. After selecting the optimum encoding method according to the flowchart of FIG. 12, the object 230 encodes the character string using the selected encoding method. The object 230 replaces characters that cannot be encoded with substitute characters.

そして、ステップＳ５９に進み、Ｗｅｂアプリ２７はステップＳ５７のｈｔｔｐリクエストに対応する画面データを作成し、その画面データをｈｔｔｐｄ２２０に供給する。ステップＳ５４に進み、ｈｔｔｐｄ２２０は供給された画面データをクライアント２００のブラウザ２０１に送信する。 In step S59, the web application 27 creates screen data corresponding to the http request in step S57, and supplies the screen data to the httpd 220. In step S54, the httpd 220 transmits the supplied screen data to the browser 201 of the client 200.

以上、本発明による第３実施例の文字列処理方法によれば、一の符号化方式で符号化された文字列を他の符号化方式に変換するときに、様々な条件に応じて符号化方式の優先順位を変化させることができるので、適切な符号化方式を選択することができる。 As described above, according to the character string processing method of the third embodiment of the present invention, when a character string encoded by one encoding method is converted into another encoding method, encoding is performed according to various conditions. Since the priority order of the methods can be changed, an appropriate encoding method can be selected.

なお、図２２ではｓｅｔＴｉｔｌｅのオブジェクト２３０をＷｅｂアプリ２７に設けた例について説明したが、ＭＣＳ３６に設けてもよい。ｓｅｔＴｉｔｌｅのオブジェクト２３０をＷｅｂアプリ２７に設けた場合のメリットは、実施例３の文字列処理方法が、ネットワーク２１０に接続しない融合機に不要な機能であるため、Ｗｅｂアプリ２７のオプションという構成にできることである。また、ｓｅｔＴｉｔｌｅのオブジェクト２３０をＭＣＳ３６に設けた場合のメリットは、他のアプリ（例えばプリンタアプリなど）でも利用できることである。 In addition, although the example which provided the object 230 of setTitle in the web application 27 was demonstrated in FIG. 22, you may provide in MCS36. The merit when the setTitle object 230 is provided in the Web application 27 is that the character string processing method of the third embodiment is a function that is unnecessary for a multi-function apparatus that is not connected to the network 210, and therefore can be configured as an option of the Web application 27. It is. An advantage of providing the setTitle object 230 in the MCS 36 is that it can be used by other applications (for example, a printer application).

本発明は、具体的に開示された実施例に限定されるものではなく、特許請求の範囲から逸脱することなく、種々の変形や変更が可能である。 The present invention is not limited to the specifically disclosed embodiments, and various modifications and changes can be made without departing from the scope of the claims.

１融合機
２ソフトウェア群
３融合機起動部
４ハードウェア資源
５アプリケーション層
６プラットフォーム
９コントロールサービス層
１０ハンドラ層
１１白黒レーザプリンタ（Ｂ＆ＷＬＰ）
１２カラーレーザプリンタ（ＣｏｌｏｒＬＰ）
１３ハードウェアリソース
２１プリンタアプリ
２２コピーアプリ
２３ファックスアプリ
２４スキャナアプリ
２５ネットファイルアプリ
２６ＷｅｂサービスＩ／Ｆ
２７Ｗｅｂアプリ
３１ネットワークコントロールサービス（ＮＣＳ）
３２デリバリーコントロールサービス（ＤＣＳ）
３３オペレーションパネルコントロールサービス（ＯＣＳ）
３４ファックスコントロールサービス（ＦＣＳ）
３５エンジンコントロールサービス（ＥＣＳ）
３６メモリコントロールサービス（ＭＣＳ）
３７ユーザインフォメーションコントロールサービス（ＵＣＳ）
３８システムコントロールサービス（ＳＣＳ）
３９システムリソースマネージャ（ＳＲＭ）
４０ファックスコントロールユニットハンドラ（ＦＣＵＨ）
４１イメージメモリハンドラ（ＩＭＨ）
５１アプリケーションプログラムインターフェース（ＡＰＩ）
５２エンジンＩ／Ｆ
６０コントローラ
６１ＣＰＵ
６２システムメモリ
６３ノースブリッジ（ＮＢ）
６４サウスブリッジ（ＳＢ）
６５ＡＧＰ（Accelerated Graphics Port）
６６ＡＳＩＣ
６７ローカルメモリ
６８ハードディスク装置（ＨＤＤ）
６９ネットワークＩ／Ｆコントローラ
７０オペレーションパネル
８０ファックスコントロールユニット（ＦＣＵ）
９０ＵＳＢデバイス
１００ＩＥＥＥ１３９４デバイス
１１０エンジン部
２００クライアント
２０１ブラウザ
２１０ネットワーク
２２０ｈｔｔｐｄ（HyperText Transfer Protocol Daemon）
２３０ｓｅｔＴｉｔｌｅのオブジェクト
２４０Ｓｅａｒｃｈのオブジェクト DESCRIPTION OF SYMBOLS 1 Fusion machine 2 Software group 3 Fusion machine starting part 4 Hardware resource 5 Application layer 6 Platform 9 Control service layer 10 Handler layer 11 Black-and-white laser printer (B & W LP)
12 Color laser printer (Color LP)
13 Hardware Resource 21 Printer Application 22 Copy Application 23 Fax Application 24 Scanner Application 25 Net File Application 26 Web Service I / F
27 Web application 31 Network control service (NCS)
32 Delivery Control Service (DCS)
33 Operation Panel Control Service (OCS)
34 Fax Control Service (FCS)
35 Engine Control Service (ECS)
36 Memory Control Service (MCS)
37 User Information Control Service (UCS)
38 System Control Service (SCS)
39 System Resource Manager (SRM)
40 Fax Control Unit Handler (FCUH)
41 Image memory handler (IMH)
51 Application Program Interface (API)
52 Engine I / F
60 controller 61 CPU
62 System memory 63 North Bridge (NB)
64 South Bridge (SB)
65 AGP (Accelerated Graphics Port)
66 ASIC
67 Local memory 68 Hard disk drive (HDD)
69 Network I / F controller 70 Operation panel 80 Fax control unit (FCU)
90 USB device 100 IEEE 1394 device 110 Engine unit 200 Client 201 Browser 210 Network 220 httpd (HyperText Transfer Protocol Daemon)
230 SetTitle object 240 Search object

特表平１１−５１２５４３号Special table Hei 11-512543

Claims

In a character string processing device that collates a character string encoded by an internal encoding method used in internal processing, using a character string encoded by one encoding method,
When the character string encoded by the internal encoding method is converted to the one encoding method, and the converted character string is collated using the character string encoded by the one encoding method, A character string matching means that handles the substitute code included in the converted character string as a character having a role of representing all characters ,
Storage means for storing a character string encoded by the one encoding method before being encoded by the inner encoding method;
Have
The character string collating means stores the character string encoded by the internal encoding method using the character string encoded by the one encoding method and stores it in the storage means. The character string encoded by the one encoding method before being encoded by the inner encoding method and the character string encoded by the one encoding method converted from the character string encoded by the inner encoding method If the encoded character string cannot be regarded as a match, the character string encoded by the one encoding method converted from the character string encoded by the inner encoding method is stored in the storage means. character string processing apparatus characterized by having a that.

Of the characters the encoded with one encoding method, the character string processing apparatus according to claim 1, further comprising a table set an example in which assign different character codes to the same character.

3. The character according to claim 1, wherein the substitute code is a character code in which a character string encoded by the internal encoding method is replaced with a character that cannot be converted to the one encoding method. Column processing unit.

An image forming apparatus having a character string collating function for collating a character string encoded by an internal encoding method used in internal processing using a character string encoded by one encoding method,
When the character string encoded by the internal encoding method is converted to the one encoding method, and the converted character string is collated using the character string encoded by the one encoding method, A character string matching means that handles the substitute code included in the converted character string as a character having a role of representing all characters ,
Storage means for storing a character string encoded by the one encoding method before being encoded by the inner encoding method;
Have
The character string collating means stores the character string encoded by the internal encoding method using the character string encoded by the one encoding method and stores it in the storage means. The character string encoded by the one encoding method before being encoded by the inner encoding method and the character string encoded by the one encoding method converted from the character string encoded by the inner encoding method If the encoded character string cannot be regarded as a match, the character string encoded by the one encoding method converted from the character string encoded by the inner encoding method is stored in the storage means. an image forming apparatus wherein a that.

5. The image forming apparatus according to claim 4, further comprising a table in which an example in which different character codes are assigned to the same character among characters encoded by the one encoding method is set.

6. The image forming apparatus according to claim 4 , wherein the substitute code is a character code in which a character that cannot be converted into the one encoding method or another encoding method is replaced.