JP2005175627A

JP2005175627A - System for taking proceedings

Info

Publication number: JP2005175627A
Application number: JP2003409479A
Authority: JP
Inventors: Yoshiro Aoyanagi; 好郎青柳
Original assignee: Fuji Photo Film Co Ltd
Current assignee: Fujifilm Holdings Corp
Priority date: 2003-12-08
Filing date: 2003-12-08
Publication date: 2005-06-30

Abstract

<P>PROBLEM TO BE SOLVED: To provide a simple system for taking proceedings available at any place. <P>SOLUTION: The system 2 for taking proceedings comprises a digital camera 10 for picking up the image of a white board 21 on which the content of a meeting is written, outputting digital image data, recording the speech of participants at the meeting and delivering digital voice data, a portable telephone 11 for transmitting the image data and the voice data to the outside, and a server 12 comprising a unit 91 for recognizing characters in the image data and converting the characters into first text data, a unit 92 for recognizing voice in the voice data and converting it into second text data, a unit 93 for editing the first and second text data automatically to make a file of proceedings, and a unit 95 for downloading the file of proceedings to the personal computer 15 of a client through the Internet 14. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、自動的に議事録ファイルを作成する議事録作成システムに関する。 The present invention relates to a minutes creation system that automatically creates a minutes file.

従来、議事録は、会議の参加者がメモを取っておき、会議終了後、メモと会議の内容とを照らし合わせながら作成していた。このため、議事録を作成する参加者は、メモを取ることに専念するあまり、議論に参加することができないという不都合が生じていた。また、会議の内容を思い出しながらメモを整理するという煩雑な作業を伴っていた。 Conventionally, minutes of meetings have been created by participants taking notes, and after the meeting, the notes are checked against the contents of the meeting. For this reason, the participants who create the minutes have been inconvenient because they cannot concentrate on taking notes because they are devoted to taking notes. In addition, it involved a complicated task of organizing notes while remembering the contents of the meeting.

上記のような問題を解決するために、会議の参加者の映像を撮像するカメラと、参加者の発言から参加者の位置情報を特定する音声処理手段と、この位置情報に基づいて、参加者を示す映像のアイコン、および発言内容を示すアイコンからなるノードを発言内容単位で作成し、このノードをパーソナルコンピュータのモニタに表示するように制御する制御手段とを備えた議事進行支援システムが提案されている（特許文献１参照）。 In order to solve the above problems, a camera that captures a video of a participant in a conference, audio processing means for identifying the participant's location information from the participant's remarks, and the participant based on this location information A proposal proceeding support system comprising a control means for creating a node composed of an icon of a video indicating an icon and an icon indicating the content of a speech in units of speech content and controlling the node to be displayed on a monitor of a personal computer has been proposed. (See Patent Document 1).

特許文献１に記載の議事進行支援システムによれば、いつ、誰が、誰に対して、何を発言したかをアイコンベースの簡単なインターフェースで入力することが可能なので、議事録作成の手間を軽減することができる。また、議事の進行状況を一見して把握することが可能となる。さらに、作成された議事録は、離れた地点にいる多くの参加者が共有可能なマルチメディア議事録として利用することができる。 According to the agenda progress support system described in Patent Document 1, it is possible to input what and who has spoken to whom with a simple icon-based interface, thus reducing the trouble of creating minutes. can do. It is also possible to grasp the progress of proceedings at a glance. Furthermore, the created minutes can be used as multimedia minutes that can be shared by many participants at remote locations.

国際公開第９６／２７９８８号パンフレットInternational Publication No. 96/27988 Pamphlet

しかしながら、特許文献１に記載の議事進行支援システムには、カメラ、カメラ制御装置、映像切替え装置、マイク、音源推定装置、音声切替え装置、音声認識装置、主制御装置など、非常に多くの装置が必要であり、システム構成が大掛かりなものとなるため、利用可能な場所が限定されるという問題があった。 However, the proceeding support system described in Patent Document 1 includes a large number of devices such as a camera, a camera control device, a video switching device, a microphone, a sound source estimation device, a voice switching device, a voice recognition device, and a main control device. Since it is necessary and the system configuration becomes large, there is a problem that the available places are limited.

本発明は、上記課題を鑑みてなされたものであり、簡単なシステム構成で、場所を選ばずに利用することができる議事録作成システムを提供することを目的とする。 The present invention has been made in view of the above problems, and an object of the present invention is to provide a minutes creation system that can be used without choosing a place with a simple system configuration.

上記目的を達成するために、本発明の議事録作成システムは、会議室に設置された表示機器の表示エリアを撮像して、デジタルの画像データを出力するとともに、会議の参加者の発言を収録して、デジタルの音声データを出力するデジタルカメラと、前記デジタルカメラに接続して、前記画像データおよび音声データを外部に送信する通信装置と、前記通信装置から送信される前記画像データ内の文字を文字認識して、第１のテキストデータに変換する文字認識装置と、前記通信装置から送信される前記音声データを音声認識して、第２のテキストデータに変換する音声認識装置と、前記第１および第２のテキストデータを自動編集して、議事録ファイルを作成する編集装置と、前記議事録ファイルを通信ネットワーク経由で顧客に配信する配信装置とから構成したことを特徴とする。 In order to achieve the above object, the minutes creation system of the present invention images the display area of a display device installed in a conference room, outputs digital image data, and records the comments of participants in the conference A digital camera that outputs digital audio data; a communication device that is connected to the digital camera and transmits the image data and audio data to the outside; and characters in the image data transmitted from the communication device A character recognition device that recognizes the character and converts it into first text data, a speech recognition device that recognizes the speech data transmitted from the communication device and converts it into second text data, and the first An editing apparatus that automatically edits the first and second text data to create a minutes file, and distributes the minutes file to customers via a communication network Characterized by being composed of a communication apparatus.

なお、前記文字認識装置、音声認識装置、編集装置、および配信装置は、前記通信装置に通信ネットワークを介して接続されたサーバ内に設けられていることが好ましい。また、前記デジタルカメラは、前記顧客により音声入力された操作命令を音声認識する音声認識手段と、音声認識した操作命令に応じた処理を実行させるべく各部を制御する制御手段とを備えることが好ましい。さらに、前記議事録ファイルの配信後、予め登録されている前記顧客の銀行口座から、システム利用料を自動的に引き落とす自動決済装置を備えることが好ましい。 The character recognition device, the speech recognition device, the editing device, and the distribution device are preferably provided in a server connected to the communication device via a communication network. The digital camera preferably includes voice recognition means for recognizing an operation command inputted by voice from the customer, and control means for controlling each unit to execute processing according to the operation command recognized by voice. . Furthermore, it is preferable to provide an automatic settlement apparatus that automatically withdraws the system usage fee from the bank account of the customer registered in advance after the delivery of the minutes file.

本発明の議事録作成システムによれば、会議室に設置された表示機器の表示エリアを撮像して、デジタルの画像データを出力するとともに、会議の参加者の発言を収録して、デジタルの音声データを出力するデジタルカメラと、デジタルカメラに接続して、画像データおよび音声データを外部に送信する通信装置と、通信装置から送信される画像データ内の文字を文字認識して、第１のテキストデータに変換する文字認識装置と、通信装置から送信される音声データを音声認識して、第２のテキストデータに変換する音声認識装置と、第１および第２のテキストデータを自動編集して、議事録ファイルを作成する編集装置と、議事録ファイルを通信ネットワーク経由で顧客に配信する配信装置とから構成したので、簡単なシステム構成で、場所を選ばずに利用することができる。また、会議の内容を網羅した議事録を、汎用性のあるデータファイルの形で、会議終了後即座に入手することができる。 According to the minutes creation system of the present invention, the display area of the display device installed in the conference room is imaged, digital image data is output, and the speech of the conference participant is recorded, and the digital audio is recorded. A digital camera that outputs data, a communication device that is connected to the digital camera and transmits image data and audio data to the outside, a character in the image data transmitted from the communication device is recognized, and the first text A character recognition device that converts data, a speech recognition device that recognizes speech data transmitted from the communication device and converts the speech data into second text data, and automatically edits the first and second text data, Since it consists of an editing device that creates the minutes file and a distribution device that distributes the minutes file to the customers via the communication network, It can be utilized to not selected. In addition, the minutes covering the contents of the meeting can be obtained immediately after the meeting in the form of a versatile data file.

図１において、本発明の議事録作成システム２は、デジタルカメラ１０、携帯電話１１、およびサーバ１２から構成される。携帯電話１１は、電話回線網１３を介してサーバ１２に接続している。また、サーバ１２は、インターネット１４を介して顧客２０（図２参照）のパーソナルコンピュータ（ＰＣ）１５に接続している。 In FIG. 1, the minutes creation system 2 of the present invention includes a digital camera 10, a mobile phone 11, and a server 12. The cellular phone 11 is connected to the server 12 via the telephone line network 13. The server 12 is connected to a personal computer (PC) 15 of the customer 20 (see FIG. 2) via the Internet 14.

議事録作成システム２は、デジタルカメラ１０で取得した画像データおよび音声データを、携帯電話１１を介してサーバ１２に送信し、送信された画像データおよび音声データを元に、サーバ１２で議事録ファイルを作成して、この議事録ファイルをインターネット１４経由で顧客２０のＰＣ１５に配信するものである。 The minutes creation system 2 transmits the image data and sound data acquired by the digital camera 10 to the server 12 via the mobile phone 11, and the minutes file is sent by the server 12 based on the transmitted image data and sound data. And the minutes file is distributed to the PC 15 of the customer 20 via the Internet 14.

図２は、議事録作成システム２を利用して会議を進行している様子を示している。顧客２０は、ホワイトボード２１全体が撮影可能な場所にデジタルカメラ１０を設置して、デジタルカメラ１０に携帯電話１１を接続し、ＰＣ１５をインターネット１４に接続する。そして、発表者２２がホワイトボード２１に書いた会議の内容をデジタルカメラ１０で撮影するとともに、顧客２０自身や、発表者２２、他の参加者２３の発言を収録する。 FIG. 2 shows a state in which a meeting is progressing using the minutes creation system 2. The customer 20 installs the digital camera 10 in a place where the entire whiteboard 21 can be photographed, connects the mobile phone 11 to the digital camera 10, and connects the PC 15 to the Internet 14. Then, the content of the conference written by the presenter 22 on the whiteboard 21 is photographed by the digital camera 10, and the statements of the customer 20, the presenter 22, and other participants 23 are recorded.

図３および図４において、デジタルカメラ１０の前面には、撮像レンズ３０、ファインダ対物窓３１、およびマイクロホン３２が設けられている。また、上面には、レリーズボタン３３が設けられ、側面には、メモリカード５６（図５参照）が着脱自在に装填されるメモリカードスロット３４、およびコネクタ３５（例えばＵＳＢコネクタ）が設けられている。さらに、背面には、ファインダ接眼窓３６、液晶表示器（ＬＣＤ）３７、および操作部３８が設けられている。 3 and 4, an imaging lens 30, a viewfinder objective window 31, and a microphone 32 are provided on the front surface of the digital camera 10. A release button 33 is provided on the upper surface, and a memory card slot 34 into which a memory card 56 (see FIG. 5) is detachably loaded and a connector 35 (for example, a USB connector) are provided on the side surface. . Further, a finder eyepiece window 36, a liquid crystal display (LCD) 37, and an operation unit 38 are provided on the back surface.

レリーズボタン３３は、２段階押しのスイッチとなっている。ファインダまたはＬＣＤ３７によるフレーミングの後に、レリーズボタン３３を軽く押圧（半押し）すると、ＣＰＵ４３（図５参照）で自動露光調整、自動焦点調整などの各種撮影準備処理が施される。この状態でレリーズボタン３３をもう１度強く押圧（全押し）すると、撮影準備処理が施された１画面分の撮像信号が画像データに変換された後、後述する画像処理および圧縮処理が施され、メモリカード５６に記録される。 The release button 33 is a two-stage push switch. When the release button 33 is lightly pressed (half-pressed) after framing by the viewfinder or the LCD 37, the CPU 43 (see FIG. 5) performs various photographing preparation processes such as automatic exposure adjustment and automatic focus adjustment. In this state, when the release button 33 is pressed again (fully pressed) once, the imaging signal for one screen subjected to the imaging preparation process is converted into image data, and then image processing and compression processing described later are performed. Recorded in the memory card 56.

デジタルカメラ１０では、静止画撮影を行う静止画撮影モード、動画撮影を行う動画撮影モード、撮影した画像をＬＣＤ３７に表示する再生モード、各種設定を行う設定モード、および静止画撮影とともに、常時マイクロホン３２で音声を収録する議事録作成モードが選択可能となっている。 In the digital camera 10, the microphone 32 is always used together with a still image shooting mode for shooting still images, a moving image shooting mode for shooting movies, a playback mode for displaying captured images on the LCD 37, a setting mode for setting various settings, and still image shooting. The minutes creation mode for recording audio can be selected.

また、デジタルカメラ１０は、マイクロホン３２を介して、カメラの操作命令を音声で受け付ける音声操作機能を備えている。この音声操作機能で扱われる操作命令には、撮影の指示、モード選択などの基本命令の他に、議事録作成モードで使用される会議開始／終了、音声収録開始／終了、データ送信などがある。 The digital camera 10 also has a voice operation function for receiving camera operation commands by voice via the microphone 32. In addition to basic commands such as shooting instructions and mode selection, the operation commands handled by this voice operation function include conference start / end, audio recording start / end, and data transmission used in the minutes creation mode. .

デジタルカメラ１０の電気的構成を示す図５において、撮像レンズ３０および絞り４０には、レンズモータ４１およびアイリスモータ４２が接続されている。これらのモータ４１、４２はステッピングモータからなり、ＣＰＵ４３に接続されたモータドライバ４４、４５から送信される駆動パルスにより動作制御され、レリーズボタン３３の半押しに伴う撮影準備処理を行う。 In FIG. 5 showing the electrical configuration of the digital camera 10, a lens motor 41 and an iris motor 42 are connected to the imaging lens 30 and the diaphragm 40. These motors 41 and 42 are stepping motors, which are controlled in operation by drive pulses transmitted from motor drivers 44 and 45 connected to the CPU 43, and perform photographing preparation processing in response to half-pressing of the release button 33.

レンズモータ４１は、操作部３８に設けられたズーム操作ボタンの操作に連動して、撮像レンズ３０のズームレンズをワイド側、あるいはテレ側に移動させ、撮像レンズ３０のズーミングを行う。また、被写体距離やズームレンズの変倍に応じて撮像レンズ３０のフォーカスレンズを移動させ、撮影条件が最適となるように撮像レンズ３０の焦点調整を行う。アイリスモータ４２は、絞り４０を動作させ、撮像レンズ３０の露出調整を行う。 The lens motor 41 performs zooming of the imaging lens 30 by moving the zoom lens of the imaging lens 30 to the wide side or the tele side in conjunction with the operation of the zoom operation button provided on the operation unit 38. Further, the focus lens of the imaging lens 30 is moved in accordance with the subject distance and zoom lens magnification, and the focus of the imaging lens 30 is adjusted so that the shooting conditions are optimized. The iris motor 42 operates the aperture 40 to adjust the exposure of the imaging lens 30.

撮像レンズ３０の背後には、撮像レンズ３０を透過した被写体光を光電変換して、撮像信号を出力するＣＣＤ４６が配置されている。ＣＣＤ４６には、ＣＰＵ４３によって制御される図示しないタイミングジェネレータが接続され、このタイミングジェネレータから入力されるタイミング信号（クロックパルス）により、電子シャッタのシャッタ速度が決定される。 Behind the imaging lens 30 is a CCD 46 that photoelectrically converts subject light transmitted through the imaging lens 30 and outputs an imaging signal. A timing generator (not shown) controlled by the CPU 43 is connected to the CCD 46, and the shutter speed of the electronic shutter is determined by a timing signal (clock pulse) input from the timing generator.

ＣＣＤ４６から出力された撮像信号は、相関二重サンプリング回路（ＣＤＳ）４７に入力され、ＣＣＤ４６の各セルの蓄積電荷量に正確に対応したＲ、Ｇ、Ｂの画像データとして出力される。ＣＤＳ４７から出力された画像データは、増幅器（ＡＭＰ）４８で増幅され、Ａ／Ｄ変換器（Ａ／Ｄ）４９でデジタルの画像データに変換される。 The imaging signal output from the CCD 46 is input to a correlated double sampling circuit (CDS) 47 and output as R, G, and B image data that accurately corresponds to the accumulated charge amount of each cell of the CCD 46. Image data output from the CDS 47 is amplified by an amplifier (AMP) 48 and converted to digital image data by an A / D converter (A / D) 49.

画像入力コントローラ５０は、バス５１を介してＣＰＵ４３に接続され、ＣＰＵ４３の制御命令に応じて、ＣＣＤ４６、ＣＤＳ４７、ＡＭＰ４８、およびＡ／Ｄ４９を制御する。Ａ／Ｄ４９から出力された画像データは、ＳＤＲＡＭ５２に一旦格納される。この画像データは、ＬＣＤドライバ５３を介してＬＣＤ３７に表示される。 The image input controller 50 is connected to the CPU 43 via the bus 51, and controls the CCD 46, the CDS 47, the AMP 48, and the A / D 49 in accordance with a control command from the CPU 43. The image data output from the A / D 49 is temporarily stored in the SDRAM 52. This image data is displayed on the LCD 37 via the LCD driver 53.

画像信号処理回路５４は、ＳＤＲＡＭ５２から画像データを読み出して、階調変換、ホワイトバランス補正、γ補正処理などの各種画像処理を施し、この画像データを再度ＳＤＲＡＭ５２に格納する。 The image signal processing circuit 54 reads the image data from the SDRAM 52, performs various image processing such as gradation conversion, white balance correction, and γ correction processing, and stores the image data in the SDRAM 52 again.

画像信号処理回路５４で各種処理を施された画像データは、ＳＤＲＡＭ５２から図示しないＹＣ変換処理回路に読み出され、輝度信号Ｙと色差信号Ｃｒ、Ｃｂとに変換される。変換された画像データは、図示しない圧縮伸長処理回路により、所定の圧縮形式（例えばＪＰＥＧ形式）で画像圧縮を施される。圧縮された画像データは、メディアコントローラ５５を経由してメモリカード５６に記録される。あるいは、外部Ｉ／Ｆ５７、コネクタ３５を経由して外部に送信される。 The image data that has been subjected to various processes by the image signal processing circuit 54 is read from the SDRAM 52 to a YC conversion processing circuit (not shown) and converted into a luminance signal Y and color difference signals Cr and Cb. The converted image data is subjected to image compression in a predetermined compression format (for example, JPEG format) by a compression / decompression processing circuit (not shown). The compressed image data is recorded on the memory card 56 via the media controller 55. Alternatively, it is transmitted to the outside via the external I / F 57 and the connector 35.

ＣＰＵ４３には、前述のレリーズボタン３３、操作部３８の他に、ＥＥＰＲＯＭ５８が接続されている。ＥＥＰＲＯＭ５８には、各種制御用のプログラムや設定情報が記録されている。また、音声操作機能を利用する顧客２０の操作命令の音声データ（語彙データおよび声紋データ）が予め登録されている。ＣＰＵ４３は、これらの情報をＥＥＰＲＯＭ５８から作業用メモリであるＳＤＲＡＭ５２に読み出して、各種処理を実行する。 In addition to the release button 33 and the operation unit 38 described above, an EEPROM 58 is connected to the CPU 43. In the EEPROM 58, various control programs and setting information are recorded. In addition, voice data (vocabulary data and voiceprint data) of the operation command of the customer 20 using the voice operation function is registered in advance. The CPU 43 reads these pieces of information from the EEPROM 58 to the SDRAM 52 which is a working memory, and executes various processes.

マイクロホン３２には、増幅器（ＡＭＰ）５９が接続されている。マイクロホン３２から入力された音声は、増幅器（ＡＭＰ）５９で増幅され、Ａ／Ｄ変換器（Ａ／Ｄ）６０でデジタルの音声データに変換される。Ａ／Ｄ６０から出力された音声データは、ＣＰＵ４３でノイズ除去などの各種信号処理を施された後、メモリカード５６に記録される。また、議事録作成モードでは、語彙変換回路６１および声紋変換回路６２に送信される。 An amplifier (AMP) 59 is connected to the microphone 32. The sound input from the microphone 32 is amplified by an amplifier (AMP) 59 and converted into digital sound data by an A / D converter (A / D) 60. The audio data output from the A / D 60 is recorded on the memory card 56 after various signal processing such as noise removal is performed by the CPU 43. In the minutes creation mode, the data is transmitted to the vocabulary conversion circuit 61 and the voiceprint conversion circuit 62.

語彙変換回路６１および声紋変換回路６２は、Ａ／Ｄ６０から出力された音声データに対して、語彙変換および声紋変換をそれぞれ施し、語彙データおよび声紋データを生成する。なお、語彙変換回路６１は、後述するサーバ１２の音声認識装置９２と同様の方式で語彙変換を行う。 The vocabulary conversion circuit 61 and the voiceprint conversion circuit 62 perform vocabulary conversion and voiceprint conversion on the voice data output from the A / D 60, respectively, and generate vocabulary data and voiceprint data. The vocabulary conversion circuit 61 performs vocabulary conversion in the same manner as a voice recognition device 92 of the server 12 described later.

第１、第２比較回路６３、６４には、ＥＥＰＲＯＭ５８に予め登録されている顧客２０の語彙データおよび声紋データが、ＣＰＵ４３を通じてプリセットされる。第１、第２比較回路６３、６４は、これらのプリセットされたデータと、語彙変換回路６１および声紋変換回路６２で生成された語彙データおよび声紋データとを比較する。そして、生成されたデータと登録されているデータとが一致した場合に、トリガ信号を出力する。ここで、第１比較回路６３は、トリガ信号とともに一致した語彙データをＣＰＵ４３に送信する。 The vocabulary data and voiceprint data of the customer 20 registered in advance in the EEPROM 58 are preset in the first and second comparison circuits 63 and 64 through the CPU 43. The first and second comparison circuits 63 and 64 compare the preset data with the vocabulary data and the voiceprint data generated by the vocabulary conversion circuit 61 and the voiceprint conversion circuit 62. Then, a trigger signal is output when the generated data matches the registered data. Here, the first comparison circuit 63 transmits the matched vocabulary data to the CPU 43 together with the trigger signal.

論理回路６５は、第１、第２比較回路６３、６４の出力の論理積をとり、第１、第２比較回路６３、６４でともにトリガ信号が発生したとき、つまり、語彙データ、声紋データともに、生成されたデータと登録されているデータとが一致したときにのみ、ＣＰＵ４３に割り込み信号を送信する。ＣＰＵ４３は、論理回路６５からの割り込み信号を受信して、第１比較回路６３から送信された語彙データが表す操作命令に対応した処理をデジタルカメラ１０の各部に実行させる。 The logic circuit 65 calculates the logical product of the outputs of the first and second comparison circuits 63 and 64, and when both the first and second comparison circuits 63 and 64 generate a trigger signal, that is, both vocabulary data and voiceprint data. An interrupt signal is transmitted to the CPU 43 only when the generated data matches the registered data. The CPU 43 receives the interrupt signal from the logic circuit 65 and causes each part of the digital camera 10 to execute processing corresponding to the operation command represented by the vocabulary data transmitted from the first comparison circuit 63.

ＣＰＵ４３は、顧客２０から会議開始命令が音声入力された場合、携帯電話１１および電話回線網１３を介して、サーバ１２との回線をオープンさせる。音声収録開始／終了命令が入力された場合は、マイクロホン３２による音声収録を開始／終了させる。また、データ送信命令が入力された場合は、メモリカード５６に記録されている画像データおよび音声データを、携帯電話１１を介してサーバ１２に送信する。会議終了命令が入力された場合は、画像データおよび音声データをサーバ１２に送信した後、サーバ１２との回線をクローズさせる。 The CPU 43 opens a line with the server 12 via the mobile phone 11 and the telephone line network 13 when a conference start command is inputted by voice from the customer 20. When a voice recording start / end command is input, voice recording by the microphone 32 is started / finished. When a data transmission command is input, the image data and audio data recorded on the memory card 56 are transmitted to the server 12 via the mobile phone 11. When a conference end command is input, the image data and audio data are transmitted to the server 12 and then the line with the server 12 is closed.

デジタルカメラ１０では、携帯電話１１および電話回線網１３を介して、議事録作成モードでメモリカード５６に記録された画像データおよび音声データを、随時一定の間隔でサーバ１２に送信する。また、議事録作成モード使用時に、メモリカード５６の記録容量が不足した場合には、ビープ音やランプを点灯させるなどして、顧客２０に対して事前に警告を発し、サーバ１２へのデータ送信、あるいはメモリカード５６の交換を促す。 In the digital camera 10, image data and audio data recorded on the memory card 56 in the minutes creation mode are transmitted to the server 12 at regular intervals through the mobile phone 11 and the telephone line network 13. Further, when the recording capacity of the memory card 56 is insufficient when the minutes creation mode is used, a warning is issued to the customer 20 in advance by turning on a beep sound or a lamp, and data transmission to the server 12 is performed. Or, it is urged to replace the memory card 56.

図６に示すように、携帯電話１１は、ＣＰＵ７０により各部を統括的に制御される。この携帯電話１１は、通信相手の音声や着信メロディを出力する受話スピーカ７１と、話し手の音声を集音する送話マイク７２と、各種選択キーやダイヤルキーからなる操作部７３とを備えている。また、携帯電話１１には、液晶表示器（ＬＣＤ）７４、アンテナ７５、コネクタ７６（例えばＵＳＢコネクタ）、およびメモリ７７が設けられている。 As shown in FIG. 6, the mobile phone 11 is controlled centrally by the CPU 70. The mobile phone 11 includes a reception speaker 71 that outputs a communication partner's voice and a ringing melody, a transmission microphone 72 that collects a speaker's voice, and an operation unit 73 including various selection keys and dial keys. . The mobile phone 11 is provided with a liquid crystal display (LCD) 74, an antenna 75, a connector 76 (for example, a USB connector), and a memory 77.

ＬＣＤ７４には、各種設定メニューからなるメニュー画面、着信相手の電話番号やメールアドレス、インターネットサーバからダウンロードした画像などが、ＬＣＤドライバ７８を介して表示される。アンテナ７５は、通信Ｉ／Ｆ７９を介して、他の携帯電話などからの電波信号を受信するとともに、携帯電話１１から発信される電波信号を外部に送信する。コネクタ７６は、デジタルカメラ１０のコネクタ３５とＵＳＢケーブルなどで接続され、外部Ｉ／Ｆ８０を介してデータの送受信を行う。メモリ７７には、デジタルカメラ１０から送信される画像データや音声データが一時的に格納される。 The LCD 74 displays a menu screen including various setting menus, a telephone number or mail address of the called party, an image downloaded from the Internet server, and the like via the LCD driver 78. The antenna 75 receives a radio signal from another mobile phone or the like via the communication I / F 79 and transmits a radio signal transmitted from the mobile phone 11 to the outside. The connector 76 is connected to the connector 35 of the digital camera 10 via a USB cable or the like, and transmits / receives data via the external I / F 80. The memory 77 temporarily stores image data and audio data transmitted from the digital camera 10.

図７に示すように、サーバ１２は、バッファメモリ９０、文字認識装置９１、音声認識装置９２、編集装置９３、配信装置９４、および自動決済装置９５を備えている。バッファメモリ９０は、デジタルカメラ１０で取得され、携帯電話１１から電話回線網１３を介して送信される画像データおよび音声データを一時的に格納し、これらのデータを時系列で並べて分別し、一定の間隔で文字認識装置９１および音声認識装置９２に送信する。 As shown in FIG. 7, the server 12 includes a buffer memory 90, a character recognition device 91, a voice recognition device 92, an editing device 93, a distribution device 94, and an automatic settlement device 95. The buffer memory 90 temporarily stores image data and audio data acquired by the digital camera 10 and transmitted from the mobile phone 11 via the telephone line network 13, and sorts these data by arranging them in time series. Are transmitted to the character recognition device 91 and the voice recognition device 92 at intervals of.

文字認識装置９１は、文字認識ソフトを備えており、送信された画像データ（ホワイトボード２１全体を撮影した画像データ）内の文字を文字認識して、第１のテキストデータに変換する。音声認識装置９２は、音声認識ソフトを備えており、送信された音声データ（会議の参加者の発言を収録した音声データ）を音声認識して、第２のテキストデータに変換する。 The character recognition device 91 includes character recognition software, recognizes characters in the transmitted image data (image data obtained by photographing the entire whiteboard 21), and converts the characters into first text data. The voice recognition device 92 is provided with voice recognition software, recognizes the transmitted voice data (voice data containing the speech of the conference participants), and converts it into second text data.

文字認識は、まず、画像データの中から文字が書かれた部分を抜き出すレイアウト解析を行い、これにより抜き出された部分から１文字１文字を切り出していき、この１文字１文字について、その特徴量、例えばエッジ、輪郭、方向寄与度などを抽出し、図示しない認識辞書に予め登録されている標準パターンと比較照合する。そして、マッチした数種の標準パターンを候補として出力し、この候補から前後の文脈などを加味しながら誤認識を訂正して、最終的に残った候補を第１のテキストデータに変換する。 In character recognition, first, layout analysis is performed to extract a portion in which characters are written from image data, and one character is extracted from the extracted portion. A quantity such as an edge, contour, direction contribution, etc. is extracted and compared with a standard pattern registered in advance in a recognition dictionary (not shown). Then, several types of matched standard patterns are output as candidates, and misrecognition is corrected while taking into account the context before and after the candidates, and finally the remaining candidates are converted into first text data.

一方、音声認識は、１５〜３０ｍｓ程度の音声データを１フレームとし、５〜２０ｍｓずつシフトしながら、高速フーリエ変換や線形予測法によりスペクトル分析を行って、その特徴量を算出する。次に、非線形伸縮パターンマッチングや隠れマルコフモデルを用いて、算出した特徴量と図示しない認識辞書に予め登録されている標準パターンとを比較照合する。そして、マッチした数種の標準パターンを候補として出力し、この候補から前後の文脈などを加味しながら誤認識を訂正して、最終的に残った候補を第２のテキストデータに変換する。なお、音声認識をより高精度に行うために、会議開始前に会議の参加者の音声を、デジタルカメラ１０を介して音声認識装置９２にサンプリングデータとして予め登録しておき、このサンプリングデータを元に音声認識を行ってもよい。 On the other hand, in speech recognition, speech data of about 15 to 30 ms is set as one frame, and spectrum analysis is performed by fast Fourier transform or linear prediction while shifting by 5 to 20 ms, and the feature amount is calculated. Next, using a non-linear expansion / contraction pattern matching or a hidden Markov model, the calculated feature value is compared with a standard pattern registered in advance in a recognition dictionary (not shown). Then, several types of matched standard patterns are output as candidates, and misrecognition is corrected while taking into consideration the context before and after the candidates, and finally the remaining candidates are converted into second text data. In order to perform voice recognition with higher accuracy, the voices of the participants of the conference are registered in advance as sampling data in the voice recognition device 92 via the digital camera 10 before the conference starts. Voice recognition may be performed.

編集装置９３は、文字認識装置９１および音声認識装置９２で変換された第１、および第２のテキストデータを自動編集して、議事録ファイルを作成する。配信装置９４には、顧客２０の電子メールアドレスが予め登録されている。この配信装置９４は、議事録ファイルのヘッダに作成日時、顧客２０の氏名を付記して、これをインターネット１４経由で顧客２０のＰＣ１５に電子メールの形で配信する。 The editing device 93 automatically edits the first and second text data converted by the character recognition device 91 and the speech recognition device 92 to create a minutes file. In the distribution device 94, the e-mail address of the customer 20 is registered in advance. The distribution device 94 adds the creation date and time and the name of the customer 20 to the header of the minutes file, and distributes them to the PC 15 of the customer 20 via the Internet 14 in the form of an e-mail.

自動決済装置９５には、顧客２０の銀行口座が予め登録されている。この自動決済装置９５は、配信装置９４による議事録ファイルの配信後、電子決済により、顧客２０の銀行口座からシステム利用料を自動的に引き落とす。 In the automatic settlement apparatus 95, the bank account of the customer 20 is registered in advance. The automatic settlement apparatus 95 automatically withdraws the system usage fee from the bank account of the customer 20 by electronic settlement after distribution of the minutes file by the distribution apparatus 94.

次に、上記実施形態による作用について、図８〜１０のフローチャートを参照して説明する。まず、図８に示すように、顧客２０は、ホワイトボード２１全体が撮影可能な場所にデジタルカメラ１０を設置し、ＵＳＢケーブルなどでコネクタ３５、７６を繋ぎ、デジタルカメラ１０に携帯電話１１を接続する。また、ＰＣ１５をインターネット１４に接続する。そして、デジタルカメラ１０の電源を投入し、議事録作成モードを選択する。 Next, the effect | action by the said embodiment is demonstrated with reference to the flowchart of FIGS. First, as shown in FIG. 8, the customer 20 installs the digital camera 10 in a place where the entire whiteboard 21 can be photographed, connects the connectors 35 and 76 with a USB cable or the like, and connects the mobile phone 11 to the digital camera 10. To do. Further, the PC 15 is connected to the Internet 14. Then, the digital camera 10 is turned on, and the minutes creation mode is selected.

会議の開始とともに、顧客２０により会議開始命令がマイクロホン３２に音声入力される。マイクロホン３２から入力された顧客２０の音声は、ＡＭＰ５９で増幅され、Ａ／Ｄ６０でデジタルの音声データに変換される。 At the start of the conference, the customer 20 inputs a conference start command to the microphone 32 by voice. The voice of the customer 20 input from the microphone 32 is amplified by the AMP 59 and converted into digital voice data by the A / D 60.

図９に示すように、デジタルカメラ１０では、議事録作成モードの選択に伴って、ＥＥＰＲＯＭ５８に予め登録されている顧客２０の語彙データおよび声紋データが、ＣＰＵ４３を通じて第１、第２比較回路６３、６４にプリセットされる。Ａ／Ｄ６０から出力された音声データは、語彙変換回路６１および声紋変換回路６２により、語彙変換および声紋変換をそれぞれ施され、語彙データおよび声紋データが生成される。次に、第１、第２比較回路６３、６４で、プリセットされたデータと、生成された語彙データおよび声紋データとが比較される。そして、生成されたデータと登録されているデータとが一致した場合に、第１、第２比較回路６３、６４からトリガ信号が出力される。ここで、第１比較回路６３からは、トリガ信号とともに一致した語彙データがＣＰＵ４３に送信される。 As shown in FIG. 9, in the digital camera 10, the vocabulary data and voiceprint data of the customer 20 registered in advance in the EEPROM 58 in accordance with the selection of the minutes creation mode are sent through the CPU 43 to the first and second comparison circuits 63, Preset to 64. The voice data output from the A / D 60 is subjected to vocabulary conversion and voiceprint conversion by the vocabulary conversion circuit 61 and the voiceprint conversion circuit 62, respectively, and vocabulary data and voiceprint data are generated. Next, the first and second comparison circuits 63 and 64 compare the preset data with the generated vocabulary data and voiceprint data. When the generated data matches the registered data, a trigger signal is output from the first and second comparison circuits 63 and 64. Here, from the first comparison circuit 63, the matched vocabulary data is transmitted to the CPU 43 together with the trigger signal.

論理回路６５では、第１、第２比較回路６３、６４の出力の論理積が算出され、第１、第２比較回路６３、６４でともにトリガ信号が発生したときにのみ、ＣＰＵ４３に割り込み信号が送信される。ＣＰＵ４３では、論理回路６５からの割り込み信号を受信して、携帯電話１１および電話回線網１３を介してサーバ１２との回線をオープンさせる。 In the logic circuit 65, the logical product of the outputs of the first and second comparison circuits 63 and 64 is calculated, and an interrupt signal is sent to the CPU 43 only when a trigger signal is generated in both the first and second comparison circuits 63 and 64. Sent. The CPU 43 receives an interrupt signal from the logic circuit 65 and opens a line with the server 12 via the mobile phone 11 and the telephone line network 13.

図８において、回線開通後、顧客２０により音声収録開始命令がマイクロホン３２に音声入力されると、図９に示す処理と同様の手順で音声認識処理が行われ、マイクロホン３２による会議の参加者の発言の収録が開始される。 8, when a voice recording start command is input to the microphone 32 by the customer 20 after the line is opened, voice recognition processing is performed in the same procedure as the processing shown in FIG. Recording of remarks begins.

マイクロホン３２で収録された音声は、上記同様にＡＭＰ５９で増幅され、Ａ／Ｄ６０でデジタルの音声データに変換される。Ａ／Ｄ６０から出力された音声データは、ＣＰＵ４３でノイズ除去などの各種信号処理を施された後、メモリカード５６に記録される。 The sound recorded by the microphone 32 is amplified by the AMP 59 as described above, and converted to digital sound data by the A / D 60. The audio data output from the A / D 60 is recorded on the memory card 56 after various signal processing such as noise removal is performed by the CPU 43.

デジタルカメラ１０の撮像レンズ３０、絞り４０を介して入射した被写体光は、ＣＣＤ４６により光電変換され、ＣＤＳ４７でサンプリングされる。ＣＤＳ４７から出力された画像データは、ＡＭＰ４８で増幅され、Ａ／Ｄ４９でデジタルの画像データに変換される。デジタル変換された画像データは、画像入力コントローラ５０を介してＳＤＲＡＭ５２に順次格納され、ＬＣＤ３７にスルー画像として表示される。 The subject light incident through the imaging lens 30 and the aperture 40 of the digital camera 10 is photoelectrically converted by the CCD 46 and sampled by the CDS 47. The image data output from the CDS 47 is amplified by the AMP 48 and converted to digital image data by the A / D 49. The digitally converted image data is sequentially stored in the SDRAM 52 via the image input controller 50 and displayed on the LCD 37 as a through image.

上記の状態で、発表者２２がホワイトボード２１に書いた会議の内容が１段落したときに、顧客２０により撮影命令がマイクロホン３２に音声入力されると、図９に示す処理と同様の手順で音声認識処理が行われ、そのときＳＤＲＡＭ５２に格納されている画像データ（ホワイトボード２１全体を撮影した画像データ）が画像信号処理回路５４に読み出され、各種画像処理が施される。 In the state described above, when the content of the conference written by the presenter 22 on the whiteboard 21 reaches one paragraph and the customer 20 inputs a shooting command into the microphone 32, the procedure similar to the process shown in FIG. Voice recognition processing is performed. At that time, image data (image data obtained by photographing the entire whiteboard 21) stored in the SDRAM 52 is read out to the image signal processing circuit 54, and various image processing is performed.

画像信号処理回路５４で各種処理を施された画像データは、ＳＤＲＡＭ５２から図示しないＹＣ変換処理回路に読み出され、輝度信号Ｙと色差信号Ｃｒ、Ｃｂとに変換される。変換された画像データは、図示しない圧縮伸長処理回路により、所定の圧縮形式（例えばＪＰＥＧ形式）で画像圧縮を施される。圧縮された画像データは、メディアコントローラ５５を経由してメモリカード５６に記録される。 The image data that has been subjected to various processes by the image signal processing circuit 54 is read from the SDRAM 52 to a YC conversion processing circuit (not shown) and converted into a luminance signal Y and color difference signals Cr and Cb. The converted image data is subjected to image compression in a predetermined compression format (for example, JPEG format) by a compression / decompression processing circuit (not shown). The compressed image data is recorded on the memory card 56 via the media controller 55.

メモリカード５６に記録された画像データおよび音声データは、携帯電話１１から電話回線網１３を介して、随時一定の間隔でサーバ１２に送信される。メモリカード５６の記録容量が不足した場合には、顧客２０に対して事前に警告が発せられ、サーバ１２へのデータ送信、あるいはメモリカード５６の交換が促される。 Image data and audio data recorded in the memory card 56 are transmitted from the mobile phone 11 to the server 12 via the telephone line network 13 at regular intervals. When the recording capacity of the memory card 56 is insufficient, a warning is issued to the customer 20 in advance, and data transmission to the server 12 or replacement of the memory card 56 is prompted.

顧客２０によりデータ送信命令がマイクロホン３２に音声入力されると、図９に示す処理と同様の手順で音声認識処理が行われ、メモリカード５６に記録されている画像データおよび音声データが、携帯電話１１を介してサーバ１２に送信される。これら一連の処理は、音声入力された操作命令に応じて、会議が終了するまで繰り返し行われる。 When the customer 20 inputs a data transmission command to the microphone 32 by voice, the voice recognition process is performed in the same procedure as the process shown in FIG. 9, and the image data and voice data recorded in the memory card 56 are transferred to the mobile phone. 11 is transmitted to the server 12 via 11. These series of processes are repeatedly performed until the conference is ended according to the operation command inputted by voice.

顧客２０により会議終了命令がマイクロホン３２に音声入力されると、図９に示す処理と同様の手順で音声認識処理が行われ、画像データおよび音声データがサーバ１２に送信された後、サーバ１２との回線がクローズされる。 When the customer 20 inputs a conference end command to the microphone 32, voice recognition processing is performed in the same procedure as the processing shown in FIG. 9, and after the image data and voice data are transmitted to the server 12, The line is closed.

図１０に示すように、サーバ１２側では、まず、顧客２０による会議開始命令を受信して、携帯電話１１を介してデジタルカメラ１０との回線がオープンされる。回線開通後、携帯電話１１を介してデジタルカメラ１０から送信された画像データおよび音声データは、バッファメモリ９０に一時的に格納される。バッファメモリ９０では、これらのデータが時系列で並べて分別され、一定の間隔で文字認識装置９１および音声認識装置９２に送信される。 As shown in FIG. 10, the server 12 side first receives a conference start command from the customer 20 and opens a line with the digital camera 10 via the mobile phone 11. After the line is opened, the image data and audio data transmitted from the digital camera 10 via the mobile phone 11 are temporarily stored in the buffer memory 90. In the buffer memory 90, these data are sorted in time series and sorted, and transmitted to the character recognition device 91 and the speech recognition device 92 at regular intervals.

デジタルカメラ１０から送信された画像データは、文字認識装置９１で画像データ内の文字が文字認識され、第１のテキストデータに変換される。一方、音声データは、音声認識装置９２で会議の参加者の発言が音声認識され、第２のテキストデータに変換される。会議終了命令が送信された場合は、編集装置９３で第１のテキストデータと第２のテキストデータが自動編集され、議事録ファイルが作成される。送信されたデータが上記のいずれでもない場合は、エラー処理が行われる。 In the image data transmitted from the digital camera 10, characters in the image data are recognized by the character recognition device 91, and converted into first text data. On the other hand, the speech data is speech-recognized by the speech recognition device 92 for speech of the participants in the conference and converted into second text data. When the conference end command is transmitted, the first text data and the second text data are automatically edited by the editing device 93, and a minutes file is created. If the transmitted data is none of the above, error processing is performed.

編集装置９３で作成された議事録ファイルは、配信装置９４でヘッダに作成日時、顧客２０の氏名が付記され、インターネット１４経由で顧客２０のＰＣ１５に電子メールの形で配信される。配信装置９４による議事録ファイルの配信後、自動決済装置９５により、顧客２０の銀行口座からシステム利用料が自動的に引き落とされる。 The minutes file created by the editing device 93 has the date and time of creation and the name of the customer 20 added to the header by the distribution device 94 and is distributed via the Internet 14 to the PC 15 of the customer 20 in the form of an e-mail. After the minutes file is distributed by the distribution device 94, the automatic settlement device 95 automatically deducts the system usage fee from the bank account of the customer 20.

上記のような構成であると、顧客２０はデジタルカメラ１０、携帯電話１１、およびＰＣ１５を用意するだけでよく、電話回線網１３に接続可能な環境であれば、どんな場所でも議事録作成システム２を利用することができる。また、音声操作機能を備えたデジタルカメラ１０を用いているので、顧客２０はハンズフリーで撮影、音声収録、データ送信などを行え、議論に参加することが可能となる。さらに、会議終了後、会議の参加者の各々のＰＣに議事録ファイルを電子メールにて転送するだけで、容易且つ確実に情報を共有することができる。 With the configuration as described above, the customer 20 only needs to prepare the digital camera 10, the mobile phone 11, and the PC 15, and the minutes generation system 2 can be used anywhere as long as it can be connected to the telephone network 13. Can be used. Further, since the digital camera 10 having the voice operation function is used, the customer 20 can perform hands-free shooting, voice recording, data transmission, and the like, and can participate in the discussion. Furthermore, after the conference is over, the information can be easily and reliably shared by simply transferring the minutes file to each PC of the conference participants by e-mail.

なお、上記実施形態では、デジタルカメラ１０で取得した画像データおよび音声データを携帯電話１１を介してサーバ１２に送信しているが、モジュラージャックを介して電話回線網１３に直接デジタルカメラ１０を接続してもよく、顧客２０のＰＣ１５にデジタルカメラ１０を接続して、インターネット１４経由でデータ送信を行ってもよい。また、サーバ１２内に文字認識装置９１、音声認識装置９２、編集装置９３、および配信装置９４を設けているが、これらを独立して設けてもよい。 In the above embodiment, image data and audio data acquired by the digital camera 10 are transmitted to the server 12 via the mobile phone 11, but the digital camera 10 is directly connected to the telephone line network 13 via a modular jack. Alternatively, the digital camera 10 may be connected to the PC 15 of the customer 20 and data transmission may be performed via the Internet 14. Further, although the character recognition device 91, the voice recognition device 92, the editing device 93, and the distribution device 94 are provided in the server 12, these may be provided independently.

さらに、サーバ１２の文字認識装置９１、音声認識回路９２で変換した第１、第２のテキストデータを、ＲＴＦ（RichText Format ）やＨＴＭＬ（HyperText Markup Language ）などの、より汎用性の高いファイル形式で出力してもよい。このようにすると、議事録ファイルの再編集を円滑に行うことができる。 Furthermore, the first and second text data converted by the character recognition device 91 and the speech recognition circuit 92 of the server 12 are converted into a more versatile file format such as RTF (RichText Format) or HTML (HyperText Markup Language). It may be output. In this way, the minutes file can be re-edited smoothly.

本発明の議事録作成システムの概略構成を示す図である。It is a figure which shows schematic structure of the minutes production system of this invention. 議事録作成システムを利用した会議の様子を示す説明図である。It is explanatory drawing which shows the mode of the meeting using the minutes creation system. デジタルカメラの正面外観斜視図である。It is a front external perspective view of a digital camera. デジタルカメラの背面外観斜視図である。It is a back external appearance perspective view of a digital camera. デジタルカメラの電気的構成を示すブロック図である。It is a block diagram which shows the electric constitution of a digital camera. 携帯電話の内部構成を示すブロック図である。It is a block diagram which shows the internal structure of a mobile telephone. サーバの内部構成を示すブロック図である。It is a block diagram which shows the internal structure of a server. デジタルカメラの処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of a digital camera. デジタルカメラの音声操作機能の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the voice operation function of a digital camera. サーバの処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of a server.

Explanation of symbols

２議事録作成システム
１０デジタルカメラ
１１携帯電話
１２サーバ
１３電話回線網
１４インターネット
１５パーソナルコンピュータ（ＰＣ）
２０顧客
２１ホワイトボード
３０撮像レンズ
３２マイクロホン
３３レリーズボタン
３５コネクタ
３７液晶表示器（ＬＣＤ）
４３ＣＰＵ
４６ＣＣＤ
５２ＳＤＲＡＭ
５４画像信号処理回路
５６メモリカード
５８ＥＥＰＲＯＭ
６１語彙変換回路
６２声紋変換回路
６３、６４第１、第２比較回路
６５論理回路
７５アンテナ
７６コネクタ
７７メモリ
９１文字認識装置
９２音声認識装置
９３編集装置
９４配信装置
９５自動決済装置 2 Minutes creation system 10 Digital camera 11 Mobile phone 12 Server 13 Telephone network 14 Internet 15 Personal computer (PC)
20 Customer 21 Whiteboard 30 Imaging Lens 32 Microphone 33 Release Button 35 Connector 37 Liquid Crystal Display (LCD)
43 CPU
46 CCD
52 SDRAM
54 Image signal processing circuit 56 Memory card 58 EEPROM
61 vocabulary conversion circuit 62 voice print conversion circuit 63, 64 first and second comparison circuit 65 logic circuit 75 antenna 76 connector 77 memory 91 character recognition device 92 voice recognition device 93 editing device 94 distribution device 95 automatic settlement device

Claims

A digital camera that captures the display area of the display device installed in the conference room, outputs digital image data, records the remarks of participants in the conference, and outputs digital audio data;
A communication device connected to the digital camera and transmitting the image data and audio data to the outside;
A character recognition device that recognizes characters in the image data transmitted from the communication device and converts the characters into first text data;
A voice recognition device that recognizes the voice data transmitted from the communication device and converts the voice data into second text data;
An editing device that automatically edits the first and second text data to create a minutes file;
A minutes creation system comprising: a delivery device for delivering the minutes file to a customer via a communication network.

2. The minutes creation according to claim 1, wherein the character recognition device, the speech recognition device, the editing device, and the distribution device are provided in a server connected to the communication device via a communication network. system.

The digital camera includes voice recognition means for voice recognition of an operation command inputted by the customer, and control means for controlling each unit to execute processing according to the voice-recognized operation command. The minutes creation system according to claim 1 or 2.

The automatic settlement apparatus according to any one of claims 1 to 3, further comprising: an automatic settlement device that automatically deducts a system usage fee from the bank account of the customer registered in advance after distribution of the minutes file. Minutes creation system.