JP2005221565A

JP2005221565A - Voice data file storing method and sound-recording processor

Info

Publication number: JP2005221565A
Application number: JP2004026840A
Authority: JP
Inventors: Satoru Wakabayashi; 覚若林
Original assignee: NEC Saitama Ltd
Current assignee: NEC Saitama Ltd
Priority date: 2004-02-03
Filing date: 2004-02-03
Publication date: 2005-08-18

Abstract

<P>PROBLEM TO BE SOLVED: To efficiently divide and store digital voice data. <P>SOLUTION: A sound-recording processor (100) is equipped with a voice processing part (17) which converts an input voice into a digital signal, a control part (18) which performs sound-recording processing for generating digital voice data based upon the converted digital signal and stores the generated voice data, and an operation part (13) which generates an event based upon user's operation. The control part generates a management files each time a start event of sound-recording processing is generated, generates a data file storing voice data each time the sound-recording processing starts, and records additional information regarding a data file being generated in the management file. Further, a voiceless section is detected from a digital signal corresponding to an input voice and when the voiceless section is detected, the sound-recording processing is stopped; when a voiced sound is detected from the digital signal after the sound-recording processing is stopped, sound-recording processing using a new data file is started. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、ディジタル録音処理において音声データのデータファイルを格納する方法に関する。 The present invention relates to a method for storing a data file of audio data in digital recording processing.

従来、ディジタル録音された音声データに対し、ユーザが再生したい音声データを容易に選択可能とするために、音声データを複数のファイルに分けて保存する技術が提案されている。この種の技術としては、例えば、後述の特許文献１に記載のものがある。特許文献１の手法は、語学教材のような既存の音声データから無音区間を検出し、その無音区間に境界を設定することにより、一連のデータをセンテンス毎に分割するというものである。 2. Description of the Related Art Conventionally, a technique has been proposed in which audio data is divided into a plurality of files and stored so that the user can easily select audio data that the user wants to reproduce from digitally recorded audio data. As this type of technology, for example, there is one described in Patent Document 1 described later. The method of Patent Document 1 is to detect a silent section from existing speech data such as a language teaching material, and to set a boundary in the silent section, thereby dividing a series of data for each sentence.

また、昨今では、会議や公演等における音声をディジタル録音するための、いわゆるＩＣレコーダ装置が普及しているが、この装置において一連の音声データを分割して保存しようとする場合にも、無音区間を利用する従来の手法を用いることが考えられる。
特開２００３−３０７９９７号公報 In recent years, so-called IC recorder devices for digitally recording audio in conferences and performances have become widespread. However, even when a series of audio data is to be divided and stored in this device, silent sections are used. It is conceivable to use a conventional method that utilizes the above.
JP 2003-307997 A

しかしながら、従来の上記手法を用いるには、録音処理により一連の音声データを取得した後、別途、上記の分割処理を行う編集作業が必要とされる。また、録音中の任意の時点に、ユーザの手動操作にて録音を中断させ、これにより音声データを細切れに保存することも可能であるが、このような作業は、会議等に参加中のユーザにとっては煩わしく不都合である。 However, in order to use the above-described conventional method, an editing operation for separately performing the above-described division processing after acquiring a series of audio data by recording processing is required. In addition, it is possible to interrupt the recording by manual operation of the user at an arbitrary time point during the recording, thereby saving the audio data in small pieces, but such work is performed by the user participating in the conference etc. Is bothersome and inconvenient.

本発明は、上記課題に鑑みてなされたものであり、録音処理により生成される音声データを効率良く分割保存することができる方法を提供することを目的とする。 The present invention has been made in view of the above problems, and an object of the present invention is to provide a method capable of efficiently dividing and storing audio data generated by a recording process.

本発明に係る音声データファイル格納方法は、入力音声に基づきディジタル音声データを生成する録音処理の開始イベントを検知したとき、管理ファイルと音声データを格納するデータファイルとを作成し該データファイルに関する付加情報を前記管理ファイルに記録し、録音処理の開始後に入力音声の無音区間を検出したとき録音処理を停止し、該停止後に有音を検知したとき、新たなデータファイルを用いた録音処理を開始すると共に該新たなデータファイルに関する新たな付加情報を前記管理ファイルに記録する。 The audio data file storage method according to the present invention creates a management file and a data file for storing audio data when a recording process start event for generating digital audio data based on input audio is detected, and adds the data file. Information is recorded in the management file, and when the silent section of the input voice is detected after the recording process is started, the recording process is stopped, and when sound is detected after the stop, the recording process using a new data file is started. At the same time, new additional information relating to the new data file is recorded in the management file.

本発明に係る録音処理装置は、入力音声をディジタル信号に変換する音声処理部と、変換されたディジタル信号に基づきディジタル音声データを生成する録音処理を行い該生成した音声データを保存する制御部と、ユーザ操作に基づきイベントを発生させる操作部とを備え、前記制御部は、録音処理の開始イベントの発生毎に管理ファイルを作成する手段と、音声データを格納するデータファイルを録音処理の開始毎に作成する手段と、作成中のデータファイルに関する付加情報を当該管理ファイルに記録する手段と、入力音声に対応するディジタル信号から無音区間を検出する手段とを有し、無音区間を検出したとき録音処理を停止し、該停止後にディジタル信号から有音を検出したとき、新たなデータファイルを用いた録音処理を開始する。 A recording processing apparatus according to the present invention includes a sound processing unit that converts input sound into a digital signal, a control unit that performs recording processing for generating digital sound data based on the converted digital signal, and stores the generated sound data; An operation unit for generating an event based on a user operation, the control unit creating a management file for each occurrence of a recording process start event, and a data file for storing audio data for each start of a recording process Recording means, means for recording additional information relating to the data file being created in the management file, and means for detecting a silent section from a digital signal corresponding to the input voice, and recording when a silent section is detected. Stops processing, and when sound is detected from the digital signal after the stop, recording processing using a new data file is started. .

本発明によれば、無音区間の検出を利用した断続的な録音処理と並行して、その都度、当該データファイルの付加情報を記録することから、別途あらためて音声データの編集作業を行うことなく、効率良く音声データを分割保存することができる。 According to the present invention, since the additional information of the data file is recorded each time in parallel with the intermittent recording process using the detection of the silent period, without performing the editing operation of the audio data separately, Audio data can be divided and stored efficiently.

［実施例］
以下、本発明の実施例について図面を用いて詳細に説明する。図１は、本発明に係る録音処理装置の実施例の構成を示すブロック図である。実施例の録音処理装置は、無線通信機能を有する携帯電話機１００であり、図１に示すように、アンテナ１０と、このアンテナ１０を介して図示しない基地局に対し電波の送受信を行う無線部１１と、画面を表示する表示部１２と、ユーザ操作のための操作部１３と、音声出力手段としてのスピーカ１４及びレシーバ１５と、音声入力手段としてのマイク１６と、音声の入出力処理を行う音声処理部１７と、ＣＰＵ１８ａ及びメモリ１８ｂにより携帯電話機１００の動作を制御する制御部１８とを備える。 [Example]
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. FIG. 1 is a block diagram showing the configuration of an embodiment of a recording processing apparatus according to the present invention. The recording processing apparatus according to the embodiment is a mobile phone 100 having a wireless communication function. As shown in FIG. 1, an antenna 10 and a wireless unit 11 that transmits and receives radio waves to a base station (not shown) via the antenna 10. A display unit 12 for displaying a screen, an operation unit 13 for user operation, a speaker 14 and a receiver 15 as sound output means, a microphone 16 as sound input means, and a sound for performing sound input / output processing. The processing part 17 and the control part 18 which controls operation | movement of the mobile telephone 100 by CPU18a and the memory 18b are provided.

携帯電話機１００が無線通信機能を果たすにあたっては、無線部１１が、アンテナ１０にて受信した信号に対し、受信したい信号周波数を選択して周波数変換及び増幅を行い、これを復調して受信データを生成し制御部１８に出力する。制御部１８は、受信データを処理して音声信号を音声処理部１７に出力し、音声処理部１７は、受けた音声信号をアナログ信号に変えることによりレシーバ１５より音声を出力させる。また、通話時にマイク１６より入力された音声は、音声処理部１７にてディジタル信号に変換され、制御部１８がその信号を送信データに処理し、これを無線部１１にて変調し、規定の周波数の搬送波として増幅しアンテナ１０より送信を行う。 When the mobile phone 100 performs a wireless communication function, the wireless unit 11 selects a signal frequency to be received from the signal received by the antenna 10, performs frequency conversion and amplification, demodulates the signal, and demodulates the received data. Generate and output to the control unit 18. The control unit 18 processes the received data and outputs an audio signal to the audio processing unit 17. The audio processing unit 17 causes the receiver 15 to output audio by changing the received audio signal to an analog signal. The voice input from the microphone 16 during a call is converted into a digital signal by the voice processing unit 17, and the control unit 18 processes the signal into transmission data, which is modulated by the radio unit 11 and specified. The signal is amplified as a frequency carrier wave and transmitted from the antenna 10.

一方、携帯電話機１００が音声の録音機能を果たすにあたっては、制御部１８が、操作部１３から発せられる録音開始イベントを検知することにより録音処理を開始し、その後、録音終了イベントの検知を以って一連の録音処理を終了する。操作部１３からの上記各イベントは、例えば、ユーザが携帯電話機１００上で所定のキー操作を行うことにより発生する。録音処理では、音声処理部１７が、マイク１６より入力された音声をディジタル信号に変換する。制御部１８は、このディジタル信号を、ある一定の無音区間を検出するまで所定の音声データコーディング方式に従い変換を行い、これにより音声データを作成する。さらに、作成した音声データ及び後述の付加情報をメモリ１８ｂの所定のファイルに格納する。 On the other hand, when the mobile phone 100 performs the sound recording function, the control unit 18 starts the recording process by detecting the recording start event emitted from the operation unit 13, and then detects the recording end event. To complete the recording process. Each event from the operation unit 13 is generated, for example, when a user performs a predetermined key operation on the mobile phone 100. In the recording process, the voice processing unit 17 converts the voice input from the microphone 16 into a digital signal. The control unit 18 converts the digital signal according to a predetermined voice data coding method until a certain silent section is detected, thereby creating voice data. Further, the created audio data and additional information described later are stored in a predetermined file in the memory 18b.

図２に沿って、制御部１８により制御される録音処理の基本手順を説明する。まず、操作部１３からの録音開始イベントＥＶ１の発生を契機に、マイク１６をＯＮ状態にして音声データの録音処理を開始し、開始後、音声データＡを音声データファイルａ１に順次格納する。音声データの無音区間を一定時間検出すると、録音動作を一時停止し、音声データファイルａ１を閉じる。その後、有音レベルを検出すると、音声データの無音区間を検出するまで、音声データＢを新たな音声データファイルａ２に格納する。 The basic procedure of the recording process controlled by the control unit 18 will be described with reference to FIG. First, triggered by the occurrence of the recording start event EV1 from the operation unit 13, the microphone 16 is turned on to start the voice data recording process. After the start, the voice data A is sequentially stored in the voice data file a1. When the silent section of the audio data is detected for a certain time, the recording operation is temporarily stopped and the audio data file a1 is closed. Thereafter, when the sound level is detected, the sound data B is stored in the new sound data file a2 until the silent section of the sound data is detected.

次いで、同様な手順にて音声データＣを音声データファイルａ３に格納後、音声データＤの録音中に、操作部１３からファイル保存イベントＥＶ２、すなわちユーザ操作による分割保存指示が発せられると、その時点までの音声データＤを音声データファイルａ４に格納し、以降の音声データＥを新たな音声データファイルａ５に順次格納する。そして、操作部１３からの録音停止イベントＥＶ３を契機に、マイク１６をＯＦＦ状態にして音声データの録音処理を停止し、音声データファイルａ５を閉じる。 Next, after the audio data C is stored in the audio data file a3 in the same procedure, when the audio data D is recorded, the operation unit 13 issues a file save event EV2, that is, a divided save instruction by a user operation, at that time The previous audio data D is stored in the audio data file a4, and the subsequent audio data E is sequentially stored in the new audio data file a5. Then, triggered by the recording stop event EV3 from the operation unit 13, the microphone 16 is turned off to stop the audio data recording process and close the audio data file a5.

このように、音声の無音区間を検出する毎に、音声データを新たな音声データファイルに分けて格納することにより、ファイルの分割を録音処理と並行して行うことができ、これにより、録音終了後に、直ちにデータファイル単位のスキップ再生（頭出し再生）を実行することができる。 In this way, each time a silent section of the voice is detected, the voice data is divided into new voice data files and stored, so that the file can be divided in parallel with the recording process, thereby completing the recording. Later, skip playback (cue playback) can be performed in units of data files.

図３に、メモリ１８ｂのファイル構成を示す。実施例の携帯電話機１００は、ツリー型のファイル構成を有し、図３に示すように、ルートディレクトリ１０１に音声データ用ディレクトリ１０２を格納し、音声データ用ディレクトリ１０２に音声データ用サブディレクトリ１０３を格納する。音声データ用サブディレクトリ１０３には、ユーザ操作により指示される録音開始から終了までの一連の録音処理の音声データが格納される。よって、例えば音声データ用サブディレクトリ１０３（００１）の完成後、別途、ユーザから録音開始指示があった際は、新たな音声データ用サブディレクトリ１０３（ＺＺＺ）が作成される。 FIG. 3 shows the file structure of the memory 18b. The cellular phone 100 according to the embodiment has a tree-type file structure, and stores an audio data directory 102 in the root directory 101 and an audio data subdirectory 103 in the audio data directory 102 as shown in FIG. Store. The audio data subdirectory 103 stores audio data of a series of recording processes from the start to the end of recording instructed by a user operation. Therefore, for example, after the audio data subdirectory 103 (001) is completed, when a recording start instruction is given separately from the user, a new audio data subdirectory 103 (ZZZ) is created.

音声データ用サブディレクトリ１０３は、プレイリスト管理ファイル１０４と、トラック管理ファイル１０５と、図３に沿って説明したような音声データファイル１０６（ａ１、・・・、ａｎ）とから構成される。 The audio data subdirectory 103 includes a playlist management file 104, a track management file 105, and an audio data file 106 (a1,..., An) as described with reference to FIG.

図４に、プレイリスト管理ファイル１０４の構成例を示す。プレイリスト管理ファイル１０４には、分割保存される音声データファイル１０６の一覧と、各ファイル１０６を関連付ける情報、すなわち各ファイルの作成順番を識別するための情報とを格納する。図示の例では、各音声データファイル１０６の識別符号と、当該音声データの録音を開始したときのカウント値及び録音終了時のカウント値とが記録される。カウント値は、制御部１８の図示しないタイマ機構により提供される。図４に示す情報は、図３に沿って説明した基本手順における各音声データファイル１０６（ａ１〜ａ５）の記録を示し、最新の記録として、ファイル保存イベントＥＶ２発生により、新たな音声データファイル（ａ５）に音声データＥを格納中である様子が示されている。 FIG. 4 shows a configuration example of the playlist management file 104. The playlist management file 104 stores a list of audio data files 106 to be divided and stored, and information for associating each file 106, that is, information for identifying the creation order of each file. In the illustrated example, the identification code of each audio data file 106, the count value when recording of the audio data is started, and the count value at the end of recording are recorded. The count value is provided by a timer mechanism (not shown) of the control unit 18. The information shown in FIG. 4 indicates the recording of each audio data file 106 (a1 to a5) in the basic procedure described with reference to FIG. 3. As the latest recording, a new audio data file ( A state in which the audio data E is being stored is shown in a5).

上記説明したプレイリスト管理ファイル１０４は、本発明に係る管理ファイルに対応し、また、図４に示す記録情報は、本発明に係る付加情報に対応する。この付加情報の作成により、音声データ用サブディレクトリ１０３に何れの音声データファイル１０６が存在し、また、それらがどのような順序で作成されたかを記録することができる。なお、図４に示す例は、各音声データファイル１０６について録音開始及び終了のカウント値を記録するものであるが、各ファイルの一覧を把握でき、且つそれらの作成順序を識別可能な情報であれば、本発明に係る付加情報は図示のものに限らない。 The playlist management file 104 described above corresponds to the management file according to the present invention, and the recorded information shown in FIG. 4 corresponds to the additional information according to the present invention. By creating this additional information, it is possible to record which audio data file 106 exists in the audio data subdirectory 103 and in what order they are created. The example shown in FIG. 4 records the recording start and end count values for each audio data file 106. However, the information can be a list of each file and identify the creation order thereof. For example, the additional information according to the present invention is not limited to the illustrated information.

図５に、音声データ用サブディレクトリ１０３のトラック管理ファイル１０５の構成例を示す。トラック管理ファイル１０５には、音声データ用サブディレクトリ１０３に対応する音声データの再生の際に参照される情報が記録され、例えば、ファイルの保存形式、音声データのコーディング種別、音声データを採取する周期であるサンプリングレート等を格納する。図示の例では、サンプリング周期２２ｋＨｚにてＰＣＭ方式により符号化した音声データが、従来知られたＷＡＶＥ形式にて保存されたことが示されている。 FIG. 5 shows a configuration example of the track management file 105 of the audio data subdirectory 103. The track management file 105 stores information to be referred to when audio data corresponding to the audio data subdirectory 103 is reproduced. For example, the file storage format, the audio data coding type, and the audio data collection cycle The sampling rate etc. which are are stored. In the example shown in the figure, it is shown that audio data encoded by the PCM method at a sampling period of 22 kHz is stored in a conventionally known WAVE format.

携帯電話機１００が本発明の音声データファイル格納方法を実施する手順を、図６のフローチャートに沿って説明する。まず、制御部１８は、操作部１３の録音開始イベントＥＶ１が発生したとき（ステップＳ１：Ｙｅｓ）、付加情報を含むファイルとなるプレイリスト管理ファイル１０４およびトラック管理ファイル１０５と、録音開始後の音声データを格納する音声データファイル１０６（ａ１）とを作成する（ステップＳ２）。これにより、音声データ用サブディレクトリ１０３が形成される。 A procedure for carrying out the audio data file storage method of the present invention by the cellular phone 100 will be described with reference to the flowchart of FIG. First, when the recording start event EV1 of the operation unit 13 occurs (step S1: Yes), the control unit 18 plays the playlist management file 104 and the track management file 105, which are files including additional information, and the audio after starting recording. An audio data file 106 (a1) for storing data is created (step S2). Thereby, the audio data sub-directory 103 is formed.

次いで、プレイリスト管理ファイル１０４に、図４に示すような録音開始のカウント値を記録すると共に、作成した音声データファイル１０６（ａ１）を開き（ステップＳ３）、マイク１６から入力される音声をディジタル信号に変換して所定量ずつバッファリングし（ステップＳ４）、これを音声データに変換したうえでメモリ１８ｂの前記音声データファイル１０６（ａ１）に順次格納する（ステップＳ５）。 Next, the count value at the start of recording as shown in FIG. 4 is recorded in the playlist management file 104, and the created voice data file 106 (a1) is opened (step S3), and the voice input from the microphone 16 is digitally recorded. It is converted into a signal and buffered by a predetermined amount (step S4), converted into audio data, and sequentially stored in the audio data file 106 (a1) in the memory 18b (step S5).

制御部１８は、作成した音声データから、ある一定時間の無音区間を検出した場合、または操作部１３からのファイル保存イベントＥＶ２の発生を検知した場合（ステップＳ６：Ｙｅｓ）、音声処理部１７の録音処理を停止させると共に音声データファイル１０６（ａ１）を閉じる（ステップＳ７）。このとき、プレイリスト管理ファイル１０４に、図４に示すような録音終了のカウント値を記録する。 When the control unit 18 detects a silent section for a certain period of time from the created audio data or detects the occurrence of a file storage event EV2 from the operation unit 13 (step S6: Yes), the control unit 18 The recording process is stopped and the audio data file 106 (a1) is closed (step S7). At this time, the recording end count value as shown in FIG. 4 is recorded in the playlist management file 104.

その後、マイク１６からの入力値に有音レベルを検出したとき（ステップＳ８：Ｙｅｓ）、新たな音声データファイル１０６（ａ２）を作成し（ステップＳ９）、この新たなファイルについて、プレイリスト管理ファイル１０４に付加情報を記録する（ステップＳ１０）。その後、上記ステップＳ４からの手順を繰り返し、実行する録音処理に沿って、音声データファイル１０６（ａ３、・・・、ａｎ）についての付加情報をプレイリスト管理ファイル１０４に順次記録する。 Thereafter, when the sound level is detected in the input value from the microphone 16 (step S8: Yes), a new audio data file 106 (a2) is created (step S9), and the playlist management file is created for this new file. Additional information is recorded in 104 (step S10). Thereafter, the procedure from step S4 is repeated, and additional information about the audio data file 106 (a3,..., An) is sequentially recorded in the playlist management file 104 in accordance with the recording process to be executed.

なお、無音区間の検出により録音処理を停止させた後、有音が検出されることなく（ステップＳ８：Ｎｏ）、録音停止イベントＥＶ３が発せられたときは（ステップＳ１１：Ｙｅｓ）、マイク１６をＯＦＦにして録音処理を終了させる。また、録音処理中に無音区間の検出やファイル保存イベントＥＶ２が無く、すなわち録音処理の続行中に（ステップＳ６：Ｎｏ）、操作部１３からの録音停止イベントＥＶ３があったとき（ステップＳ１２：Ｙｅｓ）、当該音声データファイル１０６を閉じて（ステップＳ１３）、プレイリスト管理ファイル１０４に録音終了の旨を記録し（ステップＳ１４）、マイク１６をＯＦＦにして録音処理を終了させる。 Note that after the recording process is stopped by detecting the silent section, no sound is detected (step S8: No), and when the recording stop event EV3 is issued (step S11: Yes), the microphone 16 is turned off. Turn OFF to end the recording process. Further, when there is no silent section detection or file saving event EV2 during the recording process, that is, while the recording process is continuing (step S6: No), there is a recording stop event EV3 from the operation unit 13 (step S12: Yes). The audio data file 106 is closed (step S13), the end of recording is recorded in the playlist management file 104 (step S14), the microphone 16 is turned off, and the recording process is ended.

以上説明したように、実施例の携帯電話機１００によれば、マイク１６のＯＮからＯＦＦまで間の録音処理と並行して、音声データの分割保存および付加情報の更新を行うことから、音声データを効率良く記録することができる。これにより、録音作業後、あらためて編集作業を行うことなく、直ちに音声データの選択的な再生が可能となる。 As described above, according to the cellular phone 100 of the embodiment, the audio data is divided and stored and the additional information is updated in parallel with the recording process from the ON to the OFF of the microphone 16. It is possible to record efficiently. As a result, the audio data can be selectively reproduced immediately after the recording operation without performing another editing operation.

［他の実施例］
上記説明した実施例では、無音区間の検出毎に新たな音声データファイル１０６を作成することにより音声データの分割を行ったが、本発明では、さらに、以下に説明する他の手法により音声データの分割を図ることができる。その手法を図７及び図８を用いて、以下に説明する。 [Other embodiments]
In the embodiment described above, the voice data is divided by creating a new voice data file 106 every time a silent section is detected. However, in the present invention, the voice data is further divided by another method described below. Division can be achieved. The method will be described below with reference to FIGS.

図７に示す手法は、録音処理時にマイク１６から入力される音声に、例えば、接続詞の「しかし」、「要するに」、「従って」等の所定のキーワードが含まれていたとき、その時点で新たな音声データファイル１０６に切り換えるというものである。この手法を実現するには、音声データの録音開始前に、上述のような所定のキーワードを操作部１３により入力し、これをメモリ１８ｂに登録しておく。 In the method shown in FIG. 7, when a predetermined keyword such as “but”, “in short”, “according” or the like is included in the voice input from the microphone 16 during the recording process, for example, a new keyword is added at that time. The sound data file 106 is switched to. In order to realize this method, the predetermined keyword as described above is input through the operation unit 13 and registered in the memory 18b before the start of recording of the audio data.

制御部１８は、マイク１６から入力された音声が音声処理部１７によりディジタル信号に変換されたとき、その信号と予めメモリ１８ｂに登録されたキーワードとを比較し、両者が一致した場合に、新たな音声データファイル１０６を作成する。そして、このファイルに、キーワードに対応する音声データを格納し、以降、新たにキーワードが検出されるまで、後続の音声データを順次格納する。このように、上記の接続詞のようなキーワードを用いて音声データを分割することにより、会話の主旨ごとに分割して記録することが可能となる。 When the voice input from the microphone 16 is converted into a digital signal by the voice processing unit 17, the control unit 18 compares the signal with a keyword registered in the memory 18 b in advance, and if both match, A sound data file 106 is created. Then, audio data corresponding to the keyword is stored in this file, and subsequent audio data is sequentially stored until a new keyword is detected thereafter. In this way, by dividing the voice data using keywords such as the above-mentioned conjunctions, it is possible to divide and record for each purpose of the conversation.

また、本発明によれば、所定期間が経過したとき、自動的に新たな音声データファイル１０６を作成するようにすることもできる。その場合、音声データの録音開始前に、制御部１８の図示しないタイマ機構に対し、音声データを複数のファイルに分けて格納するための時間間隔を設定しておく。この時間間隔は、録音開始後の無音区間を含む絶対的な時間、あるいは無音区間を除く実質的な録音時間の累計に対し設定し、当該期間が経過したとき新たな音声データファイル１０６を作成する。 In addition, according to the present invention, a new audio data file 106 can be automatically created when a predetermined period has elapsed. In that case, before starting the recording of the audio data, a time interval for storing the audio data divided into a plurality of files is set in a timer mechanism (not shown) of the control unit 18. This time interval is set with respect to the absolute time including the silent period after the start of recording, or the cumulative total of the recording time excluding the silent period, and a new audio data file 106 is created when the period elapses. .

上述の実質的な録音時間の累計を用いる手法は、具体的には、図８に示すように、音声データファイル１０６（ａ２）による音声データＢの録音時間が、予め設定した上限期間に達したとき、新たな音声データファイル１０６（ａ３）を作成し、このファイルを、以降の録音処理による音声データＣの保存場所とする。これにより、音声に無音区間が現れ難い状況においても、予め設定した期間が経過したとき自動的に音声データを分割することができる。 Specifically, the method using the cumulative total of the recording time described above has, as shown in FIG. 8, the recording time of the audio data B by the audio data file 106 (a2) has reached a preset upper limit period. At this time, a new audio data file 106 (a3) is created, and this file is used as a storage location of the audio data C by the subsequent recording process. Thereby, even in a situation where it is difficult for a silent section to appear in the voice, the voice data can be automatically divided when a preset period has elapsed.

上記のような所定のキーワードによる分割、あるいは所定期間ごとの分割の手法は、実施例の携帯電話機１００のように無音区間の検出ごとに分割するものに限らず、無音区間の検出を行わない装置に適用してもよい。 The method of dividing by a predetermined keyword as described above or dividing every predetermined period is not limited to the method of dividing every silent section as in the mobile phone 100 of the embodiment, and does not detect the silent section. You may apply to.

また、実施例では、本発明を携帯電話機に適用した例を説明したが、これに限らず、例えば、ＩＣレコーダ装置や、簡易型のボイスメモ装置等に適用することができる。 In the embodiments, the example in which the present invention is applied to a mobile phone has been described. However, the present invention is not limited to this, and can be applied to, for example, an IC recorder device, a simple voice memo device, and the like.

本発明による実施例の携帯電話機の構成を示すブロック図である。It is a block diagram which shows the structure of the mobile telephone of the Example by this invention. 実施例の録音処理の基本手順を説明するための説明図である。It is explanatory drawing for demonstrating the basic procedure of the recording process of an Example. 実施例のファイル構成を説明するための説明図である。It is explanatory drawing for demonstrating the file structure of an Example. 実施例のプレイリスト管理ファイルを説明するための説明図である。It is explanatory drawing for demonstrating the play list management file of an Example. 実施例のトラック管理ファイルを説明するための説明図である。It is explanatory drawing for demonstrating the track management file of an Example. 実施例の音声データファイル格納方法の手順を示すフローチャートである。It is a flowchart which shows the procedure of the audio | voice data file storage method of an Example. 他の実施例による録音処理の基本手順を説明するための説明図である。It is explanatory drawing for demonstrating the basic procedure of the recording process by another Example. 他の実施例による録音処理の基本手順を説明するための説明図である。It is explanatory drawing for demonstrating the basic procedure of the recording process by another Example.

Explanation of symbols

１００携帯電話機
１０アンテナ
１１無線部
１２操作部
１３操作部
１４スピーカ
１５レシーバ
１６マイク
１７音声処理部
１８制御部
１８ａＣＰＵ
１８ｂメモリ DESCRIPTION OF SYMBOLS 100 Mobile phone 10 Antenna 11 Radio | wireless part 12 Operation part 13 Operation part 14 Speaker 15 Receiver 16 Microphone 17 Audio | voice processing part 18 Control part 18a CPU
18b memory

Claims

When a recording process start event for generating digital audio data based on input audio is detected, a management file and a data file for storing audio data are created, and additional information relating to the data file is recorded in the management file, and the recording process is performed. When the silent section of the input voice is detected after the start of the recording, the recording process is stopped, and when the sound is detected after the stop, the recording process using the new data file is started and a new addition regarding the new data file is started. An audio data file storage method comprising: recording information in the management file.

2. The audio data file storage method according to claim 1, wherein, after detecting the recording process start event, a recording process using a new data file is started each time a predetermined keyword is detected from the input voice.

2. The audio data file storage method according to claim 1, wherein a new data file is created every elapse of a predetermined period after detection of the recording process start event.

4. The audio data file storage method according to claim 3, wherein the progress of the predetermined period is monitored based on the total execution time of the recording process.

When a recording process start event for generating digital audio data based on input audio is detected, a management file and a data file for storing audio data are created, and additional information relating to the data file is recorded in the management file, and the recording Each time a predetermined keyword is detected from the input sound after detecting a processing start event, recording processing using a new data file is started and new additional information relating to the new data file is recorded in the management file. An audio data file storage method characterized by the above.

When a recording process start event for generating digital audio data based on input audio is detected, a management file and a data file for storing audio data are created, and additional information relating to the data file is recorded in the management file, and the recording Each time a predetermined period elapses after detection of a process start event, recording processing using a new data file is started and new additional information relating to the new data file is recorded in the management file. Audio data file storage method.

7. The audio data file storage method according to claim 1, wherein information for identifying a creation order of the data file is recorded as the additional information.

An audio processing unit that converts input audio into a digital signal, a recording unit that generates digital audio data based on the converted digital signal, stores the generated audio data, and generates an event based on a user operation With an operation unit,
The control unit includes means for creating a management file for each occurrence of a recording process start event, means for creating a data file for storing audio data for each start of the recording process, and additional information regarding the data file being created. It has means for recording in the management file and means for detecting a silent section from the digital signal corresponding to the input voice. When the silent section is detected, the recording process is stopped, and then the voice is detected from the digital signal after the stop. When this is done, the recording processing apparatus starts recording processing using a new data file.

9. The control unit according to claim 8, further comprising means for detecting a predetermined keyword from a digital signal of the input voice, and starting recording processing using a new data file each time the keyword is detected. Recording processing device.

9. The recording processing apparatus according to claim 8, wherein the control unit includes means for monitoring a period after the start event of the recording process occurs, and creates a new data file every time a predetermined period elapses.

The recording processing apparatus according to claim 10, wherein the control unit monitors the elapse of the predetermined period based on a cumulative execution time of the recording process.

The recording processing apparatus according to claim 8, further comprising a wireless communication unit.