JP5392827B2

JP5392827B2 - Sound data processing device

Info

Publication number: JP5392827B2
Application number: JP2009131317A
Authority: JP
Inventors: 晋太郎高田; 裕二山本
Original assignee: NEC Casio Mobile Communications Ltd
Current assignee: NEC Casio Mobile Communications Ltd
Priority date: 2009-05-29
Filing date: 2009-05-29
Publication date: 2014-01-22
Anticipated expiration: 2029-05-29
Also published as: JP2010278918A

Description

本発明は、複数のマイクロホンを備える音データ処理装置に関する。 The present invention relates to a sound data processing apparatus including a plurality of microphones.

複数のマイクロホンで音声を取り込み、特定の音声を抽出あるいは抑圧する音源分離技術が知られている（例えば、非特許文献１）。この音源分離技術は、近年では、ビデオカメラ、携帯電話、ＩＣレコーダ等の携帯機器に広く適用されている。
例えば、特許文献１には、音源分離機能を備えるビデオカメラの録音装置が開示されている。この録音装置は、ビデオカメラの被写体へのフォーカシングに同期して、マイクロホンの指向特性を被写体にフォーカシングし、被写体からの音声信号を抽出して臨場感の高い録音を可能とする。
また、特許文献２には、動画撮影時に複数マイクロホンの音声を画像と共に記録し、撮影後に、画像内に表示された音源の音に対して音源分離を行う音声データ編集装置が開示されている。 A sound source separation technique is known in which sound is captured by a plurality of microphones and specific sound is extracted or suppressed (for example, Non-Patent Document 1). In recent years, this sound source separation technique has been widely applied to portable devices such as video cameras, mobile phones, and IC recorders.
For example, Patent Document 1 discloses a recording device for a video camera having a sound source separation function. This recording device synchronizes with the subject of the video camera to the subject, focuses the directional characteristic of the microphone on the subject, extracts the audio signal from the subject, and enables recording with high presence.
Patent Document 2 discloses an audio data editing apparatus that records sound of a plurality of microphones together with an image during moving image shooting, and performs sound source separation on the sound of the sound source displayed in the image after shooting.

上記のような、音源分離の性能・解像度の保持のためには、一定数のマイクロホンが必要である。また、各マイクロホンの間隔も一定の間隔が必要な場合がある。さらに、音源の分析方向の範囲を広げるためには、マイクロホンを平面的に配置するだけでなく、立体的に配置する必要がある。しかし、ビデオカメラや、携帯電話等の携帯端末においては、マイクロホンを設置可能な部位が限られているため、複数のマイクロホンを理想的な位置に設置することは難しい。 In order to maintain the sound source separation performance and resolution as described above, a certain number of microphones are required. In addition, there may be a need for a certain interval between the microphones. Furthermore, in order to expand the range of the analysis direction of the sound source, it is necessary to arrange the microphones not only in a plane but also in a three-dimensional manner. However, in a portable terminal such as a video camera or a mobile phone, it is difficult to install a plurality of microphones at ideal positions because there are only limited parts where microphones can be installed.

そこで、可動部（例えば、ビデオカメラのモニタ部を有した筐体、折り畳みタイプやスライドタイプの携帯電話のモニタ部を有した筐体）を有する携帯端末にあっては、可動部にもマイクロホンを配置している。 Therefore, in a portable terminal having a movable part (for example, a case having a monitor part of a video camera, a case having a monitor part of a folding type or slide type mobile phone), a microphone is also arranged in the movable part. doing.

可動部にマイクロホンを配置した場合、可動部の動きにより、マイクロホンが予定位置からずれてしまい、音源処理を適切に実行できず、音源分離処理後の音データの品質が低下する場合がある。さらに、原音声をも劣化させてしまう事態が想定される。しかし、上述した特許文献１及び２に開示されている技術は、マイクロホンの位置（マイク位置）が撮影時に変化することは考慮していない。 When the microphone is arranged in the movable part, the microphone is displaced from the planned position due to the movement of the movable part, so that the sound source process cannot be appropriately performed, and the quality of the sound data after the sound source separation process may be deteriorated. Furthermore, it is assumed that the original voice is degraded. However, the techniques disclosed in Patent Documents 1 and 2 described above do not take into consideration that the position of the microphone (microphone position) changes during shooting.

特開平５−３０８５５３号公報Japanese Patent Laid-Open No. 5-308553 特開２００８−１３１１６８号公報JP 2008-131168 A

戸上真人，天野明雄，新庄広，鴨志田亮太，“人間共生ロボットＥＭＩＥＷの聴覚機能”，人工知能学会，ｐｐ．５９−６４，２００５／１０／１４Masato Togami, Akio Amano, Hiroshi Shinjo, Ryota Kamoshida, “Hearing Function of Human Symbiotic Robot EMIEW”, Japanese Society for Artificial Intelligence, pp. 59-64, 2005/10/14

本発明は、上述した問題に鑑みてなされたものであり、録音時のマイクロホンの位置が予め設定された位置と異なっても、音源分離後の音データの劣化を防ぐことを目的とする。
また、本発明は、可動部にマイクロホンが設置された電子機器において、適切に音源分離処理を可能とすることを他の目的とする。 The present invention has been made in view of the above-described problems, and an object of the present invention is to prevent deterioration of sound data after sound source separation even when the position of a microphone at the time of recording is different from a preset position.
Another object of the present invention is to enable sound source separation processing appropriately in an electronic device in which a microphone is installed in a movable part.

上記目的を達成するため、本発明の第１の観点に係る音データ処理装置は、
第１の筐体と、
第１の筐体に対して相対的に位置を変更することが可能な第２の筐体と、
前記第１の筐体に設けられる少なくとも１つ以上のマイクロホンと、
前記第２の筐体に設けられる少なくとも１つ以上のマイクロホンと、
前記マイクロホンの位置関係を特定するための情報を取得するマイク位置特定情報取得手段と、
前記マイクロホンから取り込んだ音データと、前記マイク位置特定情報取得手段で取得したマイク位置特定情報とを記録媒体に記録させる記憶手段と、
を備えることを特徴とする。 In order to achieve the above object, a sound data processing apparatus according to the first aspect of the present invention provides:
A first housing;
A second housing capable of changing its position relative to the first housing;
At least one microphone provided in the first housing;
At least one microphone provided in the second housing;
Microphone position specifying information acquiring means for acquiring information for specifying the positional relationship of the microphone;
Storage means for recording sound data captured from the microphone and microphone position specifying information acquired by the microphone position specifying information acquiring means on a recording medium;
It is characterized by providing.

本発明の第２の観点に係る音データ処理装置は、
第１の筐体と、
第１の筐体に対して相対的に位置を変更することが可能な第２の筐体と、
前記第１の筐体に設けられる少なくとも２つ以上のマイクロホンと、
前記第２の筐体に設けられる少なくとも１つ以上のマイクロホンと、
前記第１の筐体と前記第２の筐体との位置関係が所定の位置関係であるか否かを判別する位置判別手段と、
前記第１の筐体と前記第２の筐体との前記位置関係が前記所定の位置関係と同じ場合、前記第１及び第２の筐体に設けられたマイクロホンを選択し、前記第１の筐体と前記第２の筐体との前記位置関係が前記所定の位置関係と異なる場合、前記第１の筐体に設けられたマイクロホンを選択するマイクロホン選択手段と、
前記マイクロホン選択手段で選択した前記マイクロホンの位置関係を特定するための情報を取得するマイク位置特定情報取得手段と、
前記マイクロホン選択手段で選択した前記マイクロホンから取り込んだ音データと、前記マイク位置特定情報取得手段で取得したマイク位置特定情報とを記録媒体に記録させる記憶手段と、
を備えることを特徴とする。 The sound data processing device according to the second aspect of the present invention is:
A first housing;
A second housing capable of changing its position relative to the first housing;
At least two or more microphones provided in the first housing;
At least one microphone provided in the second housing;
Position determining means for determining whether or not a positional relationship between the first casing and the second casing is a predetermined positional relationship;
If the positional relationship between the first housing and the second housing is the same as the predetermined positional relationship, a microphone provided in the first and second housings is selected, and the first housing A microphone selecting means for selecting a microphone provided in the first housing when the positional relationship between the housing and the second housing is different from the predetermined positional relationship;
Microphone position specifying information acquiring means for acquiring information for specifying the positional relationship of the microphone selected by the microphone selecting means;
Storage means for recording sound data captured from the microphone selected by the microphone selecting means and microphone position specifying information acquired by the microphone position specifying information acquiring means on a recording medium;
It is characterized by providing.

好ましくは、
前記マイク位置特定情報取得手段は、前記第１の筐体と前記第２の筐体との前記位置関係を検出する位置検出手段をさらに備え、
前記マイク位置特定情報は、前記第１の筐体と前記第２の筐体との前記位置関係を示す情報を含む、
ことを特徴とする。 Preferably,
The microphone position specifying information acquisition means further includes position detection means for detecting the positional relationship between the first casing and the second casing,
The microphone position specifying information includes information indicating the positional relationship between the first casing and the second casing.
It is characterized by that.

また、好ましくは、
前記第１の筐体と前記第２の筐体との前記位置関係を検出する位置検出手段と、
前記位置検出手段により検出された位置関係から、前記第１の筐体に設けられる少なくとも２つ以上のマイクロホンと前記第２の筐体に設けられる少なくとも１つ以上のマイクロホンとから構成される複数のマイクロホンの位置関係を求めるマイク位置取得手段と、
をさらに備え、
前記マイク位置特定情報は、前記マイク位置取得手段により取得された前記複数のマイクロホンの前記位置関係を特定する情報を含む、
ことを特徴とする。 Also preferably,
Position detecting means for detecting the positional relationship between the first casing and the second casing;
Based on the positional relationship detected by the position detection means, a plurality of microphones including at least two or more microphones provided in the first casing and at least one or more microphones provided in the second casing. Microphone position acquisition means for obtaining the positional relationship of the microphone;
Further comprising
The microphone position specifying information includes information for specifying the positional relationship of the plurality of microphones acquired by the microphone position acquiring unit.
It is characterized by that.

さらに好ましくは、
前記マイク位置特定情報は、前記マイクロホンの位置関係が変化したタイミングを特定する情報、または、録音時点でのマイクロホンの位置関係を特定する情報、を含む、
ことを特徴とする。 More preferably,
The microphone position specifying information includes information for specifying timing when the positional relationship of the microphone has changed, or information for specifying the positional relationship of the microphone at the time of recording,
It is characterized by that.

また、さらに好ましくは、
前記第１の筐体と第２の筐体との前記位置関係が前記所定の位置関係と異なる場合に警告を行う警告手段
をさらに備えることを特徴とする。 More preferably,
The apparatus further comprises warning means for giving a warning when the positional relationship between the first housing and the second housing is different from the predetermined positional relationship.

また、さらに好ましくは、
音データ処理装置により記録媒体に記録された音データを読み出す読み出し手段と、
前記記録媒体に記録されているマイク位置特定情報に基づいて録音時のマイクロホンの位置関係を特定するマイク位置特定手段と、
前記読み出し手段により読み出された音データに、前記マイク位置特定手段で特定されたマイクロホンの位置関係に基づいて、音源分離処理を施す音源分離手段と、
前記音源分離処理を施した音データを再生する再生手段と、
を備えることを特徴とする。 More preferably,
Reading means for reading out sound data recorded on the recording medium by the sound data processing device;
Microphone position specifying means for specifying a microphone positional relationship during recording based on microphone position specifying information recorded on the recording medium;
Sound source separation means for performing sound source separation processing on the sound data read by the reading means based on the positional relationship of the microphones specified by the microphone position specifying means;
Reproducing means for reproducing the sound data subjected to the sound source separation processing;
It is characterized by providing.

また、さらに好ましくは、
音データ処理装置により記録媒体に記録された音データを読み出す読み出し手段と、
前記記録媒体に記録されているマイク位置特定情報に基づいて録音時のマイクロホンの位置関係を特定するマイク位置特定手段と、
前記読み出し手段により読み出された音データに、前記マイク位置特定手段で特定されたマイクロホンの位置関係に基づいて、音源分離処理を施す音源分離手段と、
前記音源分離処理を施した音データを再生する再生手段と、
を備え、
音データ処理装置により記録媒体に前記マイクロホンの位置関係が変化したタイミングを特定する情報に基づいて、前記読み出し手段により読み出した音データが、前記マイクロホンの位置関係が変化したタイミングに達することを契機として、前記位置関係から前記複数のマイクロホンの位置関係を求めるマイク位置取得手段と、
をさらに備えることを特徴とする。 More preferably,
Reading means for reading out sound data recorded on the recording medium by the sound data processing device;
Microphone position specifying means for specifying a microphone positional relationship during recording based on microphone position specifying information recorded on the recording medium;
Sound source separation means for performing sound source separation processing on the sound data read by the reading means based on the positional relationship of the microphones specified by the microphone position specifying means;
Reproducing means for reproducing the sound data subjected to the sound source separation processing;
With
On the basis of the information specifying the timing when the positional relationship of the microphone is changed on the recording medium by the sound data processing device, the sound data read by the reading means reaches the timing when the positional relationship of the microphone changes. , A microphone position acquisition means for obtaining a positional relationship of the plurality of microphones from the positional relationship;
Is further provided.

また、さらに好ましくは、
音データ処理装置により記録媒体に記録された音データを読み出す読み出し手段と、
前記記録媒体に記録されているマイク位置特定情報に基づいて録音時のマイクロホンの位置関係を特定するマイク位置特定手段と、
前記読み出し手段により読み出された音データに、前記マイク位置特定手段で特定されたマイクロホンの位置関係に基づいて、音源分離処理を施す音源分離手段と、
前記音源分離手段により音源分離処理が施された音データを記憶する音源分離データ記憶手段と、
を備えることを特徴とする。 More preferably,
Reading means for reading out sound data recorded on the recording medium by the sound data processing device;
Microphone position specifying means for specifying a microphone positional relationship during recording based on microphone position specifying information recorded on the recording medium;
Sound source separation means for performing sound source separation processing on the sound data read by the reading means based on the positional relationship of the microphones specified by the microphone position specifying means;
Sound source separation data storage means for storing sound data subjected to sound source separation processing by the sound source separation means;
It is characterized by providing.

また、さらに好ましくは、
音データ処理装置により記録媒体に記録された音データを読み出す読み出し手段と、
前記記録媒体に記録されているマイク位置特定情報に基づいて録音時のマイクロホンの位置関係を特定するマイク位置特定手段と、
前記読み出し手段により読み出された音データに、前記マイク位置特定手段で特定されたマイクロホンの位置関係に基づいて、音源分離処理を施す音源分離手段と、
前記音源分離手段により音源分離処理が施された音データを記憶する音源分離データ記憶手段と、
を備え、
音データ処理装置により記録媒体に前記マイクロホンの位置関係が変化したタイミングを特定する情報に基づいて、前記読み出し手段により読み出した音データが、前記マイクロホンの位置関係が変化したタイミングに達することを契機として、前記位置関係から前記複数のマイクロホンの位置関係を求めるマイク位置取得手段と、
をさらに備えることを特徴とする。 More preferably,
Reading means for reading out sound data recorded on the recording medium by the sound data processing device;
Microphone position specifying means for specifying a microphone positional relationship during recording based on microphone position specifying information recorded on the recording medium;
Sound source separation means for performing sound source separation processing on the sound data read by the reading means based on the positional relationship of the microphones specified by the microphone position specifying means;
Sound source separation data storage means for storing sound data subjected to sound source separation processing by the sound source separation means;
With
On the basis of the information specifying the timing when the positional relationship of the microphone is changed on the recording medium by the sound data processing device, the sound data read by the reading means reaches the timing when the positional relationship of the microphone changes. , A microphone position acquisition means for obtaining a positional relationship of the plurality of microphones from the positional relationship;
Is further provided.

また、さらに好ましくは、
前記記録媒体に記録された音データ、又は、前記音源分離手段により音源分離処理が施された音データに対し音編集処理を施す音編集手段
をさらに備えることを特徴とする。 More preferably,
It further comprises sound editing means for performing sound editing processing on sound data recorded on the recording medium or sound data subjected to sound source separation processing by the sound source separation means.

本発明によれば、可動部に設置されたマイクロホンを有する携帯機器において、可動部が撮影又は録音中に動かされたとしても、音源分離後の音データの劣化を防ぐことが可能となる。 ADVANTAGE OF THE INVENTION According to this invention, even if a movable part is moved during imaging | photography or recording in the portable apparatus which has the microphone installed in the movable part, it becomes possible to prevent deterioration of the sound data after sound source separation.

本発明の実施形態１に係る携帯装置の機能構成の一例を示すブロック図である。It is a block diagram which shows an example of a function structure of the portable apparatus which concerns on Embodiment 1 of this invention. （ａ）及び（ｂ）は、本発明の実施形態１に係る携帯装置の外観上の構成例を示す図である。(A) And (b) is a figure which shows the structural example on the external appearance of the portable apparatus which concerns on Embodiment 1 of this invention. 本発明の実施形態１に係る携帯装置の録画処理の流れを示すフローチャート図である。It is a flowchart figure which shows the flow of the video recording process of the portable apparatus which concerns on Embodiment 1 of this invention. （ａ）は、マイクロホンの基準位置を示すテーブルの例であり、（ｂ）は、ヒンジ角θとφの変化とマイクロホンの位置との関係を示すテーブルの例である。(A) is an example of the table which shows the reference position of a microphone, (b) is an example of the table which shows the relationship between the change of hinge angle (theta) and (phi), and the position of a microphone. 本発明の実施形態２に係る携帯装置の機能構成の一例を示すブロック図である。It is a block diagram which shows an example of a function structure of the portable apparatus which concerns on Embodiment 2 of this invention. 本発明の実施形態２に係る携帯装置の録画処理の流れを示すフローチャート図である。It is a flowchart figure which shows the flow of the video recording process of the portable apparatus which concerns on Embodiment 2 of this invention. 本発明の実施形態２に係る録画データの例を示す図である。It is a figure which shows the example of the video recording data which concern on Embodiment 2 of this invention. 本発明の実施形態２に係る携帯装置の再生編集処理の流れを示すフローチャート図である。It is a flowchart figure which shows the flow of the reproduction | regeneration edit process of the portable apparatus which concerns on Embodiment 2 of this invention. 本発明の実施形態３に係る携帯装置の機能構成の一例を示すブロック図である。It is a block diagram which shows an example of a function structure of the portable apparatus which concerns on Embodiment 3 of this invention. （ａ）及び（ｂ）は、本発明の実施形態３に係る携帯装置の外観上の構成例を示す図である。(A) And (b) is a figure which shows the structural example on the external appearance of the portable apparatus which concerns on Embodiment 3 of this invention. 本発明の実施形態３に係る携帯装置の録画処理の流れを示すフローチャート図である。It is a flowchart figure which shows the flow of the video recording process of the portable apparatus which concerns on Embodiment 3 of this invention. （ａ）は、携帯装置９０１ａにおける、スイッチ判別信号とマイクロホンの位置との対応関係の例を示す図であり、（ｂ）は、携帯装置９０１ｂにおける、スイッチ判別信号とマイクロホンの位置との対応関係の例を示す図である。(A) is a figure which shows the example of the correspondence of the switch discrimination | determination signal and the position of a microphone in the portable apparatus 901a, (b) is the correspondence of the switch discrimination | determination signal and the position of a microphone in the portable apparatus 901b. It is a figure which shows the example of. 本発明の実施形態４に係る携帯装置の機能構成の一例を示すブロック図である。It is a block diagram which shows an example of a function structure of the portable apparatus which concerns on Embodiment 4 of this invention. 本発明の実施形態４に係る携帯装置の録画処理の流れを示すフローチャート図である。It is a flowchart figure which shows the flow of the video recording process of the portable apparatus which concerns on Embodiment 4 of this invention. 本発明の実施形態４に係る録画データの例を示す図である。It is a figure which shows the example of the video recording data which concern on Embodiment 4 of this invention. 本発明の実施形態４に係る携帯装置の再生編集処理の流れを示すフローチャート図である。It is a flowchart figure which shows the flow of the reproduction | regeneration edit process of the portable apparatus which concerns on Embodiment 4 of this invention.

以下、音データ処理機能及び動画撮影機能を有する折り畳み型の携帯電話等の携帯装置に本発明が適用される実施の形態を説明する。また、特定の音源のみの抽出又は抑圧を行う技術を総称して音源分離処理と呼ぶ。 Hereinafter, an embodiment in which the present invention is applied to a portable device such as a folding cellular phone having a sound data processing function and a moving image photographing function will be described. Further, techniques for extracting or suppressing only specific sound sources are collectively referred to as sound source separation processing.

（実施形態１）
＜録画時に、可動部を有する携帯装置の姿勢を判別し、判別した姿勢に従って音源分離処理を適切に行う例＞
本実施形態では、本体部と可動部とを備える携帯装置１０１において、本体部と可動部とがなす角度に応じて、音源分離処理を適切に行い、音源分離処理を施した音データを画像と共に記憶する携帯装置１０１について説明する。 (Embodiment 1)
<Example of discriminating the attitude of a portable device having a movable part during recording and appropriately performing sound source separation processing according to the discriminated attitude>
In the present embodiment, in a portable device 101 including a main body and a movable portion, sound source separation processing is appropriately performed according to an angle formed by the main body portion and the movable portion, and sound data subjected to the sound source separation processing is displayed together with an image. The portable device 101 to be stored will be described.

本実施形態の携帯装置１０１は、図１に示すように、マイクロホン１０２〜１０５と、ＡＤＣ（Analog to Digital Converter）１０６と、撮影部１０７と、入力部１０８と、角度センサ１０９と、制御部１１０と、記憶部１１３と、表示部１１４と、ＤＡＣ（Digital to Analog Converter）１１５と、スピーカ１１６と、を備える。 As shown in FIG. 1, the portable device 101 according to the present embodiment includes microphones 102 to 105, an ADC (Analog to Digital Converter) 106, a photographing unit 107, an input unit 108, an angle sensor 109, and a control unit 110. A storage unit 113, a display unit 114, a DAC (Digital to Analog Converter) 115, and a speaker 116.

マイクロホン１０２〜１０５は、音を集音し、集音した音をアナログ信号に変換する。後述するように、マイクロホン１０２〜１０４は、本体に、マイクロホン１０５は、可動部に配置されている。 The microphones 102 to 105 collect sound and convert the collected sound into an analog signal. As will be described later, the microphones 102 to 104 are disposed in the main body, and the microphone 105 is disposed in the movable portion.

ＡＤＣ１０６はマイクロホン１０２〜１０５から入力された音データのアナログ信号をデジタル信号に変換し、そのデジタル信号を制御部１１０に送る。 The ADC 106 converts an analog signal of sound data input from the microphones 102 to 105 into a digital signal, and sends the digital signal to the control unit 110.

撮影部１０７は、本体に設置され、ＣＣＤ（Charge Coupled Device）カメラ、ＣＭＯＳ（Complimentary MOS）センサ等から構成され、映像を電気信号に変換し、映像信号を制御部１１０に送る。 The photographing unit 107 is installed in the main body, and is composed of a CCD (Charge Coupled Device) camera, a CMOS (Complimentary MOS) sensor, and the like, converts an image into an electric signal, and sends the image signal to the control unit 110.

入力部１０８は、電源スイッチや録音又は録画ボタン、映像ズーム倍率の指定、音源抽出や抑圧等の音源分離処理の効果（音源分離効果）を指定するためのボタン等から構成される。ユーザよりズーム制御や音源分離処理等の各種制御処理の指示を受け付け、その指示情報を制御部１１０に送る。 The input unit 108 includes a power switch, a recording or recording button, a button for designating an image zoom magnification, and a sound source separation process effect (sound source separation effect) such as sound source extraction and suppression. An instruction of various control processes such as zoom control and sound source separation process is received from the user, and the instruction information is sent to the control unit 110.

角度センサ１０９は、ロータリー・エンコーダ、ジャイロセンサ等から構成され、表示部１１４を備える可動部に設置される。角度センサ１０９は、本体部と可動部とのなす角度（後述するヒンジ部の角度（ヒンジ角度））を検出し、その角度情報を制御部１１０に送る。 The angle sensor 109 is composed of a rotary encoder, a gyro sensor, and the like, and is installed in a movable part including the display unit 114. The angle sensor 109 detects an angle formed by the main body unit and the movable unit (an angle of a hinge unit (hinge angle) described later) and sends the angle information to the control unit 110.

制御部１１０は、ＣＰＵ（Central Processing Unit）と、ＲＯＭ（Read Only Memory）と、ＲＡＭ（Random Access Memory）とから構成される。制御部１１０は、記憶部１１３に格納されているプログラムを実行し、携帯装置１０１本来の機能を実行すると共に、音データ処理装置としての動作も実行し、例えば、図３のフローチャートに示す処理を実行する。 The control unit 110 includes a CPU (Central Processing Unit), a ROM (Read Only Memory), and a RAM (Random Access Memory). The control unit 110 executes a program stored in the storage unit 113, executes the original function of the portable device 101, and also executes an operation as a sound data processing device. For example, the processing shown in the flowchart of FIG. Run.

制御部１１０は、機能的に、音源分離処理部１１１及びマイク位置取得部１１２を有する。 The control unit 110 functionally includes a sound source separation processing unit 111 and a microphone position acquisition unit 112.

音源分離処理部１１１はＡＤＣ１０６から送られた複数の音データに対し、入力部１０８から送られた指示に基づいて音源分離処理を施す。音源分離処理部１１１は、角度センサ１０９の検出値に基づいて、その時点におけるマイクロホン１０２〜１０５の実際の位置に基づいて、音源分離を行う。
なお、音源分離処理部１１１で行われる音源分離は、所定のデジタル信号処理に基づいた処理や、あるいはそれ以外のアルゴリズムを任意に適用可能である。 The sound source separation processing unit 111 performs sound source separation processing on a plurality of sound data sent from the ADC 106 based on an instruction sent from the input unit 108. The sound source separation processing unit 111 performs sound source separation based on the actual values of the microphones 102 to 105 at that time based on the detection value of the angle sensor 109.
Note that the sound source separation performed by the sound source separation processing unit 111 can arbitrarily apply processing based on predetermined digital signal processing or other algorithms.

マイク位置取得部１１２は、可動部の位置が基準位置(例えば、図２（ｂ）においてθ=９０°、φ=０°等)でない場合に、角度センサ１０９が検出したヒンジ角度を基に、マイクロホン１０２〜１０５の位置を、幾何学的な計算で求め、音源分離処理部１１１にその情報を送る。なお、マイク位置の求め方自体は任意である。 The microphone position acquisition unit 112 is based on the hinge angle detected by the angle sensor 109 when the position of the movable unit is not the reference position (for example, θ = 90 °, φ = 0 ° in FIG. 2B). The positions of the microphones 102 to 105 are obtained by geometric calculation, and the information is sent to the sound source separation processing unit 111. Note that the method for obtaining the microphone position itself is arbitrary.

記憶部１１３は、ＲＯＭ、フラッシュメモリ、ＨＤＤ（Hard Disk Drive）等から構成され、撮影部１０７から取り込んだ画像データ、音源分離処理後の音データ、マイクロホンの基準位置（可動部が基準位置にあるときのマイクロホンの位置）の情報等の音源分離処理の際に使用される諸情報（音源分離パラメータ）、及び音源分離処理部１１１から得た情報等を格納する。また、制御部１１０が行う処理に係るプログラム等を格納する。
予め記憶部１１３に記憶される音源分離パラメータは、マイクロホンの基準位置の情報と、音源分離効果に関する情報（音源の強調量、周囲音の抑圧量等）と、音源到来方向の範囲（その範囲内に位置する音源からの音を強調又は減衰する）等から構成される。例えば、音源到来方向の範囲は撮影時の映像ズームの段階と、予め対応付けて記憶されてもよい。 The storage unit 113 includes a ROM, a flash memory, an HDD (Hard Disk Drive), and the like. The image data captured from the imaging unit 107, the sound data after the sound source separation process, and the microphone reference position (the movable unit is at the reference position). Various information (sound source separation parameters) used in sound source separation processing such as information on the position of the microphone at the time, information obtained from the sound source separation processing unit 111, and the like are stored. In addition, a program or the like related to processing performed by the control unit 110 is stored.
The sound source separation parameter stored in advance in the storage unit 113 includes information on the reference position of the microphone, information on the sound source separation effect (sound source enhancement amount, ambient sound suppression amount, etc.), and sound source arrival direction range (within that range). The sound from the sound source located at (1) is emphasized or attenuated). For example, the range of the sound source arrival direction may be stored in advance in association with the stage of image zoom at the time of shooting.

表示部１１４は、ＬＣＤ（Liquid Crystal Display）又は有機ＥＬディスプレイ（organic Electro-Luminescence display）、及びドライバ等から構成され、撮影部１０７で撮影した映像や、撮影モード等の諸情報を表示する。 The display unit 114 includes an LCD (Liquid Crystal Display) or an organic EL display (organic Electro-Luminescence display), a driver, and the like, and displays various images such as an image captured by the imaging unit 107 and an imaging mode.

ＤＡＣ１１５は音データを出力する際に、デジタル信号をアナログ信号に変換し音出力部１１６に送る。 When the DAC 115 outputs sound data, the DAC 115 converts the digital signal into an analog signal and sends the analog signal to the sound output unit 116.

音出力部１１６は増幅器、スピーカ等から構成され、ＤＡコンバータ１１５からの信号に基づいて音を出力する。 The sound output unit 116 includes an amplifier, a speaker, and the like, and outputs sound based on a signal from the DA converter 115.

次に、携帯装置１０１の機械的構成について説明する。
携帯装置１０１は、図２（ａ）、（ｂ）に例示するように、本体部２０１と、可動部２０２と、これらを連結するヒンジ部と、から構成される。
本体部２０１には、マイクロホン１０２〜１０５，撮影部１０７が固定されており、可動部２０２には、マイクロホン１０５が配置されている。
図２（ａ）は、可動部２０２が一方向に開閉する携帯装置１０１ａを、図２（ｂ）は、可動部２０２が２軸の周りを動く携帯装置１０１ｂを、それぞれ、示している。 Next, the mechanical configuration of the mobile device 101 will be described.
The portable apparatus 101 is comprised from the main-body part 201, the movable part 202, and the hinge part which connects these so that it may illustrate to Fig.2 (a), (b).
Microphones 102 to 105 and the photographing unit 107 are fixed to the main body 201, and the microphone 105 is disposed on the movable unit 202.
2A shows a portable device 101a in which the movable unit 202 opens and closes in one direction, and FIG. 2B shows a portable device 101b in which the movable unit 202 moves around two axes.

図２（ａ）に示す携帯装置１０１ａにおいて、ヒンジ部は、１軸型のヒンジ機構から構成され、角度センサ１０９はヒンジ部に設置され、本体部２０１と可動部２０２がなす角度θを測定する。可動部２０２に設置されたマイクロホン１０５の位置はヒンジ角度θから求まる。 In the portable device 101a shown in FIG. 2A, the hinge portion is configured by a uniaxial hinge mechanism, the angle sensor 109 is installed on the hinge portion, and measures an angle θ formed by the main body portion 201 and the movable portion 202. . The position of the microphone 105 installed on the movable unit 202 is obtained from the hinge angle θ.

一方、図２（ｂ）に示す携帯装置１０１ｂにおいて、ヒンジ部は２軸型ヒンジ機構から構成される。２軸型ヒンジ機構は相互に直交する軸（回転軸ｘ、回転軸ｙ）を有し、可動部２０２は回転軸ｘ周りに回転可能であると共に回転軸ｙ周りに回転可能である。角度センサ１０９は、回転軸ｘ周りにおける本体部２０１と可動部２０２とがなす角度θと、回転軸ｙ周りにおける本体部２０１と可動部２０２とがなす角度φと、を検出する。可動部２０２に設置されたマイクロホン１０５の位置はヒンジ角度θ及びφから求まる。 On the other hand, in the portable device 101b shown in FIG. 2 (b), the hinge portion is composed of a biaxial hinge mechanism. The biaxial hinge mechanism has axes (rotation axis x, rotation axis y) orthogonal to each other, and the movable portion 202 can rotate about the rotation axis x and can rotate about the rotation axis y. The angle sensor 109 detects an angle θ formed between the main body 201 and the movable part 202 around the rotation axis x and an angle φ formed between the main body 201 and the movable part 202 around the rotation axis y. The position of the microphone 105 installed on the movable part 202 is obtained from the hinge angles θ and φ.

なお、以下の説明では、理解を容易にするため、携帯装置１０１ｂに関して説明する。 In the following description, the portable device 101b will be described for easy understanding.

マイクロホン１０２〜１０４は本体部２０１に配置され、マイクロホン１０５は可動部２０２に配置される。ここで、図２（ｂ）に示すように、撮影部１０７の光軸をｚ軸とし、互いに垂直なｘ軸、ｙ軸、及びｚ軸から構成される直交座標系を設定する。マイクロホン１０２〜１０５の３次元空間上の位置は、この直交座標系で定義される。マイクロホン１０５の位置及びマイクロホン１０２〜１０５の位置関係は、携帯装置１０１ｂの姿勢によって変化する。 The microphones 102 to 104 are arranged in the main body unit 201, and the microphone 105 is arranged in the movable unit 202. Here, as shown in FIG. 2B, the optical axis of the photographing unit 107 is set as the z axis, and an orthogonal coordinate system including the x axis, the y axis, and the z axis perpendicular to each other is set. The positions of the microphones 102 to 105 in the three-dimensional space are defined by this orthogonal coordinate system. The position of the microphone 105 and the positional relationship between the microphones 102 to 105 change depending on the posture of the portable device 101b.

以下、携帯装置１０１ｂにおいて実行される録画処理について説明する。なお、映像の撮影・保存等の一般的な撮影装置としての動作は従来と同一であるため、本実施形態に特有な音源分離処理を中心に説明する。 Hereinafter, the recording process executed in the portable device 101b will be described. Note that operations as a general image capturing apparatus such as image capturing / storing are the same as those in the past, and thus the description will focus on sound source separation processing unique to the present embodiment.

ユーザが入力部１０８に録画処理開始及び音源分離処理開始の指示を入力すると、入力部１０８はその指示を制御部１１０に送る。制御部１１０は、録画処理開始及び音源分離処理開始の指示に応答し、図３のフローチャートに示す録画処理を開始する。 When the user inputs an instruction to start recording processing and sound source separation processing to the input unit 108, the input unit 108 sends the instructions to the control unit 110. In response to the instruction to start the recording process and the sound source separation process, the control unit 110 starts the recording process shown in the flowchart of FIG.

制御部１１０は、まず、撮影部１０７に画像の取り込みを開始させ、マイクロホン１０２〜１０５に音の取り込みを開始させる。さらに、制御部１１０は、撮影部１０７及びマイクロホン１０２〜１０５から取り込んだデータを一時的に制御部１１０のＲＡＭ等に記憶させる（ステップＳ３０１）。 First, the control unit 110 causes the photographing unit 107 to start capturing an image and causes the microphones 102 to 105 to start capturing sound. Further, the control unit 110 temporarily stores the data acquired from the imaging unit 107 and the microphones 102 to 105 in the RAM or the like of the control unit 110 (step S301).

続いて、制御部１１０は、予め記憶部１１３に記憶された音源分離パラメータをＲＡＭ等に読み出す（ステップＳ３０２）。 Subsequently, the control unit 110 reads the sound source separation parameter stored in advance in the storage unit 113 into a RAM or the like (step S302).

次に、制御部１１０は、角度センサ１０９から送られたヒンジ角度θとφが基準角度と同一か否か、即ち、θ＝９０°、φ＝０°であるか否かを判別する（ステップＳ３０３）。 Next, the controller 110 determines whether or not the hinge angles θ and φ sent from the angle sensor 109 are the same as the reference angle, that is, whether θ = 90 ° and φ = 0 ° (step). S303).

ヒンジ角度θ、φが共に基準角度と同一ならば（ステップＳ３０３；Ｎｏ）、制御部１１０は、ステップＳ３０２で読み出しておいたマイクロホンの基準位置の情報を音源分離処理部１１１に送り、音源分離処理部１１１は初期マイクロホンの基準位置の情報を音源分離パラメータとして設定する（ステップＳ３１０）。
続いて、音源分離処理部１１１は音源分離パラメータを基に、ＡＤＣ１０６を介して供給されるマイクロホン１０２〜１０５からの音データに、音源分離処理を施す（ステップＳ３０７）。例えば、ユーザが指示した分離対象音源が撮影部１０７のフォーカス位置の音源、音源分離処理の効果を「抽出（強調）」であるとすると、制御部１１０は、マイクロホン１０２〜１０５からＡＣＤＣ１０６を介して供給される音データに、フォーカス位置からの音を抽出する音源分離処理を実行する。
制御部１１０は、画像データと、音源分離処理後の音データとを記憶部１１３に書き込む（ステップＳ３０８）。 If the hinge angles θ and φ are both the same as the reference angle (step S303; No), the control unit 110 sends the information on the reference position of the microphone read in step S302 to the sound source separation processing unit 111, and the sound source separation processing. The unit 111 sets information on the reference position of the initial microphone as a sound source separation parameter (step S310).
Subsequently, the sound source separation processing unit 111 performs sound source separation processing on the sound data from the microphones 102 to 105 supplied via the ADC 106 based on the sound source separation parameter (step S307). For example, if the separation target sound source instructed by the user is the sound source at the focus position of the photographing unit 107 and the effect of the sound source separation processing is “extraction (emphasis)”, the control unit 110 transmits the microphones 102 to 105 via the ACDC 106. A sound source separation process for extracting sound from the focus position is performed on the supplied sound data.
The control unit 110 writes the image data and the sound data after the sound source separation process in the storage unit 113 (step S308).

次に、制御部１１０は、録画処理終了の指示の有無を判別する（ステップＳ３０９）。録画処理終了の指示がない場合（ステップＳ３０９；Ｎｏ）、制御部１１０は、ヒンジ角度θ、φに変更があるか否かを判別する（ステップ３１１）。
ヒンジ角度θ、φが変更された場合（ステップＳ３１１；Ｙｅｓ）、制御部１１０は、ステップＳ３０３に進み、ヒンジ角度θ、φに変更がない場合は（ステップＳ３１１；Ｎｏ）、音データに対して引き続き同一の音源分離処理を施す（ステップＳ３０７）。 Next, the control unit 110 determines whether there is an instruction to end the recording process (step S309). When there is no instruction to end the recording process (step S309; No), the control unit 110 determines whether or not the hinge angles θ and φ are changed (step 311).
When the hinge angles θ and φ are changed (step S311; Yes), the control unit 110 proceeds to step S303. When the hinge angles θ and φ are not changed (step S311; No), the control unit 110 performs the processing on the sound data. Subsequently, the same sound source separation process is performed (step S307).

一方、録画開始時点で、ヒンジ角度θとφの少なくも一方が基準角度でない場合（ステップＳ３０３；Ｙｅｓ）、或いは、録画開始後、可動部２０２が操作されて、ヒンジ角度θとφの少なくも一方が基準角度でなくなった場合（ステップＳ３１１；Ｙｅｓ，ステップＳ３０３；Ｙｅｓ）、制御部１１０は、可動部２０２が基準位置ではない旨を表示部１１４に表示させる（ステップＳ３０４）。例えば、撮影中の表示部１１４に“マイクの位置が動かされました。”等のメッセージを表示する。 On the other hand, if at least one of the hinge angles θ and φ is not the reference angle at the start of recording (step S303; Yes), or after the start of recording, the movable unit 202 is operated to reduce at least the hinge angles θ and φ. When one side is no longer the reference angle (step S311; Yes, step S303; Yes), the control unit 110 displays on the display unit 114 that the movable unit 202 is not the reference position (step S304). For example, a message such as “The microphone position has been moved” is displayed on the display unit 114 during shooting.

次に、マイク位置取得部１１２はヒンジ角度θ、φの値に基づいて、その時点のマイクロホン１０５の位置を新たに取得する（ステップＳ３０５）。例えば、図２（ｂ）の座標系において、可動部の基準位置を、θ＝９０°、φ＝０°とし、基準位置でのマイクロホン１０５の座標を（ｘ０，ｙ０，ｚ０）と定義する。ここで、可動部が閉じた状態にある時は、θ＝０°、可動部が図２（ｂ）の状態にある時は、θ＝９０°である。可動部が、φ＝０°のままで、ｘ軸周りに移動した時のヒンジ角度θの差分を、Δθ＝θ’−θ（θ：移動前のヒンジ角度、θ’：移動後のヒンジ角度）とすると、移動後のマイクロホン１０５の座標（ｘ１，ｙ１，ｚ１）は、（ｘ０，ｙ０・ｃｏｓΔθ，ｙ０・ｓｉｎΔθ）と求まる。また、可動部が、θ＝９０°のままで、ｙ軸周りに移動したときのヒンジ角度φの差分を、Δφ＝φ’−φ（φ：移動前のヒンジ角度、φ’：移動後のヒンジ角度）とすると、移動後のマイクロホン１０５の座標（ｘ２，ｙ２，ｚ２）は、（ｘ０・ｃｏｓΔφ，ｙ０，ｘ０・ｓｉｎΔφ）と求まる。
その後、マイク位置取得部１１２は、取得した新たなマイク位置の情報を音源分離処理部１１１に送り、音源分離処理部１１１は受け付けた新たなマイク位置を音源分離パラメータとして設定する（ステップＳ３０６）。 Next, the microphone position acquisition unit 112 newly acquires the position of the microphone 105 at that time based on the values of the hinge angles θ and φ (step S305). For example, in the coordinate system of FIG. 2B, the reference position of the movable part is defined as θ = 90 ° and φ = 0 °, and the coordinates of the microphone 105 at the reference position are defined as (x0, y0, z0). Here, when the movable part is in the closed state, θ = 0 °, and when the movable part is in the state of FIG. 2B, θ = 90 °. The difference of the hinge angle θ when the movable part moves around the x axis while φ = 0 ° is expressed as Δθ = θ′−θ (θ: hinge angle before movement, θ ′: hinge angle after movement) ), The coordinates (x1, y1, z1) of the microphone 105 after movement are obtained as (x0, y0 · cos Δθ, y0 · sin Δθ). Further, the difference of the hinge angle φ when the movable part moves around the y axis while θ = 90 ° is expressed as Δφ = φ′−φ (φ: hinge angle before movement, φ ′: after movement) Assuming that (hinge angle), the coordinates (x2, y2, z2) of the microphone 105 after movement are obtained as (x0 · cosΔφ, y0, x0 · sinΔφ).
Thereafter, the microphone position acquisition unit 112 sends the acquired information on the new microphone position to the sound source separation processing unit 111, and the sound source separation processing unit 111 sets the received new microphone position as a sound source separation parameter (step S306).

音源分離処理部１１１は、設定されたマイク位置を含む音源分離パラメータに基づいて音源分離処理を行う（ステップＳ３０７）。前述の例で説明すると、撮影部１０７のフォーカス位置からの音をマイクロホン１０２〜１０５の新たな位置関係を用いて抽出する。
制御部１１０は、画像データと、音源分離処理後の音データとを記憶部１１３に書き込む（ステップＳ３０８）。 The sound source separation processing unit 111 performs sound source separation processing based on the sound source separation parameter including the set microphone position (step S307). In the example described above, the sound from the focus position of the photographing unit 107 is extracted using the new positional relationship of the microphones 102 to 105.
The control unit 110 writes the image data and the sound data after the sound source separation process in the storage unit 113 (step S308).

録画処理終了の指示を受け付けた場合は（ステップＳ３０９：Ｙｅｓ）、録画処理を終了する。 If an instruction to end the recording process is received (step S309: Yes), the recording process is ended.

本実施形態によれば、可動部２０２を有する携帯装置１０１において、音源分離処理後の音データを記憶する場合に、マイクロホンが設置された可動部２０２が録画又は録音中に基準位置から動いても、角度センサ１０９の情報から可動部、マイク位置を取得し、そのマイクロホン位置を音源分離処理に適用することで、音源分離処理後のデータの品質を保ったままで録音を行うことができる。また、マイクロホンが設置された可動部が撮影中に動いても、撮影者に可動部２０２の位置が基準位置でないことを知らせることができる。 According to the present embodiment, in the portable device 101 having the movable part 202, when storing the sound data after the sound source separation process, even if the movable part 202 provided with the microphone moves from the reference position during recording or recording. By acquiring the movable part and the microphone position from the information of the angle sensor 109 and applying the microphone position to the sound source separation process, recording can be performed while maintaining the quality of the data after the sound source separation process. In addition, even if the movable part in which the microphone is installed moves during photographing, the photographer can be notified that the position of the movable part 202 is not the reference position.

なお、図２（ｂ）に示す構成を例に実施形態を説明したが、図２（ａ）に示す構成の場合には、φ＝０と設定すればよい。 Although the embodiment has been described by taking the configuration shown in FIG. 2B as an example, in the configuration shown in FIG. 2A, φ = 0 may be set.

また、以上の説明においては、ヒンジ角θ或いはφが基準角度以外の場合に、ステップＳ３０５で、マイクロホン１０５の位置を演算処理で求めたが、例えば、ヒンジ角θとφについて、所定の分解能で、マイクロホン１０５の位置を予め求めておき、図４（ｂ）に示すようにテーブル化して、記憶部１１３に格納しておき、ステップＳ３０２で、このテーブルをＲＡＭ上に展開し、ステップＳ３０５では、ヒンジ角θとφをキーにこのテーブルを検索してマイクロホン１０５の位置を求めるようにしてもよい。 In the above description, when the hinge angle θ or φ is other than the reference angle, the position of the microphone 105 is obtained by calculation processing in step S305. For example, the hinge angles θ and φ are determined with a predetermined resolution. , The position of the microphone 105 is obtained in advance, converted into a table as shown in FIG. 4B and stored in the storage unit 113. In step S302, this table is expanded on the RAM. In step S305, The position of the microphone 105 may be obtained by searching this table using the hinge angles θ and φ as keys.

なお、本発明は、折り畳み型携帯電話に限定されるものではなく、可動部を有した録画装置、又は録音装置全般に適用できる。例えば、スライド型の携帯電話、その他複数の使用スタイルを選択できる携帯電話、ビデオカメラ、電子カメラ、ムービ、ＰＤＡ、ノートパソコン、ウェアラブルパソコン、電卓、電子辞書等に適用できる。また、アプリケーションも動画撮影や録音のみに限定されず、ＴＶ電話や通話、音声認識アプリケーション等にも適用可能である。 Note that the present invention is not limited to a foldable mobile phone, and can be applied to a recording apparatus having a movable portion or a general recording apparatus. For example, the present invention can be applied to a slide-type mobile phone, a mobile phone capable of selecting a plurality of usage styles, a video camera, an electronic camera, a movie, a PDA, a notebook computer, a wearable personal computer, a calculator, an electronic dictionary, and the like. In addition, the application is not limited to moving image shooting and recording, but can be applied to a TV phone, a telephone call, a voice recognition application, and the like.

また、本実施形態では、可動部２０２が回転軸ｘ、回転軸ｙの２軸周りを可動する２軸型ヒンジを備える携帯装置について説明したが、可動する範囲はこれに限定されるものでは無い。さらに、可動部２０２が基準位置でない場合、警告の方法は、モニタへのメッセージ表示に限定されるものでは無く、筐体に設置されたランプの点滅や、筐体部分の振動、スピーカ１１６からの警告音等の方法でもよい。 In the present embodiment, the portable unit 202 has been described with respect to the portable device including the two-axis hinge that can move around the two axes of the rotation axis x and the rotation axis y. However, the movable range is not limited to this. . Further, when the movable unit 202 is not at the reference position, the warning method is not limited to the message display on the monitor, but the blinking of the lamp installed in the casing, the vibration of the casing, A warning sound or the like may be used.

また、本実施形態では、４つのマイクロホンを図２（ａ）及び図２（ｂ）のように配置したが、マイクロホンの数が２つ以上でかつ、マイクロホンが本体部２０１と可動部２０２のそれぞれに配置されている場合であるならば、本発明を適用可能である。 In the present embodiment, four microphones are arranged as shown in FIGS. 2A and 2B. However, the number of microphones is two or more, and the microphones are the main body 201 and the movable part 202, respectively. If this is the case, the present invention is applicable.

（実施形態２）
＜撮影後、再生編集時に音源分離を行う例＞
実施形態１においては、録画・録音時に音源分離処理を実行し、音源分離処理を施した音データを記憶部１１３に記録したが、録画・録音時には、生の音データを記録し、再生・編集時に音源分離処理を施すことも可能である。このような構成の携帯装置５０１を以下に説明する。 (Embodiment 2)
<Example of sound source separation after shooting and during playback editing>
In the first embodiment, sound source separation processing is performed during recording / recording, and sound data subjected to sound source separation processing is recorded in the storage unit 113. However, during recording / recording, raw sound data is recorded and played back / edited. Sometimes sound source separation processing can be applied. The portable device 501 having such a configuration will be described below.

携帯装置５０１は、図５に示すように、マイクロホン１０２〜１０５と、ＡＤＣ１０６と、撮影部１０７と、入力部１０８と、角度センサ１０９と、制御部１１０と、記憶部１１３と、表示部１１４と、ＤＡＣ１１５と、スピーカ１１６と、から構成される。
制御部１１０以外は、実施形態１の携帯装置１０１と同様の機能構成を有する。また、実施形態２における携帯装置５０１の外観の構成例は図２に示すものと同様とする。以下、実施形態１と異なる機能構成を有する制御部１１０について説明する。 As illustrated in FIG. 5, the portable device 501 includes a microphone 102 to 105, an ADC 106, an imaging unit 107, an input unit 108, an angle sensor 109, a control unit 110, a storage unit 113, and a display unit 114. , DAC 115 and speaker 116.
Except for the control unit 110, it has the same functional configuration as the portable device 101 of the first embodiment. In addition, the configuration example of the appearance of the portable device 501 in the second embodiment is the same as that shown in FIG. Hereinafter, the control unit 110 having a functional configuration different from that of the first embodiment will be described.

制御部１１０は、機能的に、音源分離処理部１１１と、マイク位置取得部１１２と、再生編集処理部５１１と、計時部５１２と、を有する。音源分離処理部１１１及びマイク位置取得部１１２は実施形態１と同様の機能を有する。
制御部１１０は、さらに、角度センサ１０９により計測されたヒンジ角度θ、φが変更されたか否かを判別する。そして、変更されたタイミングを表す情報とヒンジ角度θ、φを対応付けて、記憶部１１３に記憶させる。 The control unit 110 functionally includes a sound source separation processing unit 111, a microphone position acquisition unit 112, a reproduction / editing processing unit 511, and a timer unit 512. The sound source separation processing unit 111 and the microphone position acquisition unit 112 have the same functions as those in the first embodiment.
The controller 110 further determines whether or not the hinge angles θ and φ measured by the angle sensor 109 have been changed. Then, the information indicating the changed timing and the hinge angles θ and φ are associated with each other and stored in the storage unit 113.

再生編集処理部５１１は、記憶部１１３に記憶されている画像データや音データの再生編集や、音源分離処理部１１１が処理した音データの再生又は書き込み等を行う。例えば、再生編集処理部５１１は、記憶部１１３に記憶されている音データや音源分離処理部１１１が処理した音データに対して、特定の周波数帯域の強調や音量変換等を行う。 The reproduction editing processing unit 511 performs reproduction editing of image data and sound data stored in the storage unit 113, reproduction or writing of the sound data processed by the sound source separation processing unit 111, and the like. For example, the reproduction / edit processing unit 511 performs enhancement of a specific frequency band, volume conversion, and the like on the sound data stored in the storage unit 113 and the sound data processed by the sound source separation processing unit 111.

計時部５１２は、録画処理開始からの経過時間を計測する。制御部１１０は、ヒンジ角度θ、φが変更された場合に、計時部５１２が計測した経過時間を新たなヒンジ角度θ、φの情報に対応付けて記憶部１１３に記憶させる。 The timer unit 512 measures an elapsed time from the start of the recording process. When the hinge angles θ and φ are changed, the control unit 110 causes the storage unit 113 to store the elapsed time measured by the time measuring unit 512 in association with information on the new hinge angles θ and φ.

また、記憶部１１３に記憶される録画データのデータ構造の例を図７に示す。録画データは、撮影部１０７から取り込んだ画像データと、マイクロホン１０２〜１０５から取り込んだ音データと、マイクロホンの基準位置等の音源分離パラメータと、録画処理開始時の可動部のヒンジ角度、φ（可動部の角度１）と、ヒンジ角度θ、φが変更された時刻及び変更されたヒンジ角度θ、φと（可動部の角度２、・・・）、から構成される。 An example of the data structure of the recording data stored in the storage unit 113 is shown in FIG. The recorded data includes image data captured from the photographing unit 107, sound data captured from the microphones 102 to 105, sound source separation parameters such as a reference position of the microphone, hinge angle of the movable unit at the start of the recording process, φ (movable Part angle 1), the time when the hinge angles θ and φ are changed, and the changed hinge angle θ and φ (the angle 2 of the movable part,...).

以下、携帯装置５０１が行う、録画処理の動作と、再生編集処理の動作について説明する。 Hereinafter, an operation of a recording process and an operation of a reproduction / editing process performed by the portable device 501 will be described.

ユーザが入力部１０８に録画処理開始の指示を入力すると、入力部１０８は録画処理開始の指示を制御部１１０に送る。制御部１１０は、録画処理開始の指示を受け付けると、図６のフローチャートに示す録画処理を開始する。 When the user inputs an instruction to start recording processing to the input unit 108, the input unit 108 sends an instruction to start recording processing to the control unit 110. When receiving an instruction to start the recording process, control unit 110 starts the recording process shown in the flowchart of FIG.

制御部１１０は、撮影部１０７に画像の取り込みと、マイクロホン１０２〜１０５に音の取り込みとを開始させ、撮影部１０７及びマイクロホン１０２〜１０５から取り込んだデータを一時的に制御部１１０のＲＡＭ等に記憶させる（ステップＳ３０１）。 The control unit 110 causes the image capturing unit 107 to start capturing an image and the microphones 102 to 105 to start capturing sound, and temporarily stores the data captured from the image capturing unit 107 and the microphones 102 to 105 into the RAM of the control unit 110 or the like. Store (step S301).

続いて、制御部１１０は、マイクロホンの数とそれらの３次元空間上の基準位置とを、図７に示すように、記憶部１１３に記憶する録画データのヘッダ領域に書き込む（ステップＳ６０２）。 Subsequently, the control unit 110 writes the number of microphones and their reference positions in the three-dimensional space in the header area of the recording data stored in the storage unit 113 as shown in FIG. 7 (step S602).

次に、制御部１１０は、角度センサ１０９から送られたヒンジ角度θ、φが、基準角度と同一か否かの判別を行う（ステップＳ３０３）。 Next, the control unit 110 determines whether or not the hinge angles θ and φ sent from the angle sensor 109 are the same as the reference angle (step S303).

ヒンジ角度θ、φが基準角度と同じならば（ステップＳ３０３；Ｎｏ）、制御部１１０は、計時部５１２が計測した経過時間と、角度センサ１０９が検出したヒンジ角度θ、φの値とを録画データのヘッダ領域に書き込む（ステップ６０５）。ここで、録画開始時には、開始時のヒンジ角度θ、φが書き込まれる（図７の例では、経過時間＝０[sec]、θ＝９０°、φ＝０°）。 If the hinge angles θ and φ are the same as the reference angle (step S303; No), the control unit 110 records the elapsed time measured by the time measuring unit 512 and the values of the hinge angles θ and φ detected by the angle sensor 109. Write in the header area of the data (step 605). Here, at the start of recording, the hinge angles θ and φ at the start are written (in the example of FIG. 7, elapsed time = 0 [sec], θ = 90 °, φ = 0 °).

一方、ヒンジ角度θ又はφが基準角度でない場合（ステップＳ３０３；Ｙｅｓ）、制御部１１０は、可動部２０２が基準位置ではない旨を表示部１１４に表示する（ステップＳ３０４）。そして、制御部１１０は、計時部５１２が計測した経過時間と、角度センサ１０９が検出したヒンジ角度θ、φの値とを、録画データのヘッダ領域に書き込む（ステップＳ６０５）。例えば、可動部２０２が撮影開始から８秒後に基準位置から動かされ、ヒンジ角度θが４５°になったとすると、制御部１１０は、図７に示すように、可動部２０２の位置が変更された時の経過時間（変更時間）と、新たなヒンジ角度θ、φとを録画データのヘッダに書き込む。以後、可動部２０２の位置が変更された場合は、“可動部の位置の変更３”、“可動部の位置４”…、とその都度、経過時間（変更時間）とヒンジ角度θ、φを記憶する。 On the other hand, when the hinge angle θ or φ is not the reference angle (step S303; Yes), the control unit 110 displays on the display unit 114 that the movable unit 202 is not the reference position (step S304). Then, the control unit 110 writes the elapsed time measured by the time measuring unit 512 and the values of the hinge angles θ and φ detected by the angle sensor 109 in the header area of the recording data (step S605). For example, if the movable unit 202 is moved from the reference position 8 seconds after the start of photographing and the hinge angle θ is 45 °, the control unit 110 changes the position of the movable unit 202 as shown in FIG. The elapsed time (change time) and the new hinge angles θ and φ are written in the header of the recording data. Thereafter, when the position of the movable portion 202 is changed, “change of the movable portion position 3”, “position of the movable portion 4”, and the elapsed time (change time) and the hinge angles θ, φ are set in each case. Remember.

ヘッダへの書き込みが終了すると、制御部１１０は、記憶部１１３に画像データと、音データとを書き込む（ステップＳ６０６）。 When the writing to the header is completed, the control unit 110 writes the image data and the sound data in the storage unit 113 (step S606).

次に、制御部１１０は、録画処理終了の指示の有無を判別する（ステップＳ３０９）。録画処理終了の指示がなければ（Ｓ３０９；Ｎｏ）、制御部１１０は、ヒンジ角度θ、φに変更があるか否かを判別する（ステップ３１１）。ヒンジ角度θ、φに変更がある場合（ステップＳ３１１；Ｙｅｓ）、制御部１１０は、ステップＳ３０３以降の処理を繰り返す。ヒンジ角度θ、φに変更がない場合は（ステップＳ３１１；Ｎｏ）、引き続き画像データと音データとを記憶部１１３に書き込む（ステップＳ６０６）。 Next, the control unit 110 determines whether there is an instruction to end the recording process (step S309). If there is no instruction to end the recording process (S309; No), the control unit 110 determines whether or not the hinge angles θ and φ are changed (step 311). When there is a change in the hinge angles θ and φ (step S311; Yes), the control unit 110 repeats the processing after step S303. If the hinge angles θ and φ are not changed (step S311; No), the image data and the sound data are continuously written in the storage unit 113 (step S606).

録画処理終了の指示を受け付けた場合（ステップＳ３０９；Ｙｅｓ）、録画処理を終了する。 When an instruction to end the recording process is received (step S309; Yes), the recording process ends.

このようにして、記録部１１３には、録画・録音開始後の各時点における、画像データと音データ、さらに、各マイクロホンの位置を特定し得る情報が蓄積される。 In this manner, the recording unit 113 stores image data and sound data at each time point after recording / recording start, and information that can specify the position of each microphone.

次に、記憶部１１３に記憶されたデータを再生、編集する動作について図８を用いて説明する。 Next, operations for reproducing and editing data stored in the storage unit 113 will be described with reference to FIG.

ユーザが入力部１０８に、録画データの選択、再生編集の開始及び音源分離処理開始の指示を入力すると、入力部１０８はその指示を制御部１１０に送る。制御部１１０は、指示を受け付けると、図８のフローチャートに示す処理を開始する。 When the user inputs instructions for selecting recording data, starting reproduction editing and starting sound source separation processing to the input unit 108, the input unit 108 sends the instructions to the control unit 110. When control unit 110 accepts the instruction, control unit 110 starts the processing shown in the flowchart of FIG.

制御部１１０は、まず、記憶部１１３に記憶された録画データを選択し、必要な情報の読み込み等を開始する（ステップＳ８０１）。続いて、制御部１１０は、録画時におけるマイクロホンの数とそれらの３次元空間上の基準位置等のパラメータを録画データのヘッダ領域から呼び出す（ステップＳ８０２）。次に、ユーザより音源分離処理を行う指示が与えられているかどうかの判別を行う（ステップＳ８０３）。 First, the control unit 110 selects recorded data stored in the storage unit 113 and starts reading necessary information (step S801). Subsequently, the control unit 110 calls parameters such as the number of microphones at the time of recording and their reference positions in the three-dimensional space from the header area of the recording data (step S802). Next, it is determined whether or not an instruction to perform sound source separation processing is given by the user (step S803).

ユーザより音源分離指示が与えられていなければ（ステップＳ８０３；Ｎｏ）、記憶された音データを呼び出し（ステップＳ８１２）、画像データとともに、音データをそのまま再生する（ステップＳ８１０）。例えば、ユーザがデータの再生後しばらくは録音された音データをそのまま再生したい場合は、モノラル再生の場合は記憶部１１３に記憶された複数の音データの内一つを再生し、ステレオ再生の場合は、二つの音データを再生する。例えば、マイクロホン１０２〜１０５から取り込んだ音データを再生する。 If no sound source separation instruction is given from the user (step S803; No), the stored sound data is called (step S812), and the sound data is reproduced as it is together with the image data (step S810). For example, when the user wants to play the recorded sound data as it is for a while after playing the data, in the case of monaural playback, one of a plurality of sound data stored in the storage unit 113 is played back, and in the case of stereo playback Plays two sound data. For example, sound data captured from the microphones 102 to 105 is reproduced.

ユーザより音源分離処理を行う指示が与えられると（ステップＳ８０３；Ｙｅｓ）、制御部１１０は、記憶部１１３に記憶された複数の音データを用いて音源分離処理を行う。まず、制御部１１０は、ユーザより指示された音源分離効果に対応した音源分離パラメータを取得する（ステップＳ８０４）。例えば、ユーザがタッチパネル等で画面内に存在する音源に接触して指定し、その音源のみを抽出する指示をした場合、制御部１１０は、その画面内に存在する音源の位置から、抽出すべき音源の到来方向の範囲等を取得する。 When an instruction to perform sound source separation processing is given from the user (step S803; Yes), the control unit 110 performs sound source separation processing using a plurality of sound data stored in the storage unit 113. First, the control unit 110 acquires sound source separation parameters corresponding to the sound source separation effect designated by the user (step S804). For example, when the user touches and designates a sound source existing in the screen with a touch panel or the like and instructs to extract only the sound source, the control unit 110 should extract from the position of the sound source existing in the screen. Get the range of arrival direction of the sound source.

次に、制御部１１０は、再生編集中の録画データの経過時間が、可動部２０２の位置が変更された変更時間に達しているかどうかの判別を行う（ステップＳ８０５）。経過時間が変更時間に達した場合（ステップＳ８０５；Ｙｅｓ）、そのタイミングにおけるヒンジ角度θ２０３、φ２０４に基づき、マイク位置を取得する（ステップＳ８０６）。この時、ヒンジ角度θ２０３が９０°、φ２０４が０°の場合を基準位置であるとした場合に、ヒンジ角度θ２０３が９０°、φ２０４が０°であるならば、マイク位置を新たに取得する必要は無く、録画データのヘッダ領域に書き込まれている初期マイク位置を読み出せばよい。また、実施形態１と同様に、マイク位置の取得方法は、θ２０３、φ２０４の値に基づいて幾何学的な計算で求めてもよいし、図４のようなテーブルから求めるようにしてもよい。そして、制御部１１０は求めたマイク位置を音源分離処理部１１１に送り、音源分離処理部１１１は、受け付けたマイク位置を音源分離パラメータとして設定する（ステップ８０７）。 Next, the control unit 110 determines whether or not the elapsed time of the recording data being reproduced and edited has reached the change time when the position of the movable unit 202 has been changed (step S805). If the elapsed time reaches the change time (step S805; Yes), the microphone position is acquired based on the hinge angles θ203 and φ204 at that timing (step S806). At this time, when the hinge angle θ203 is 90 ° and φ204 is 0 ° as the reference position, if the hinge angle θ203 is 90 ° and φ204 is 0 °, a new microphone position needs to be acquired. There is no need to read the initial microphone position written in the header area of the recorded data. As in the first embodiment, the microphone position acquisition method may be obtained by geometric calculation based on the values of θ203 and φ204, or may be obtained from a table as shown in FIG. Then, the control unit 110 sends the obtained microphone position to the sound source separation processing unit 111, and the sound source separation processing unit 111 sets the received microphone position as a sound source separation parameter (step 807).

一方、再生編集中の録画データの経過時間が変更時間に達していない場合（ステップＳ８０５；Ｎｏ）、すでに求めたマイク位置を音源分離パラメータとして設定する（ステップＳ８１３）。 On the other hand, if the elapsed time of the recording data being reproduced / edited has not reached the change time (step S805; No), the already determined microphone position is set as the sound source separation parameter (step S813).

音源分離処理部１１１は、記憶部１１３に記憶されたマイクロホン１０２〜１０５より取り込んだ全音データを読み出す（ステップＳ８０８）。その後、音源分離処理部１１１は、音源分離パラメータを基に、複数の音データに対し音源分離を行う（ステップＳ８０９）。 The sound source separation processing unit 111 reads all sound data captured from the microphones 102 to 105 stored in the storage unit 113 (step S808). Thereafter, the sound source separation processing unit 111 performs sound source separation on a plurality of sound data based on the sound source separation parameter (step S809).

再生編集処理部５１１は、音源分離処理後の音データを記憶された画像データとともに再生を行い、又は音源分離処理後の音データを記憶部１１３に記憶させる（ステップＳ８１０）。 The reproduction editing processing unit 511 reproduces the sound data after the sound source separation process together with the stored image data, or stores the sound data after the sound source separation process in the storage unit 113 (step S810).

次に、制御部１１０はユーザによる再生編集終了の指示があるか否かを判別する（ステップＳ８１１）。再生編集終了の指示を受け付けた場合（ステップＳ８１１；Ｙｅｓ）、再生編集処理を終了する。制御部１１０は、再生編集終了の指示を受け付けない場合（ステップＳ８１１；Ｎｏ）、ステップＳ８０３からの処理を繰り返す。 Next, the control unit 110 determines whether or not there is an instruction to end reproduction editing by the user (step S811). If an instruction to end reproduction / editing is received (step S811; Yes), the reproduction / editing process is terminated. When the control unit 110 does not accept an instruction to end reproduction / editing (step S811; No), the processing from step S803 is repeated.

本実施形態によれば、可動部２０２を有する携帯装置１０１において、再生編集時に音源分離処理を行う場合に、可動部２０２の位置が基準位置と異なっていても、保存された経過時間とマイク位置を用いることによって、音源分離処理後の音データの品質劣化を防ぐことが可能となる。また、撮影時に、撮影者に可動部２０２の位置が基準位置でないことを知らせることができる。 According to the present embodiment, in the portable device 101 having the movable unit 202, when performing sound source separation processing at the time of playback and editing, even if the position of the movable unit 202 is different from the reference position, the stored elapsed time and microphone position By using, it becomes possible to prevent the quality deterioration of the sound data after the sound source separation processing. Further, at the time of shooting, the photographer can be notified that the position of the movable portion 202 is not the reference position.

なお、本実施形態において、録画、録音時に可動部２０２が動かされた場合に、その経過時間とヒンジ角度θ２０３、φ２０４の値を記憶する例を挙げたが、一定時間毎に逐一ヒンジ角度θ２０３、φ２０４の値を記憶してもよい。さらに、録画、録音時にθ、φの値を保存するのではなく、録画、録音の時点でθ、φの値からマイク位置を取得し、記憶部１１３にマイク位置そのものを保存してもよい。これによって、データ記憶時のファイル容量は増加するが、再生、編集時に音源分離処理を行う際のマイク位置取得処理は省けるため、計算リソースが限られていて、リアルタイム性が求められる場合等に有効である。 In this embodiment, when the movable unit 202 is moved during recording and recording, an example of storing the elapsed time and the values of the hinge angles θ203 and φ204 is given. However, the hinge angle θ203, The value of φ204 may be stored. Further, instead of storing the values of θ and φ at the time of recording and recording, the microphone position may be acquired from the values of θ and φ at the time of recording and recording, and the microphone position itself may be stored in the storage unit 113. This increases the file capacity when storing data, but it can save the microphone position acquisition processing when performing sound source separation processing during playback and editing, so it is effective when the calculation resources are limited and real-time performance is required. It is.

また、実施形態２では、計時部５１２は録画処理開始からの経過時間を計測したが、記憶された音データにおいて何時ヒンジ角度が変更されたかがわかればよいので、例えば日時等を計測するようにしてもよい。 In the second embodiment, the timing unit 512 measures the elapsed time from the start of the recording process. However, since it is only necessary to know when the hinge angle is changed in the stored sound data, for example, the date and time are measured. Also good.

また、実施形態２では、音源分離効果の例を任意の音源の抽出としたが、任意の音源を抑圧する場合も同様に実施可能である。さらに、任意の１点の音源だけでなく、複数の音源に対しても、抽出、抑圧の音源分離効果を指定できる。 In the second embodiment, the example of the sound source separation effect is extraction of an arbitrary sound source. However, the present invention can be similarly implemented when suppressing an arbitrary sound source. Furthermore, the sound source separation effect of extraction and suppression can be designated not only for one arbitrary sound source but also for a plurality of sound sources.

また、撮影後、記憶データの編集処理を行う場合において、編集後のデータは、元の複数の音声データを上書きする場合や、別のデータ領域に記憶する方法が考えられ、それらの判断は編集時にユーザが決めることができる。元の音声データを上書きせずに残しておいた場合は、何度でも音源分離効果の編集処理が可能である。保存後の音データについて、一般的な動画再生機器で再生できる形式（例えば、ＰＣで再生可能な動画フォーマットＡＶＩ（Audio Video Interleave）やビデオカメラの録画フォーマットであるＡＶＣＨＤ（Advanced Video Codec High Definition）など）で画像データと合わせて記憶することもできる。このような一般的な動画記録形式で保存しておけば、再生機器を問わず録画データの再生が可能となる。 In addition, when editing the stored data after shooting, the edited data can be overwritten with a plurality of original audio data, or stored in a separate data area. Sometimes the user can decide. If the original audio data is left without being overwritten, the sound source separation effect can be edited any number of times. The saved sound data can be played back on a general video playback device (for example, a video format AVI (Audio Video Interleave) that can be played back on a PC, an AVCHD (Advanced Video Codec High Definition) that is a video camera recording format, etc. ) Can be stored together with the image data. If stored in such a general moving image recording format, the recorded data can be played back regardless of the playback device.

また、実施形態１で述べたものと同様に、マイクロホンが２つ以上であるならば、本実施形態は適用でき、可動部の可動方向については図２における回転軸ｘ、回転軸ｙの軸に限定されるものでは無く、さらに可動部が撮影時に所定の位置では無い旨を伝える手段も本実施形態に限定されるものではない。 As in the first embodiment, if there are two or more microphones, this embodiment can be applied, and the movable direction of the movable portion is the axis of the rotation axis x and the rotation axis y in FIG. The means for notifying that the movable part is not at the predetermined position at the time of shooting is not limited to the present embodiment.

（実施形態３）
＜可動部が所定の位置かどうかのみが判別できるスイッチ判別機構を有する携帯装置において、音源分離と同時に録画を行う例＞
実施形態１及び実施形態２では、角度センサを有している携帯装置について説明したが、コスト、デザイン等の点で、角度センサが設置できないような場合が想定される。
実施形態３では、可動部の位置が基準位置かどうかのみがわかるスイッチ判別機構を有し、録画もしくは録音時に音源分離処理を行い、音源分離処理を施した音データを記憶する携帯装置９０１について説明する。 (Embodiment 3)
<Example of recording simultaneously with sound source separation in a portable device having a switch discriminating mechanism that can discriminate only whether or not the movable part is at a predetermined position>
In the first and second embodiments, the portable device having the angle sensor has been described. However, it is assumed that the angle sensor cannot be installed in terms of cost, design, and the like.
In the third embodiment, a portable device 901 having a switch discrimination mechanism that only knows whether the position of the movable part is the reference position, performing sound source separation processing during recording or recording, and storing sound data subjected to sound source separation processing will be described. To do.

携帯装置９０１は、図９に示すように、マイクロホン１０２〜１０５と、ＡＤＣ１０６と、撮影部１０７と、入力部１０８と、スイッチ判別機構９０２と、制御部１１０と、記憶部１１３と、表示部１１４と、ＤＡＣ１１５と、スピーカ１１６と、から構成される。 As illustrated in FIG. 9, the portable device 901 includes microphones 102 to 105, an ADC 106, an imaging unit 107, an input unit 108, a switch determination mechanism 902, a control unit 110, a storage unit 113, and a display unit 114. And a DAC 115 and a speaker 116.

また、本実施形態の携帯装置９０１は、外観上、携帯装置９０１ａ及び携帯装置９０１ｂの２つの構成を取り得る。携帯装置９０１ａは一般的なビデオカメラや、動画撮影機能付きの携帯電話等に用いられる形態である。携帯装置９０１ａは本体部２０１と可動部２０２と、携帯装置９０１ｂは本体部２０５と可動部２０６とから構成され、可動部２０２、２０５はヒンジ機構を介して本体部２０１、２０５に設置される。携帯装置９０１ａにおいて、マイクロホン１０２、１０３、１０４は本体部２０１に、マイクロホン１０５は可動部２０２に設置される。一方、携帯装置９０１ｂにおいては、マイクロホン１０２は本体部２０５に設置され、マイクロホン１０３、１０４、１０５は可動部２０６に設置される。実施形態１と同様に、撮影部１０７を基準に直行座標系を設定し、この座標系でマイク位置を決定する。 In addition, the mobile device 901 of the present embodiment can take two configurations of the mobile device 901a and the mobile device 901b in appearance. The portable device 901a is a form used for a general video camera, a mobile phone with a moving image photographing function, or the like. The portable device 901a includes a main body unit 201 and a movable unit 202, and the portable device 901b includes a main body unit 205 and a movable unit 206. The movable units 202 and 205 are installed in the main body units 201 and 205 via a hinge mechanism. In the portable device 901 a, the microphones 102, 103, and 104 are installed in the main body unit 201, and the microphone 105 is installed in the movable unit 202. On the other hand, in the portable device 901b, the microphone 102 is installed in the main body 205, and the microphones 103, 104, and 105 are installed in the movable unit 206. As in the first embodiment, an orthogonal coordinate system is set based on the photographing unit 107, and the microphone position is determined based on this coordinate system.

以下、携帯装置９０１の特徴的な機能を提供するスイッチ判別機構９０２について説明する。 Hereinafter, a switch discrimination mechanism 902 that provides a characteristic function of the portable device 901 will be described.

スイッチ判別機構９０２は、可動部２０２、２０６の位置が基準位置か否かの判別ができる機構であり、マイクロスイッチ或いはマグネットスイッチ等から構成される。スイッチ判別機構９０２は、判別結果（スイッチ判別信号）を制御部１１０に送る。例えば、可動部２０２、２０６が基準位置にある場合は、スイッチ判別機構９０２は、制御部１１０に“１”の信号を送り、そうでない場合は“０”の信号を送る。 The switch discriminating mechanism 902 is a mechanism that can discriminate whether or not the position of the movable parts 202 and 206 is the reference position, and is configured by a micro switch or a magnet switch. The switch determination mechanism 902 sends a determination result (switch determination signal) to the control unit 110. For example, when the movable units 202 and 206 are at the reference position, the switch determination mechanism 902 sends a signal “1” to the control unit 110, and sends a signal “0” otherwise.

スイッチ判別機構９０２を有する携帯装置９０１において、携帯装置９０１の姿勢が変更されると、本体部２０１、２０５と可動部２０２、２０６の位置関係を検出することができない。したがって、基準となる撮影部１０７と異なる筐体に設置されたマイクロホンのマイク位置を検出することができない。このような場合、マイク位置を検出できないマイクロホンから取り込んだ音データを用いて音源分離処理を施すと、音源分離処理後の音データの品質が低下するおそれがある。以下、上記マイクロホンから取り込んだ音データを音源分離処理に使用しない録画処理について説明する。 In the portable device 901 having the switch determination mechanism 902, if the posture of the portable device 901 is changed, the positional relationship between the main body portions 201 and 205 and the movable portions 202 and 206 cannot be detected. Therefore, it is impossible to detect the microphone position of a microphone installed in a housing different from the reference imaging unit 107. In such a case, if sound source separation processing is performed using sound data acquired from a microphone whose microphone position cannot be detected, the quality of sound data after the sound source separation processing may be degraded. Hereinafter, a recording process in which the sound data captured from the microphone is not used for the sound source separation process will be described.

ユーザが入力部１０８に録画処理開始及び音源分離処理開始の指示を入力すると、入力部１０８はその指示を制御部１１０に送る。制御部１１０は、録画処理開始及び音源分離処理開始の指示を受け付けると、図１１のフローチャートに示す録画処理を開始する。 When the user inputs an instruction to start recording processing and sound source separation processing to the input unit 108, the input unit 108 sends the instructions to the control unit 110. Upon receiving instructions for starting the recording process and starting the sound source separation process, the control unit 110 starts the recording process shown in the flowchart of FIG.

制御部１１０は、まず、撮影部１０７に画像の取り込みと、マイクロホン１０２〜１０５に音の取り込みとを開始させ、一時的に制御部１１０のＲＡＭ等に記憶させる（ステップＳ３０１）。 First, the control unit 110 causes the photographing unit 107 to start capturing images and the microphones 102 to 105 to start capturing sound, and temporarily stores them in the RAM or the like of the control unit 110 (step S301).

続いて、制御部１１０は、予め記憶部１１３に記憶された音源分離パラメータをＲＡＭ等に呼び出す（ステップＳ３０２）。 Subsequently, the control unit 110 calls the sound source separation parameter stored in advance in the storage unit 113 to the RAM or the like (step S302).

次に、制御部１１０は、スイッチ判別機構９０２から送られたスイッチ判別信号を判別する（ステップＳ１１０３）。 Next, the control unit 110 determines the switch determination signal sent from the switch determination mechanism 902 (step S1103).

スイッチ判別信号が“１”の（可動部の位置が基準位置である）場合（Ｓ１１０３；Ｎｏ）、制御部１１０は、初期マイク位置を音源分離処理部１１１に送り、音源分離処理部１１１は初期マイク位置を音源分離パラメータとして設定する（ステップＳ３１０）。 When the switch determination signal is “1” (the position of the movable part is the reference position) (S1103; No), the control unit 110 sends the initial microphone position to the sound source separation processing unit 111, and the sound source separation processing unit 111 is initialized. The microphone position is set as a sound source separation parameter (step S310).

一方、スイッチ判別信号が“０”の（可動部の位置が基準位置でない）場合（ステップＳ１１０３；Ｙｅｓ）、制御部１１０は、可動部２０２、２０６が基準位置ではない旨を表示部１１４に表示させる（ステップＳ３０４）。例えば、撮影中の表示部１１４に“マイクの位置を基準位置にして下さい。”等のメッセージを表示する。 On the other hand, when the switch determination signal is “0” (the position of the movable part is not the reference position) (step S1103; Yes), the control unit 110 displays on the display unit 114 that the movable parts 202 and 206 are not the reference position. (Step S304). For example, a message such as “Please set the microphone position to the reference position” is displayed on the display unit 114 during shooting.

次に、制御部１１０は、スイッチ判別機構９０２の結果に基づいて、音源分離処理に用いる音データを取り込むマイクロホンを選択する（ステップＳ１１０５）。携帯装置９０１ａにおいて、撮影部１０７を備える本体部２０１に設置されたマイクロホン１０２〜１０４は、撮影部１０７との位置関係に変更が生じずマイクロホンの基準位置のままである。一方、可動部２０２が基準位置から動かされるとマイクロホン１０５の位置が初期マイク位置からずれる。従って、スイッチ判別信号＝“１”の場合、制御部１１０はマイクロホン１０２〜１０５を選択し、スイッチ判別信号＝“０”の場合、制御部１１０はマイクロホン１０２〜１０４を選択する。 Next, the control unit 110 selects a microphone that captures sound data used for sound source separation processing based on the result of the switch determination mechanism 902 (step S1105). In the portable device 901a, the microphones 102 to 104 installed in the main body 201 including the photographing unit 107 do not change in the positional relationship with the photographing unit 107 and remain at the reference position of the microphone. On the other hand, when the movable unit 202 is moved from the reference position, the position of the microphone 105 is shifted from the initial microphone position. Therefore, when the switch determination signal = “1”, the control unit 110 selects the microphones 102 to 105, and when the switch determination signal = “0”, the control unit 110 selects the microphones 102 to 104.

また、携帯装置９０１ｂの場合には、可動部２０６が動かされても、可動部２０６に設置されたマイクロホン１０３〜１０５と撮影部１０７の位置関係に変更は生じない。一方、本体部２０５に設置されたマイクロホン１０２は、撮影部１０７との位置関係が変化するので、マイクロホンの基準位置からずれることになる。従って、スイッチ判別信号＝“１”の場合、制御部１１０はマイクロホン１０２〜１０５を選択し、スイッチ判別信号＝“０”の場合、制御部１１０はマイクロホン１０３〜１０５を選択する。 In the case of the portable device 901b, even if the movable unit 206 is moved, the positional relationship between the microphones 103 to 105 installed in the movable unit 206 and the photographing unit 107 does not change. On the other hand, the microphone 102 installed in the main body 205 is displaced from the reference position of the microphone because the positional relationship with the imaging unit 107 changes. Therefore, when the switch determination signal = “1”, the control unit 110 selects the microphones 102 to 105, and when the switch determination signal = “0”, the control unit 110 selects the microphones 103 to 105.

さらに、制御部１１０は、選択したマイクロホンのマイク位置を求める。音源分離処理用の音データを取り込むマイクロホンは、筐体の姿勢から求まる。したがって、スイッチ判別信号とマイク位置の対応関係を、図１２（ａ）（携帯装置９０１ａの場合）又は図１２（ｂ）（携帯装置９０１ｂの場合）のようなテーブルにしておき、予め音源分離処理部１１１や、記憶部１１３に記憶させておくことができる。制御部１１０は、スイッチ判別信号の値をキーにしてマイク位置の情報を求める。 Further, the control unit 110 obtains the microphone position of the selected microphone. A microphone for capturing sound data for sound source separation processing is obtained from the attitude of the housing. Accordingly, the correspondence relationship between the switch discrimination signal and the microphone position is set in a table as shown in FIG. 12A (in the case of the portable device 901a) or FIG. 12B (in the case of the portable device 901b), and the sound source separation process is performed in advance. It can be stored in the unit 111 or the storage unit 113. Control unit 110 obtains microphone position information using the value of the switch determination signal as a key.

制御部１１０は選択したマイクロホンとマイク位置の情報とを音源分離処理部１１１に送り、音源分離処理部１１１は選択したマイクロホンのマイク位置を音源分離パラメータとして設定する（ステップＳ１１０６）。 The control unit 110 sends the selected microphone and microphone position information to the sound source separation processing unit 111, and the sound source separation processing unit 111 sets the microphone position of the selected microphone as a sound source separation parameter (step S1106).

音源分離処理部１１１は、音源分離パラメータを基に選択されたマイクロホンから取り込んだ音データに対して音源分離を行う（ステップＳ１１０７）。すなわち、携帯装置９０１ａにおいては、初期マイク位置からずれたマイクロホン１０５以外から取り込んだ音データを、携帯装置９０１ｂにおいては、初期マイク位置からずれたマイクロホン１０２以外から取り込んだ音データを、音源分離処理に用いる。これにより音源分離後の音データの品質劣化を防ぐことができる。制御部１１０は、画像データと、音源分離処理後の音データとを記憶部１１３に書き込む（ステップＳ３０８）。 The sound source separation processing unit 111 performs sound source separation on the sound data acquired from the microphone selected based on the sound source separation parameter (step S1107). That is, in the portable device 901a, sound data captured from other than the microphone 105 deviated from the initial microphone position is used, and in the portable device 901b, sound data acquired from other than the microphone 102 deviated from the initial microphone position is subjected to sound source separation processing. Use. As a result, quality deterioration of the sound data after the sound source separation can be prevented. The control unit 110 writes the image data and the sound data after the sound source separation process in the storage unit 113 (step S308).

次に、制御部１１０は、録画処理終了の指示の有無を判別する（ステップＳ３０９）。録画処理終了の指示がない場合（ステップＳ３０９；Ｎｏ）、制御部１１０は、スイッチ判別信号に変更があるか否かを判別する（ステップＳ１１１１）。スイッチ判別信号に変更がある場合（ステップＳ１１１１；Ｙｅｓ）、ステップＳ１１０３以降の処理を繰り返す。スイッチ判別信号に変更がない場合（ステップＳ１１１１；Ｎｏ）、音データに対して引き続き音源分離処理を施し（ステップＳ１１０７）、処理後の音データを記憶部１１３に書き込む（ステップＳ３０８）。 Next, the control unit 110 determines whether there is an instruction to end the recording process (step S309). When there is no instruction to end the recording process (step S309; No), the control unit 110 determines whether there is a change in the switch determination signal (step S1111). If there is a change in the switch determination signal (step S1111; Yes), the processing after step S1103 is repeated. If there is no change in the switch determination signal (step S1111; No), the sound source separation process is continuously performed on the sound data (step S1107), and the processed sound data is written in the storage unit 113 (step S308).

本実施形態によれば、可動部２０２、２０６を撮影中に動かしてしまっても、可動部２０２、２０６の位置が基準位置か否かのみが判別できる廉価なスイッチ判別機構を用いて、音源分離処理後の音データの品質劣化を防ぐことができる。また、モニタ上に警告等を表示し基準位置に戻すことを促すことができる。 According to this embodiment, even if the movable parts 202 and 206 are moved during photographing, the sound source separation is performed by using an inexpensive switch discriminating mechanism that can discriminate whether or not the positions of the movable parts 202 and 206 are the reference positions. It is possible to prevent quality degradation of the processed sound data. Further, a warning or the like can be displayed on the monitor to prompt the user to return to the reference position.

なお、本実施形態で説明した発明は、図１０に示したような外観上の構成例に限定されるものではなく、複数のマイクロホンが本体部と可動部のそれぞれに設置された情報処理装置に適用可能である。 Note that the invention described in the present embodiment is not limited to the configuration example in appearance as shown in FIG. 10, and an information processing apparatus in which a plurality of microphones are installed in each of the main body unit and the movable unit. Applicable.

（実施形態４）
＜可動部が所定の位置かどうかのみが判別できるスイッチ判別機構を有する携帯装置において、撮影後、再生編集時に音源分離を行う例＞
実施形態４では、可動部の位置が基準位置かどうかのみがわかるスイッチ判別機構を有し、複数のマイクロホンの取り込んだ音データを記憶部に記憶し、再生編集時に音源分離処理を行う携帯装置１３０１について説明する。 (Embodiment 4)
<Example of performing sound source separation at the time of playback and editing after shooting in a portable device having a switch discriminating mechanism that can discriminate only whether or not the movable part is at a predetermined position>
In the fourth embodiment, the portable device 1301 has a switch determination mechanism that can know only whether the position of the movable portion is the reference position, stores sound data captured by a plurality of microphones in the storage portion, and performs sound source separation processing during reproduction editing. Will be described.

携帯装置１３０１は、図１３に示すように、マイクロホン１０２〜１０５と、ＡＤＣ１０６と、撮影部１０７と、入力部１０８と、スイッチ判別機構９０２と、制御部１１０と、記憶部１１３と、表示部１１４と、ＤＡＣ１１５と、スピーカ１１６と、から構成される。
制御部１１０は実施形態２と同様の機能構成を有する。
制御部１１０以外は、実施形態３の携帯装置９０１と同様の機能構成を有する。
また、携帯装置１３０１の外観上の構成は図１０に示すものと同様とする。 As illustrated in FIG. 13, the portable device 1301 includes a microphone 102 to 105, an ADC 106, an imaging unit 107, an input unit 108, a switch determination mechanism 902, a control unit 110, a storage unit 113, and a display unit 114. And a DAC 115 and a speaker 116.
The control unit 110 has the same functional configuration as that of the second embodiment.
Except for the control unit 110, it has the same functional configuration as the portable device 901 of the third embodiment.
The external configuration of the portable device 1301 is the same as that shown in FIG.

以下、スイッチ判別機構９０２を備える携帯装置１３０１が行う録画処理について説明する。 Hereinafter, recording processing performed by the portable device 1301 including the switch determination mechanism 902 will be described.

ユーザが入力部１０８に録画処理開始の指示を入力すると、入力部１０８は録画処理開始の指示を制御部１１０に送る。制御部１１０は、録画処理開始の指示を受け付けると、図１４のフローチャートに示す処理を開始する。 When the user inputs an instruction to start recording processing to the input unit 108, the input unit 108 sends an instruction to start recording processing to the control unit 110. When control unit 110 receives an instruction to start the recording process, control unit 110 starts the process shown in the flowchart of FIG.

続いて、制御部１１０は、マイクロホンの数とそれらの３次元空間上の基準位置とを、記憶部１１３に記憶する録画データのヘッダ領域に書き込む（ステップＳ６０２）。 Subsequently, the control unit 110 writes the number of microphones and their reference positions in the three-dimensional space in the header area of the recording data stored in the storage unit 113 (step S602).

次に、制御部１１０はスイッチ判別機構９０２から送られたスイッチ判別信号を判別する（ステップＳ１１０３）。スイッチ判別信号が“１”の場合（ステップＳ１１０３；Ｎｏ）、計時部５１２が計測した時間とスイッチ判別信号の値“１”を、図１５のように、録画データのヘッダ領域に書き込む（ステップＳ１４０５）。 Next, the control unit 110 determines the switch determination signal sent from the switch determination mechanism 902 (step S1103). When the switch discrimination signal is “1” (step S1103; No), the time measured by the timer unit 512 and the switch discrimination signal value “1” are written in the header area of the recorded data as shown in FIG. 15 (step S1405). ).

一方、スイッチ判別信号が“０”の場合（ステップＳ１１０３；Ｙｅｓ）、制御部１１０は、可動部２０２、２０６が基準位置ではない旨を表示部１１４に表示させる（ステップＳ３０４）。そして、制御部１１０は、計時部５１２が計測した経過時間とスイッチ判別信号の値“０”を録画データのヘッダ領域に書き込む（ステップＳ１４０５）。例えば、可動部２０２が撮影開始から８秒後に基準位置から動かされたとすると、制御部１１０は、図１５に示すように、可動部２０２の位置が変更された時の経過時間（変更時間）と、スイッチ判別信号“０”を録画データのヘッダに書き込む。 On the other hand, when the switch determination signal is “0” (step S1103; Yes), the control unit 110 causes the display unit 114 to display that the movable units 202 and 206 are not the reference position (step S304). Then, the control unit 110 writes the elapsed time measured by the time measuring unit 512 and the value “0” of the switch determination signal in the header area of the recording data (step S1405). For example, assuming that the movable unit 202 is moved from the reference position 8 seconds after the start of imaging, the control unit 110 calculates the elapsed time (change time) when the position of the movable unit 202 is changed as shown in FIG. The switch discrimination signal “0” is written in the header of the recording data.

ヘッダへの書き込みが終了すると、制御部１１０は、ステップＳ１１０５と同様に、スイッチ判別信号の値に基づいて、記憶する音データを取り込んだマイクロホンを選択する（ステップＳ１４０６）。 When the writing to the header is completed, the control unit 110 selects the microphone that has captured the sound data to be stored based on the value of the switch determination signal, similarly to step S1105 (step S1406).

そして、制御部１１０は、画像データと、選択したマイクロホンから取り込んだ音データとを記憶部１１３に書き込む（ステップＳ１４０７）。 Then, the control unit 110 writes the image data and the sound data captured from the selected microphone in the storage unit 113 (step S1407).

次に、制御部１１０は、録画処理終了の指示の有無を判別する（ステップＳ３０９）。録画処理終了の指示がない場合（Ｓ３０９；Ｎｏ）、制御部１１０は、スイッチ判別信号に変更があるか否かを判別する（ステップＳ１１１１）。スイッチ判別信号に変更がある場合（ステップＳ１１１１；Ｙｅｓ）、ステップＳ１１０３以降の処理を繰り返す。スイッチ判別信号に変更がない場合（ステップＳ１１１１；Ｎｏ）、引き続き画像データと音データとを記憶部１１３に書き込む（ステップＳ１４０７）。 Next, the control unit 110 determines whether there is an instruction to end the recording process (step S309). When there is no instruction to end the recording process (S309; No), the control unit 110 determines whether or not there is a change in the switch determination signal (step S1111). If there is a change in the switch determination signal (step S1111; Yes), the processing after step S1103 is repeated. If there is no change in the switch determination signal (step S1111; No), the image data and the sound data are continuously written in the storage unit 113 (step S1407).

次に、記憶部１１３に記憶されたデータを再生、編集する動作について図１６を用いて説明する。 Next, operations for reproducing and editing data stored in the storage unit 113 will be described with reference to FIG.

ユーザが入力部１０８に、録画データの選択、再生編集の開始及び音源分離処理開始の指示を入力すると、入力部１０８はその指示を制御部１１０に送る。制御部１１０は、指示を受け付けると、図１６に示すフローチャートの処理を開始する。 When the user inputs instructions for selecting recording data, starting reproduction editing and starting sound source separation processing to the input unit 108, the input unit 108 sends the instructions to the control unit 110. When control unit 110 accepts the instruction, control unit 110 starts the processing of the flowchart shown in FIG.

ユーザより音源分離指示が与えられていなければ（ステップＳ８０３；Ｎｏ）、記憶された音データを呼び出し（ステップＳ８１２）、画像データとともに、音データをそのまま再生する（ステップＳ８１０）。 If no sound source separation instruction is given from the user (step S803; No), the stored sound data is called (step S812), and the sound data is reproduced as it is together with the image data (step S810).

ユーザより音源分離処理を行う指示が与えられると（ステップＳ８０３；Ｙｅｓ）、制御部１１０は、記憶部１１３に記憶された複数の音データを用いて音源分離処理を行う。まず、ユーザより指示された音源分離処理の種類に対応した音源分離パラメータを取得する（ステップＳ８０４）。 When an instruction to perform sound source separation processing is given from the user (step S803; Yes), the control unit 110 performs sound source separation processing using a plurality of sound data stored in the storage unit 113. First, a sound source separation parameter corresponding to the type of sound source separation processing instructed by the user is acquired (step S804).

次に、制御部１１０は、再生編集中の録画データの経過時間が、可動部２０２の位置が変更された変更時間に達しているかどうかの判別を行う（ステップＳ８０５）。経過時間が変更時間に達した場合（ステップＳ８０５；Ｙｅｓ）、音源分離に悪影響を与えないマイクロホンのマイク位置を、ヘッダ領域に記録されたスイッチ判別信号と図１２（ａ）又は図１２（ｂ）のテーブルとから求める（Ｓ１６０６）。そして、音源分離処理部１１１は求めたマイク位置の情報を音源分離パラメータとして設定する（Ｓ１６０７）。 Next, the control unit 110 determines whether or not the elapsed time of the recording data being reproduced and edited has reached the change time when the position of the movable unit 202 has been changed (step S805). When the elapsed time reaches the change time (step S805; Yes), the microphone position of the microphone that does not adversely affect the sound source separation, the switch determination signal recorded in the header area, and FIG. 12 (a) or FIG. 12 (b). (S1606). Then, the sound source separation processing unit 111 sets the obtained microphone position information as a sound source separation parameter (S1607).

次に、選択されたマイクロホンの音データを、記憶部１１３から呼び出す（Ｓ１６０８）。その後、音源分離処理部１１１は、音源分離パラメータを基に、複数の音データに対し音源分離を行う（ステップＳ８０９）。 Next, the sound data of the selected microphone is called from the storage unit 113 (S1608). Thereafter, the sound source separation processing unit 111 performs sound source separation on a plurality of sound data based on the sound source separation parameter (step S809).

以下、実施形態２のステップ８１０以降と同様の処理を行う。 Thereafter, the same processing as that after step 810 in the second embodiment is performed.

本実施形態によれば、可動部２０２及び廉価なスイッチ判別機構を有する携帯装置１３０１において、再生編集時に音源分離処理を行う場合に、可動部２０２の位置が基準位置と異なっていても、保存された経過時間とマイク位置を用いることによって、音源分離処理後の音データの品質劣化を防ぐことが可能となる。 According to the present embodiment, in the portable device 1301 having the movable unit 202 and an inexpensive switch discriminating mechanism, when the sound source separation process is performed at the time of reproduction editing, the movable unit 202 is stored even if it is different from the reference position. By using the elapsed time and the microphone position, it is possible to prevent the quality deterioration of the sound data after the sound source separation process.

なお、本実施形態では録画時の動作において、可動部の位置にかかわらず、マイクロホン４個の音声データ全てを常に記憶したが、可動部の位置が所定の位置で無い場合、音源分離に悪影響を与えるマイクロホンの入力データは音源分離には使用しないので、記憶をしなくてもよい。これによって保存データの容量を削減することができる。 In this embodiment, in the operation at the time of recording, all the sound data of the four microphones are always stored regardless of the position of the movable part. However, if the position of the movable part is not a predetermined position, the sound source separation is adversely affected. Since the input data of the given microphone is not used for sound source separation, it need not be stored. As a result, the capacity of stored data can be reduced.

１０１、１０１ａ、１０１ｂ、５０１、９０１、１３０１…携帯装置、１０２、１０３、１０４、１０５…マイクロホン、１０６…ＡＤＣ、１０７…撮影部、１０８…入力部、１０９…角度センサ、１１０…制御部、１１１…音源分離処理部、１１２…マイク位置取得部、１１３…記憶部、１１４…表示部、１１５…ＤＡＣ、１１６…スピーカ、２０１、２０５…本体部、２０２、２０６…可動部、２０３…回転軸ｘ周りに本体部と可動部がなすヒンジ角度θ、２０４…回転軸ｙ周りに本体部と可動部がなすヒンジ角度φ、５１１…再生編集処理部、５１２…計時部、９０２…スイッチ判別機構 101, 101a, 101b, 501, 901, 1301 ... portable device, 102, 103, 104, 105 ... microphone, 106 ... ADC, 107 ... shooting unit, 108 ... input unit, 109 ... angle sensor, 110 ... control unit, 111 ... sound source separation processing unit, 112 ... microphone position acquisition unit, 113 ... storage unit, 114 ... display unit, 115 ... DAC, 116 ... speaker, 201, 205 ... main body unit, 202, 206 ... movable unit, 203 ... rotation axis x Hinge angle θ formed by the main body part and the movable part around, 204... Hinge angle φ formed by the main body part and the movable part around the rotation axis y, 511... Playback editing processing part, 512.

Claims

A first housing;
A second housing capable of changing its position relative to the first housing;
At least one microphone provided in the first housing;
At least one microphone provided in the second housing;
Microphone position specifying information acquiring means for acquiring information for specifying the positional relationship of the microphone;
Storage means for recording sound data captured from the microphone and microphone position specifying information acquired by the microphone position specifying information acquiring means on a recording medium;
A sound data processing apparatus comprising:

A first housing;
A second housing capable of changing its position relative to the first housing;
At least two or more microphones provided in the first housing;
At least one microphone provided in the second housing;
Position determining means for determining whether or not a positional relationship between the first casing and the second casing is a predetermined positional relationship;
If the positional relationship between the first housing and the second housing is the same as the predetermined positional relationship, a microphone provided in the first and second housings is selected, and the first housing A microphone selecting means for selecting a microphone provided in the first housing when the positional relationship between the housing and the second housing is different from the predetermined positional relationship;
Microphone position specifying information acquiring means for acquiring information for specifying the positional relationship of the microphone selected by the microphone selecting means;
Storage means for recording sound data captured from the microphone selected by the microphone selecting means and microphone position specifying information acquired by the microphone position specifying information acquiring means on a recording medium;
A sound data processing apparatus comprising:

The microphone position specifying information acquisition means further includes position detection means for detecting the positional relationship between the first casing and the second casing,
The microphone position specifying information includes information indicating the positional relationship between the first casing and the second casing.
The sound data processing device according to claim 1 or 2, characterized in that.

The microphone position specifying information acquisition means includes:
Position detecting means for detecting the positional relationship between the first casing and the second casing;
Based on the positional relationship detected by the position detection means, a plurality of microphones including at least two or more microphones provided in the first casing and at least one or more microphones provided in the second casing. A microphone position obtaining means for obtaining a positional relationship of the microphone, and
The microphone position specifying information includes information for specifying the positional relationship of the plurality of microphones acquired by the microphone position acquiring unit.
The sound data processing device according to claim 1 or 2, characterized in that.

The microphone position specifying information includes information for specifying timing when the positional relationship of the microphone has changed, or information for specifying the positional relationship of the microphone at the time of recording,
The sound data processing device according to any one of claims 1 to 4, characterized in that.

Either one of claims 1 to 5, characterized in that the positional relationship between the first casing and the second casing further comprises a warning means for performing warning when different from the predetermined positional relationship 1 The sound data processing device according to the item.

Reading means for reading out sound data recorded on a recording medium by the sound data processing device according to any one of claims 1 to 5 ,
Microphone position specifying means for specifying a microphone positional relationship during recording based on microphone position specifying information recorded on the recording medium;
Sound source separation means for performing sound source separation processing on the sound data read by the reading means based on the positional relationship of the microphones specified by the microphone position specifying means;
Reproducing means for reproducing the sound data subjected to the sound source separation processing;
A sound data processing apparatus comprising:

Reading means for reading sound data recorded on a recording medium by the sound data processing device according to claim 5 ;
Microphone position specifying means for specifying a microphone positional relationship during recording based on microphone position specifying information recorded on the recording medium;
Sound source separation means for performing sound source separation processing on the sound data read by the reading means based on the positional relationship of the microphones specified by the microphone position specifying means;
Reproducing means for reproducing the sound data subjected to the sound source separation processing;
With
The timing at which the sound data read by the reading means changes in the positional relationship of the microphone based on the information specifying the timing at which the positional relationship of the microphone has changed on the recording medium by the sound data processing device according to claim 5. Trigger position acquisition means for obtaining the positional relationship of the plurality of microphones from the positional relationship,
The sound data processing device further comprising:

Reading means for reading out sound data recorded on a recording medium by the sound data processing device according to any one of claims 1 to 5 ,
Microphone position specifying means for specifying a microphone positional relationship during recording based on microphone position specifying information recorded on the recording medium;
Sound source separation means for performing sound source separation processing on the sound data read by the reading means based on the positional relationship of the microphones specified by the microphone position specifying means;
Sound source separation data storage means for storing sound data subjected to sound source separation processing by the sound source separation means;
A sound data processing apparatus comprising:

Reading means for reading sound data recorded on a recording medium by the sound data processing device according to claim 5 ;
Microphone position specifying means for specifying a microphone positional relationship during recording based on microphone position specifying information recorded on the recording medium;
Sound source separation means for performing sound source separation processing on the sound data read by the reading means based on the positional relationship of the microphones specified by the microphone position specifying means;
Sound source separation data storage means for storing sound data subjected to sound source separation processing by the sound source separation means;
With
The timing at which the sound data read by the reading means changes in the positional relationship of the microphone based on the information specifying the timing at which the positional relationship of the microphone has changed on the recording medium by the sound data processing device according to claim 5. Trigger position acquisition means for obtaining the positional relationship of the plurality of microphones from the positional relationship,
The sound data processing device further comprising:

Sound data recorded on the recording medium, or to claim 9 or 10, further comprising a sound editing means for performing a sound editing processing on sound data source separation process is performed by the sound source separation unit The sound data processing device described.