JP6236765B2

JP6236765B2 - Music data editing apparatus and music data editing method

Info

Publication number: JP6236765B2
Application number: JP2012244710A
Authority: JP
Inventors: 入山　達也; 達也入山
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2011-11-29
Filing date: 2012-11-06
Publication date: 2017-11-29
Anticipated expiration: 2032-11-06
Also published as: JP2013137520A

Description

本発明は、音符の時系列を指定する音楽データを編集する技術に関する。 The present invention relates to a technique for editing music data specifying a time series of musical notes.

音声合成や楽音合成等の音響合成に使用される音楽データを表示および編集する技術が従来から提案されている。例えば音声合成に適用される音楽データは、合成音の音高と発音期間と音声符号（例えば歌詞の文字）と制御情報とを音符毎に指定する。制御情報は、音声合成に適用されて合成音の特性を制御する情報であり、例えば発音開始直後のピッチの変動（ベンド）やビブラートの態様（種類や継続長）を指定する。特許文献１には、音楽データで指定される各音符を表現する図形（以下「音符図形」という）を、音高軸と時間軸とが設定されたピアノロール型の楽譜領域に配置し、各音符図形に対する利用者からの指示に応じて音楽データを編集する技術が開示されている。各音符の音符図形の近傍には、その音符の制御情報を表現する図形（例えばビブラートを表現する図形）が配置される。 Techniques for displaying and editing music data used for sound synthesis such as speech synthesis and musical tone synthesis have been proposed. For example, music data applied to speech synthesis designates the pitch of a synthesized sound, a pronunciation period, a speech code (for example, lyrics characters), and control information for each note. The control information is information that is applied to speech synthesis to control the characteristics of the synthesized sound, and specifies, for example, pitch variation (bend) immediately after the start of sound generation and vibrato mode (type and duration). In Patent Document 1, a figure representing each note specified by music data (hereinafter referred to as “note figure”) is arranged in a piano roll-type score area in which a pitch axis and a time axis are set. A technique for editing music data in response to an instruction from a user for a note graphic is disclosed. In the vicinity of the note graphic of each note, a graphic expressing the control information of the note (for example, a graphic expressing vibrato) is arranged.

特許第４４５６０８８号公報Japanese Patent No. 4456088

特許文献１の技術では、例えば利用者が選択した音符の制御情報を編集するための設定画面を楽譜領域とは別個に表示し、設定画面に対する利用者からの指示に応じて制御情報を編集する構成が採用され得る。しかし、各音符図形とは独立した設定画面に対する操作では、各音符の制御情報を利用者が直観的に把握して所望の数値に設定することは困難である。以上の事情を考慮して、本発明は、制御情報の直観的な編集を可能にすることを目的とする。 In the technique of Patent Document 1, for example, a setting screen for editing the control information of a note selected by the user is displayed separately from the score area, and the control information is edited according to an instruction from the user on the setting screen. A configuration may be employed. However, it is difficult for the user to intuitively grasp the control information of each note and set it to a desired numerical value by operating the setting screen independent of each note graphic. In view of the above circumstances, an object of the present invention is to enable intuitive editing of control information.

以上の課題を解決するために本発明が採用する手段を説明する。なお、本発明の理解を容易にするために、以下の説明では、本発明の要素と後述の実施形態の要素との対応を括弧書で付記するが、本発明の範囲を実施形態の例示に限定する趣旨ではない。 Means employed by the present invention to solve the above problems will be described. In order to facilitate the understanding of the present invention, in the following description, the correspondence between the elements of the present invention and the elements of the embodiments described later will be indicated in parentheses, but the scope of the present invention will be exemplified in the embodiments. It is not intended to be limited.

本発明の音楽データ編集装置は、合成音の音高および発音時点と音響合成に適用される制御情報とを音符毎に指定する音楽データを編集する装置であって、音高軸と時間軸とが設定された楽譜領域のうち音楽データが指定する音高および発音時点に対応する位置に音符図形を音符毎に表示させる手段であって、楽譜領域内の各音符図形に対応する位置に、当該音符図形が示す音符の制御情報の変更の指示を受付ける編集画像（例えば遷移画像ＱA，変数指示画像ＱB，処理選択画像ＱC，遷移画像ＱD）を配置する表示制御手段（例えば表示制御部３２）と、各音符の編集画像に対する利用者からの指示に応じて当該音符の制御情報を編集する編集処理手段（例えば編集処理部３４）とを具備する。以上の構成では、楽譜領域に配置された編集画像に対する操作に応じて制御情報が編集される。すなわち、楽譜領域に対する直接的な操作で（すなわち、楽譜領域とは別個の設定画面等に対する操作を介することなく）制御情報が編集される。したがって、例えば楽譜領域内の各音符図形を確認しながら各音符の制御情報を直観的かつ容易に編集することが可能である。 The music data editing apparatus of the present invention is an apparatus for editing music data that specifies, for each note, the pitch and the time of sound generation of a synthesized sound and control information applied to sound synthesis, and includes a pitch axis and a time axis. Is a means for displaying a note graphic for each note at a position corresponding to the pitch specified by the music data and at the time of pronunciation in the musical score area, and at the position corresponding to each note graphic in the musical score area. Display control means (for example, display control unit 32) for arranging an edit image (for example, transition image QA, variable instruction image QB, process selection image QC, transition image QD) for receiving an instruction to change the control information of the note indicated by the note graphic And editing processing means (for example, an editing processing unit 34) that edits the control information of the note according to an instruction from the user with respect to the edited image of each note. In the above configuration, the control information is edited according to the operation on the edited image arranged in the score area. That is, the control information is edited by a direct operation on the score area (ie, without an operation on a setting screen or the like separate from the score area). Therefore, for example, it is possible to intuitively and easily edit the control information of each note while confirming each note graphic in the score area.

本発明の好適な態様において、表示制御手段は、制御情報を適用した合成音の特徴量（例えば音高や音量）の時間変化を表現する遷移画像（例えば遷移画像ＱAまたは遷移画像ＱD）を含む編集画像を楽譜領域内に配置し、遷移画像に対する利用者からの指示に応じて遷移画像を変化させ、編集処理手段は、利用者からの指示に応じた遷移画像の変化に対応するように制御情報を編集する。以上の態様では、合成音の特徴量の時間変化を表現する遷移画像の変化に対応するように制御情報が編集されるから、特徴量の時間変化を直観的に編集することが可能である。なお、以上の態様の具体例は、例えば第１実施形態や第５実施形態として後述される。 In a preferred aspect of the present invention, the display control means includes a transition image (for example, the transition image QA or the transition image QD) that represents a temporal change in the characteristic amount (for example, pitch or volume) of the synthesized sound to which the control information is applied. The edited image is arranged in the score area, the transition image is changed according to the instruction from the user with respect to the transition image, and the editing processing means is controlled to respond to the change of the transition image according to the instruction from the user. Edit information. In the above aspect, since the control information is edited so as to correspond to the change of the transition image expressing the time change of the feature value of the synthesized sound, the time change of the feature value can be edited intuitively. In addition, the specific example of the above aspect is later mentioned, for example as 1st Embodiment or 5th Embodiment.

遷移画像を表示する態様の具体例において、表示制御手段は、利用者からの指示に応じて遷移画像を音高軸方向に伸縮する。以上の態様によれば、遷移画像を音高軸方向に伸縮することで特徴量の時間変化を詳細に確認できるという利点がある。なお、以上の態様の具体例は、例えば第４実施形態として後述される。 In the specific example of the mode for displaying the transition image, the display control means expands and contracts the transition image in the pitch axis direction in accordance with an instruction from the user. According to the above aspect, there exists an advantage that the time change of a feature-value can be confirmed in detail by expanding / contracting a transition image to a pitch axis direction. In addition, the specific example of the above aspect is later mentioned as 4th Embodiment, for example.

本発明の好適な態様において、表示制御手段は、制御情報の数値を示す変数指示画像（例えば変数指示画像ＱB）を含む編集画像を表示領域内に配置し、変数指示画像が示す数値を利用者からの指示に応じて変化させ、編集処理手段は、利用者からの指示に応じた変数指示画像の数値の変化に対応するように制御情報を編集する。以上の態様では、変数指示画像に対する操作で利用者が制御情報の数値を直接的に指定することが可能である。以上の態様の具体例は、例えば第２実施形態として後述される。 In a preferred aspect of the present invention, the display control means arranges an edit image including a variable instruction image (for example, the variable instruction image QB) indicating the numerical value of the control information in the display area, and uses the numerical value indicated by the variable instruction image as a user. The editing processing means edits the control information so as to correspond to the change in the numerical value of the variable instruction image according to the instruction from the user. In the above aspect, the user can directly specify the numerical value of the control information by operating the variable instruction image. A specific example of the above aspect will be described later as a second embodiment, for example.

本発明の好適な態様において、表示制御手段は、音響合成時の特定処理の実行の有無を示す処理選択画像（例えば処理選択画像ＱC）を含む編集画像を表示領域内に配置し、処理選択画像が示す特定処理の実行の有無を利用者からの指示に応じて変化させ、編集処理手段は、処理選択画像が示す特定処理の実行の有無に対応するように制御情報を編集する。以上の態様では、処理選択画像に対する操作で利用者が特定処理の実行の有無を直接的に指定することが可能である。なお、以上の態様の具体例は、例えば第３実施形態として後述される。 In a preferred aspect of the present invention, the display control means arranges an edited image including a process selection image (for example, the process selection image QC) indicating whether or not the specific process at the time of sound synthesis is performed in the display area, and the process selection image The editing process means edits the control information so as to correspond to the execution of the specific process indicated by the process selection image. In the above aspect, the user can directly specify whether or not to execute the specific process by an operation on the process selection image. In addition, the specific example of the above aspect is later mentioned, for example as 3rd Embodiment.

本発明の好適な態様において、楽譜領域の表示倍率が閾値を下回る場合（第１表示状態）と表示倍率が閾値を上回る場合（第２表示状態）とで、編集画像の表示の有無や編集対象となる制御情報の種類を変更する構成も好適である。例えば、楽譜領域の表示倍率が閾値を下回る場合に編集画像を非表示とする構成や、表示倍率が閾値を下回る場合に、表示倍率が閾値を上回る場合と比較して少ない種類数の制御情報の変更の指示を受付ける編集画像を配置する構成が採用され得る。 In a preferred aspect of the present invention, whether or not an edited image is displayed and whether to be edited depending on whether the display magnification of the score area is below the threshold (first display state) or the display magnification exceeds the threshold (second display state). A configuration in which the type of control information to be changed is also suitable. For example, a configuration in which the edited image is not displayed when the display magnification of the score area is lower than the threshold, or when the display magnification is lower than the threshold, the number of types of control information is smaller than when the display magnification is higher than the threshold. A configuration may be employed in which an edited image that receives a change instruction is arranged.

以上の各態様に係る音楽データ編集装置は、音楽データの表示に専用されるＤＳＰ（Digital Signal Processor）などのハードウェア（電子回路）で実現されるほか、ＣＰＵ（Central Processing Unit）などの汎用の演算処理装置とプログラムとの協働でも実現される。本発明のプログラムは、合成音の音高および発音時点と音響合成に適用される制御情報とを音符毎に指定する音楽データを編集するために、音高軸と時間軸とが設定された楽譜領域のうち音楽データが指定する音高および発音時点に対応する位置に音符図形を音符毎に表示させる処理であって、楽譜領域の表示倍率が閾値を上回る場合に、楽譜領域内の各音符図形に対応する位置に、当該音符図形が示す音符の制御情報の変更の指示を受付ける編集画像を配置する表示制御処理と、各音符の編集画像に対する利用者からの指示に応じて当該音符の制御情報を編集する編集処理とをコンピュータに実行させる。以上のプログラムによれば、本発明の音楽データ編集装置と同様の作用および効果が実現される。本発明のプログラムは、コンピュータが読取可能な記録媒体に格納された形態で提供されてコンピュータにインストールされるほか、通信網を介した配信の形態で提供されてコンピュータにインストールされる。 The music data editing apparatus according to each of the above aspects is realized by hardware (electronic circuit) such as a DSP (Digital Signal Processor) dedicated to display of music data, and a general-purpose device such as a CPU (Central Processing Unit). This is also realized by cooperation between the arithmetic processing unit and the program. The program of the present invention is a musical score in which a pitch axis and a time axis are set in order to edit music data that designates the pitch of a synthesized sound, the time of sound generation, and control information applied to sound synthesis for each note. This is a process of displaying a note graphic for each note at a position corresponding to the pitch and pronunciation point specified by the music data in the area, and each note graphic in the score area when the display magnification of the score area exceeds the threshold Display control processing for arranging an edit image for accepting an instruction to change the control information of the note indicated by the note graphic at a position corresponding to the note graphic, and control information for the note according to an instruction from the user with respect to the edit image of each note The computer is caused to execute editing processing for editing. According to the above program, the same operation and effect as the music data editing apparatus of the present invention are realized. The program of the present invention is provided in a form stored in a computer-readable recording medium and installed in the computer, or is provided in a form distributed via a communication network and installed in the computer.

本発明の第１実施形態に係る音声合成装置のブロック図である。1 is a block diagram of a speech synthesizer according to a first embodiment of the present invention. 音楽データの模式図である。It is a schematic diagram of music data. 第１表示状態における編集画面の模式図である。It is a schematic diagram of the edit screen in a 1st display state. 第１表示状態における設定画面の模式図である。It is a schematic diagram of the setting screen in a 1st display state. 第２表示状態における編集画面の模式図である。It is a schematic diagram of the edit screen in a 2nd display state. 遷移画像の変化の説明図である。It is explanatory drawing of the change of a transition image. 第２実施形態における第２表示状態の編集画面の模式図である。It is a schematic diagram of the edit screen of the 2nd display state in 2nd Embodiment. 第３実施形態における第２表示状態の編集画面の模式図である。It is a schematic diagram of the edit screen of the 2nd display state in 3rd Embodiment. 第４実施形態における第２表示状態の編集画面の模式図である。It is a schematic diagram of the edit screen of the 2nd display state in 4th Embodiment. 第４実施形態における第２表示状態の編集画面の模式図である。It is a schematic diagram of the edit screen of the 2nd display state in 4th Embodiment. 第５実施形態における第２表示状態の編集画面の模式図である。It is a schematic diagram of the edit screen of the 2nd display state in 5th Embodiment. 変形例における第２表示状態の編集画面の模式図である。It is a schematic diagram of the edit screen of the 2nd display state in a modification. 変形例における編集画面の模式図である。It is a schematic diagram of the edit screen in a modification.

＜第１実施形態＞
図１は、本発明の第１実施形態に係る音声合成装置１００のブロック図である。音声合成装置１００は、素片接続型の音声合成で歌唱音の音声信号Ｓを生成する信号処理装置であり、図１に示すように、演算処理装置１２と記憶装置１４と表示装置２２と入力装置２４と放音装置２６とを具備するコンピュータシステムで実現される。例えば据置型の情報処理装置（パーソナルコンピュータ）や携帯型の情報処理装置（携帯電話機や携帯情報端末）で音声合成装置１００は実現される。 <First Embodiment>
FIG. 1 is a block diagram of a speech synthesizer 100 according to the first embodiment of the present invention. The speech synthesizer 100 is a signal processing device that generates a speech signal S of a singing sound by unit connection type speech synthesis. As shown in FIG. 1, an arithmetic processing device 12, a storage device 14, a display device 22, and an input This is realized by a computer system including the device 24 and the sound emitting device 26. For example, the speech synthesizer 100 is realized by a stationary information processing apparatus (personal computer) or a portable information processing apparatus (cellular phone or portable information terminal).

演算処理装置１２は、記憶装置１４に格納されたプログラムＰGMを実行することで複数の機能（表示制御部３２，編集処理部３４，音声合成部３６）を実現する。なお、演算処理装置１２の各機能を複数の集積回路に分散した構成や、専用の電子回路（例えばＤＳＰ）が一部の機能を実現する構成も採用され得る。 The arithmetic processing unit 12 implements a plurality of functions (display control unit 32, editing processing unit 34, speech synthesis unit 36) by executing the program PGM stored in the storage device 14. A configuration in which each function of the arithmetic processing unit 12 is distributed over a plurality of integrated circuits, or a configuration in which a dedicated electronic circuit (for example, a DSP) realizes a part of the functions may be employed.

表示装置２２（例えば液晶表示装置）は、演算処理装置１２が指示する画像を表示する。入力装置２４は、利用者からの指示を受付ける機器（例えばマウス等のポインティングデバイスやキーボード）である。なお、表示装置２２と一体に構成されたタッチパネルを入力装置２４として採用することも可能である。放音装置２６（例えばヘッドホンやスピーカ）は、演算処理装置１２が生成した音声信号Ｓに応じた音波を放射する。 The display device 22 (for example, a liquid crystal display device) displays an image instructed by the arithmetic processing device 12. The input device 24 is a device that accepts an instruction from a user (for example, a pointing device such as a mouse or a keyboard). Note that a touch panel configured integrally with the display device 22 may be employed as the input device 24. The sound emitting device 26 (for example, a headphone or a speaker) emits a sound wave corresponding to the sound signal S generated by the arithmetic processing device 12.

記憶装置１４は、演算処理装置１２が実行するプログラムＰGMや演算処理装置１２が使用する各種のデータ（音声素片群ＤA，音楽データＤB）を記憶する。半導体記録媒体や磁気記録媒体等の公知の記録媒体または複数種の記録媒体の組合せが記憶装置１４として採用される。 The storage device 14 stores a program PGM executed by the arithmetic processing device 12 and various data (speech segment group DA, music data DB) used by the arithmetic processing device 12. A known recording medium such as a semiconductor recording medium or a magnetic recording medium or a combination of a plurality of types of recording media is employed as the storage device 14.

音声素片群ＤAは、相異なる音声素片に対応する複数の素片データ（例えば音声素片の波形のサンプル系列）で構成されて音声合成の素材として使用される音声合成ライブラリである。音声素片は、言語的な意味の区別の最小単位である音素（例えば母音や子音）、または複数の音素を連結した音素連鎖（例えばダイフォンやトライフォン）である。 The speech unit group DA is a speech synthesis library that is composed of a plurality of unit data (for example, a sample series of speech unit waveforms) corresponding to different speech units and used as a material for speech synthesis. The phoneme unit is a phoneme (for example, a vowel or a consonant) that is a minimum unit of linguistic meaning distinction, or a phoneme chain (for example, a diphone or a triphone) that connects a plurality of phonemes.

音楽データＤBは、楽曲を構成する音符の時系列を指定するデータであり、図２に示すように、楽曲内の相異なる音符に対応する複数の単位データＵを含んで構成される。各単位データＵは、音高Ｘ1と発音時点Ｘ2と継続長Ｘ3と音声符号Ｘ4と制御情報Ｘ5とを指定する。音高Ｘ1は音符の音高（実際には各音高に付与されたノートナンバ）である。発音時点Ｘ2は発音が開始する時刻（発音時刻）を意味し、継続長Ｘ3は音符の発音が継続する時間（音価）を意味する。すなわち、発音時点Ｘ2と継続長Ｘ3とで音符の発音期間が規定される。なお、各音符の発音時点Ｘ2と消音時点とで発音期間を規定することも可能である。音声符号Ｘ4は、楽曲の歌詞等の発音内容を示す符号である。以下の説明では歌詞の発音文字（書記素）を音声符号Ｘ4として例示するが、例えば音素記号を音声符号Ｘ4として指定することも可能である。 The music data DB is data that designates a time series of notes constituting a musical composition, and includes a plurality of unit data U corresponding to different musical notes in the musical composition, as shown in FIG. Each unit data U designates a pitch X1, a sound generation point X2, a duration X3, a voice code X4, and control information X5. The pitch X1 is the pitch of the note (actually, the note number assigned to each pitch). The sound generation time point X2 means the time when sounding starts (sounding time), and the duration X3 means the time (note value) that the sound of the note continues. That is, the sound generation period is defined by the sound generation point X2 and the duration X3. It is also possible to define the sound generation period between the sound generation time point X2 and the mute time point of each note. The audio code X4 is a code indicating the pronunciation content such as the lyrics of the music. In the following description, the pronunciation character (grapheme) of the lyrics is exemplified as the phonetic code X4. However, for example, a phoneme symbol can be designated as the phonetic code X4.

制御情報Ｘ5は、音声合成に適用されて合成音の音楽的な特性を制御する変数（表情パラメータ）である。第１実施形態では、発音時点Ｘ2の直後の音高の微細な変化を規定する変数（音高の変動幅Ｚ1および変動長Ｚ2）と、音符に付加されるビブラートを規定する変数（ビブラートの継続長Ｚ3および種類Ｚ4）とを制御情報Ｘ5として例示する。図２に示すように、変動幅Ｚ1は、発音期間のうち発音時点Ｘ2から目標の音高Ｘ1に到達する時点までの区間（以下「開始区間」という）内での音高の変動量（発音開始時点の音高と目標の音高Ｘ1との差異）を規定し、変動長Ｚ2は開始区間の時間長を規定する。また、ビブラートの継続長Ｚ3は、発音区間のうちビブラートが付加される区間の時間長を規定する。例えば、発音期間の継続長Ｘ3に対するビブラートの時間長の比率が継続長Ｚ3として指定される。ビブラートの種類Ｚ4は、事前に用意された複数の候補（ビブラートなし／通常のビブラート／大振幅のビブラート／小振幅のビブラート／長周期のビブラート／短周期のビブラート）の何れかに設定される。 The control information X5 is a variable (expression parameter) that is applied to speech synthesis and controls the musical characteristics of the synthesized sound. In the first embodiment, variables that define minute changes in pitch immediately after the sounding point X2 (pitch fluctuation range Z1 and fluctuation length Z2) and variables that specify vibrato added to the notes (continuation of vibrato) The length Z3 and the type Z4) are exemplified as the control information X5. As shown in FIG. 2, the fluctuation range Z1 is a pitch fluctuation amount (sound generation) within a section (hereinafter referred to as “start section”) from the sound generation time point X2 to the time point when the target sound pitch X1 is reached in the sound generation period. The difference between the pitch at the start point and the target pitch X1) is defined, and the variation length Z2 defines the time length of the start section. Further, the vibrato continuation length Z3 defines the time length of a section where vibrato is added in the pronunciation section. For example, the ratio of the vibrato time length to the duration X3 of the pronunciation period is designated as the duration Z3. The vibrato type Z4 is set to one of a plurality of candidates (no vibrato / normal vibrato / large amplitude vibrato / small amplitude vibrato / long cycle vibrato / short cycle vibrato).

図１の音声合成部３６は、音声素片群ＤAと音楽データＤBとを利用して音声信号Ｓを生成する。具体的には、音声合成部３６は、第１に、音楽データＤBが各音符に指定する音声符号Ｘ4に対応する音声素片の素片データを音声素片群ＤAから順次に選択し、第２に、各素片データを、単位データＵが指定する音高Ｘ1および継続長Ｘ3に調整するとともに制御情報Ｘ5に応じて音声の特性を調整する。第３に、音声合成部３６は、調整後の素片データを、各単位データＵが指定する発音時点Ｘ2に配置して相互に連結することで音声信号Ｓを生成する。音声合成部３６が生成した音声信号Ｓが放音装置２６に供給されて音波として再生される。 The voice synthesizer 36 in FIG. 1 generates a voice signal S using the voice element group DA and the music data DB. Specifically, first, the speech synthesizer 36 sequentially selects, from the speech unit group DA, speech unit data of speech units corresponding to the speech code X4 designated by the music data DB for each note. Second, each piece of data is adjusted to the pitch X1 and duration X3 specified by the unit data U, and the sound characteristics are adjusted according to the control information X5. Thirdly, the speech synthesizer 36 generates the speech signal S by arranging the segment data after adjustment at the sound generation time point X2 designated by each unit data U and connecting them together. The voice signal S generated by the voice synthesizer 36 is supplied to the sound emitting device 26 and reproduced as a sound wave.

図１の表示制御部３２は、音楽データＤBの内容を利用者が確認する図３の編集画面５０を表示装置２２に表示させる。図３に示すように、第１実施形態の編集画面５０は、楽譜領域５１を含んで構成される。楽譜領域５１は、相互に交差する時間軸（横軸）および音高軸（縦軸）が設定されたピアノロール型の座標平面である。図３において時間軸方向に等間隔に配列された縦方向の破線Ｌは、楽曲内の１拍分に相当する期間の境界線（以下「拍線」という）を意味する。すなわち、時間軸上で相互に隣合う２本の拍線Ｌの間隔が楽曲の１拍分の時間長に相当する。 The display control unit 32 in FIG. 1 causes the display device 22 to display the editing screen 50 in FIG. 3 on which the user confirms the contents of the music data DB. As shown in FIG. 3, the editing screen 50 according to the first embodiment includes a score area 51. The score area 51 is a piano roll coordinate plane in which a time axis (horizontal axis) and a pitch axis (vertical axis) intersecting each other are set. In FIG. 3, vertical broken lines L arranged at equal intervals in the time axis direction mean a boundary line (hereinafter referred to as “beat line”) of a period corresponding to one beat in the music. That is, the interval between two beat lines L adjacent to each other on the time axis corresponds to the time length of one beat of the music.

表示制御部３２は、音楽データＤBが指定する各音符を表現する音符図形Ｖを楽譜領域５１内に配置する。第１実施形態の音符図形Ｖは矩形状の図形である。音楽データＤBに対応する楽曲のうちの一部の区間（以下「表示対象区間」という）内の音符について音符図形Ｖが楽譜領域５１に表示される。音高軸方向における音符図形Ｖの位置は音楽データＤBの音高Ｘ1に応じて設定され、時間軸方向における音符図形Ｖの位置は音楽データＤBの発音時点Ｘ2に応じて設定される。時間軸方向における音符図形Ｖの表示長は音楽データＤBの継続長Ｘ3に応じて設定される。また、音楽データＤBの音声符号Ｘ4が音符図形Ｖの内部に配置される。 The display control unit 32 arranges a note graphic V representing each note specified by the music data DB in the score area 51. The note graphic V of the first embodiment is a rectangular graphic. A musical note figure V is displayed in the score area 51 for the notes in a part of the music piece corresponding to the music data DB (hereinafter referred to as “display target section”). The position of the note graphic V in the pitch axis direction is set according to the pitch X1 of the music data DB, and the position of the note graphic V in the time axis direction is set according to the sounding point X2 of the music data DB. The display length of the note graphic V in the time axis direction is set according to the continuation length X3 of the music data DB. Further, the audio code X4 of the music data DB is arranged inside the musical note figure V.

図１の編集処理部３４は、楽譜領域５１に対する利用者からの指示に応じて音楽データＤBを編集する。例えば楽譜領域５１内の既存の音符図形Ｖの位置の変更が指示された場合、その音符図形Ｖに対応する単位データＵの音高Ｘ1および発音時点Ｘ2が変更され、音符図形Ｖの表示長の変更が指示された場合には単位データＵの継続長Ｘ3が変更される。また、各音符図形Ｖに対応する音声符号Ｘ4の変更が指示された場合、その音符図形Ｖに対応する単位データＵの音声符号Ｘ4が変更される。また、音符図形Ｖの追加が指示された場合にはその音符図形Ｖに対応する単位データＵが音楽データＤBに追加される。 The editing processing unit 34 in FIG. 1 edits the music data DB in response to an instruction from the user for the score area 51. For example, when the change of the position of the existing note graphic V in the score area 51 is instructed, the pitch X1 and the pronunciation point X2 of the unit data U corresponding to the note graphic V are changed, and the display length of the note graphic V is changed. When the change is instructed, the continuation length X3 of the unit data U is changed. When the change of the voice code X4 corresponding to each note graphic V is instructed, the voice code X4 of the unit data U corresponding to the note graphic V is changed. When the addition of the note graphic V is instructed, the unit data U corresponding to the note graphic V is added to the music data DB.

図３に示すように、編集画面５０は、楽譜領域５１の時間軸方向の表示倍率Ｒtを利用者が変更するための操作子画像（スライダ）５２を含む。利用者は、入力装置２４を使用して操作子画像５２を適宜に操作することが可能である。表示制御部３２は、操作子画像５２に対する操作で利用者が指定した表示倍率Ｒtとなるように楽譜領域５１内の表示画像を時間軸方向に伸縮する。 As shown in FIG. 3, the editing screen 50 includes an operator image (slider) 52 for the user to change the display magnification Rt in the time axis direction of the score area 51. The user can appropriately operate the operator image 52 using the input device 24. The display control unit 32 expands and contracts the display image in the score area 51 in the time axis direction so that the display magnification Rt specified by the user by the operation on the operator image 52 is obtained.

表示倍率Ｒtは、楽譜領域５１内での楽曲の単位時間（例えば楽曲の１拍分の時間長）の表示上の長さに相当する。したがって、表示倍率Ｒtが増加する（楽譜領域５１内での単位時間の表示長が長くなる）ほど楽曲内の表示対象区間は短くなり、楽曲のうち楽譜領域５１内に表示される小節数や拍数が減少する（拍線Ｌの間隔が拡大する）とともに各音符図形Ｖは時間軸方向に伸長する。他方、表示倍率Ｒtが減少する（楽譜領域５１内での単位時間の表示長が短くなる）ほど楽曲内の表示対象区間は長くなり、楽曲のうち楽譜領域５１内に表示される小節数や拍数が増加する（拍線Ｌの間隔が縮小する）とともに音符図形Ｖは時間軸方向に縮小される。なお、表示倍率Ｒtを変化させた場合でも楽譜領域５１自体の表示サイズは変化しない。 The display magnification Rt corresponds to the display length of the unit time of the music in the score area 51 (for example, the time length of one beat of the music). Therefore, as the display magnification Rt increases (the display length of the unit time in the musical score area 51 becomes longer), the display target section in the music becomes shorter, and the number of bars and beats displayed in the musical score area 51 of the music are shortened. As the number decreases (the interval between the beat lines L increases), each note graphic V expands in the time axis direction. On the other hand, as the display magnification Rt decreases (the display length of the unit time in the score area 51 becomes shorter), the display target section in the song becomes longer, and the number of bars and beats displayed in the score area 51 of the song. As the number increases (the interval between the beat lines L decreases), the note graphic V is reduced in the time axis direction. Even when the display magnification Rt is changed, the display size of the score area 51 itself does not change.

表示制御部３２は、表示倍率Ｒtに応じて楽譜領域５１の表示態様を変化させる。具体的には、表示制御部３２は、利用者が設定した表示倍率Ｒtと所定の閾値ＴHとを比較し、表示倍率Ｒtが閾値ＴHを下回る第１表示状態と表示倍率Ｒtが閾値ＴHを上回る第２表示状態とで楽譜領域５１の表示態様を変化させる。すなわち、第２表示状態は、楽譜領域５１を第１表示状態と比較して拡大表示した状態を意味する。図３は、第１表示状態での編集画面５０の表示例であり、図５は、第２表示状態での編集画面５０の表示例である。 The display control unit 32 changes the display mode of the score area 51 according to the display magnification Rt. Specifically, the display control unit 32 compares the display magnification Rt set by the user with a predetermined threshold TH, and the first display state in which the display magnification Rt is lower than the threshold TH and the display magnification Rt is higher than the threshold TH. The display mode of the score area 51 is changed according to the second display state. That is, the second display state means a state in which the musical score area 51 is enlarged and compared with the first display state. FIG. 3 is a display example of the editing screen 50 in the first display state, and FIG. 5 is a display example of the editing screen 50 in the second display state.

図３に示すように、第１表示状態では、表示制御部３２は、各音符図形Ｖの周囲に表情図形Ｅを配置する。表情図形Ｅは、各音符の制御情報Ｘ5の編集を利用者が開始するための図形であり、発音期間内の開始区間に対応する第１部分Ｅ1と、ビブラートが付加され得る区間（すなわち発音区間の末尾側の区間）に対応する第２部分Ｅ2とを含んで構成される。ビブラートが付加される音符の第２部分Ｅ2は波線状の図形（すなわちビブラートによる音高の変化を表現する図形）に設定され、ビブラートが付加されない音符の第２部分Ｅ2は直線状の図形（すなわち音高が一定に維持されることを表現する図形）に設定される。 As shown in FIG. 3, in the first display state, the display control unit 32 arranges an expression graphic E around each note graphic V. The facial expression graphic E is a graphic for the user to start editing the control information X5 of each note. The facial expression graphic E is a first part E1 corresponding to the start period within the pronunciation period and a section where vibrato can be added (that is, the pronunciation period). And a second portion E2 corresponding to the tail end section). The second part E2 of the note to which the vibrato is added is set to a wavy figure (that is, a figure expressing a change in pitch due to vibrato), and the second part E2 of the note to which the vibrato is not added is a linear figure (ie, the figure). A graphic representing that the pitch is kept constant).

所望の音符に対応する表情図形Ｅの第１部分Ｅ1を利用者が入力装置２４の操作（例えばマウスによるクリック）で指示すると、その音符の開始区間に関する制御情報Ｘ5（Ｚ1，Ｚ2）を設定するための図４の部分(A)の設定画面６１が編集画面５０とは別個に表示される。利用者は、設定画面６１に対する操作により開始区間内の音高の変動幅Ｚ1と変動長Ｚ2とを任意に設定することが可能である。編集処理部３４は、音楽データＤBの制御情報Ｘ5が指定する変動幅Ｚ1と変動長Ｚ2とを設定画面６１での設定値に変更する。 When the user designates the first portion E1 of the expression figure E corresponding to the desired note by operating the input device 24 (for example, clicking with the mouse), the control information X5 (Z1, Z2) relating to the start interval of the note is set. 4 is displayed separately from the editing screen 50. FIG. The user can arbitrarily set the fluctuation range Z1 and the fluctuation length Z2 of the pitch within the start section by operating the setting screen 61. The edit processing unit 34 changes the fluctuation range Z1 and the fluctuation length Z2 designated by the control information X5 of the music data DB to the setting values on the setting screen 61.

また、所望の音符に対応する表情図形Ｅの第２部分Ｅ2を利用者が指示すると、その音符のビブラートに関する制御情報Ｘ5（Ｚ3，Ｚ4）を設定するための図４の部分(B)の設定画面６２が編集画面５０とは別個に表示される。利用者は、設定画面６２に対する操作によりビブラートの継続長Ｚ3を任意に設定するとともにビブラートの種類Ｚ4を複数の候補から選択することが可能である。編集処理部３４は、音楽データＤBの制御情報Ｘ5が指定するビブラートの継続長Ｚ3と種類Ｚ4とを設定画面６２での設定値に変更する。 When the user designates the second part E2 of the expression graphic E corresponding to the desired note, setting of the part (B) in FIG. 4 for setting the control information X5 (Z3, Z4) relating to the vibrato of the note The screen 62 is displayed separately from the editing screen 50. The user can arbitrarily set the vibrato duration Z3 by an operation on the setting screen 62, and can select the vibrato type Z4 from a plurality of candidates. The edit processing unit 34 changes the vibrato duration Z3 and the type Z4 specified by the control information X5 of the music data DB to the setting values on the setting screen 62.

第１表示状態で楽譜領域５１内に配置される表情図形Ｅの第１部分Ｅ1および第２部分Ｅ2の表示には、事前に用意された所定の図形が使用される。すなわち、第１部分Ｅ1の画像は、制御情報Ｘ5には依存せず、複数の表情図形Ｅにわたり共通する。また、第２部分Ｅ2の画像は、時間軸方向の表示長がビブラートの継続長Ｚ3に応じて設定される以外は、複数の表情図形Ｅにわたり共通する。例えば、第２部分Ｅ2の波線の振幅や周期は複数の表情図形Ｅにわたり共通する。 In order to display the first part E1 and the second part E2 of the facial expression figure E arranged in the score area 51 in the first display state, a predetermined figure prepared in advance is used. That is, the image of the first part E1 does not depend on the control information X5, and is common over a plurality of facial expression figures E. Further, the image of the second portion E2 is common to the plurality of facial expression figures E except that the display length in the time axis direction is set according to the vibrato continuation length Z3. For example, the amplitude and period of the wavy line of the second portion E2 are common across the plurality of facial expression figures E.

他方、表示倍率Ｒtが第１表示状態と比較して高い第２表示状態（拡大表示時）では、表示制御部３２は、図５に示すように、楽譜領域５１内の各音符図形Ｖに対応する位置（音符図形Ｖの直下）に遷移画像ＱAを配置する。遷移画像ＱAは、制御情報Ｘ5を実際に反映させた各音符の音高の時間変化を表現する画像（曲線または折線）である。したがって、遷移画像ＱAの態様（特徴量の時間変化）は音符毎に相違し得る。具体的には、遷移画像ＱAが示す音高は、図６に示すように、変動長Ｚ2の開始区間内で変動幅Ｚ1にわたり変化して音高Ｘ1に到達し、種類Ｚ4に応じた振幅および周期で継続長Ｚ3にわたり変動する。 On the other hand, in the second display state (in enlarged display) in which the display magnification Rt is higher than the first display state, the display control unit 32 corresponds to each note graphic V in the score area 51 as shown in FIG. The transition image QA is arranged at a position (directly below the note graphic V). The transition image QA is an image (curve or broken line) that expresses the time change of the pitch of each note that actually reflects the control information X5. Therefore, the aspect of the transition image QA (time change of the feature amount) can be different for each note. Specifically, as shown in FIG. 6, the pitch indicated by the transition image QA changes over the fluctuation range Z1 within the start interval of the fluctuation length Z2 and reaches the pitch X1, and the amplitude and the amplitude corresponding to the type Z4 It fluctuates over a continuous length Z3 with a period.

利用者は、入力装置２４を適宜に操作する（例えば遷移画像ＱAの一部をマウスでドラッグする）ことで、所望の音符に対応する遷移画像ＱAの変化を指示することが可能である。表示制御部３２は、利用者からの指示に応じて遷移画像ＱAの形状を変化させる。図６には、遷移画像ＱAの変化が破線で例示されている。編集処理部３４は、利用者が操作した遷移画像ＱAが示す音符の制御情報Ｘ5を、その遷移画像ＱAの変化に対応するように編集する。 The user can instruct the change of the transition image QA corresponding to a desired note by appropriately operating the input device 24 (for example, dragging a part of the transition image QA with a mouse). The display control unit 32 changes the shape of the transition image QA in accordance with an instruction from the user. In FIG. 6, the change of the transition image QA is illustrated by a broken line. The edit processing unit 34 edits the note control information X5 indicated by the transition image QA operated by the user so as to correspond to the change of the transition image QA.

例えば図６に破線で示すように遷移画像ＱAの左端部を利用者が音高軸方向に移動させた場合には、制御情報Ｘ5のうち音高の変動幅Ｚ1が変更され、遷移画像ＱAの開始区間を利用者が伸縮した場合には制御情報Ｘ5のうち音高の変動長Ｚ2が変更される。また、遷移画像ＱAのビブラートの区間を利用者が伸縮した場合には制御情報Ｘ5のうちビブラートの継続長Ｚ3が変更され、遷移画像ＱAにおけるビブラートの振幅や周期を利用者が変更した場合には制御情報Ｘ5内のビブラートの種類Ｚ4が変更される。 For example, when the user moves the left end portion of the transition image QA in the pitch axis direction as shown by a broken line in FIG. 6, the pitch fluctuation range Z1 of the control information X5 is changed, and the transition image QA is changed. When the user expands or contracts the start section, the pitch variation length Z2 in the control information X5 is changed. Further, when the user expands or contracts the vibrato section of the transition image QA, the vibrato duration Z3 in the control information X5 is changed, and when the user changes the vibrato amplitude or cycle in the transition image QA. The vibrato type Z4 in the control information X5 is changed.

以上に説明したように、第１表示状態では、表情図形Ｅに対する指示を契機として表示される設定画面（６１，６２）を操作することで制御情報Ｘ5が編集されるのに対し、第２表示状態では、楽譜領域５１内に配置された遷移画像ＱAに対する操作（すなわち楽譜領域５１に対する直接的な操作）で制御情報Ｘ5が編集される。したがって、例えば楽譜領域５１内の各音符の相関を確認しながら各音符の制御情報Ｘ5を直観的に編集することが可能である。 As described above, in the first display state, the control information X5 is edited by operating the setting screen (61, 62) displayed in response to an instruction for the facial expression figure E, whereas the second display In the state, the control information X5 is edited by an operation on the transition image QA arranged in the score area 51 (that is, a direct operation on the score area 51). Therefore, for example, it is possible to intuitively edit the control information X5 of each note while checking the correlation between the notes in the score area 51.

なお、表示倍率Ｒtが低い第１表示状態でも楽譜領域５１に遷移画像ＱAを表示して利用者による操作を受付ける構成では、楽譜領域５１内に多数の音符図形Ｖと多数の遷移画像ＱAとが小さいサイズで混在するから、制御情報Ｘ5が所望の数値に編集されるように利用者が遷移画像ＱAを正確に操作することは困難である。第１実施形態では、表示倍率Ｒtが閾値ＴHを上回る第２表示状態（拡大表示時）のみで遷移画像ＱAが表示されるから、遷移画像ＱAの正確な操作が容易であるという利点がある。他方、第１表示状態では、設定画面（６１，６２）を操作することで制御情報Ｘ5を正確に設定することが可能である。 In the configuration in which the transition image QA is displayed in the score area 51 and the operation by the user is accepted even in the first display state where the display magnification Rt is low, a large number of note graphics V and a large number of transition images QA are included in the score area 51. Since they are mixed in a small size, it is difficult for the user to accurately operate the transition image QA so that the control information X5 is edited to a desired numerical value. In the first embodiment, since the transition image QA is displayed only in the second display state (during enlarged display) in which the display magnification Rt exceeds the threshold value TH, there is an advantage that an accurate operation of the transition image QA is easy. On the other hand, in the first display state, it is possible to accurately set the control information X5 by operating the setting screen (61, 62).

＜第２実施形態＞
本発明の第２実施形態を説明する。なお、以下に例示する各形態において作用や機能が第１実施形態と同等である要素については、以上の説明で参照した符号を流用して各々の詳細な説明を適宜に省略する。 Second Embodiment
A second embodiment of the present invention will be described. In addition, about the element which an effect | action and a function are equivalent to 1st Embodiment in each form illustrated below, each reference detailed in the above description is diverted and each detailed description is abbreviate | omitted suitably.

図７は、第２実施形態における第２表示状態での編集画面５０（楽譜領域５１）の模式図である。第２実施形態では、楽譜領域５１内の各音符図形Ｖに対応する位置に、第１実施形態と同様の遷移画像ＱAに加えて変数指示画像ＱB（ＱB1〜ＱB4）が配置される。各音符に対応する変数指示画像ＱBは、その音符の制御情報Ｘ5の数値を示す画像である。 FIG. 7 is a schematic diagram of the editing screen 50 (score area 51) in the second display state in the second embodiment. In the second embodiment, a variable instruction image QB (QB1 to QB4) is arranged at a position corresponding to each note graphic V in the score area 51 in addition to the transition image QA similar to the first embodiment. The variable instruction image QB corresponding to each note is an image showing the numerical value of the control information X5 of that note.

具体的には、変数指示画像ＱB1は音高の変動幅Ｚ1を表示し、変数指示画像ＱB2は音高の変動長Ｚ2を表示する。また、変数指示画像ＱB3はビブラートの継続長Ｚ3を表示し、変数指示画像ＱB4はビブラートの種類Ｚ4を表示する。なお、第１表示状態での表示画像は第１実施形態と同様である。すなわち、第１表示状態では、遷移画像ＱAや変数指示画像ＱBは楽譜領域５１に表示されない。 Specifically, the variable instruction image QB1 displays a pitch fluctuation range Z1, and the variable instruction image QB2 displays a pitch fluctuation length Z2. The variable instruction image QB3 displays the vibrato duration Z3, and the variable instruction image QB4 displays the vibrato type Z4. The display image in the first display state is the same as that in the first embodiment. That is, the transition image QA and the variable instruction image QB are not displayed in the score area 51 in the first display state.

利用者は、入力装置２４を適宜に操作することで所望の変数指示画像ＱBの数値を指示することが可能である。表示制御部３２は、変数指示画像ＱBの数値を利用者が指定した数値に変更する。編集処理部３４は、変数指示画像ＱBに対して利用者が指示した数値に応じて、その変数指示画像ＱBに対応する音符の制御情報Ｘ5を編集する。 The user can instruct the numerical value of the desired variable instruction image QB by appropriately operating the input device 24. The display control unit 32 changes the numerical value of the variable instruction image QB to a numerical value designated by the user. The edit processing unit 34 edits the note control information X5 corresponding to the variable instruction image QB according to the numerical value specified by the user with respect to the variable instruction image QB.

例えば、変数指示画像ＱB1の数値が変更された場合には制御情報Ｘ5のうち音高の変動幅Ｚ1が変更後の数値に更新され、変数指示画像ＱB2の数値が変更された場合には制御情報Ｘ5のうち音高の変動長Ｚ2が変更後の数値に更新される。同様に、制御情報Ｘ5のうちビブラートの継続長Ｚ3は変数指示画像ＱB3に対する指示に応じて更新され、制御情報Ｘ5のうちビブラートの種類Ｚ4は変数指示画像ＱB4に対する指示（例えばプルダウンメニューの複数の候補からビブラートの種類を選択する指示）に応じて更新される。 For example, when the numerical value of the variable instruction image QB1 is changed, the pitch fluctuation range Z1 of the control information X5 is updated to the changed numerical value, and when the numerical value of the variable instruction image QB2 is changed, the control information Of X5, the pitch fluctuation length Z2 is updated to the changed value. Similarly, the vibrato duration Z3 in the control information X5 is updated according to the instruction for the variable instruction image QB3, and the vibrato type Z4 in the control information X5 is an instruction for the variable instruction image QB4 (for example, a plurality of pull-down menu candidates). Updated in response to an instruction to select the type of vibrato from

変数指示画像ＱBに対する利用者からの指示に応じて制御情報Ｘ5が更新されると、表示制御部３２は、更新後の制御情報Ｘ5に対応するように遷移画像ＱAを更新する。また、利用者は、第１実施形態と同様に、入力装置２４を操作することで遷移画像ＱAを変化させることが可能である。遷移画像ＱAの変化に対応するように制御情報Ｘ5が更新されると、表示制御部３２は、各変数指示画像ＱBの数値を制御情報Ｘ5の更新後の内容に変更する。 When the control information X5 is updated in response to an instruction from the user for the variable instruction image QB, the display control unit 32 updates the transition image QA so as to correspond to the updated control information X5. Further, as in the first embodiment, the user can change the transition image QA by operating the input device 24. When the control information X5 is updated so as to correspond to the change of the transition image QA, the display control unit 32 changes the numerical value of each variable instruction image QB to the updated content of the control information X5.

以上に説明したように、第２実施形態の第２表示状態では、楽譜領域５１内に配置された変数指示画像ＱB（ＱB1〜ＱB4）に対する操作（すなわち楽譜領域５１に対する直接的な操作）で制御情報Ｘ5が編集される。したがって、第１実施形態と同様に、例えば楽譜領域５１内の各音符の相関を確認しながら各音符の制御情報Ｘ5を直観的に編集することが可能である。また、表示倍率Ｒtが高い第２表示状態にて変数指示画像ＱBが表示されるから、例えば表示倍率Ｒtが低い第１表示状態でも変数指示画像ＱBを表示する構成と比較して、変数指示画像ＱBの数値を変更する操作が容易であるという利点がある。表示倍率Ｒtが低い第１表示状態では変数指示画像ＱBの表示が省略されて楽譜領域５１が簡素化されるから、利用者が各音符図形Ｖを容易に確認できるという利点もある。 As described above, in the second display state of the second embodiment, control is performed by an operation on the variable instruction image QB (QB1 to QB4) arranged in the score area 51 (that is, a direct operation on the score area 51). Information X5 is edited. Therefore, similarly to the first embodiment, for example, it is possible to intuitively edit the control information X5 of each note while confirming the correlation between the notes in the score area 51. Further, since the variable instruction image QB is displayed in the second display state where the display magnification Rt is high, for example, the variable instruction image QB is compared with the configuration in which the variable instruction image QB is displayed even in the first display state where the display magnification Rt is low. There is an advantage that the operation of changing the value of QB is easy. In the first display state in which the display magnification Rt is low, the display of the variable instruction image QB is omitted and the score area 51 is simplified. Therefore, there is an advantage that the user can easily confirm each note graphic V.

＜第３実施形態＞
第３実施形態の制御情報Ｘ5は、第１実施形態と同様の変数（Ｚ1〜Ｚ4）に加えて処理選択情報Ｚ5を含んで構成される。各音符の制御情報Ｘ5の処理選択情報Ｚ5は、その音符の合成音の生成時に所定の処理（以下「特定処理」という）を実行するか否かを複数種の特定処理の各々について指定する。特定処理は、音声合成時に音声合成部３６が実行可能な処理である。以下の説明では、ポルタメント処理と自動素片決定処理とを特定処理として例示する。ポルタメント処理は、相前後する２個の音符の音高を連続的に連結する処理である。自動素片決定処理は、利用者が指定した音声符号Ｘ4（発音文字）に対応する音声素片を自動的に選択する処理である。自動素片決定処理を実行しない状態では、利用者が任意に指定した音声素片が音声符号Ｘ4とは無関係に選択される。すなわち、利用者による音声素片の選択が保護（プロテクト）される。 <Third Embodiment>
The control information X5 of the third embodiment includes processing selection information Z5 in addition to the same variables (Z1 to Z4) as in the first embodiment. The process selection information Z5 of the control information X5 for each note designates, for each of a plurality of types of specific processes, whether or not to execute a predetermined process (hereinafter referred to as “specific process”) when the synthesized sound of the note is generated. The specific process is a process that can be executed by the speech synthesizer 36 during speech synthesis. In the following description, portamento processing and automatic segment determination processing are exemplified as specific processing. The portamento process is a process for continuously connecting the pitches of two successive notes. The automatic segment determination process is a process of automatically selecting a speech segment corresponding to the speech code X4 (phonetic character) designated by the user. In a state where the automatic segment determination process is not executed, the speech segment arbitrarily designated by the user is selected regardless of the speech code X4. That is, the selection of the speech segment by the user is protected.

図８は、第３実施形態における第２表示状態での編集画面５０（楽譜領域５１）の模式図である。第３実施形態では、楽譜領域５１内の各音符図形Ｖに対応する位置に、第１実施形態と同様の遷移画像ＱAに加えて処理選択画像ＱC（ＱC1，ＱC2）が配置される。各音符に対応する処理選択画像ＱCは、その音符の合成音の生成時に特定処理を実行するか否かを示す画像である。処理選択画像ＱC1は、ポルタメント処理の実行の有無を示す画像であり、処理選択画像ＱC2は、自動素片決定処理の実行の有無を示す画像である。表示制御部３２は、制御情報Ｘ5の処理選択情報Ｚ5に応じて各処理選択画像ＱC（ＱC1，ＱC2）を表示させる。処理選択画像ＱC1のチェックボックスがオンに設定された状態（チェックが付加された状態）はポルタメント処理を実行することを意味し、処理選択画像ＱC2のチェックボックスがオンに設定された状態は自動素片決定処理を実行しないことを意味する。なお、第１表示状態での表示画像は第１実施形態と同様である。すなわち、第１表示状態では、遷移画像ＱAや処理選択画像ＱCは楽譜領域５１に表示されない。 FIG. 8 is a schematic diagram of the editing screen 50 (score area 51) in the second display state in the third embodiment. In the third embodiment, a processing selection image QC (QC1, QC2) is arranged at a position corresponding to each note graphic V in the score area 51 in addition to the transition image QA similar to the first embodiment. The process selection image QC corresponding to each note is an image indicating whether or not the specific process is executed when the synthesized sound of the note is generated. The process selection image QC1 is an image indicating whether or not the portamento process is executed, and the process selection image QC2 is an image indicating whether or not the automatic segment determination process is executed. The display control unit 32 displays each process selection image QC (QC1, QC2) according to the process selection information Z5 of the control information X5. A state in which the check box of the process selection image QC1 is set to ON (a state in which a check is added) means that portamento processing is executed, and a state in which the check box of the process selection image QC2 is set to ON is an automatic element. This means that the single decision process is not executed. The display image in the first display state is the same as that in the first embodiment. That is, the transition image QA and the process selection image QC are not displayed in the score area 51 in the first display state.

利用者は、入力装置２４を適宜に操作する（例えば各処理選択画像ＱCのチェックボックスをマウスでクリックする）ことで、各処理選択画像ＱCに対応する特定処理の実行の有無を指示することが可能である。表示制御部３２は、各処理選択画像ＱCのチェックボックスを利用者からの指示に応じてオンまたはオフに設定する。編集処理部３４は、各処理選択画像ＱCに対する利用者からの指示に応じて、その音符に対応する制御情報Ｘ5（処理選択情報Ｚ5）を編集する。 The user can instruct whether or not to execute a specific process corresponding to each process selection image QC by appropriately operating the input device 24 (for example, clicking a check box of each process selection image QC with a mouse). Is possible. The display control unit 32 sets the check box of each process selection image QC to ON or OFF according to an instruction from the user. The edit processing unit 34 edits the control information X5 (process selection information Z5) corresponding to the note in response to an instruction from the user for each process selection image QC.

以上に説明したように、第３実施形態の第２表示状態では、楽譜領域５１内に配置された処理選択画像ＱC（ＱC1，ＱC2）に対する操作（すなわち楽譜領域５１に対する直接的な操作）で制御情報Ｘ5が編集される。したがって、第１実施形態と同様に、例えば楽譜領域５１内の各音符の相関を確認しながら各音符の制御情報Ｘ5を直観的に編集することが可能である。また、表示倍率Ｒtが高い第２表示状態にて処理選択画像ＱCが表示されるから、例えば表示倍率Ｒtが低い第１表示状態でも処理選択画像ＱCを表示する構成と比較して、処理選択画像ＱCの操作が容易であるという利点がある。表示倍率Ｒtが低い第１表示状態では処理選択画像ＱCの表示が省略されて楽譜領域５１が簡素化されるから、利用者が各音符図形Ｖを容易に確認できるという利点もある。 As described above, in the second display state of the third embodiment, control is performed by an operation on the processing selection image QC (QC1, QC2) arranged in the score area 51 (that is, a direct operation on the score area 51). Information X5 is edited. Therefore, similarly to the first embodiment, for example, it is possible to intuitively edit the control information X5 of each note while confirming the correlation between the notes in the score area 51. Further, since the process selection image QC is displayed in the second display state where the display magnification Rt is high, for example, the process selection image QC is compared with the configuration in which the process selection image QC is displayed even in the first display state where the display magnification Rt is low. There is an advantage that the operation of QC is easy. In the first display state where the display magnification Rt is low, the display of the processing selection image QC is omitted and the score area 51 is simplified, so that there is also an advantage that the user can easily confirm each note graphic V.

＜第４実施形態＞
図９は、第４実施形態における第２表示状態での編集画面５０の模式図である。第１実施形態では、楽譜領域５１内の各音符図形Ｖの周囲に遷移画像ＱAを配置した。第４実施形態の表示制御部３２は、図９に示すように、各音符の音符図形Ｖの内側（輪郭線の内部）にその音符の遷移画像ＱAを配置する。 <Fourth embodiment>
FIG. 9 is a schematic diagram of the editing screen 50 in the second display state according to the fourth embodiment. In the first embodiment, the transition image QA is arranged around each note graphic V in the score area 51. As shown in FIG. 9, the display control unit 32 according to the fourth embodiment arranges the transition image QA of the note inside the note figure V of each note (inside the outline).

第４実施形態の編集画面５０は、図９に示すように、楽譜領域５１の音高軸方向の表示倍率Ｒpを利用者が変更するための操作子画像（スライダ）５３を含んで構成される。利用者は、入力装置２４を使用して操作子画像５３を適宜に操作することが可能である。表示制御部３２は、操作子画像５３に対する操作で利用者が指定した表示倍率Ｒpとなるように楽譜領域５１内の表示画像を音高軸方向に伸縮する。 As shown in FIG. 9, the editing screen 50 of the fourth embodiment includes an operator image (slider) 53 for the user to change the display magnification Rp of the musical score area 51 in the pitch axis direction. . The user can appropriately operate the operator image 53 using the input device 24. The display control unit 32 expands and contracts the display image in the score area 51 in the pitch axis direction so that the display magnification Rp specified by the user by the operation on the operator image 53 is obtained.

図１０は、図９の状態から表示倍率Ｒpを増加させた場合の編集画面５０の模式図である。図１０に示すように、表示制御部３２は、表示倍率Ｒpが増加するほど、音高軸方向における１個の音高の表示幅（図１０の横方向の破線の間隔）を拡大するとともに各音符図形Ｖを音高軸方向に伸長する。そして、表示制御部３２は、音符図形Ｖの伸長に連動するように遷移画像ＱAも音高軸方向に伸長する。すなわち、遷移画像ＱAが示す音高の時間変化が強調されて視認し易くなる。なお、表示倍率Ｒpに連動した遷移画像ＱAの伸縮では制御情報Ｘ5は変更されない。遷移画像ＱAに対する利用者からの指示に応じて制御情報Ｘ5が編集される構成は第１実施形態と同様である。 FIG. 10 is a schematic diagram of the editing screen 50 when the display magnification Rp is increased from the state of FIG. As shown in FIG. 10, as the display magnification Rp increases, the display control unit 32 increases the display width of one pitch in the pitch axis direction (interval between broken lines in the horizontal direction in FIG. 10) and The note graphic V is expanded in the pitch axis direction. Then, the display control unit 32 extends the transition image QA in the pitch axis direction in conjunction with the expansion of the note graphic V. That is, the time change of the pitch indicated by the transition image QA is emphasized and is easily visually recognized. Note that the control information X5 is not changed by expansion / contraction of the transition image QA linked to the display magnification Rp. The configuration in which the control information X5 is edited in response to an instruction from the user for the transition image QA is the same as in the first embodiment.

第４実施形態においても第１実施形態と同様の効果が実現される。また、第４実施形態では、音高軸方向の表示倍率Ｒpが変更され得るから、利用者は、表示倍率Ｒpを増加させることで、遷移画像ＱAが示す音高の時間変化を詳細に確認することが可能である。 In the fourth embodiment, the same effect as in the first embodiment is realized. In the fourth embodiment, since the display magnification Rp in the pitch axis direction can be changed, the user confirms in detail the time change of the pitch indicated by the transition image QA by increasing the display magnification Rp. It is possible.

また、第４実施形態では、遷移画像ＱAで表現される音高が音高軸上の数値に対応するから、音符図形Ｖの周囲に遷移画像ＱAを配置した第１実施形態の構成（すなわち、遷移画像ＱAが表現する音高と音高軸上の数値とが必ずしも整合しない構成）と比較して、遷移画像ＱAが示す音高の数値を利用者が正確かつ容易に把握できる（ひいては編集作業が容易化される）という利点がある。 In the fourth embodiment, since the pitch expressed by the transition image QA corresponds to a numerical value on the pitch axis, the configuration of the first embodiment in which the transition image QA is arranged around the note graphic V (that is, Compared with the pitch that the transition image QA expresses and the numerical value on the pitch axis do not necessarily match, the user can accurately and easily grasp the numerical value of the pitch indicated by the transition image QA (and editing work) Is facilitated).

＜第５実施形態＞
第５実施形態の制御情報Ｘ5は、第１実施形態と同様の変数（Ｚ1〜Ｚ4）に加えて、各音符の発音時点Ｘ2の直後の振幅の微細な変化を規定する変数（強勢度Ｚ6，減衰度Ｚ7）を含んで構成される。強勢度Ｚ6は、発音時点Ｘ2の直後にて音声の振幅が増加する速度（アクセント）を規定し、減衰度Ｚ7は、発音開始後に音声の振幅が減衰する度合（ディケイ）を規定する。 <Fifth Embodiment>
In addition to the variables (Z1 to Z4) similar to those in the first embodiment, the control information X5 in the fifth embodiment includes variables (intensities Z6, Z6) that define a minute change in amplitude immediately after the pronunciation time X2 of each note. Attenuation degree Z7) is included. The strength Z6 defines the speed (accent) at which the voice amplitude increases immediately after the sounding time point X2, and the attenuation degree Z7 defines the degree (decay) at which the voice amplitude attenuates after the start of sounding.

図１１は、第５実施形態における第２表示状態での編集画面５０の模式図である。表示制御部３２は、第２表示状態において、楽譜領域５１内の各音符図形Ｖに対応する位置に遷移画像ＱDを配置する。遷移画像ＱDは、制御情報Ｘ5（Ｚ6，Ｚ7）を実際に反映させた各音符の振幅の時間変化を表現する画像（曲線または折線）である。すなわち、遷移画像ＱDが示す振幅は、図１１に示すように、発音時点Ｘ2から強勢度Ｚ6に応じた速度で増加して目標値に到達したうえで減衰度Ｚ7に応じた度合で経時的に減衰する。 FIG. 11 is a schematic diagram of the editing screen 50 in the second display state according to the fifth embodiment. The display control unit 32 arranges the transition image QD at a position corresponding to each note graphic V in the score area 51 in the second display state. The transition image QD is an image (curved line or broken line) that expresses the time change of the amplitude of each note that actually reflects the control information X5 (Z6, Z7). That is, as shown in FIG. 11, the amplitude indicated by the transition image QD increases from the sound generation time point X2 at a speed according to the strength Z6, reaches the target value, and then gradually increases with the degree according to the attenuation Z7. Attenuates.

第１実施形態の遷移画像ＱAと同様に、利用者は、入力装置２４を利用して遷移画像ＱDを直接的に操作することで制御情報Ｘ5の編集を指示することが可能である。すなわち、表示制御部３２は、利用者からの指示に応じて遷移画像ＱDの形状を変化させ、編集処理部３４は、利用者が操作した遷移画像ＱDが示す音符の制御情報Ｘ5（強勢度Ｚ6，減衰度Ｚ7）を、その遷移画像ＱDの変化に対応するように編集する。したがって、第５実施形態においても第１実施形態と同様の効果が実現される。 Similar to the transition image QA of the first embodiment, the user can instruct editing of the control information X5 by directly operating the transition image QD using the input device 24. That is, the display control unit 32 changes the shape of the transition image QD in accordance with an instruction from the user, and the editing processing unit 34 controls the musical note control information X5 (strength Z6) indicated by the transition image QD operated by the user. , Attenuation Z7) is edited so as to correspond to the change of the transition image QD. Therefore, the same effect as that of the first embodiment is also realized in the fifth embodiment.

なお、第２実施形態の変数指示画像ＱBや第３実施形態の処理選択画像ＱCを遷移画像ＱDとともに（または遷移画像ＱDに代えて）音符毎に楽譜領域５１に配置することも可能である。変数指示画像ＱBは、例えば強勢度Ｚ6や減衰度Ｚ7を利用者が指定するために使用され得る。また、第４実施形態と同様に、遷移画像ＱDを音符図形Ｖの内側に配置した構成や、音高軸方向の表示倍率Ｒpに応じて遷移画像ＱDを音符図形Ｖとともに音高軸方向に伸縮する構成も採用される。 Note that the variable instruction image QB of the second embodiment and the processing selection image QC of the third embodiment can be arranged in the score area 51 for each note together with the transition image QD (or instead of the transition image QD). The variable instruction image QB can be used, for example, for the user to specify the strength Z6 and the attenuation Z7. Similarly to the fourth embodiment, the transition image QD is expanded and contracted in the pitch axis direction together with the note graphic V in accordance with the configuration in which the transition image QD is arranged inside the musical note graphic V and the display magnification Rp in the pitch axis direction. The structure to do is also adopted.

＜変形例＞
以上の各形態は多様に変形され得る。具体的な変形の態様を以下に例示する。以下の例示から任意に選択された２以上の態様を適宜に併合することも可能である。 <Modification>
Each of the above forms can be variously modified. Specific modifications are exemplified below. Two or more modes arbitrarily selected from the following examples can be appropriately combined.

（１）時間軸方向の表示倍率Ｒtが高い第２表示状態では、楽譜領域５１内に表示される音符図形Ｖの個数が第１表示状態と比較して減少する。そこで、楽曲内の表示対象区間の前後の各音符に対応する情報を表示対象区間内の情報とともに楽譜領域５１に配置する構成が好適である。例えば図１２のように、表示制御部３２は、表示対象区間内の各音符の音声符号Ｘ4_A（「さいた」の文字列）とともに、表示対象区間の直後の所定個の音符（すなわち音符図形Ｖが楽譜領域５１内には表示されない音符）の音声符号Ｘ4_B（「はなが」の文字列）を楽譜領域５１内に配置する。以上の構成によれば、表示倍率Ｒtが高い場合でも楽曲内の広い範囲にわたり各音符の情報（例えば音声符号Ｘ4）を確認することが可能である。 (1) In the second display state in which the display magnification Rt in the time axis direction is high, the number of note figures V displayed in the score area 51 is reduced compared to the first display state. Therefore, a configuration in which information corresponding to each note before and after the display target section in the music is arranged in the score area 51 together with the information in the display target section is preferable. For example, as shown in FIG. 12, the display control unit 32, together with the voice code X4_A (character string “Sai”) of each note in the display target section, a predetermined number of notes (that is, the note graphic V) immediately after the display target section. Is placed in the score area 51. The voice code X4_B (character string of “Hanaga”) is not displayed in the score area 51. According to the above configuration, even when the display magnification Rt is high, it is possible to check the information of each note (for example, the voice code X4) over a wide range in the music.

（２）表示倍率Ｒtや表示倍率Ｒpを利用者が変更するための構成は任意である。例えば、表示倍率Ｒtや表示倍率Ｒpを利用者が数値で指定する構成や、利用者による所定の操作（例えばボタンの押下）で表示倍率Ｒtや表示倍率Ｒpを所定値（例えば閾値ＴH）に設定する構成が採用される。また、利用者が１個の音符図形Ｖを選択した場合に、その音符図形Ｖを含む所定の範囲の音符図形Ｖが楽譜領域５１に位置するように表示倍率Ｒtを設定することも可能である。例えば、図３に例示された第１表示状態において先頭から第２番目の音符図形Ｖ（音声符号「い」）を利用者が選択した場合に、利用者が選択した音符図形Ｖと直前および直後の音符図形Ｖとを含む３個の音符図形Ｖが楽譜領域５１の時間軸方向の全体にわたるように（すなわち図３の表示から図５の表示に変更されるように）表示倍率Ｒtが自動的に設定される。 (2) The configuration for the user to change the display magnification Rt and the display magnification Rp is arbitrary. For example, the display magnification Rt and the display magnification Rp are specified by the user as numerical values, or the display magnification Rt and the display magnification Rp are set to predetermined values (for example, a threshold TH) by a predetermined operation (for example, pressing a button) by the user. A configuration is adopted. In addition, when the user selects one note graphic V, the display magnification Rt can be set so that the note graphic V in a predetermined range including the note graphic V is positioned in the score area 51. . For example, in the first display state illustrated in FIG. 3, when the user selects the second musical note graphic V (speech code “I”) from the beginning, the musical note graphic V selected by the user and immediately before and after The display magnification Rt is automatically set so that the three note figures V including the note figure V extend all over the time axis direction of the score area 51 (that is, the display is changed from the display of FIG. 3 to the display of FIG. 5). Set to

（３）各制御情報Ｘ5の変更指示を利用者から受付けるための画像（以下「編集画像」という）の形態は以上の例示に限定されない。例えば、利用者からの指示に応じて回転する形式の操作子（ツマミ）の画像や、利用者からの指示に応じて直線的に移動する形式の操作子（スライダ）の画像が、例えば第２実施形態の変数指示画像ＱBとして採用され得る。また、第１実施形態の遷移画像ＱAと第２実施形態の変数指示画像ＱBと第３実施形態の処理選択画像ＱCとを適宜に組合わせて編集画像を構成することも可能である。以上の説明から理解されるように、前述の各形態にて例示した編集画像（ＱA〜ＱD）は、制御情報Ｘ5の変更指示を利用者から受付けるための画像として包括され、編集画像に対する変更指示が直接的に（すなわち設定画面６１や設定画面６２等の他の画像に対する操作を介することなく）制御情報Ｘ5に反映される。 (3) The form of an image (hereinafter referred to as “edited image”) for receiving an instruction to change each control information X5 from the user is not limited to the above examples. For example, an image of an operator (knob) that rotates in response to an instruction from the user, or an image of an operator (slider) that moves linearly in response to an instruction from the user is, for example, the second. It may be adopted as the variable instruction image QB of the embodiment. It is also possible to configure an edited image by appropriately combining the transition image QA of the first embodiment, the variable instruction image QB of the second embodiment, and the process selection image QC of the third embodiment. As understood from the above description, the edited images (QA to QD) exemplified in the above-described embodiments are included as images for receiving a change instruction for the control information X5 from the user, and change instructions for the edited image are included. Is directly reflected in the control information X5 (ie, without performing an operation on another image such as the setting screen 61 or the setting screen 62).

（４）前述の各形態では、表示倍率Ｒtが閾値ＴHを上回る第２表示状態にて各音符図形Ｖの周囲に編集画像（ＱA〜ＱD）を配置したが、第１表示状態および第２表示状態の双方において各音符図形Ｖの周囲に編集画像（ＱA〜ＱD）を配置することも可能である。第１表示状態および第２表示状態の双方で編集画像（ＱA〜ＱD）を表示する場合、第１表示状態にて編集画像に対する操作で編集可能な制御情報Ｘ5の種類数よりも多数の制御情報Ｘ5を、第２表示状態（拡大表示時）にて編集画像に対する操作で編集可能とした構成が好適に採用される。 (4) In each of the above-described embodiments, the edited images (QA to QD) are arranged around each note graphic V in the second display state in which the display magnification Rt exceeds the threshold TH, but the first display state and the second display are provided. It is also possible to arrange edited images (QA to QD) around each note graphic V in both states. When the edited image (QA to QD) is displayed in both the first display state and the second display state, a larger number of control information than the number of types of control information X5 that can be edited by operating the edited image in the first display state. A configuration is preferably employed in which X5 can be edited by operating the edited image in the second display state (during enlarged display).

例えば、各音符の音高の時間変化を表現する第１実施形態の遷移画像ＱAを表示する場合、第１表示状態ではビブラートの継続長Ｚ3の変更のみが許可され、第２表示状態ではビブラートの継続長Ｚ3および種類（音高の変動幅）Ｚ4の双方の変更が許可される。第１表示状態ではビブラートの継続長Ｚ3の変更のみを許可し、第２表示状態では、ビブラートの継続長Ｚ3や種類Ｚ4等の変更指示を受付ける遷移画像ＱAを各音符図形Ｖの内側に配置することも可能である。また、第１表示状態では音高の時間変化の変更指示を受付ける遷移画像ＱAを表示し、第２表示状態では、特定処理の有無を示す処理選択画像ＱCを遷移画像ＱAに加えて表示する構成も採用され得る。以上に例示した各態様は、編集画像に対する操作で編集可能な制御情報Ｘ5の種類数が表示倍率Ｒtに応じて変化する構成（例えば第２表示状態で編集可能な制御情報Ｘ5の種類数が第１表示状態で編集可能な制御情報Ｘ5の種類数を上回る構成）として包括され、第１表示状態および第２表示状態の各々で表示される画像の具体的な内容は、以上の例示に限定されることなく適宜に変更される。また、例えば表示倍率Ｒtが閾値ＴHを上回る場合にマウスカーソルの表示態様を変化させる構成も採用され得る。 For example, when displaying the transition image QA of the first embodiment expressing the time change of the pitch of each note, only the change of the vibrato duration Z3 is permitted in the first display state, and the vibrato is changed in the second display state. It is permitted to change both the continuation length Z3 and the type (pitch fluctuation range) Z4. In the first display state, only the change of the vibrato duration Z3 is permitted, and in the second display state, the transition image QA that accepts a change instruction such as the vibrato duration Z3 and the type Z4 is arranged inside each note graphic V. It is also possible. Further, in the first display state, a transition image QA that accepts an instruction to change a pitch change with time is displayed, and in the second display state, a process selection image QC that indicates the presence or absence of specific processing is displayed in addition to the transition image QA. Can also be employed. Each aspect illustrated above has a configuration in which the number of types of control information X5 that can be edited by an operation on an edited image changes according to the display magnification Rt (for example, the number of types of control information X5 that can be edited in the second display state is the first). The specific contents of the images displayed in each of the first display state and the second display state are limited to the above examples. It is changed as appropriate without any change. Further, for example, a configuration in which the display mode of the mouse cursor is changed when the display magnification Rt exceeds the threshold value TH may be employed.

（５）編集画像に対する利用者からの指示に応じて編集可能な制御情報Ｘ5の種類は以上の例示に限定されない。例えば、合成音の音量（ダイナミクス，ベロシティ）や明瞭度（高域成分の増減度合）や発声時の開口の度合等の変数を制御情報Ｘ5として、編集画像に対する指示に応じて編集する構成が採用される。すなわち、制御情報Ｘ5は、音声合成に適用される変数として包括される。また、制御情報Ｘ5のうち楽譜領域５１に表示させる情報を利用者が選択できる構成も好適である。 (5) The type of control information X5 that can be edited in response to an instruction from the user for the edited image is not limited to the above examples. For example, a configuration is adopted in which variables such as synthesized sound volume (dynamics, velocity), clarity (increase / decrease degree of high-frequency component), opening degree at the time of utterance, etc. are used as control information X5, and editing is performed in response to an instruction to the edited image. Is done. That is, the control information X5 is included as a variable applied to speech synthesis. In addition, a configuration in which the user can select information to be displayed in the score area 51 among the control information X5 is also suitable.

（６）前述の各形態では、楽曲の１個のパートの音符を楽譜領域５１に表示したが、楽曲の複数のパートの各々の音符を楽譜領域５１に同時または選択的に表示することも可能である。音符図形Ｖはパート毎に相異なる態様（すなわち、色彩や階調の相違により各パートの音符図形Ｖを視覚的に区別可能な態様）で表示される。 (6) In each of the above-described embodiments, the notes of one part of the music are displayed in the score area 51. However, the notes of each of the parts of the music can be displayed simultaneously or selectively in the score area 51. It is. The note graphic V is displayed in a different mode for each part (that is, a mode in which the note graphic V of each part can be visually distinguished by a difference in color and gradation).

（７）第３実施形態における処理選択画像ＱCの形式は以上の例示に限定されない。例えば、図１３に例示される通り、自動素片決定処理やポルタメント処理等の特定処理の実行（すなわち、前述の処理選択画像ＱC1や処理選択画像ＱC2のチェックボックスがオンに設定された状態）を意味する処理選択画像ＱC3を音符図形Ｖに付加する（例えば音符図形Ｖに重複させる）ことも可能である。図１３の例示では、音符図形Ｖとは表示態様が相違する図形（三角形）が処理選択画像ＱC3として音符図形Ｖの隅部（右下隅）に配置されている。利用者が処理選択画像ＱC3を操作する（例えばマウスでクリックする）と、処理選択画像ＱC3が消去された状態（特定処理を実行しない状態）に変更される。処理選択画像ＱC3が消去された状態で利用者が音符図形Ｖの隅部を操作すると、処理選択画像ＱC3が表示される。したがって、利用者は、処理選択画像ＱC3の表示／非表示で特定処理の実行の有無を視覚的に確認することが可能である。なお、処理選択画像ＱCの表示／非表示を制御する構成のほか、処理選択画像ＱCの表示態様（色彩や階調）を特定処理の実行の有無に応じて制御する構成も採用され得る。 (7) The format of the process selection image QC in the third embodiment is not limited to the above examples. For example, as illustrated in FIG. 13, execution of specific processing such as automatic segment determination processing and portamento processing (that is, a state in which the check boxes of the above-described processing selection image QC1 and processing selection image QC2 are set to ON). It is also possible to add the meaning processing selection image QC3 to the note graphic V (for example, to overlap the note graphic V). In the example of FIG. 13, a graphic (triangle) having a display mode different from the musical note graphic V is arranged at the corner (lower right corner) of the musical note graphic V as the processing selection image QC3. When the user operates the process selection image QC3 (for example, clicks with the mouse), the process selection image QC3 is changed to a state where the process selection image QC3 is erased (a state where the specific process is not executed). When the user operates the corner portion of the note graphic V with the process selection image QC3 being erased, the process selection image QC3 is displayed. Therefore, the user can visually confirm whether or not the specific process is executed by displaying / hiding the process selection image QC3. In addition to the configuration for controlling the display / non-display of the process selection image QC, a configuration for controlling the display mode (color and gradation) of the process selection image QC depending on whether or not the specific process is executed may be employed.

（８）前述の各形態では、音声素片群ＤAと音楽データＤBとを記憶する記憶装置１４を音声合成装置１００に搭載したが、音声合成装置１００とは独立した外部装置（例えばサーバ装置）が音声素片群ＤAおよび音楽データＤBの一方または双方を記憶する構成も採用され得る。音声合成装置１００は、例えば通信網を介して音声素片群ＤAまたは音楽データＤBを取得して、編集画面５０の表示や音楽データＤBの編集や音声信号Ｓの合成を実行する。以上の説明から理解されるように、音声素片群ＤAや音楽データＤBを記憶する要素（前述の各形態における記憶装置１４）は音声合成装置１００の必須の要素ではない。 (8) In each of the above-described embodiments, the storage device 14 that stores the speech element group DA and the music data DB is mounted on the speech synthesizer 100. However, an external device (for example, a server device) that is independent of the speech synthesizer 100. May also be configured to store one or both of the speech element group DA and the music data DB. The speech synthesizer 100 acquires the speech element group DA or the music data DB via, for example, a communication network, and displays the editing screen 50, edits the music data DB, and synthesizes the speech signal S. As can be understood from the above description, the element for storing the speech element group DA and the music data DB (the storage device 14 in each of the above embodiments) is not an essential element of the speech synthesizer 100.

（９）前述の各形態では、日本語の音声の合成を例示したが、合成対象となる音声の言語は任意であり、日本語には限定されない。例えば、英語，スペイン語，中国語，韓国語等の任意の言語の音声を生成する場合にも以上の各形態を同様に適用することが可能である。 (9) In the above-described embodiments, the synthesis of Japanese speech has been illustrated, but the language of the speech to be synthesized is arbitrary and is not limited to Japanese. For example, the above embodiments can be similarly applied to the case of generating speech in an arbitrary language such as English, Spanish, Chinese, or Korean.

（１０）前述の各形態では、音声合成部３６を含む音声合成装置１００を例示したが、音楽データＤBを表示装置２２に表示させて利用者からの指示に応じて編集する装置（音楽データ編集装置）としても本発明は実現される。音楽データ編集装置は、例えば図１の音声合成装置１００から音声合成部３６を省略した構成である。音楽データ編集装置に音声合成部３６を追加することで音声合成装置１００が実現されると換言することも可能である。 (10) In each of the above-described embodiments, the speech synthesizer 100 including the speech synthesizer 36 is exemplified. However, a device (music data editing device) that displays the music data DB on the display device 22 and edits it in accordance with an instruction from the user. The present invention is also realized as an apparatus. For example, the music data editing apparatus has a configuration in which the voice synthesizing unit 36 is omitted from the voice synthesizing apparatus 100 of FIG. In other words, the speech synthesizer 100 is realized by adding the speech synthesizer 36 to the music data editing device.

また、前述の各形態では、音声合成に適用される音楽データＤBを例示したが、音楽データＤBを適用した合成の対象は音声（人声）に限定されない。例えば、各種の楽器の演奏音の合成（楽音合成）に音楽データＤBを利用することも可能である。すなわち、音楽データＤBは、音声合成や楽音合成を包含する音響合成に適用されるデータとして包括される。 Further, in each of the above-described embodiments, the music data DB applied to speech synthesis is illustrated, but the synthesis target to which the music data DB is applied is not limited to speech (human voice). For example, the music data DB can be used for synthesizing performance sounds (musical tone synthesis) of various musical instruments. That is, the music data DB is included as data applied to acoustic synthesis including speech synthesis and musical tone synthesis.

１００……音声合成装置、１２……演算処理装置、１４……記憶装置、２２……表示装置、２４……入力装置、２６……放音装置、３２……表示制御部、３４……編集処理部、３６……音声合成部。
DESCRIPTION OF SYMBOLS 100 ... Speech synthesis device, 12 ... Arithmetic processing device, 14 ... Memory | storage device, 22 ... Display device, 24 ... Input device, 26 ... Sound emission device, 32 ... Display control part, 34 ... Editing Processing unit, 36... Speech synthesis unit.

Claims

A device for editing music data that designates the pitch and sounding time of a synthesized sound and control information applied to sound synthesis for each note,
A note graphic is displayed for each note in a position corresponding to the pitch specified by the music data and the point in time of pronunciation in the musical score area where the pitch axis and time axis are set, and corresponds to each note graphic in the musical score area. Display control means for arranging an edit image for receiving an instruction to change the control information of the note indicated by the note graphic,
Editing processing means for editing the control information of the note according to an instruction from the user for the edited image of each note,
The display control means includes control information for accepting a change instruction by an edited image when the display magnification is lower than the threshold when the display magnification of the score area is higher than the threshold, and the display magnification is the threshold A music data editing apparatus that arranges an edited image that accepts an instruction to change the number of types of control information, which is larger than the case of less than

The display control means arranges the edited image including a transition image expressing a temporal change in a feature amount of a synthesized sound to which control information is applied in the score area, and according to an instruction from a user with respect to the transition image Changing the transition image;
The music data editing apparatus according to claim 1, wherein the editing processing unit edits the control information so as to correspond to a change in the transition image according to an instruction from a user.

The display magnification of the score area is the display magnification in the time axis direction,
The display control means expands and contracts the transition image in the time axis direction according to the display magnification in the time axis direction, while the pitch axis direction display is changed independently from the display magnification in the time axis direction. The music data editing apparatus according to claim 2, wherein the transition image is expanded and contracted in the pitch axis direction according to a magnification.

The display control means arranges the edited image including a variable instruction image indicating a numerical value of control information in the score area, and changes a numerical value indicated by the variable instruction image according to an instruction from a user,
The music data editing apparatus according to any one of claims 1 to 3, wherein the editing processing unit edits the control information so as to correspond to a change in a numerical value of the variable instruction image according to an instruction from a user.

The display control means arranges the edited image including the process selection image indicating whether or not the specific process at the time of sound synthesis is executed in the score area, and determines whether or not the specific process indicated by the process selection image is executed. Change according to instructions from
The music data editing apparatus according to any one of claims 1 to 4, wherein the editing processing unit edits the control information so as to correspond to whether or not a specific process indicated by the process selection image is executed.

A method of editing music data that specifies the pitch of a synthesized sound and the time of pronunciation and control information applied to sound synthesis for each note,
Computer system
A note graphic is displayed for each note in a position corresponding to the pitch specified by the music data and the point in time of pronunciation in the musical score area where the pitch axis and time axis are set, and corresponds to each note graphic in the musical score area. An edit image that accepts an instruction to change the control information of the note indicated by the note graphic,
Edit the control information of the note according to the instruction from the user for the edited image of each note,
The arrangement of the edited image includes control information for receiving an instruction to change the edited image when the display magnification is lower than the threshold when the display magnification of the score area is higher than the threshold, and the display magnification is A music data editing method in which an edited image for accepting an instruction to change a large number of types of control information compared to a case where the threshold value is below the threshold is arranged.