WO2019082320A1 - Solfege assistance device, method for controlling same, and program - Google Patents

Solfege assistance device, method for controlling same, and program

Info

Publication number
WO2019082320A1
WO2019082320A1 PCT/JP2017/038592 JP2017038592W WO2019082320A1 WO 2019082320 A1 WO2019082320 A1 WO 2019082320A1 JP 2017038592 W JP2017038592 W JP 2017038592W WO 2019082320 A1 WO2019082320 A1 WO 2019082320A1
Authority
WO
WIPO (PCT)
Prior art keywords
pitch
unit
solfege
utterance
name
Prior art date
Application number
PCT/JP2017/038592
Other languages
French (fr)
Japanese (ja)
Inventor
松本 秀一
Original Assignee
ヤマハ株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ヤマハ株式会社 filed Critical ヤマハ株式会社
Priority to PCT/JP2017/038592 priority Critical patent/WO2019082320A1/en
Publication of WO2019082320A1 publication Critical patent/WO2019082320A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10GREPRESENTATION OF MUSIC; RECORDING MUSIC IN NOTATION FORM; ACCESSORIES FOR MUSIC OR MUSICAL INSTRUMENTS NOT OTHERWISE PROVIDED FOR, e.g. SUPPORTS
    • G10G7/00Other auxiliary devices or accessories, e.g. conductors' batons or separate holders for resin or strings
    • G10G7/02Tuning forks or like devices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals

Definitions

  • the present invention relates to a solfege auxiliary device, a control method thereof, and a program.
  • Non-Patent Document 1 a tuner which displays a note name close to the pitch of the input sound and causes the user to adjust the input sound to the displayed pitch name.
  • solfege singular
  • An auxiliary device such as a tuner used for the solfege compares the pitch of the voice input to the microphone with the pitch to be adjusted.
  • note names indicate absolute pitches in each octave
  • step names indicate pitches relative to the tonic.
  • the conventional auxiliary device used for the solfege determines whether the pitch is correct or not, which is the absolute pitch of the input sound, regardless of the type of speech. Therefore, even if the pronunciation is performed with the floor name of the scale (for example, Do (Re), Mi (Mi) ...), the target of detection is only the pronunciation pitch. The floor name is ignored.
  • the voice of the pitch C3 is vocalized with the floor name "do". If this is done, it is determined that the pitch is correct, but even if the voice with pitch C3 is uttered with the floor name "Soo", it is determined that the pitch is correct.
  • the pitch G3 which is the tonic to check the vocalization of the G scale, it is possible to utter the sound of the pitch G3 with the floor name "Do" or with the floor name "Soo" , It is determined that the pitch is correct.
  • the singer when checking the utterance pitch, assuming the key to be sung, utter the pitch of the tonic of the key with the floor name “do”, and correspond each pitch of the key scale It is natural to speak with the floor name. For example, if the setting tone is C tone, the pitches C3, D3 and E3 are uttered as “d”, “re” and “mi”. If the setting key is a G key, the pitches G3, A3 and B3 are uttered as "d”, "re” and "mi”.
  • a singer usually uses a musical instrument or the like to grasp the pitch to be uttered when uttering in the solfege. It is convenient if it is possible to recognize the pitch to be uttered of each floor name according to the set key without requiring the target pitch to be produced by an instrument or the like. Also, conventionally, in the determination of pitch, the relationship between the key to be set and the floor name to be uttered has not been taken into consideration.
  • An object of the present invention is to provide a solfege aid that can contribute to the solfege aid.
  • a setting unit (10) for setting a key for setting a key
  • the setting unit A pitch acquisition unit (32, acquiring a pitch of the specified pitch name by specifying a pitch name corresponding to the acquired speech based on the tone set by the user and the utterance acquired by the speech acquisition unit; 34) and a solfege auxiliary device is provided.
  • a control method of a solfege auxiliary device comprising: a setting step of setting a key; and an utterance acquisition step of acquiring an utterance indicating a floor name from a result of speech recognition. Pitch acquisition for specifying a pitch name corresponding to the acquired speech based on the tone set in the setting step and the speech acquired in the speech acquisition step, and acquiring the pitch of the specified pitch name.
  • the program causes a computer to execute a control method of a solfege auxiliary device, and the control method comprises a setting step of setting a key, and a result of recognizing speech.
  • a pitch name corresponding to the acquired utterance is specified based on an utterance acquisition step for acquiring an utterance indicating a floor name, a tone set in the setting step, and the utterance acquired in the utterance acquisition step, And a pitch acquiring step of acquiring a pitch of the specified pitch name.
  • the present invention can contribute to the solfege assistance.
  • FIG. 1 is a view showing the overall configuration of a pitch determination system including a solfege auxiliary device according to an embodiment of the present invention.
  • the solfege auxiliary device of the present invention is configured as a tuner 30 as an example.
  • the tuner 30 has an audio acquisition function by the microphone 23, a display function by the display unit 18, and a sound generation function.
  • the tuner 30 has a function of mainly informing by display the pitch of the sound acquired by the microphone 23 (or the deviation from the target pitch).
  • the information terminal device 40 such as a PC or a smartphone may be communicably connected to the tuner 30 wirelessly or by wire, and the system may be configured to realize a solfege auxiliary device mainly by the information terminal device 40.
  • the pitch determined by the information terminal device 40 may be reported by the tuner 30.
  • the device connected to the information terminal device 40 may have a function of at least notifying the determined pitch by display, voice or the like.
  • the present invention is realized mainly by the tuner 30 alone will be described.
  • FIG. 2 is a block diagram of the tuner 30.
  • the tuner 30 includes a central processing unit (CPU) 10, a timer 11, a read only memory (ROM) 12, a random access memory (RAM) 13, a storage unit 14, an operation element 17, and a display unit 18.
  • a sound source 19, an effect circuit 20, a sound system 21, a communication I / F (Interface) 16, a microphone 23, and a bus 22 are provided.
  • the CPU 10 is a central processing unit that controls the entire tuner 30.
  • the timer 11 is a module that measures time.
  • the ROM 12 is a non-volatile memory that stores control programs and various data.
  • the RAM 13 is a volatile memory used as a work area of the CPU 10 and various buffers.
  • the display unit 18 is configured of a liquid crystal display panel or the like, and displays an operation state of the tuner 30, various setting screens, a message for the user, and the like.
  • the operator 17 is an operation module such as an operation button or an operation knob.
  • the external storage device 15 is, for example, an external device connected to the tuner 30, and is, for example, a device that stores audio data.
  • the communication I / F 16 is a communication module for communicating with an external device wirelessly or by wire.
  • the communication I / F 16 includes a MIDI (Musical Instrument Digital Interface) interface.
  • the bus 22 transfers data between the units in the tuner 30.
  • the microphone 23 acquires surrounding sound.
  • the sound source 19 generates sound generation data under the control of the CPU 10 based on the data stored in the storage unit 14 and the RAM 13.
  • the effect circuit 20 applies the acoustic effect designated by the operator 17 to the sounding data generated by the sound source 19.
  • the sound system 21 converts the data processed by the effect circuit 20 into an analog signal by a digital / analog converter. Then, the sound system 21 amplifies the analog signal and outputs it from a speaker or the like.
  • the display unit 18 displays various information under the control of the CPU 10.
  • FIG. 3 is a block diagram of the main part of the tuner 30 for pitch determination.
  • the tuner 30 includes an utterance acquisition unit 31, a pitch detection unit 32, a pitch name identification unit 33, a pitch acquisition unit 34, a pitch comparison unit 35, and a notification unit 36.
  • the speech acquisition unit 31 recognizes the speech acquired by the microphone 23 and detects speech from the recognition result.
  • the utterance acquisition unit 31 extracts and acquires only the utterance indicating the target floor name, and ignores (deletes) the utterance that is not the acquisition target.
  • the pitch detection unit 32 detects the pitch of the utterance acquired by the utterance acquisition unit 31.
  • the pitch name identification unit 33 identifies a pitch name from the speech acquired by the speech acquisition unit 31.
  • the note name specifying unit 33 specifies the note name in consideration of the setting key.
  • the pitch acquisition unit 34 acquires the pitch of the specified pitch name.
  • the pitch acquisition unit 34 acquires the pitch of the pitch name in consideration of the set calibration.
  • the pitch acquisition unit 34 outputs information indicating the acquired pitch to the notification unit 36.
  • the notification unit 36 presents the pitch output from the pitch acquisition unit 34 by sounding (presentation unit).
  • the pitch comparison unit 35 as an output unit compares the pitch detected by the pitch detection unit 32 with the pitch acquired by the pitch acquisition unit 34, and outputs the comparison result to the notification unit 36.
  • the notification unit 36 notifies the user of the comparison result of the pitch.
  • the functions of the speech acquisition unit 31, pitch detection unit 32, pitch name identification unit 33, pitch acquisition unit 34, pitch comparison unit 35, and calibration unit are all realized mainly by the cooperation of the storage unit 14, ROM 12, RAM 13 and CPU 10. Be done.
  • the function of the notification unit 36 is realized by the cooperation of the storage unit 14, the display unit 18, the sound source 19, the effect circuit 20, the sound system 21, and the CPU 10. Details of the pitch determination process will be described later with reference to FIG.
  • FIG. 4 is a table showing the relationship between MIDI note numbers, key tones and floor names. This table is stored, for example, in the ROM 12.
  • the MIDI note number (MIDI #; MIDI code) of pitch C3 is 60.
  • the note number of pitch C4 one octave higher than pitch C3 is 72.
  • the diatonic scale is “Do” “Re” “Mi” “Mi” “Fa” “So” “La” (La), “Si (Si)”.
  • the key is C key (C major)
  • the floor name adopted as the target of utterance acquisition may be one type determined in advance, or the user may be able to select from a plurality of types.
  • floor names There are various floor names, but any of them may be adopted.
  • Nishizuka formula, Sato formula, etc. may be adopted, and types other than Japanese ones (for example, See, Dee, E, Tse, Day, AE, etc.) may be adopted.
  • the names and names of the Nishizuka-style and Sato-style notations are disclosed in the following URLs.
  • ttps //en.wikipedia.org/wiki/%E9%9F%B3%E5%90%8D%E3%83%BB%E9%9A%8A%8E%E5%90%8D%E8%A1%A1%A8%E8 % A8% 98
  • the floor name is assumed to be the diatonic scale (so that Di, Me, etc. are not included) as the target of speech acquisition for the sake of simplifying the explanation, the diatonic scale may also be targeted (Di And Me etc.).
  • FIG. 5 is a flowchart of the pitch determination process. This process is realized by the CPU 10 reading out the program stored in the ROM 12 to the ROM 12 and executing the program. This process is started, for example, when the operator 17 or the like instructs to start the pitch determination process.
  • the CPU 10 executes setting processing based on an instruction from the user by the operator 17 as a designation unit (step S101). For example, the CPU 10 sets a tone (setting unit) such as C tone or D tone, an utterance indicating a floor name to be acquired (hereinafter, an acquisition target floor name), that is, an utterance target for acquisition and a calibration setting. Etc. For calibration that shifts the pitch reference corresponding to each pitch name (adjusts the pitch of the pitch name), the setting is applied to all pitches. For example, the frequency of the pitch A3 is normally 440 Hz, but a value (for example, 442 Hz) shifted from 440 Hz is set as the frequency of the pitch A3 by the calibration setting. In the description of FIG.
  • step S102 the CPU 10 waits until there is an audio input.
  • the voice is input from the microphone 23, and it is determined that there is a voice input when any voice is input, not limited to the utterance of the acquisition target.
  • the speech acquisition unit 31 recognizes the input speech by a known method, and acquires an utterance of an acquisition target from the result of recognizing the speech (step S103).
  • the CPU 10 determines whether an utterance to be acquired has been acquired (step S104). As a result of the determination, when the utterance of the acquisition target is not acquired, the process returns to step S102, while when the utterance of the acquisition target is acquired, the process proceeds to step S105.
  • step S105 the pitch detection unit 32 detects the pitch of the utterance acquired this time.
  • the CPU 10 identifies a pitch name corresponding to the utterance acquired this time, and acquires the pitch of the identified pitch name (step S106). Specifically, first, the pitch name specifying unit 33 specifies the pitch name corresponding to the utterance acquired this time, based on the set tone and the utterance acquired this time. For example, when the setting tone is D tone (d major tone) and the utterance acquired this time is "do", the note name D is specified. Alternatively, if the setting tone is E tone (E major tone) and the utterance acquired this time is "do", the note name E is specified.
  • the pitch acquisition unit 34 also refers to the table (FIG. 4) to acquire the pitch of the specified pitch name. At that time, the pitch acquiring unit 34 acquires a value obtained by shifting the frequency corresponding to the MIDI code defined in the table based on the calibration setting as the pitch of the specified pitch name. In step S106, the pitch acquisition unit 34 also sends (the information of) the acquired pitch to the notification unit 36. Then, the notification unit 36 presents the sent pitch by pronunciation. Thereby, the singer can recognize the target pitch to be uttered according to the key and the floor name. The presentation of the pitch here is not essential.
  • the pitch comparison unit 35 detects the pitch of the utterance acquired this time (detection pitch) detected by the pitch detection unit 32 in step S106 and the identified sound acquired by the pitch acquisition unit 34 in step S106.
  • the name pitch (pitch name pitch) is compared, and the comparison result ( ⁇ pitch) is output (step S107).
  • the pitch comparison unit 35 corrects the shift amount in units of octaves (integer multiple of 1200 cents) so that the shift amount is less than one octave if the shift amount between the detection pitch and the pitch name pitch is one octave or more. Is calculated as ⁇ pitch. It is not essential to provide such a process for correcting the deviation amount in octave units.
  • the notification unit 36 determines whether the absolute value of the ⁇ pitch output from the pitch comparison unit 35 is within a predetermined range (first predetermined value) (for example, the ⁇ pitch is within a range of ⁇ 400 to +400 cents). It is determined whether or not it is (step S108). Then, when the absolute value of the ⁇ pitch is within the predetermined range, the notification unit 36 performs a notification process (step S109).
  • a predetermined range for example, the ⁇ pitch is within a range of ⁇ 400 to +400 cents.
  • notification may be performed by a value, a color, or a mark corresponding to the ⁇ pitch using an LED or the like, or may be notified by voice.
  • the timbre and volume may be varied according to the ⁇ pitch.
  • the ⁇ pitch may be read out by voice.
  • step S109 the process proceeds to step S110. If the absolute value of the ⁇ pitch is out of the predetermined range, the notification unit 36 proceeds with the process to step S110 without executing the notification process. Therefore, when speaking at a pitch extremely deviated from the target, notification of the ⁇ pitch, which is the comparison result, is not performed, and notification is performed only when the ⁇ pitch indicates a deviation amount less than the first predetermined value. Thereby, the troublesome alerting
  • the CPU 10 determines whether or not an instruction to end the pitch determination process has been issued by the operation element 17 or the like (step S110). Then, when there is no instruction to end the pitch determination process, the CPU 10 executes other processes (step S111), and returns the process to step S102. In the other processing, change of setting contents, various guidances and the like are executed. On the other hand, when an instruction to end the pitch determination process is issued, the process of FIG. 5 ends.
  • FIG. 6A is a musical score example showing musical notes uttered by a singer.
  • FIG. 6B is a time chart showing the utterance pitch.
  • the setting tone is C tone (C major)
  • the acquisition target floor name is a Japanese style floor name ("Do" "Re” "Mi") "Si (Si (Si) ))).
  • the pointer display in the notification unit 36 indicates that when the pointer points straight up, there is no deviation of the detection pitch from the reference pitch pitch. When the pointer inclines to the right, it means that the detection pitch of singing is higher than the pitch pitch ( ⁇ pitch is positive). Display examples by the notification unit 36 are shown by displays t1 to t9.
  • the singer utters Do at the first sound.
  • the singer intended to utter at the pitch C3, but the pointer is inclined to the right because it is too high and utters close to the pitch E3 (display t1).
  • the singer uttered Do (Do)
  • the utterance pitch was slightly high (display t2), but when the pitch was corrected, the pitch became almost appropriate (display t3).
  • the utterance pitch is slightly higher than the pitch C3 (display t4), and then, the user pronounces Re (Re), but the pitch is lower.
  • the pitch was corrected to a nearly appropriate pitch (display t6).
  • the singer utters Mi (Mi) at the third note. Since the singer uttered the third note by croaking, the pointer is inclined to the left at the beginning (display t7), and then the pitch becomes almost appropriate when the pitch is corrected (display t8), then the pitch Is on the high side (display t9).
  • CPU 10 identifies the name of the note corresponding to the utterance based on the set tone and the utterance acquired from the result of recognizing the speech, and acquires the pitch of the identified note. . Then, the CPU 10 presents the acquired pitch by tone generation. As a result, since it is possible to inform in advance the pitch to be uttered according to the key and the step name to be uttered, it is possible to contribute to the solfege assistance.
  • the CPU 10 outputs a comparison result ( ⁇ pitch) between the acquired speech pitch and the specified pitch name pitch.
  • ⁇ pitch a comparison result between the acquired speech pitch and the specified pitch name pitch.
  • the comparison result indicates a shift of one or more octaves
  • the value corrected so that the shift amount is less than one octave is taken as the ⁇ pitch, so the utterance pitch is determined regardless of the singer's voice key be able to.
  • an utterance indicating a floor name to be acquired can be specified, only a fixed floor name can be used for pitch determination, and erroneous determination is less likely to occur.
  • the CPU 10 corrects the pitch (in addition to the above notification processing in the notification processing in step S109 described above).
  • the target note name pitch may be pronounced.
  • the CPU 10 may mute (stop presentation) this sound when the singing pitch is corrected and the absolute value of the ⁇ pitch falls below a second predetermined value.
  • the CPU 10 may sound a target pitch pitch sound for a certain period of time when the pitch name corresponding to the utterance acquired this time is specified.
  • the meaning of the command may be understood, and the user may be able to instruct setting of the key, the acquisition target floor name, the calibration, and the like by the command.
  • the device connected thereto may have a notification function, and is not limited to the tuner 30.
  • the method of acquiring the application for realizing the present invention There is no limitation on the method of acquiring the application for realizing the present invention.
  • the application executed by the information terminal device 40 may not be originally installed in the information terminal device 40, and may be downloaded and installed after the fact.
  • the storage medium storing the control program represented by the software for achieving the present invention may be read out to the present instrument to achieve the same effect, in which case, the storage medium is read from the storage medium.
  • the program code itself implements the novel functions of the present invention, and the non-transitory computer readable recording medium storing the program code constitutes the present invention.
  • the program code may be supplied via a transmission medium or the like, in which case the program code itself constitutes the present invention.
  • ROMs, floppy disks, hard disks, optical disks, magneto-optical disks, CD-ROMs, CD-Rs, magnetic tapes, non-volatile memory cards, etc. can be used as storage media in these cases.
  • non-transitory computer readable recording medium is a volatile memory (for example, a server or client internal to the computer system when the program is transmitted via a network such as the Internet or a communication line such as a telephone line) It also includes one that holds a program for a fixed time, such as a dynamic random access memory (DRAM).
  • DRAM dynamic random access memory

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Electrophonic Musical Instruments (AREA)

Abstract

Provided is a solfege assistance device that can contribute to assisting with solfege. When a to-be-acquired utterance is acquired, a CPU 10 detects the pitch of the utterance, specifies a pitch name that corresponds to a currently acquired utterance on the basis of a set pitch and the currently acquired utterance, and acquires the pitch of the specified pitch name.

Description

ソルフェージュの補助装置及びその制御方法、プログラムSolfege auxiliary device, control method therefor, program
 本発明は、ソルフェージュの補助装置及びその制御方法、プログラムに関する。 The present invention relates to a solfege auxiliary device, a control method thereof, and a program.
 従来、入力された音のピッチに近い音名を表示し、表示された音名の音高にユーザが入力音を合わせていくチューナが知られている(非特許文献1)。初見で楽譜を歌唱するソルフェージュ(視唱)においては、歌唱者は、肉声で発した音の音程を確認したい。ソルフェージュに用いられるチューナ等の補助装置で、マイクに入力された発声音のピッチと合わせたいピッチとが比較される。 Conventionally, a tuner has been known which displays a note name close to the pitch of the input sound and causes the user to adjust the input sound to the displayed pitch name (Non-Patent Document 1). In solfege (singular), where the score is sung at the first look, the singer wants to confirm the pitch of the sound emitted by the natural voice. An auxiliary device such as a tuner used for the solfege compares the pitch of the voice input to the microphone with the pitch to be adjusted.
 しかしながら、音名は各オクターブにおける絶対的な音の高さを示すのに対し、階名は主音に対する相対的な音の高さを示す。ソルフェージュに用いられる従来の補助装置は、発話の種類を問わず、入力された音の絶対的な音の高さであるピッチの正否やずれを判定する。従って、発音を音階の階名(例えば、ド(Do)、レ(Re)、ミ(Mi)…)の発声(発話)で行ったとしても、検知の対象となるのは発音ピッチだけで、階名は無視される。 However, note names indicate absolute pitches in each octave, while step names indicate pitches relative to the tonic. The conventional auxiliary device used for the solfege determines whether the pitch is correct or not, which is the absolute pitch of the input sound, regardless of the type of speech. Therefore, even if the pronunciation is performed with the floor name of the scale (for example, Do (Re), Mi (Mi) ...), the target of detection is only the pronunciation pitch. The floor name is ignored.
 例えば、歌唱者が、C調の音階の発声チェックのために、主音である音名Cの音高(例えば、音高C3)に合わせる場合、音高C3の音を階名「ド」で発声すればピッチが正しいと判定されるが、仮に、音高C3の音を階名「ソ」で発声したとしても、ピッチが正しいと判定されてしまう。同様に、G調の音階の発声チェックのために、主音である音高G3に合わせる場合、音高G3の音を階名「ド」で発声しても階名「ソ」で発声しても、ピッチが正しいと判定される。逆に、歌唱者が、C調の音階の発声チェックにおいて、主音でない音高D3に合わせる場合、発声音高が音高D3以外であると、発声階名が「ド」であっても「レ」であっても、ピッチが正しくないと判定される。 For example, if the singer matches the pitch C (eg, pitch C3) which is the tonic to tune the C scale, the voice of the pitch C3 is vocalized with the floor name "do". If this is done, it is determined that the pitch is correct, but even if the voice with pitch C3 is uttered with the floor name "Soo", it is determined that the pitch is correct. Similarly, when checking to the pitch G3 which is the tonic to check the vocalization of the G scale, it is possible to utter the sound of the pitch G3 with the floor name "Do" or with the floor name "Soo" , It is determined that the pitch is correct. On the contrary, when the singer matches the pitch D3 which is not the tonic in the vocal check of the scale of C, if the vocal pitch is other than the pitch D3, even if the vocalization floor name is "do" Even if “,” it is determined that the pitch is not correct.
 歌唱者にとっては、発声ピッチのチェックをする場合に、歌唱しようとする調を想定し、その調の主音の音高を階名「ド」で発声し、音階の各音高をそれらに対応する階名で発声するのが自然である。例えば、設定調がC調であれば、音高C3、D3、E3が「ド」、「レ」、「ミ」と発声される。設定調がG調であれば、音高G3、A3、B3が「ド」、「レ」、「ミ」と発声される。 For the singer, when checking the utterance pitch, assuming the key to be sung, utter the pitch of the tonic of the key with the floor name “do”, and correspond each pitch of the key scale It is natural to speak with the floor name. For example, if the setting tone is C tone, the pitches C3, D3 and E3 are uttered as "d", "re" and "mi". If the setting key is a G key, the pitches G3, A3 and B3 are uttered as "d", "re" and "mi".
 従来、歌唱者は、ソルフェージュにおいて発声をする際に、楽器等を用いて発声すべきピッチを把握するのが通常であった。設定された調に応じた各階名の発声すべきピッチを、楽器等による目標ピッチの発音を要することなく認識できれば便利である。また、従来は、ピッチの判定において、設定される調と発話する階名との関係が考慮されることはなかった。 Conventionally, a singer usually uses a musical instrument or the like to grasp the pitch to be uttered when uttering in the solfege. It is convenient if it is possible to recognize the pitch to be uttered of each floor name according to the set key without requiring the target pitch to be produced by an instrument or the like. Also, conventionally, in the determination of pitch, the relationship between the key to be set and the floor name to be uttered has not been taken into consideration.
 本発明の目的は、ソルフェージュの補助に寄与することができるソルフェージュの補助装置を提供することである。 An object of the present invention is to provide a solfege aid that can contribute to the solfege aid.
 上記目的を達成するために本発明によれば、調を設定する設定部(10)と、音声を認識した結果から、階名を示す発話を取得する発話取得部(31)と、前記設定部により設定された調と前記発話取得部により前記取得された発話とに基づいて前記取得された発話に対応する音名を特定し、該特定した音名のピッチを取得するピッチ取得部(32、34)と、を有するソルフェージュの補助装置が提供される。 According to the present invention to achieve the above object, according to the present invention, a setting unit (10) for setting a key, a speech acquisition unit (31) for acquiring a speech indicating a floor name from a result of speech recognition, and the setting unit A pitch acquisition unit (32, acquiring a pitch of the specified pitch name by specifying a pitch name corresponding to the acquired speech based on the tone set by the user and the utterance acquired by the speech acquisition unit; 34) and a solfege auxiliary device is provided.
 上記目的を達成するために本発明によれば、ソルフェージュの補助装置の制御方法であって、調を設定する設定ステップと、音声を認識した結果から、階名を示す発話を取得する発話取得ステップと、前記設定ステップにより設定された調と前記発話取得ステップにより取得された発話とに基づいて前記取得された発話に対応する音名を特定し、該特定した音名のピッチを取得するピッチ取得ステップと、を有するソルフェージュの補助装置の制御方法が提供される。 According to the present invention to achieve the above object, there is provided a control method of a solfege auxiliary device, comprising: a setting step of setting a key; and an utterance acquisition step of acquiring an utterance indicating a floor name from a result of speech recognition. Pitch acquisition for specifying a pitch name corresponding to the acquired speech based on the tone set in the setting step and the speech acquired in the speech acquisition step, and acquiring the pitch of the specified pitch name A control method of a solfege auxiliary device is provided.
 上記目的を達成するために本発明によれば、ソルフェージュの補助装置の制御方法をコンピュータに実行させるプログラムであって、前記制御方法は、調を設定する設定ステップと、音声を認識した結果から、階名を示す発話を取得する発話取得ステップと、前記設定ステップにより設定された調と前記発話取得ステップにより取得された発話とに基づいて前記取得された発話に対応する音名を特定し、該特定した音名のピッチを取得するピッチ取得ステップと、を有するプログラムが提供される。 According to the present invention to achieve the above object, the program causes a computer to execute a control method of a solfege auxiliary device, and the control method comprises a setting step of setting a key, and a result of recognizing speech. A pitch name corresponding to the acquired utterance is specified based on an utterance acquisition step for acquiring an utterance indicating a floor name, a tone set in the setting step, and the utterance acquired in the utterance acquisition step, And a pitch acquiring step of acquiring a pitch of the specified pitch name.
 なお、上記括弧内の符号は例示である。 In addition, the code in the said parenthesis is an illustration.
 本発明によれば、ソルフェージュの補助に寄与することができる。 According to the present invention, it can contribute to the solfege assistance.
ソルフェージュの補助装置を含むピッチ判定システムの全体構成を示す図である。It is a figure which shows the whole structure of the pitch determination system containing the auxiliary | assistance apparatus of a solfege. チューナのブロック図である。It is a block diagram of a tuner. ピッチ判定のためのチューナの主要部のブロック図である。It is a block diagram of the principal part of the tuner for pitch determination. MIDIのノートナンバと調と階名との関係を示すテーブルである。6 is a table showing the relationship between MIDI note numbers, tones, and floor names. ピッチ判定処理のフローチャートである。It is a flowchart of a pitch determination process. 歌唱者が発声する音符を示す楽譜例である。It is a musical score example which shows the note which a singer utters. 発声ピッチを示すタイムチャートである。It is a time chart which shows utterance pitch.
 以下、図面を参照して本発明の実施の形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
 図1は、本発明の一実施の形態に係るソルフェージュの補助装置を含むピッチ判定システムの全体構成を示す図である。本発明のソルフェージュの補助装置は一例として、チューナ30として構成される。チューナ30は、マイク23による音声取得機能、表示部18による表示機能、さらに発音機能を有する。チューナ30は、マイク23で取得した音声のピッチ(または目標ピッチからのずれ)を主に表示により報知する機能を有する。なお、PCやスマートフォン等の情報端末装置40とチューナ30とを無線または有線で通信可能に接続し、主として情報端末装置40によりソルフェージュの補助装置が実現されるシステムとしてもよい。その場合、情報端末装置40で判定したピッチをチューナ30で報知してもよい。情報端末装置40に本発明を適用する場合に情報端末装置40と接続される装置は、少なくとも判定したピッチを表示や音声等により報知する機能を有すればよい。以降、本発明を主としてチューナ30単体で実現した例を説明する。 FIG. 1 is a view showing the overall configuration of a pitch determination system including a solfege auxiliary device according to an embodiment of the present invention. The solfege auxiliary device of the present invention is configured as a tuner 30 as an example. The tuner 30 has an audio acquisition function by the microphone 23, a display function by the display unit 18, and a sound generation function. The tuner 30 has a function of mainly informing by display the pitch of the sound acquired by the microphone 23 (or the deviation from the target pitch). Note that the information terminal device 40 such as a PC or a smartphone may be communicably connected to the tuner 30 wirelessly or by wire, and the system may be configured to realize a solfege auxiliary device mainly by the information terminal device 40. In that case, the pitch determined by the information terminal device 40 may be reported by the tuner 30. When the present invention is applied to the information terminal device 40, the device connected to the information terminal device 40 may have a function of at least notifying the determined pitch by display, voice or the like. Hereinafter, an example in which the present invention is realized mainly by the tuner 30 alone will be described.
 図2は、チューナ30のブロック図である。チューナ30は、CPU(Central Processing Unit)10と、タイマ11と、ROM(Read Only Memory)12と、RAM(Random Access Memory)13と、記憶部14と、操作子17と、表示部18と、音源19と、効果回路20と、サウンドシステム21と、通信I/F(Interface)16と、マイク23と、バス22と、を備える。CPU10は、チューナ30全体の制御を行う中央処理装置である。タイマ11は、時間を計測するモジュールである。ROM12は制御プログラムや各種のデータなどを格納する不揮発性のメモリである。RAM13はCPU10のワーク領域及び各種のバッファなどとして使用される揮発性のメモリである。表示部18は、液晶ディスプレイパネルなどで構成され、チューナ30の動作状態、各種設定画面、ユーザに対するメッセージなどを表示する。 FIG. 2 is a block diagram of the tuner 30. As shown in FIG. The tuner 30 includes a central processing unit (CPU) 10, a timer 11, a read only memory (ROM) 12, a random access memory (RAM) 13, a storage unit 14, an operation element 17, and a display unit 18. A sound source 19, an effect circuit 20, a sound system 21, a communication I / F (Interface) 16, a microphone 23, and a bus 22 are provided. The CPU 10 is a central processing unit that controls the entire tuner 30. The timer 11 is a module that measures time. The ROM 12 is a non-volatile memory that stores control programs and various data. The RAM 13 is a volatile memory used as a work area of the CPU 10 and various buffers. The display unit 18 is configured of a liquid crystal display panel or the like, and displays an operation state of the tuner 30, various setting screens, a message for the user, and the like.
 操作子17は、操作ボタンや操作つまみなどの操作モジュールである。外部記憶装置15は、例えば、チューナ30に接続される外部機器であり、例えば、音声データを記憶する装置である。通信I/F16は、外部機器と無線または有線で通信するための通信モジュールである。通信I/F16には、MIDI(Musical Instrument Digital Interface)インターフェイスが含まれる。バス22はチューナ30における各部の間のデータ転送を行う。マイク23は周囲の音声を取得する。音源19は、記憶部14やRAM13に記憶されたデータに基づいて、CPU10の制御の基で、発音用データを生成する。効果回路20は、音源19が生成した発音用データに対して、操作子17により指定された音響効果を適用する。サウンドシステム21は、効果回路20による処理後のデータを、デジタル/アナログ変換器によりアナログ信号に変換する。そして、サウンドシステム21は、アナログ信号を増幅してスピーカなどから出力する。表示部18は、CPU10の制御の基で、各種情報を表示する。 The operator 17 is an operation module such as an operation button or an operation knob. The external storage device 15 is, for example, an external device connected to the tuner 30, and is, for example, a device that stores audio data. The communication I / F 16 is a communication module for communicating with an external device wirelessly or by wire. The communication I / F 16 includes a MIDI (Musical Instrument Digital Interface) interface. The bus 22 transfers data between the units in the tuner 30. The microphone 23 acquires surrounding sound. The sound source 19 generates sound generation data under the control of the CPU 10 based on the data stored in the storage unit 14 and the RAM 13. The effect circuit 20 applies the acoustic effect designated by the operator 17 to the sounding data generated by the sound source 19. The sound system 21 converts the data processed by the effect circuit 20 into an analog signal by a digital / analog converter. Then, the sound system 21 amplifies the analog signal and outputs it from a speaker or the like. The display unit 18 displays various information under the control of the CPU 10.
 次に、ユーザの発話のピッチを判定する方法について説明する。図3は、ピッチ判定のためのチューナ30の主要部のブロック図である。チューナ30は、発話取得部31、ピッチ検知部32、音名特定部33、ピッチ取得部34、ピッチ比較部35及び報知部36を有する。発話取得部31は、マイク23で取得された音声を認識し、その認識結果から発話を検知する。発話取得部31は、取得対象の階名(後述)が指定されている場合は、対象の階名を示す発話だけを抽出して取得し、取得対象でない発話を無視(削除)する。 Next, a method of determining the pitch of the user's speech will be described. FIG. 3 is a block diagram of the main part of the tuner 30 for pitch determination. The tuner 30 includes an utterance acquisition unit 31, a pitch detection unit 32, a pitch name identification unit 33, a pitch acquisition unit 34, a pitch comparison unit 35, and a notification unit 36. The speech acquisition unit 31 recognizes the speech acquired by the microphone 23 and detects speech from the recognition result. When the acquisition target floor name (described later) is designated, the utterance acquisition unit 31 extracts and acquires only the utterance indicating the target floor name, and ignores (deletes) the utterance that is not the acquisition target.
 ピッチ検知部32は、発話取得部31により取得された発話のピッチを検知する。音名特定部33は、発話取得部31により取得された発話から音名を特定する。なお、調がデフォルト以外のものに設定されている場合は、音名特定部33は、設定調を考慮して音名を特定する。ピッチ取得部34は、特定した音名のピッチを取得する。キャリブレーション部によるキャリブレーション(後述)が設定されている場合は、ピッチ取得部34は、設定されているキャリブレーションを考慮して音名のピッチを取得する。ピッチ取得部34は、取得したピッチを示す情報を報知部36に出力する。報知部36は、ピッチ取得部34から出力されたピッチを発音により提示する(提示部)。また、出力部としてのピッチ比較部35は、ピッチ検知部32により検知されたピッチとピッチ取得部34により取得されたピッチとを比較して、その比較結果を報知部36に出力する。報知部36は、ピッチの比較結果をユーザに報知する。 The pitch detection unit 32 detects the pitch of the utterance acquired by the utterance acquisition unit 31. The pitch name identification unit 33 identifies a pitch name from the speech acquired by the speech acquisition unit 31. When the key is set to something other than the default, the note name specifying unit 33 specifies the note name in consideration of the setting key. The pitch acquisition unit 34 acquires the pitch of the specified pitch name. When calibration (described later) by the calibration unit is set, the pitch acquisition unit 34 acquires the pitch of the pitch name in consideration of the set calibration. The pitch acquisition unit 34 outputs information indicating the acquired pitch to the notification unit 36. The notification unit 36 presents the pitch output from the pitch acquisition unit 34 by sounding (presentation unit). Further, the pitch comparison unit 35 as an output unit compares the pitch detected by the pitch detection unit 32 with the pitch acquired by the pitch acquisition unit 34, and outputs the comparison result to the notification unit 36. The notification unit 36 notifies the user of the comparison result of the pitch.
 発話取得部31、ピッチ検知部32、音名特定部33、ピッチ取得部34、ピッチ比較部35、キャリブレーション部の機能はいずれも、主として記憶部14、ROM12、RAM13及びCPU10の協働により実現される。報知部36の機能は、記憶部14、表示部18、音源19、効果回路20、サウンドシステム21及びCPU10の協働により実現される。ピッチ判定処理の詳細は図5で後述する。 The functions of the speech acquisition unit 31, pitch detection unit 32, pitch name identification unit 33, pitch acquisition unit 34, pitch comparison unit 35, and calibration unit are all realized mainly by the cooperation of the storage unit 14, ROM 12, RAM 13 and CPU 10. Be done. The function of the notification unit 36 is realized by the cooperation of the storage unit 14, the display unit 18, the sound source 19, the effect circuit 20, the sound system 21, and the CPU 10. Details of the pitch determination process will be described later with reference to FIG.
 図4は、MIDIのノートナンバと調と階名との関係を示すテーブルである。このテーブルは、例えば、ROM12に記憶されている。音高C3の、MIDIのノートナンバ(MIDI#;MIDIコード)は60である。音高C3より1オクターブ高い音高C4のノートナンバは72である。一般的な日本式の階名を例にとると、全音階は、「ド(Do)」「レ(Re)」「ミ(Mi)」「ファ(Fa)」「ソ(So)」「ラ(La)」「シ(Si)」の7つである。調がC調(ハ長調)である場合、階名と音高との対応関係は、「ド」が音高C3(MIDI#=60)、「レ」が音高D3(MIDI#=62)、「ミ」が音高E3(MIDI#=64)、「シ」が音高B3(MIDI#=71)に対応する。 FIG. 4 is a table showing the relationship between MIDI note numbers, key tones and floor names. This table is stored, for example, in the ROM 12. The MIDI note number (MIDI #; MIDI code) of pitch C3 is 60. The note number of pitch C4 one octave higher than pitch C3 is 72. Taking a typical Japanese style floor name as an example, the diatonic scale is “Do” “Re” “Mi” “Mi” “Fa” “So” “La” (La), “Si (Si)”. When the key is C key (C major), the correspondence between the floor name and the pitch is that "do" is pitch C3 (MIDI # = 60) and "Le" is pitch D3 (MIDI # = 62). , “M” corresponds to the pitch E3 (MIDI # = 64), and “S” corresponds to the pitch B3 (MIDI # = 71).
 一方、調がD調(ニ長調)である場合、階名と音高との対応関係は、「ド」が音高D3(MIDI#=62)、「レ」が音高E3(MIDI#=64)、「ミ」が音高F3#(MIDI#=66)、「シ」が音高C4#(MIDI#=73)に対応する。また、調がE調(ホ長調)である場合、階名と音高との対応関係は、「ド」が音高E3(MIDI#=64)、「レ」が音高F3#(MIDI#=66)、「ミ」が音高G3#(MIDI#=68)、「シ」が音高D4#(MIDI#=75)に対応する。 On the other hand, when the key is D-tone (d major), the correspondence between the floor name and the pitch is that "do" is pitch D3 (MIDI # = 62) and "Le" is pitch E3 (MIDI # = 64) “M” corresponds to the pitch F3 # (MIDI # = 66), and “S” corresponds to the pitch C4 # (MIDI # = 73). In addition, when the key is E key (H major), the correspondence between the floor name and the pitch is that "do" is pitch E3 (MIDI # = 64) and "Le" is pitch F3 # (MIDI #). = 66), “mi” corresponds to the pitch G3 # (MIDI # = 68), and “shi” corresponds to the pitch D4 # (MIDI # = 75).
 歌唱者は、D調の音階に関して自身の発声ピッチを確認しようとする場合、D調の全音階の音高を各階名で発話する。例えば歌唱者は、音高D(例えば、MIDI#=62)の発声ピッチを確認したい場合、「ド」という階名を音高D3で発声する。すると、D調での「ド」に対応する音名はDであるので、ピッチ比較部35は、歌唱者による「ド」の発話のピッチと音名Dのピッチとを比較する。 When trying to check the pitch of his / her vocalization with respect to the D-tone scale, the singer utters the pitch of the D-tone scale with each floor name. For example, when the singer wants to confirm the utterance pitch of the pitch D (for example, MIDI # = 62), the singer utters the floor name "do" at the pitch D3. Then, since the pitch name corresponding to "D" in D tone is D, the pitch comparison unit 35 compares the pitch of the speech of "D" by the singer with the pitch of the pitch D.
 なお、発話取得の対象として採用される階名は、予め定めた1種類としてもよいし、複数種類の中からユーザが選択できるようにしてもよい。階名には各種あるが、どれを採用してもよい。例えば、西塚式や佐藤式等を採用してもよいし、日本式以外の種類(例えば、シー、ディー、イー、あるいはツェー、デー、エー等)を採用してもよい。西塚式や佐藤式の音名・階名表記については、次のURL等に開示されている。
ttps://ja.wikipedia.org/wiki/%E9%9F%B3%E5%90%8D%E3%83%BB%E9%9A%8E%E5%90%8D%E8%A1%A8%E8%A8%98
 また、説明の簡単化のため、発話取得の対象として、階名は全音階が対象とされるとするが(従って、DiやMe等は含まれない)、半音階を対象としてもよい(DiやMe等を含める)。
Note that the floor name adopted as the target of utterance acquisition may be one type determined in advance, or the user may be able to select from a plurality of types. There are various floor names, but any of them may be adopted. For example, Nishizuka formula, Sato formula, etc. may be adopted, and types other than Japanese ones (for example, See, Dee, E, Tse, Day, AE, etc.) may be adopted. The names and names of the Nishizuka-style and Sato-style notations are disclosed in the following URLs.
ttps: //en.wikipedia.org/wiki/%E9%9F%B3%E5%90%8D%E3%83%BB%E9%9A%8A%8E%E5%90%8D%E8%A1%A1%A8%E8 % A8% 98
In addition, although the floor name is assumed to be the diatonic scale (so that Di, Me, etc. are not included) as the target of speech acquisition for the sake of simplifying the explanation, the diatonic scale may also be targeted (Di And Me etc.).
 図5は、ピッチ判定処理のフローチャートである。この処理は、ROM12に格納されているプログラムをCPU10がROM12に読み出して実行することにより実現される。この処理は、例えば、操作子17等によってピッチ判定処理の開始指示があると開始される。 FIG. 5 is a flowchart of the pitch determination process. This process is realized by the CPU 10 reading out the program stored in the ROM 12 to the ROM 12 and executing the program. This process is started, for example, when the operator 17 or the like instructs to start the pitch determination process.
 まず、CPU10は、指定部としての操作子17によるユーザからの指示に基づき設定処理を実行する(ステップS101)。例えばCPU10は、C調やD調等の調の設定(設定部)、取得対象となる階名(以下、取得対象階名)を示す発話、すなわち取得対象の発話の設定、及びキャリブレーションの設定等を行う。各音名に対応する音程の基準をシフトさせる(音名のピッチを調整する)キャリブレーションについては、その設定は全音高に適用される。例えば、音高A3の周波数は通常、440Hzであるが、キャリブレーション設定により、440Hzからずれた値(例えば、442Hz)が音高A3の周波数とされる。図5の説明において、取得対象の発話として、例えば、上記したような日本式の階名による「ド(Do)」から「シ(Si)」の7つが設定されたとする。なお、調、取得対象階名、キャリブレーションのうちユーザによる指定がされなかったものについては、デフォルト設定が採用される。 First, the CPU 10 executes setting processing based on an instruction from the user by the operator 17 as a designation unit (step S101). For example, the CPU 10 sets a tone (setting unit) such as C tone or D tone, an utterance indicating a floor name to be acquired (hereinafter, an acquisition target floor name), that is, an utterance target for acquisition and a calibration setting. Etc. For calibration that shifts the pitch reference corresponding to each pitch name (adjusts the pitch of the pitch name), the setting is applied to all pitches. For example, the frequency of the pitch A3 is normally 440 Hz, but a value (for example, 442 Hz) shifted from 440 Hz is set as the frequency of the pitch A3 by the calibration setting. In the description of FIG. 5, it is assumed that, for example, seven utterances from “Do” to “Si (Si)” with Japanese-style floor names as described above are set as the utterances to be acquired. The default setting is adopted for any key, acquisition target floor name, and calibration that have not been designated by the user.
 次に、CPU10は、音声入力が有るまで待つ(ステップS102)。音声はマイク23から入力され、取得対象の発話に限らず何らかの音声が入力されると音声入力が有ったと判別される。そして音声入力が有ると、発話取得部31は、入力された音声を公知の手法により認識し、音声を認識した結果から、取得対象の発話を取得する(ステップS103)。そしてCPU10は、取得対象の発話が取得されたか否かを判別する(ステップS104)。その判別の結果、CPU10は、取得対象の発話が取得されない場合は、処理をステップS102に戻す一方、取得対象の発話が取得された場合は、処理をステップS105に進める。ステップS105では、ピッチ検知部32は、今回取得された発話のピッチを検知する。 Next, the CPU 10 waits until there is an audio input (step S102). The voice is input from the microphone 23, and it is determined that there is a voice input when any voice is input, not limited to the utterance of the acquisition target. Then, when there is a voice input, the speech acquisition unit 31 recognizes the input speech by a known method, and acquires an utterance of an acquisition target from the result of recognizing the speech (step S103). Then, the CPU 10 determines whether an utterance to be acquired has been acquired (step S104). As a result of the determination, when the utterance of the acquisition target is not acquired, the process returns to step S102, while when the utterance of the acquisition target is acquired, the process proceeds to step S105. In step S105, the pitch detection unit 32 detects the pitch of the utterance acquired this time.
 次に、CPU10は、今回取得された発話に対応する音名を特定し、特定した音名のピッチを取得する(ステップS106)。具体的にはまず、音名特定部33は、設定された調と今回取得された発話とに基づいて、今回取得された発話に対応する音名を特定する。例えば、設定調がD調(ニ長調)であって、今回取得された発話が「ド」である場合、音名Dが特定される。あるいは、設定調がE調(ホ長調)であって、今回取得された発話が「ド」である場合、音名Eが特定される。なお、音高C3からB3までの範囲で音名が特定されるとするが、オクターブ単位で異なる範囲、例えば音高C4からB4までの範囲で音名が特定されるとしてもよい。ステップS106ではまた、ピッチ取得部34は、テーブル(図4)を参照し、特定された音名のピッチを取得する。その際、ピッチ取得部34は、テーブルに規定されているMIDIコードに対応する周波数をキャリブレーション設定に基づいてシフトした値を、特定された音名のピッチとして取得する。ステップS106ではまた、ピッチ取得部34は、取得したピッチ(の情報)を報知部36に送る。すると、報知部36は送られたピッチを発音により提示する。これにより、歌唱者は、調と階名に応じて発声すべき目標ピッチを認識することができる。なお、ここでのピッチの提示は必須でない。 Next, the CPU 10 identifies a pitch name corresponding to the utterance acquired this time, and acquires the pitch of the identified pitch name (step S106). Specifically, first, the pitch name specifying unit 33 specifies the pitch name corresponding to the utterance acquired this time, based on the set tone and the utterance acquired this time. For example, when the setting tone is D tone (d major tone) and the utterance acquired this time is "do", the note name D is specified. Alternatively, if the setting tone is E tone (E major tone) and the utterance acquired this time is "do", the note name E is specified. Although the pitch names are specified in the range from pitch C3 to B3, the pitch names may be specified in a range different from one another in units of octaves, for example, from pitch C4 to B4. In step S106, the pitch acquisition unit 34 also refers to the table (FIG. 4) to acquire the pitch of the specified pitch name. At that time, the pitch acquiring unit 34 acquires a value obtained by shifting the frequency corresponding to the MIDI code defined in the table based on the calibration setting as the pitch of the specified pitch name. In step S106, the pitch acquisition unit 34 also sends (the information of) the acquired pitch to the notification unit 36. Then, the notification unit 36 presents the sent pitch by pronunciation. Thereby, the singer can recognize the target pitch to be uttered according to the key and the floor name. The presentation of the pitch here is not essential.
 次に、ピッチ比較部35は、ステップS106でピッチ検知部32により検知された、今回取得された発話のピッチ(検知ピッチ)と、ステップS106でピッチ取得部34により取得された、特定された音名のピッチ(音名ピッチ)とを比較し、比較結果(Δピッチ)を出力する(ステップS107)。比較結果としてのΔピッチは、基準となる音名ピッチに対する検知ピッチのずれ量(セント)であり、Δピッチ=検知ピッチ-音名ピッチにより算出される。Δピッチの符号が正だと、検知ピッチが音名ピッチより高いことを意味する。なお、歌唱者の性別等によって発音ピッチがオクターブ単位で異なることが想定される。そこでピッチ比較部35は、検知ピッチと音名ピッチとのずれ量が1オクターブ以上である場合は、ずれ量が1オクターブ未満となるよう、オクターブを単位としてずれ量を補正(1200セントの整数倍を加算または減算)した値を、Δピッチとして算出する。なお、このような、オクターブ単位でずれ量を補正する処理を設けることは必須でない。 Next, the pitch comparison unit 35 detects the pitch of the utterance acquired this time (detection pitch) detected by the pitch detection unit 32 in step S106 and the identified sound acquired by the pitch acquisition unit 34 in step S106. The name pitch (pitch name pitch) is compared, and the comparison result (Δ pitch) is output (step S107). The Δ pitch as the comparison result is the deviation amount (cent) of the detection pitch with respect to the reference pitch pitch, and is calculated by Δ pitch = detection pitch−pitch pitch. If the sign of Δ pitch is positive, it means that the detection pitch is higher than the pitch name pitch. In addition, it is assumed that a pronunciation pitch changes with an octave unit by the sex etc. of a singer. Therefore, the pitch comparison unit 35 corrects the shift amount in units of octaves (integer multiple of 1200 cents) so that the shift amount is less than one octave if the shift amount between the detection pitch and the pitch name pitch is one octave or more. Is calculated as Δ pitch. It is not essential to provide such a process for correcting the deviation amount in octave units.
 次に、報知部36は、ピッチ比較部35から出力されたΔピッチの絶対値が所定範囲(第1の所定値)内(例えば、Δピッチが-400~+400セントの範囲内)であるか否かを判別する(ステップS108)。そして、報知部36は、Δピッチの絶対値が所定範囲内である場合は、報知処理を実行する(ステップS109)。報知の態様は問わないが、一例として、表示部18に、市販のチューナに採用されるような指針を表示させ、基準(目標値)となる音名ピッチとの乖離を視覚的に知らせる。このほか、LED等を用いてΔピッチに応じた値や色やマークによって報知してもよいし、音声によって報知してもよい。音声の場合も、Δピッチに応じて音色や音量を異ならせてもよい。あるいは、音声でΔピッチを読み上げてもよい。報知処理により、発声ピッチのずれを歌唱者に知らせることができる。 Next, the notification unit 36 determines whether the absolute value of the Δ pitch output from the pitch comparison unit 35 is within a predetermined range (first predetermined value) (for example, the Δ pitch is within a range of −400 to +400 cents). It is determined whether or not it is (step S108). Then, when the absolute value of the Δ pitch is within the predetermined range, the notification unit 36 performs a notification process (step S109). There is no limitation on the manner of notification, but as an example, a pointer used in a commercially available tuner is displayed on the display unit 18 to visually indicate the deviation from the pitch name as a reference (target value). In addition to this, notification may be performed by a value, a color, or a mark corresponding to the Δ pitch using an LED or the like, or may be notified by voice. Also in the case of voice, the timbre and volume may be varied according to the Δ pitch. Alternatively, the Δ pitch may be read out by voice. By the notification process, it is possible to notify the singer of the deviation of the utterance pitch.
 ステップS109の後、処理はステップS110に進む。また、Δピッチの絶対値が所定範囲外である場合は、報知部36は、報知処理を実行せずに処理をステップS110に進める。従って、目標から極端にずれた音高で発声をした場合は比較結果であるΔピッチの報知がされず、Δピッチが第1の所定値未満のずれ量を示す場合にだけ報知される。これにより、煩わしい報知が抑制される。なお、Δピッチを報知しない場合でも、発声ピッチが目標から極端にずれている旨を報知するようにしてもよい。なお、ステップS108の処理を設けることは必須なく、Δピッチの値に拘わらず、一律に報知処理が実行されるようにしてもよい。 After step S109, the process proceeds to step S110. If the absolute value of the Δ pitch is out of the predetermined range, the notification unit 36 proceeds with the process to step S110 without executing the notification process. Therefore, when speaking at a pitch extremely deviated from the target, notification of the Δ pitch, which is the comparison result, is not performed, and notification is performed only when the Δ pitch indicates a deviation amount less than the first predetermined value. Thereby, the troublesome alerting | reporting is suppressed. Even when the Δ pitch is not notified, it may be notified that the speech pitch is extremely deviated from the target. It is not essential to provide the process of step S108, and the notification process may be executed uniformly regardless of the value of Δ pitch.
 次に、CPU10は、操作子17等によってピッチ判定処理の終了指示があったか否かを判別する(ステップS110)。そしてCPU10は、ピッチ判定処理の終了指示がない場合は、その他処理を実行し(ステップS111)、処理をステップS102に戻す。その他処理においては、設定内容の変更や、各種の案内等が実行される。一方、ピッチ判定処理の終了指示があった場合は、図5の処理は終了する。 Next, the CPU 10 determines whether or not an instruction to end the pitch determination process has been issued by the operation element 17 or the like (step S110). Then, when there is no instruction to end the pitch determination process, the CPU 10 executes other processes (step S111), and returns the process to step S102. In the other processing, change of setting contents, various guidances and the like are executed. On the other hand, when an instruction to end the pitch determination process is issued, the process of FIG. 5 ends.
 ピッチ判定処理(図5)による比較結果の報知の一例を図6で説明する。図6Aは、歌唱者が発声する音符を示す楽譜例である。図6Bは発声ピッチを示すタイムチャートである。一例として、設定調はC調(ハ長調)で、取得対象階名は日本式の階名(「ド(Do)」「レ(Re)」「ミ(Mi)」・・・「シ(Si)」)とする。キャリブレーション設定は無しとする。報知部36での指針表示は、指針が真っ直ぐ上を指すとき、基準となる音名ピッチに対する検知ピッチのずれが無いことを示す。指針が右に傾斜するときは音名ピッチに対して歌唱の検知ピッチが高いことを意味する(Δピッチが正)。報知部36による表示例を表示t1~t9で示す。 An example of notification of the comparison result by the pitch determination process (FIG. 5) will be described with reference to FIG. FIG. 6A is a musical score example showing musical notes uttered by a singer. FIG. 6B is a time chart showing the utterance pitch. As an example, the setting tone is C tone (C major), and the acquisition target floor name is a Japanese style floor name ("Do" "Re" "Mi") "Si (Si (Si) ))). There is no calibration setting. The pointer display in the notification unit 36 indicates that when the pointer points straight up, there is no deviation of the detection pitch from the reference pitch pitch. When the pointer inclines to the right, it means that the detection pitch of singing is higher than the pitch pitch (Δ pitch is positive). Display examples by the notification unit 36 are shown by displays t1 to t9.
 まず、歌唱者は1音目にド(Do)を発声する。歌唱者が、音高C3で発声するつもりであったが、高すぎて音高E3に近い発声をしたために、指針は右に傾斜している(表示t1)。その後、歌唱者はド(Do)を発声し直し、その直後は発声ピッチがやや高いが(表示t2)、音程を是正するとほぼ適切なピッチとなった(表示t3)。歌唱者が発声を2音目のレ(Re)に移行する直前に、発声ピッチが音高C3より少し高くなり(表示t4)、その後、レ(Re)を発声したが、そのピッチが低めであったので指針は左に傾斜し(表示t5)、その後、音程を是正するとほぼ適切なピッチとなった(表示t6)。歌唱者は3音目にミ(Mi)を発声する。歌唱者は、3音目をしゃくりにより発声したので、最初は指針が左に傾斜し(表示t7)、その後、音程が是正されるとほぼ適切なピッチとなるが(表示t8)、その後、音程は高めに推移している(表示t9)。 First, the singer utters Do at the first sound. The singer intended to utter at the pitch C3, but the pointer is inclined to the right because it is too high and utters close to the pitch E3 (display t1). Thereafter, the singer uttered Do (Do), and immediately after that, the utterance pitch was slightly high (display t2), but when the pitch was corrected, the pitch became almost appropriate (display t3). Just before the singer shifts the utterance to the second note (Re), the utterance pitch is slightly higher than the pitch C3 (display t4), and then, the user pronounces Re (Re), but the pitch is lower. Since the pointer was inclined to the left (display t5), the pitch was corrected to a nearly appropriate pitch (display t6). The singer utters Mi (Mi) at the third note. Since the singer uttered the third note by croaking, the pointer is inclined to the left at the beginning (display t7), and then the pitch becomes almost appropriate when the pitch is corrected (display t8), then the pitch Is on the high side (display t9).
 本実施の形態によれば、CPU10は、設定された調と、音声を認識した結果から取得した発話とに基づいて、発話に対応する音名を特定し、特定した音名のピッチを取得する。そしてCPU10は、取得されたピッチを発音により提示する。これにより、調と発話する階名とに応じて発声すべきピッチを事前に知らせることができるので、ソルフェージュの補助に寄与することができる。 According to the present embodiment, CPU 10 identifies the name of the note corresponding to the utterance based on the set tone and the utterance acquired from the result of recognizing the speech, and acquires the pitch of the identified note. . Then, the CPU 10 presents the acquired pitch by tone generation. As a result, since it is possible to inform in advance the pitch to be uttered according to the key and the step name to be uttered, it is possible to contribute to the solfege assistance.
 また、CPU10は、取得した発話のピッチと特定した音名のピッチとの比較結果(Δピッチ)を出力する。これにより、調と発話する階名とに合致させて、発声ピッチを判定することができる。また、出力した比較結果が報知されるので、歌唱者は発声ピッチのずれを認識することができる。従って、歌唱者は、視唱トレーニングを簡易に行うことができ、ソルフェージュの補助に寄与することができる。 Further, the CPU 10 outputs a comparison result (Δ pitch) between the acquired speech pitch and the specified pitch name pitch. As a result, the utterance pitch can be determined in accordance with the key and the floor name to be uttered. Moreover, since the output comparison result is notified, the singer can recognize the deviation of the utterance pitch. Therefore, the singer can perform visual training easily and can contribute to the solfege assistance.
 また、比較結果が1オクターブ以上のずれを示す場合は、ずれ量が1オクターブ未満となるよう補正された値がΔピッチとされるので、歌唱者の発声キーを問わずに発声ピッチを判定することができる。また、取得対象となる、階名を示す発話を指定できるので、決まった階名だけをピッチ判定に用いることができ、誤判定が生じにくい。 Also, if the comparison result indicates a shift of one or more octaves, the value corrected so that the shift amount is less than one octave is taken as the Δ pitch, so the utterance pitch is determined regardless of the singer's voice key be able to. In addition, since an utterance indicating a floor name to be acquired can be specified, only a fixed floor name can be used for pitch determination, and erroneous determination is less likely to occur.
 なお、CPU10は、Δピッチの絶対値が第2の所定値(例えば、50セント)以上である場合は、上記したステップS109での報知処理の際に、上記報知処理に加えて、正しい音程(目標となる音名ピッチ)の音を発音させてもよい。そしてCPU10は、この音を、歌唱音程が是正されてΔピッチの絶対値が第2の所定値を下回ると消音(提示停止)してもよい。 When the absolute value of the Δ pitch is equal to or greater than the second predetermined value (for example, 50 cents), the CPU 10 corrects the pitch (in addition to the above notification processing in the notification processing in step S109 described above). The target note name pitch) may be pronounced. Then, the CPU 10 may mute (stop presentation) this sound when the singing pitch is corrected and the absolute value of the Δ pitch falls below a second predetermined value.
 なお、CPU10は、今回取得された発話に対応する音名が特定された時点で、一定時間、目標となる音名ピッチの音を発音してもよい。 The CPU 10 may sound a target pitch pitch sound for a certain period of time when the pitch name corresponding to the utterance acquired this time is specified.
 なお、検知対象の発話とは別に、コマンドの意味を理解し、ユーザが、コマンドによって、調、取得対象階名、キャリブレーション等の設定指示を行えるようにしてもよい。 In addition to the utterance of the detection target, the meaning of the command may be understood, and the user may be able to instruct setting of the key, the acquisition target floor name, the calibration, and the like by the command.
 なお、本発明をスマートフォン等の情報端末装置40で実現する場合は、それに接続される装置は報知機能を有すればよく、チューナ30に限らない。本発明を実現するためのアプリケーションの取得方法は問わない。情報端末装置40で実行されるアプリケーションは、情報端末装置40に当初からインストールされていなくてもよく、事後的にダウンロードされ、インストールされてもよい。 When the present invention is implemented by an information terminal device 40 such as a smartphone, the device connected thereto may have a notification function, and is not limited to the tuner 30. There is no limitation on the method of acquiring the application for realizing the present invention. The application executed by the information terminal device 40 may not be originally installed in the information terminal device 40, and may be downloaded and installed after the fact.
 以上、本発明をその好適な実施形態に基づいて詳述してきたが、本発明はこれら特定の実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の様々な形態も本発明に含まれる。 Although the present invention has been described in detail based on its preferred embodiments, the present invention is not limited to these specific embodiments, and various embodiments within the scope of the present invention are also included in the present invention. included.
 なお、本発明を達成するためのソフトウェアによって表される制御プログラムを記憶した記憶媒体を、本楽器に読み出すことによって同様の効果を奏するようにしてもよく、その場合、記憶媒体から読み出されたプログラムコード自体が本発明の新規な機能を実現することになり、そのプログラムコードを記憶した、非一過性のコンピュータ読み取り可能な記録媒体は本発明を構成することになる。また、プログラムコードを伝送媒体等を介して供給してもよく、その場合は、プログラムコード自体が本発明を構成することになる。なお、これらの場合の記憶媒体としては、ROMのほか、フロッピディスク、ハードディスク、光ディスク、光磁気ディスク、CD-ROM、CD-R、磁気テープ、不揮発性のメモリカード等を用いることができる。「非一過性のコンピュータ読み取り可能な記録媒体」は、インターネット等のネットワークや電話回線等の通信回線を介してプログラムが送信された場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリ(例えばDRAM(Dynamic Random Access Memory))のように、一定時間プログラムを保持しているものも含む。 Note that the storage medium storing the control program represented by the software for achieving the present invention may be read out to the present instrument to achieve the same effect, in which case, the storage medium is read from the storage medium. The program code itself implements the novel functions of the present invention, and the non-transitory computer readable recording medium storing the program code constitutes the present invention. Also, the program code may be supplied via a transmission medium or the like, in which case the program code itself constitutes the present invention. In addition to ROMs, floppy disks, hard disks, optical disks, magneto-optical disks, CD-ROMs, CD-Rs, magnetic tapes, non-volatile memory cards, etc. can be used as storage media in these cases. The “non-transitory computer readable recording medium” is a volatile memory (for example, a server or client internal to the computer system when the program is transmitted via a network such as the Internet or a communication line such as a telephone line) It also includes one that holds a program for a fixed time, such as a dynamic random access memory (DRAM).
10 CPU(設定部)
17 操作子(指定部)
30 チューナ
31 発話取得部
32 ピッチ検知部
33 音名特定部
34 ピッチ取得部
35 ピッチ比較部(出力部)
36 報知部
10 CPU (setting unit)
17 Operator (designated part)
Reference Signs List 30 tuner 31 utterance acquisition unit 32 pitch detection unit 33 pitch name identification unit 34 pitch acquisition unit 35 pitch comparison unit (output unit)
36 Notification unit

Claims (12)

  1.  調を設定する設定部と、
     音声を認識した結果から、階名を示す発話を取得する発話取得部と、
     前記設定部により設定された調と前記発話取得部により取得された発話とに基づいて前記取得された発話に対応する音名を特定し、該特定した音名のピッチを取得するピッチ取得部と、
    を有するソルフェージュの補助装置。
    A setting unit for setting a key,
    A speech acquisition unit for acquiring a speech indicating a floor name from a result of speech recognition;
    A pitch acquisition unit for identifying a pitch name corresponding to the acquired utterance based on the tone set by the setting unit and the utterance acquired by the utterance acquisition unit; and acquiring a pitch of the identified pitch name ,
    Solfege aid device with.
  2.  前記ピッチ取得部により取得されたピッチを提示する提示部を有する請求項1に記載のソルフェージュの補助装置。 The solfege auxiliary device according to claim 1, further comprising a presentation unit that presents the pitch acquired by the pitch acquisition unit.
  3.  前記発話取得部により取得された発話のピッチを検知するピッチ検知部と、
     前記ピッチ検知部により検知されたピッチと前記ピッチ取得部により取得されたピッチとを比較し、その比較結果を出力する出力部と、を有する請求項1または2に記載のソルフェージュの補助装置。
    A pitch detection unit that detects a pitch of the utterance acquired by the utterance acquisition unit;
    The solfege auxiliary device according to claim 1, further comprising: an output unit that compares the pitch detected by the pitch detection unit with the pitch acquired by the pitch acquisition unit and outputs the comparison result.
  4.  前記出力部から出力された比較結果を報知する報知部を有する請求項3に記載のソルフェージュの補助装置。 The solfege auxiliary device according to claim 3, further comprising a notification unit that notifies the comparison result output from the output unit.
  5.  前記報知部は、前記比較結果が第1の所定値未満のずれ量を示す場合にだけ前記比較結果を報知する請求項4に記載のソルフェージュの補助装置。 5. The solfege auxiliary device according to claim 4, wherein the notification unit reports the comparison result only when the comparison result indicates a deviation amount smaller than a first predetermined value.
  6.  前記報知部は、前記比較結果が前記第1の所定値未満で且つ前記第1の所定値より小さい第2の所定値を超えるずれ量を示す場合は、前記比較結果の報知に加えて、正しいピッチの音を発音する請求項5に記載のソルフェージュの補助装置。 The notification unit is correct in addition to notification of the comparison result when the comparison result indicates a deviation amount that is less than the first predetermined value and exceeds a second predetermined value that is smaller than the first predetermined value. The solfege auxiliary device according to claim 5, wherein the pitch sound is produced.
  7.  前記報知部は、前記正しいピッチの音を発音した後、前記比較結果が前記第2の所定値を超えるずれ量を示さなくなった場合は、前記正しいピッチの音の発音を停止する請求項6に記載のソルフェージュの補助装置。 The notification unit stops the sound generation of the sound of the correct pitch when the comparison result does not indicate a deviation amount exceeding the second predetermined value after the sound of the sound of the correct pitch is generated. Solfege aid described.
  8.  前記出力部は、前記ピッチ検知部により検知されたピッチと前記ピッチ取得部により取得されたピッチとのずれ量が1オクターブ以上である場合は、前記ずれ量が1オクターブ未満となるよう、オクターブを単位として前記ずれ量を補正した値を、前記比較結果として出力する請求項3~7のいずれか1項に記載のソルフェージュの補助装置。 The output unit sets an octave so that the shift amount is less than one octave when the shift amount between the pitch detected by the pitch detection unit and the pitch acquired by the pitch acquisition unit is one octave or more. The solfege auxiliary device according to any one of claims 3 to 7, wherein a value obtained by correcting the deviation amount as a unit is output as the comparison result.
  9.  前記発話取得部による取得対象となる、階名を示す発話を指定する指定部を有し、
     前記発話取得部は、認識した音声から、前記指定部により指定された発話を取得する請求項1~8のいずれか1項に記載のソルフェージュの補助装置。
    And a designation unit that designates an utterance indicating a floor name to be acquired by the utterance acquisition unit.
    The solfege assistance device according to any one of claims 1 to 8, wherein the speech acquisition unit acquires the speech designated by the designation unit from the recognized speech.
  10.  前記音名のピッチを調整するキャリブレーション部を有し、
     前記ピッチ取得部は、前記特定した音名に対応し且つ前記キャリブレーション部により調整されたピッチを取得する請求項1~9のいずれか1項に記載のソルフェージュの補助装置。
    A calibration unit for adjusting the pitch of the note name;
    The solfege auxiliary device according to any one of claims 1 to 9, wherein the pitch acquisition unit acquires the pitch corresponding to the specified pitch name and adjusted by the calibration unit.
  11.  ソルフェージュの補助装置の制御方法であって、
     調を設定する設定ステップと、
     音声を認識した結果から、階名を示す発話を取得する発話取得ステップと、
     前記設定ステップにより設定された調と前記発話取得ステップにより取得された発話とに基づいて前記取得された発話に対応する音名を特定し、該特定した音名のピッチを取得するピッチ取得ステップと、
    を有するソルフェージュの補助装置の制御方法。
    It is a control method of a solfege auxiliary device, and
    Setting steps to set the key,
    An utterance acquisition step of acquiring an utterance indicating a floor name from a result of speech recognition;
    A pitch acquisition step of specifying a pitch name corresponding to the acquired speech based on the tone set in the setting step and the speech acquired in the speech acquisition step, and acquiring a pitch of the specified pitch name; ,
    A control method of a solfege auxiliary device having:
  12.  ソルフェージュの補助装置の制御方法をコンピュータに実行させるプログラムであって、
     前記制御方法は、
     調を設定する設定ステップと、
     音声を認識した結果から、階名を示す発話を取得する発話取得ステップと、
     前記設定ステップにより設定された調と前記発話取得ステップにより取得された発話とに基づいて前記取得された発話に対応する音名を特定し、該特定した音名のピッチを取得するピッチ取得ステップと、
    を有するプログラム。
    A program that causes a computer to execute a control method of a solfege auxiliary device,
    The control method is
    Setting steps to set the key,
    An utterance acquisition step of acquiring an utterance indicating a floor name from a result of speech recognition;
    A pitch acquisition step of specifying a pitch name corresponding to the acquired speech based on the tone set in the setting step and the speech acquired in the speech acquisition step, and acquiring a pitch of the specified pitch name; ,
    A program with.
PCT/JP2017/038592 2017-10-25 2017-10-25 Solfege assistance device, method for controlling same, and program WO2019082320A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/JP2017/038592 WO2019082320A1 (en) 2017-10-25 2017-10-25 Solfege assistance device, method for controlling same, and program

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2017/038592 WO2019082320A1 (en) 2017-10-25 2017-10-25 Solfege assistance device, method for controlling same, and program

Publications (1)

Publication Number Publication Date
WO2019082320A1 true WO2019082320A1 (en) 2019-05-02

Family

ID=66246818

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2017/038592 WO2019082320A1 (en) 2017-10-25 2017-10-25 Solfege assistance device, method for controlling same, and program

Country Status (1)

Country Link
WO (1) WO2019082320A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200249633A1 (en) * 2017-10-25 2020-08-06 Yamaha Corporation Tempo setting device, control method thereof, and program

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06308878A (en) * 1993-04-26 1994-11-04 Matsushita Electric Ind Co Ltd Display method of musical scale name
JP2001242862A (en) * 2000-03-01 2001-09-07 Yamaha Corp Portable telephone and its musical score data forming method
JP2001265336A (en) * 2000-03-21 2001-09-28 Nec Corp Portable telephone system and arrival melody input method
JP2009031658A (en) * 2007-07-30 2009-02-12 Kawai Musical Instr Mfg Co Ltd Interval sense training device and interval sense training program
WO2017072754A2 (en) * 2015-10-25 2017-05-04 Koren Morel A system and method for computer-assisted instruction of a music language

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06308878A (en) * 1993-04-26 1994-11-04 Matsushita Electric Ind Co Ltd Display method of musical scale name
JP2001242862A (en) * 2000-03-01 2001-09-07 Yamaha Corp Portable telephone and its musical score data forming method
JP2001265336A (en) * 2000-03-21 2001-09-28 Nec Corp Portable telephone system and arrival melody input method
JP2009031658A (en) * 2007-07-30 2009-02-12 Kawai Musical Instr Mfg Co Ltd Interval sense training device and interval sense training program
WO2017072754A2 (en) * 2015-10-25 2017-05-04 Koren Morel A system and method for computer-assisted instruction of a music language

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200249633A1 (en) * 2017-10-25 2020-08-06 Yamaha Corporation Tempo setting device, control method thereof, and program
US11526134B2 (en) * 2017-10-25 2022-12-13 Yamaha Corporation Tempo setting device and control method thereof

Similar Documents

Publication Publication Date Title
US7698134B2 (en) Device in which selection is activated by voice and method in which selection is activated by voice
CN107430849B (en) Sound control device, sound control method, and computer-readable recording medium storing sound control program
JP6485185B2 (en) Singing sound synthesizer
US20160133246A1 (en) Voice synthesis device, voice synthesis method, and recording medium having a voice synthesis program recorded thereon
JP2014048472A (en) Voice synthesis system for karaoke and parameter extractor
WO2019082320A1 (en) Solfege assistance device, method for controlling same, and program
JP2021144238A (en) Pronunciation system, controller, control method thereof and program
US8249874B2 (en) Synthesizing speech from text
JP2008058724A (en) Lyrics telop display control system of karaoke device
JP2019132979A (en) Karaoke device
US11437016B2 (en) Information processing method, information processing device, and program
JP2016142967A (en) Accompaniment training apparatus and accompaniment training program
JP4779365B2 (en) Pronunciation correction support device
JP2007047486A (en) Karaoke device for vehicle
JP4180548B2 (en) Karaoke device with vocal range notification function
JP2009244790A (en) Karaoke system with singing teaching function
JP2007292922A (en) Telop display device
JP2005173256A (en) Karaoke apparatus
WO2019003350A1 (en) Singing sound generation device, method and program
JP2019117282A (en) Karaoke device
JP6944390B2 (en) Karaoke equipment
JP2019117284A (en) Karaoke device
JP2013003430A (en) Karaoke device
WO2024024629A1 (en) Audio processing assistance device, audio processing assistance method, audio processing assistance program, audio processing assistance system
US20210097975A1 (en) Information processing method, information processing device, and program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17929879

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17929879

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP