WO2019082320A1

WO2019082320A1 - Solfege assistance device, method for controlling same, and program

Info

Publication number: WO2019082320A1
Application number: PCT/JP2017/038592
Authority: WO
Inventors: 松本　秀一
Original assignee: ヤマハ株式会社
Priority date: 2017-10-25
Filing date: 2017-10-25
Publication date: 2019-05-02

Abstract

Provided is a solfege assistance device that can contribute to assisting with solfege. When a to-be-acquired utterance is acquired, a CPU 10 detects the pitch of the utterance, specifies a pitch name that corresponds to a currently acquired utterance on the basis of a set pitch and the currently acquired utterance, and acquires the pitch of the specified pitch name.

Description

Solfege auxiliary device, control method therefor, program

The present invention relates to a solfege auxiliary device, a control method thereof, and a program.

Conventionally, a tuner has been known which displays a note name close to the pitch of the input sound and causes the user to adjust the input sound to the displayed pitch name (Non-Patent Document 1). In solfege (singular), where the score is sung at the first look, the singer wants to confirm the pitch of the sound emitted by the natural voice. An auxiliary device such as a tuner used for the solfege compares the pitch of the voice input to the microphone with the pitch to be adjusted.

However, note names indicate absolute pitches in each octave, while step names indicate pitches relative to the tonic. The conventional auxiliary device used for the solfege determines whether the pitch is correct or not, which is the absolute pitch of the input sound, regardless of the type of speech. Therefore, even if the pronunciation is performed with the floor name of the scale (for example, Do (Re), Mi (Mi) ...), the target of detection is only the pronunciation pitch. The floor name is ignored.

For example, if the singer matches the pitch C (eg, pitch C3) which is the tonic to tune the C scale, the voice of the pitch C3 is vocalized with the floor name "do". If this is done, it is determined that the pitch is correct, but even if the voice with pitch C3 is uttered with the floor name "Soo", it is determined that the pitch is correct. Similarly, when checking to the pitch G3 which is the tonic to check the vocalization of the G scale, it is possible to utter the sound of the pitch G3 with the floor name "Do" or with the floor name "Soo" , It is determined that the pitch is correct. On the contrary, when the singer matches the pitch D3 which is not the tonic in the vocal check of the scale of C, if the vocal pitch is other than the pitch D3, even if the vocalization floor name is "do" Even if “,” it is determined that the pitch is not correct.

For the singer, when checking the utterance pitch, assuming the key to be sung, utter the pitch of the tonic of the key with the floor name “do”, and correspond each pitch of the key scale It is natural to speak with the floor name. For example, if the setting tone is C tone, the pitches C3, D3 and E3 are uttered as "d", "re" and "mi". If the setting key is a G key, the pitches G3, A3 and B3 are uttered as "d", "re" and "mi".

Conventionally, a singer usually uses a musical instrument or the like to grasp the pitch to be uttered when uttering in the solfege. It is convenient if it is possible to recognize the pitch to be uttered of each floor name according to the set key without requiring the target pitch to be produced by an instrument or the like. Also, conventionally, in the determination of pitch, the relationship between the key to be set and the floor name to be uttered has not been taken into consideration.

An object of the present invention is to provide a solfege aid that can contribute to the solfege aid.

According to the present invention to achieve the above object, according to the present invention, a setting unit (10) for setting a key, a speech acquisition unit (31) for acquiring a speech indicating a floor name from a result of speech recognition, and the setting unit A pitch acquisition unit (32, acquiring a pitch of the specified pitch name by specifying a pitch name corresponding to the acquired speech based on the tone set by the user and the utterance acquired by the speech acquisition unit; 34) and a solfege auxiliary device is provided.

According to the present invention to achieve the above object, there is provided a control method of a solfege auxiliary device, comprising: a setting step of setting a key; and an utterance acquisition step of acquiring an utterance indicating a floor name from a result of speech recognition. Pitch acquisition for specifying a pitch name corresponding to the acquired speech based on the tone set in the setting step and the speech acquired in the speech acquisition step, and acquiring the pitch of the specified pitch name A control method of a solfege auxiliary device is provided.

According to the present invention to achieve the above object, the program causes a computer to execute a control method of a solfege auxiliary device, and the control method comprises a setting step of setting a key, and a result of recognizing speech. A pitch name corresponding to the acquired utterance is specified based on an utterance acquisition step for acquiring an utterance indicating a floor name, a tone set in the setting step, and the utterance acquired in the utterance acquisition step, And a pitch acquiring step of acquiring a pitch of the specified pitch name.

In addition, the code in the said parenthesis is an illustration.

According to the present invention, it can contribute to the solfege assistance.

It is a figure which shows the whole structure of the pitch determination system containing the auxiliary | assistance apparatus of a solfege. It is a block diagram of a tuner. It is a block diagram of the principal part of the tuner for pitch determination. 6 is a table showing the relationship between MIDI note numbers, tones, and floor names. It is a flowchart of a pitch determination process. It is a musical score example which shows the note which a singer utters. It is a time chart which shows utterance pitch.

Hereinafter, embodiments of the present invention will be described with reference to the drawings.

FIG. 1 is a view showing the overall configuration of a pitch determination system including a solfege auxiliary device according to an embodiment of the present invention. The solfege auxiliary device of the present invention is configured as a tuner 30 as an example. The tuner 30 has an audio acquisition function by the microphone 23, a display function by the display unit 18, and a sound generation function. The tuner 30 has a function of mainly informing by display the pitch of the sound acquired by the microphone 23 (or the deviation from the target pitch). Note that the information terminal device 40 such as a PC or a smartphone may be communicably connected to the tuner 30 wirelessly or by wire, and the system may be configured to realize a solfege auxiliary device mainly by the information terminal device 40. In that case, the pitch determined by the information terminal device 40 may be reported by the tuner 30. When the present invention is applied to the information terminal device 40, the device connected to the information terminal device 40 may have a function of at least notifying the determined pitch by display, voice or the like. Hereinafter, an example in which the present invention is realized mainly by the tuner 30 alone will be described.

FIG. 2 is a block diagram of the tuner 30. As shown in FIG. The tuner 30 includes a central processing unit (CPU) 10, a timer 11, a read only memory (ROM) 12, a random access memory (RAM) 13, a storage unit 14, an operation element 17, and a display unit 18. A sound source 19, an effect circuit 20, a sound system 21, a communication I / F (Interface) 16, a microphone 23, and a bus 22 are provided. The CPU 10 is a central processing unit that controls the entire tuner 30. The timer 11 is a module that measures time. The ROM 12 is a non-volatile memory that stores control programs and various data. The RAM 13 is a volatile memory used as a work area of the CPU 10 and various buffers. The display unit 18 is configured of a liquid crystal display panel or the like, and displays an operation state of the tuner 30, various setting screens, a message for the user, and the like.

The operator 17 is an operation module such as an operation button or an operation knob. The external storage device 15 is, for example, an external device connected to the tuner 30, and is, for example, a device that stores audio data. The communication I / F 16 is a communication module for communicating with an external device wirelessly or by wire. The communication I / F 16 includes a MIDI (Musical Instrument Digital Interface) interface. The bus 22 transfers data between the units in the tuner 30. The microphone 23 acquires surrounding sound. The sound source 19 generates sound generation data under the control of the CPU 10 based on the data stored in the storage unit 14 and the RAM 13. The effect circuit 20 applies the acoustic effect designated by the operator 17 to the sounding data generated by the sound source 19. The sound system 21 converts the data processed by the effect circuit 20 into an analog signal by a digital / analog converter. Then, the sound system 21 amplifies the analog signal and outputs it from a speaker or the like. The display unit 18 displays various information under the control of the CPU 10.

Next, a method of determining the pitch of the user's speech will be described. FIG. 3 is a block diagram of the main part of the tuner 30 for pitch determination. The tuner 30 includes an utterance acquisition unit 31, a pitch detection unit 32, a pitch name identification unit 33, a pitch acquisition unit 34, a pitch comparison unit 35, and a notification unit 36. The speech acquisition unit 31 recognizes the speech acquired by the microphone 23 and detects speech from the recognition result. When the acquisition target floor name (described later) is designated, the utterance acquisition unit 31 extracts and acquires only the utterance indicating the target floor name, and ignores (deletes) the utterance that is not the acquisition target.

The pitch detection unit 32 detects the pitch of the utterance acquired by the utterance acquisition unit 31. The pitch name identification unit 33 identifies a pitch name from the speech acquired by the speech acquisition unit 31. When the key is set to something other than the default, the note name specifying unit 33 specifies the note name in consideration of the setting key. The pitch acquisition unit 34 acquires the pitch of the specified pitch name. When calibration (described later) by the calibration unit is set, the pitch acquisition unit 34 acquires the pitch of the pitch name in consideration of the set calibration. The pitch acquisition unit 34 outputs information indicating the acquired pitch to the notification unit 36. The notification unit 36 presents the pitch output from the pitch acquisition unit 34 by sounding (presentation unit). Further, the pitch comparison unit 35 as an output unit compares the pitch detected by the pitch detection unit 32 with the pitch acquired by the pitch acquisition unit 34, and outputs the comparison result to the notification unit 36. The notification unit 36 notifies the user of the comparison result of the pitch.

The functions of the speech acquisition unit 31, pitch detection unit 32, pitch name identification unit 33, pitch acquisition unit 34, pitch comparison unit 35, and calibration unit are all realized mainly by the cooperation of the storage unit 14, ROM 12, RAM 13 and CPU 10. Be done. The function of the notification unit 36 is realized by the cooperation of the storage unit 14, the display unit 18, the sound source 19, the effect circuit 20, the sound system 21, and the CPU 10. Details of the pitch determination process will be described later with reference to FIG.

FIG. 4 is a table showing the relationship between MIDI note numbers, key tones and floor names. This table is stored, for example, in the ROM 12. The MIDI note number (MIDI #; MIDI code) of pitch C3 is 60. The note number of pitch C4 one octave higher than pitch C3 is 72. Taking a typical Japanese style floor name as an example, the diatonic scale is “Do” “Re” “Mi” “Mi” “Fa” “So” “La” (La), “Si (Si)”. When the key is C key (C major), the correspondence between the floor name and the pitch is that "do" is pitch C3 (MIDI # = 60) and "Le" is pitch D3 (MIDI # = 62). , “M” corresponds to the pitch E3 (MIDI # = 64), and “S” corresponds to the pitch B3 (MIDI # = 71).

On the other hand, when the key is D-tone (d major), the correspondence between the floor name and the pitch is that "do" is pitch D3 (MIDI # = 62) and "Le" is pitch E3 (MIDI # = 64) “M” corresponds to the pitch F3 # (MIDI # = 66), and “S” corresponds to the pitch C4 # (MIDI # = 73). In addition, when the key is E key (H major), the correspondence between the floor name and the pitch is that "do" is pitch E3 (MIDI # = 64) and "Le" is pitch F3 # (MIDI #). = 66), “mi” corresponds to the pitch G3 # (MIDI # = 68), and “shi” corresponds to the pitch D4 # (MIDI # = 75).

When trying to check the pitch of his / her vocalization with respect to the D-tone scale, the singer utters the pitch of the D-tone scale with each floor name. For example, when the singer wants to confirm the utterance pitch of the pitch D (for example, MIDI # = 62), the singer utters the floor name "do" at the pitch D3. Then, since the pitch name corresponding to "D" in D tone is D, the pitch comparison unit 35 compares the pitch of the speech of "D" by the singer with the pitch of the pitch D.

Note that the floor name adopted as the target of utterance acquisition may be one type determined in advance, or the user may be able to select from a plurality of types. There are various floor names, but any of them may be adopted. For example, Nishizuka formula, Sato formula, etc. may be adopted, and types other than Japanese ones (for example, See, Dee, E, Tse, Day, AE, etc.) may be adopted. The names and names of the Nishizuka-style and Sato-style notations are disclosed in the following URLs.
ttps: //en.wikipedia.org/wiki/%E9%9F%B3%E5%90%8D%E3%83%BB%E9%9A%8A%8E%E5%90%8D%E8%A1%A1%A8%E8 % A8% 98
In addition, although the floor name is assumed to be the diatonic scale (so that Di, Me, etc. are not included) as the target of speech acquisition for the sake of simplifying the explanation, the diatonic scale may also be targeted (Di And Me etc.).

FIG. 5 is a flowchart of the pitch determination process. This process is realized by the CPU 10 reading out the program stored in the ROM 12 to the ROM 12 and executing the program. This process is started, for example, when the operator 17 or the like instructs to start the pitch determination process.

First, the CPU 10 executes setting processing based on an instruction from the user by the operator 17 as a designation unit (step S101). For example, the CPU 10 sets a tone (setting unit) such as C tone or D tone, an utterance indicating a floor name to be acquired (hereinafter, an acquisition target floor name), that is, an utterance target for acquisition and a calibration setting. Etc. For calibration that shifts the pitch reference corresponding to each pitch name (adjusts the pitch of the pitch name), the setting is applied to all pitches. For example, the frequency of the pitch A3 is normally 440 Hz, but a value (for example, 442 Hz) shifted from 440 Hz is set as the frequency of the pitch A3 by the calibration setting. In the description of FIG. 5, it is assumed that, for example, seven utterances from “Do” to “Si (Si)” with Japanese-style floor names as described above are set as the utterances to be acquired. The default setting is adopted for any key, acquisition target floor name, and calibration that have not been designated by the user.

Next, the CPU 10 waits until there is an audio input (step S102). The voice is input from the microphone 23, and it is determined that there is a voice input when any voice is input, not limited to the utterance of the acquisition target. Then, when there is a voice input, the speech acquisition unit 31 recognizes the input speech by a known method, and acquires an utterance of an acquisition target from the result of recognizing the speech (step S103). Then, the CPU 10 determines whether an utterance to be acquired has been acquired (step S104). As a result of the determination, when the utterance of the acquisition target is not acquired, the process returns to step S102, while when the utterance of the acquisition target is acquired, the process proceeds to step S105. In step S105, the pitch detection unit 32 detects the pitch of the utterance acquired this time.

Next, the CPU 10 identifies a pitch name corresponding to the utterance acquired this time, and acquires the pitch of the identified pitch name (step S106). Specifically, first, the pitch name specifying unit 33 specifies the pitch name corresponding to the utterance acquired this time, based on the set tone and the utterance acquired this time. For example, when the setting tone is D tone (d major tone) and the utterance acquired this time is "do", the note name D is specified. Alternatively, if the setting tone is E tone (E major tone) and the utterance acquired this time is "do", the note name E is specified. Although the pitch names are specified in the range from pitch C3 to B3, the pitch names may be specified in a range different from one another in units of octaves, for example, from pitch C4 to B4. In step S106, the pitch acquisition unit 34 also refers to the table (FIG. 4) to acquire the pitch of the specified pitch name. At that time, the pitch acquiring unit 34 acquires a value obtained by shifting the frequency corresponding to the MIDI code defined in the table based on the calibration setting as the pitch of the specified pitch name. In step S106, the pitch acquisition unit 34 also sends (the information of) the acquired pitch to the notification unit 36. Then, the notification unit 36 presents the sent pitch by pronunciation. Thereby, the singer can recognize the target pitch to be uttered according to the key and the floor name. The presentation of the pitch here is not essential.

Next, the pitch comparison unit 35 detects the pitch of the utterance acquired this time (detection pitch) detected by the pitch detection unit 32 in step S106 and the identified sound acquired by the pitch acquisition unit 34 in step S106. The name pitch (pitch name pitch) is compared, and the comparison result (Δ pitch) is output (step S107). The Δ pitch as the comparison result is the deviation amount (cent) of the detection pitch with respect to the reference pitch pitch, and is calculated by Δ pitch = detection pitch−pitch pitch. If the sign of Δ pitch is positive, it means that the detection pitch is higher than the pitch name pitch. In addition, it is assumed that a pronunciation pitch changes with an octave unit by the sex etc. of a singer. Therefore, the pitch comparison unit 35 corrects the shift amount in units of octaves (integer multiple of 1200 cents) so that the shift amount is less than one octave if the shift amount between the detection pitch and the pitch name pitch is one octave or more. Is calculated as Δ pitch. It is not essential to provide such a process for correcting the deviation amount in octave units.

Next, the notification unit 36 determines whether the absolute value of the Δ pitch output from the pitch comparison unit 35 is within a predetermined range (first predetermined value) (for example, the Δ pitch is within a range of −400 to +400 cents). It is determined whether or not it is (step S108). Then, when the absolute value of the Δ pitch is within the predetermined range, the notification unit 36 performs a notification process (step S109). There is no limitation on the manner of notification, but as an example, a pointer used in a commercially available tuner is displayed on the display unit 18 to visually indicate the deviation from the pitch name as a reference (target value). In addition to this, notification may be performed by a value, a color, or a mark corresponding to the Δ pitch using an LED or the like, or may be notified by voice. Also in the case of voice, the timbre and volume may be varied according to the Δ pitch. Alternatively, the Δ pitch may be read out by voice. By the notification process, it is possible to notify the singer of the deviation of the utterance pitch.

After step S109, the process proceeds to step S110. If the absolute value of the Δ pitch is out of the predetermined range, the notification unit 36 proceeds with the process to step S110 without executing the notification process. Therefore, when speaking at a pitch extremely deviated from the target, notification of the Δ pitch, which is the comparison result, is not performed, and notification is performed only when the Δ pitch indicates a deviation amount less than the first predetermined value. Thereby, the troublesome alerting | reporting is suppressed. Even when the Δ pitch is not notified, it may be notified that the speech pitch is extremely deviated from the target. It is not essential to provide the process of step S108, and the notification process may be executed uniformly regardless of the value of Δ pitch.

Next, the CPU 10 determines whether or not an instruction to end the pitch determination process has been issued by the operation element 17 or the like (step S110). Then, when there is no instruction to end the pitch determination process, the CPU 10 executes other processes (step S111), and returns the process to step S102. In the other processing, change of setting contents, various guidances and the like are executed. On the other hand, when an instruction to end the pitch determination process is issued, the process of FIG. 5 ends.

An example of notification of the comparison result by the pitch determination process (FIG. 5) will be described with reference to FIG. FIG. 6A is a musical score example showing musical notes uttered by a singer. FIG. 6B is a time chart showing the utterance pitch. As an example, the setting tone is C tone (C major), and the acquisition target floor name is a Japanese style floor name ("Do" "Re" "Mi") "Si (Si (Si) ))). There is no calibration setting. The pointer display in the notification unit 36 indicates that when the pointer points straight up, there is no deviation of the detection pitch from the reference pitch pitch. When the pointer inclines to the right, it means that the detection pitch of singing is higher than the pitch pitch (Δ pitch is positive). Display examples by the notification unit 36 are shown by displays t1 to t9.

First, the singer utters Do at the first sound. The singer intended to utter at the pitch C3, but the pointer is inclined to the right because it is too high and utters close to the pitch E3 (display t1). Thereafter, the singer uttered Do (Do), and immediately after that, the utterance pitch was slightly high (display t2), but when the pitch was corrected, the pitch became almost appropriate (display t3). Just before the singer shifts the utterance to the second note (Re), the utterance pitch is slightly higher than the pitch C3 (display t4), and then, the user pronounces Re (Re), but the pitch is lower. Since the pointer was inclined to the left (display t5), the pitch was corrected to a nearly appropriate pitch (display t6). The singer utters Mi (Mi) at the third note. Since the singer uttered the third note by croaking, the pointer is inclined to the left at the beginning (display t7), and then the pitch becomes almost appropriate when the pitch is corrected (display t8), then the pitch Is on the high side (display t9).

According to the present embodiment, CPU 10 identifies the name of the note corresponding to the utterance based on the set tone and the utterance acquired from the result of recognizing the speech, and acquires the pitch of the identified note. . Then, the CPU 10 presents the acquired pitch by tone generation. As a result, since it is possible to inform in advance the pitch to be uttered according to the key and the step name to be uttered, it is possible to contribute to the solfege assistance.

Further, the CPU 10 outputs a comparison result (Δ pitch) between the acquired speech pitch and the specified pitch name pitch. As a result, the utterance pitch can be determined in accordance with the key and the floor name to be uttered. Moreover, since the output comparison result is notified, the singer can recognize the deviation of the utterance pitch. Therefore, the singer can perform visual training easily and can contribute to the solfege assistance.

Also, if the comparison result indicates a shift of one or more octaves, the value corrected so that the shift amount is less than one octave is taken as the Δ pitch, so the utterance pitch is determined regardless of the singer's voice key be able to. In addition, since an utterance indicating a floor name to be acquired can be specified, only a fixed floor name can be used for pitch determination, and erroneous determination is less likely to occur.

When the absolute value of the Δ pitch is equal to or greater than the second predetermined value (for example, 50 cents), the CPU 10 corrects the pitch (in addition to the above notification processing in the notification processing in step S109 described above). The target note name pitch) may be pronounced. Then, the CPU 10 may mute (stop presentation) this sound when the singing pitch is corrected and the absolute value of the Δ pitch falls below a second predetermined value.

The CPU 10 may sound a target pitch pitch sound for a certain period of time when the pitch name corresponding to the utterance acquired this time is specified.

In addition to the utterance of the detection target, the meaning of the command may be understood, and the user may be able to instruct setting of the key, the acquisition target floor name, the calibration, and the like by the command.

When the present invention is implemented by an information terminal device 40 such as a smartphone, the device connected thereto may have a notification function, and is not limited to the tuner 30. There is no limitation on the method of acquiring the application for realizing the present invention. The application executed by the information terminal device 40 may not be originally installed in the information terminal device 40, and may be downloaded and installed after the fact.

Although the present invention has been described in detail based on its preferred embodiments, the present invention is not limited to these specific embodiments, and various embodiments within the scope of the present invention are also included in the present invention. included.

Note that the storage medium storing the control program represented by the software for achieving the present invention may be read out to the present instrument to achieve the same effect, in which case, the storage medium is read from the storage medium. The program code itself implements the novel functions of the present invention, and the non-transitory computer readable recording medium storing the program code constitutes the present invention. Also, the program code may be supplied via a transmission medium or the like, in which case the program code itself constitutes the present invention. In addition to ROMs, floppy disks, hard disks, optical disks, magneto-optical disks, CD-ROMs, CD-Rs, magnetic tapes, non-volatile memory cards, etc. can be used as storage media in these cases. The “non-transitory computer readable recording medium” is a volatile memory (for example, a server or client internal to the computer system when the program is transmitted via a network such as the Internet or a communication line such as a telephone line) It also includes one that holds a program for a fixed time, such as a dynamic random access memory (DRAM).

10 CPU (setting unit)
17 Operator (designated part)
Reference Signs List 30 tuner 31 utterance acquisition unit 32 pitch detection unit 33 pitch name identification unit 34 pitch acquisition unit 35 pitch comparison unit (output unit)
36 Notification unit

Claims

A setting unit for setting a key,
A speech acquisition unit for acquiring a speech indicating a floor name from a result of speech recognition;
A pitch acquisition unit for identifying a pitch name corresponding to the acquired utterance based on the tone set by the setting unit and the utterance acquired by the utterance acquisition unit; and acquiring a pitch of the identified pitch name ,
Solfege aid device with.
The solfege auxiliary device according to claim 1, further comprising a presentation unit that presents the pitch acquired by the pitch acquisition unit.
A pitch detection unit that detects a pitch of the utterance acquired by the utterance acquisition unit;
The solfege auxiliary device according to claim 1, further comprising: an output unit that compares the pitch detected by the pitch detection unit with the pitch acquired by the pitch acquisition unit and outputs the comparison result.
The solfege auxiliary device according to claim 3, further comprising a notification unit that notifies the comparison result output from the output unit.
5. The solfege auxiliary device according to claim 4, wherein the notification unit reports the comparison result only when the comparison result indicates a deviation amount smaller than a first predetermined value.
The notification unit is correct in addition to notification of the comparison result when the comparison result indicates a deviation amount that is less than the first predetermined value and exceeds a second predetermined value that is smaller than the first predetermined value. The solfege auxiliary device according to claim 5, wherein the pitch sound is produced.
The notification unit stops the sound generation of the sound of the correct pitch when the comparison result does not indicate a deviation amount exceeding the second predetermined value after the sound of the sound of the correct pitch is generated. Solfege aid described.
The output unit sets an octave so that the shift amount is less than one octave when the shift amount between the pitch detected by the pitch detection unit and the pitch acquired by the pitch acquisition unit is one octave or more. The solfege auxiliary device according to any one of claims 3 to 7, wherein a value obtained by correcting the deviation amount as a unit is output as the comparison result.
And a designation unit that designates an utterance indicating a floor name to be acquired by the utterance acquisition unit.
The solfege assistance device according to any one of claims 1 to 8, wherein the speech acquisition unit acquires the speech designated by the designation unit from the recognized speech.
A calibration unit for adjusting the pitch of the note name;
The solfege auxiliary device according to any one of claims 1 to 9, wherein the pitch acquisition unit acquires the pitch corresponding to the specified pitch name and adjusted by the calibration unit.
It is a control method of a solfege auxiliary device, and
Setting steps to set the key,
An utterance acquisition step of acquiring an utterance indicating a floor name from a result of speech recognition;
A pitch acquisition step of specifying a pitch name corresponding to the acquired speech based on the tone set in the setting step and the speech acquired in the speech acquisition step, and acquiring a pitch of the specified pitch name; ,
A control method of a solfege auxiliary device having:
A program that causes a computer to execute a control method of a solfege auxiliary device,
The control method is
Setting steps to set the key,
An utterance acquisition step of acquiring an utterance indicating a floor name from a result of speech recognition;
A pitch acquisition step of specifying a pitch name corresponding to the acquired speech based on the tone set in the setting step and the speech acquired in the speech acquisition step, and acquiring a pitch of the specified pitch name; ,
A program with.