WO2015111256A1

WO2015111256A1 - Speech adjustment system, server, and in-vehicle device

Info

Publication number: WO2015111256A1
Application number: PCT/JP2014/077446
Authority: WO
Inventors: 古郡　弘滋
Original assignee: クラリオン株式会社
Priority date: 2014-01-24
Filing date: 2014-10-15
Publication date: 2015-07-30
Also published as: JPWO2015111256A1

Abstract

Provided are a speech adjustment system, a server and an in-vehicle device capable of diversifying information provided for a user without depending on the storage capacity and processing ability of the in-vehicle device. A mobile terminal (200) accepts the input of an individual-specific TTS parameter as a parameter for speech adjustment through user operation. A service provider server (300) with a storage unit (302) for storing text data representing utterance content acquires the individual-specific TTS parameter, and generates speech data from the text data on the basis of the parameter. An in-vehicle device (100) acquires the speech data and outputs a speech on the basis of the acquired speech data.

Description

Audio adjustment system, server, and in-vehicle device

The present invention relates to a sound adjustment system, a server, and an in-vehicle device.

As a background art in this technical field, there is JP-A-2006-301059 (Patent Document 1). This gazette states that “a voice output request including a voice data ID or text data is converted using a conversion table corresponding to the voice quality of the narrator set by the user, and if the converted data is a voice data ID, the voice data Is output using voice, and in the case of text data, the voice synthesis unit synthesizes the voice data and outputs the voice. "

JP 2006-301059 A

In Patent Document 1, for example, in order to increase the types of voice quality of a message that can be selected by the user, it is necessary to store the feature data indicating the feature correspondingly, but the capacity of the storage device in the vehicle navigation device is increased. There are limits.
Therefore, an object of the present invention is to provide a voice adjustment system, a server, and an in-vehicle device that can diversify information to be provided to a user without depending on the storage capacity and processing capability of the in-vehicle device.

In order to achieve the above object, an audio adjustment system of the present invention includes a mobile terminal that receives a user operation, a server with which the mobile terminal can communicate, and an in-vehicle device that outputs audio based on audio data, The mobile terminal has a terminal-side processing unit that accepts input of parameters for voice adjustment via the user operation, the server acquires the parameter, a storage unit that stores data indicating utterance content, A server-side processing unit that generates voice data from the data based on a parameter, and the in-vehicle device acquires the voice data and outputs a voice based on the acquired voice data. It is characterized by having.

According to the present invention, it is possible to diversify the information provided to the user without depending on the storage capacity and processing capability of the in-vehicle device.

FIG. 1 is a diagram showing an audio adjustment system according to an embodiment of the present invention. FIG. 2 is a block diagram showing the configuration of the in-vehicle device. FIG. 3 is a block diagram showing the configuration of the mobile terminal. FIG. 4 is a diagram showing an operation when the mobile terminal executes the sound adjustment program. FIG. 5 shows an example of the adjustment screen. FIG. 6 is a diagram illustrating an operation at the time of starting the in-vehicle device. FIG. 7 shows an example of the edit screen. FIG. 8 is a diagram illustrating an operation when the user of the in-vehicle device performs destination setting using voice recognition. FIG. 9 is a diagram showing an example of the operation when the mobile terminal executes the mail reading program.

Embodiments of the present invention will be described below with reference to the drawings.
FIG. 1 is a diagram showing an audio adjustment system according to an embodiment of the present invention.
The voice adjustment system 1 includes an in-vehicle device 100 mounted on a vehicle 10 that is a mobile body, a mobile terminal 200 carried by various users, and a service providing server 300 connected to a public communication line network N1 such as the Internet network. And.
As will be described later, the in-vehicle device 100 and the mobile terminal 200 can be connected by short-range wireless communication. The mobile terminal 200 and the service providing server 300 can be connected to each other via the public communication line network N1.

FIG. 2 is a block diagram showing the configuration of the in-vehicle device 100.
The in-vehicle device 100 includes an information processing unit 101, a storage unit 102, a display unit 103, an input unit 104, a GPS receiving device 105, a near field communication unit 106, and a voice input / output unit 107. Each of these devices is electrically connected by a bus 108 and configured to be able to exchange data with each other.
The information processing unit 101 includes a CPU 109, a ROM 110, a RAM 111, and peripheral circuits (not shown) connected to each other via a bus 112, and functions as a computer (vehicle-mounted processing unit) that centrally controls the vehicle-mounted device 100. For example, the information processing unit 101 of the in-vehicle device 100 executes various processes such as route guidance processing and voice processing executed by a known car navigation device or car audio device by executing a control program stored in the storage unit 102. Execute.

In the case of route guidance processing, the information processing unit 101 identifies the current location of the vehicle 10 based on the GPS radio wave received by the GPS receiver 105 and uses the map data stored in the storage unit 102 to determine the departure place ( For example, a route from the current location) to the destination is searched. In addition, the information processing unit 101 generates graphics information such as a map image and traffic information, outputs it to the display unit 103, reads voice data for performing route guidance from the storage unit 102, and is installed in the vehicle. A guidance voice is emitted from a speaker (not shown).

The storage unit 102 stores a control program executed by the information processing unit 101 and various data. For example, an HDD (Hard Disk Drive), a semiconductor memory, a CD-ROM, or a DVD-ROM is applied to the storage unit 102. In the present embodiment, a voice interactive application program (hereinafter referred to as a control program that performs navigation processing, a voice uttered by a user of the mobile terminal 200, etc.) is transmitted to the storage unit 102 via the mobile terminal 200 to the service providing server 300. Is called a conversation program).
These programs may be programs acquired (downloaded) via the public communication network N1 (FIG. 1), or may be programs installed in the in-vehicle device 100 in advance.

The display unit 103 displays various images under the control of the information processing unit 101. For example, a liquid crystal display device is used for the display unit 103.
The input unit 104 is a device that detects a user operation and notifies the information processing unit 101, and includes an operation switch and a transmissive touch panel arranged on the display screen.
The GPS receiver 105 receives GPS radio waves transmitted from GPS satellites, calculates the position and direction of the vehicle based on the GPS radio waves, and outputs them to the information processing unit 101.
The short-range communication unit 106 is a wireless communication interface for performing short-range wireless communication under the control of the information processing unit 101. By using the short-range communication unit 106, wireless communication can be performed between the in-vehicle device 100 and the portable terminal 200 in the vehicle. In this embodiment, Bluetooth (registered trademark) is used for short-range wireless communication, but Wi-Fi (registered trademark) may be used.

The voice input / output unit 107 includes a decoder, an amplifier, and the like, and outputs various voices from a speaker installed in the vehicle 10 under the control of the information processing unit 101. For example, the audio input / output unit 107 generates an audio signal from the audio data stored in the storage unit 102 and emits the sound from a speaker. Specific examples of the voice include guidance voice for navigation, CD voice, radio voice, and various voices transmitted from the portable terminal 200. The voice input / output unit 107 has a microphone and an AD conversion circuit, and has a function of converting voice uttered by the user into voice data. The in-vehicle device 100 may further include other configurations included in a known in-vehicle device.

FIG. 3 is a block diagram illustrating a configuration of the mobile terminal 200.
The mobile terminal 200 is a smartphone or a PDA (Personal Data Assistance). As illustrated in FIG. 3, the mobile terminal 200 includes an information processing unit 201, a storage unit 202, a display unit 203, an input unit 204, a GPS reception device 205, a communication unit 206, and a short-range communication unit 207. A voice input / output unit 208. Each of these devices is electrically connected by a bus 209 and configured to be able to exchange data with each other.

The information processing unit 201 includes a CPU 210, a ROM 211, a RAM 212, and peripheral circuits (not shown) connected to each other via a bus 213, and functions as a computer (terminal side processing unit) that centrally controls the mobile terminal 200.
For example, the information processing unit 201 of the mobile terminal 200 executes a control program stored in the storage unit 202, thereby enabling a telephone function provided in a known mobile terminal, a mail function for sending / receiving / browsing electronic mail, Browsing function, and a function of executing various application programs acquired from the Internet or the like.

The storage unit 202 stores a control program executed by the information processing unit 201 and various data. For example, a semiconductor memory or an HDD is applied to the storage unit 202.
In the present embodiment, the storage unit 202 communicates with an application program (hereinafter referred to as a voice adjustment program) for adjusting a voice (navigation guidance voice) used in the in-vehicle device 100 and the in-vehicle device 100. Application programs (hereinafter referred to as communication programs) and application programs for editing text data stored in the service providing server 300 (hereinafter referred to as editing programs) are stored.
These programs may be programs acquired (downloaded) via the public communication network N1 (FIG. 1), or may be programs installed in advance in the mobile terminal 200.

The display unit 203 is a device that displays various images under the control of the information processing unit 201. For example, a liquid crystal display device is used.
The input unit 204 is a device that detects a user operation and notifies the information processing unit 201, and includes an operation switch and a transmissive touch panel arranged on the display screen.
The GPS receiving device 205 receives GPS radio waves transmitted from GPS satellites, calculates the current location and orientation of the mobile terminal 200 based on the GPS radio waves, and outputs them to the information processing unit 201.

Under the control of the information processing unit 201, the communication unit 206 accesses the public communication line network N1 (FIG. 1) or the like via a wireless communication network (a mobile phone communication network in this embodiment), and the public communication line network N1. It is a communication interface for communicating with a device connected to the device. The communication unit 206 enables communication between the mobile terminal 200 and the service providing server 300 (FIG. 1). Further, the communication unit 206 can make calls and mails with other portable terminals.
The near field communication unit 207 is a wireless communication interface for performing near field wireless communication under the control of the information processing unit 201. By using the short-range communication unit 207, wireless communication can be performed between the mobile terminal 200 and the in-vehicle device 100.

The voice input / output unit 208 outputs various voices from a speaker included in the mobile terminal 200 under the control of the information processing unit 201, or inputs voice uttered by the user via a microphone and converts it into voice data. To do. By using the voice input / output unit 208, it is possible to make a call with another portable terminal and to reproduce voice data such as music stored in the storage unit 202 of the portable terminal 200. The in-vehicle device 100 may further include other configurations included in a known in-vehicle device.

The service providing server 300 is a server that provides a service such as creating voice data from text data, that is, a TTS (Text to speech) service. As shown in FIG. 1, the service providing server 300 includes an information processing unit 301, a storage unit 302, and a communication unit 303, and has a higher processing capacity and a larger storage capacity than the in-vehicle device 100. It is configured.
The information processing unit 301 includes a CPU, a ROM, a RAM, and the like, and functions as a computer (server side processing unit) that centrally controls the service providing server 300.

The storage unit 302 stores a control program executed by the information processing unit 301 and various data. The information processing unit 301 can function as a TTS engine that performs voice conversion processing for creating voice data from text data by executing a control program stored in the storage unit 302.
The storage unit 302 also stores a database in which text data indicating utterance content (such as voice for route guidance) reproduced by the in-vehicle device 100 and a database of point of interest (POI) used for route search and the like are stored. ing.

Next, processing executed in the sound adjustment system 1 will be described.
FIG. 4 is a diagram illustrating an operation when the mobile terminal 200 executes the sound adjustment program.
When the information processing unit 201 of the portable terminal 200 executes the voice adjustment program in response to a user operation, the information processing unit 201 performs a login process to the service providing server 300 (step S1A). In this case, the information processing unit 201 accesses the service providing server 300 through the communication unit 206 and causes the display unit 203 to display a login screen. Then, when the user of the mobile terminal 200 inputs login information (for example, a user ID or a password) via the input unit 204, the login is completed.

When the login is completed, the information processing unit 301 of the service providing server 300 displays a voice adjustment screen (adjustment screen) on the screen of the mobile terminal 200 (step S2A).
FIG. 5 is a diagram showing an example of the adjustment screen.
The adjustment screen is a screen that accepts input of parameters for voice adjustment, and more specifically, displays a screen that accepts input of voice pitch, speed, height (height), intonation, gender, and type. The parameters are not limited to the above parameters, and may be added or changed as appropriate.

Returning to FIG. 4, when voice adjustment parameters are input via the input unit 204 of the mobile terminal 200, the information processing unit 301 of the service providing server 300 stores these parameter groups as individual TTS parameters. Store (save) in 302 (step S3A).
In this case, the storage unit 302 stores individual TTS parameters in association with identification information (user ID or terminal ID) for identifying the mobile terminal 200. As a result, the individual TTS parameter can be specified based on the identification information.

In this way, the user of the mobile terminal 200 can set the desired parameters by operating the mobile terminal 200 that he / she owns, and store it in the service providing server 300. When this parameter is set (when the audio adjustment program is executed), it is not necessary to connect the mobile terminal 200 to the in-vehicle device 100. Therefore, the user can set parameters not only in the vehicle 10 but also in any place outside the vehicle 10.

At the time of parameter setting, the service providing server 300 converts predetermined sample text data into voice data (voice conversion) based on the individual TTS parameters and sends the voice data to the portable terminal 200. You may make it reproduce | regenerate. According to this configuration, the user can quickly confirm the voice of the parameter set by the user.
Further, since the service providing server 300 has a high processing capability, it can be converted into voice data in almost real time. Therefore, the user can adjust the parameter while confirming the voice.

FIG. 6 is a diagram illustrating an operation when the in-vehicle device 100 is activated. As a premise, it is assumed that the in-vehicle device 100 and the mobile terminal 200 are connected for communication.
When the in-vehicle device 100 is activated, the information processing unit 101 of the in-vehicle device 100 causes the mobile terminal 200 to execute the communication program. As a result, the in-vehicle device 100 accesses the service providing server 300 via the portable terminal 200 and can communicate with the server 300.
In this case, the information processing unit 101 of the in-vehicle device 100 downloads the audio data adjusted by the individual TTS parameter from the service providing server 300 (step S1B). The service providing server 300 may acquire identification information (user ID or terminal ID) from the mobile terminal 200 and specify a single individual TTS parameter based on the identification information. Moreover, not only this method but the well-known technique for specifying the individual TTS parameter corresponding to the portable terminal 200 or the vehicle-mounted apparatus 100 is applicable widely.

Here, the voice data is data obtained by voice-converting a text data group of fixed phrases stored in the storage unit 302 of the service providing server 300. That is, the service providing server 300 converts each text data into voice data based on the individual TTS parameters by the TTS engine. Thus, voice data corresponding to the characters of each text data is generated with the voice adjusted by the individual TTS parameter. This voice conversion process may be executed when a download instruction is issued from the in-vehicle device 100, or at a timing before the download instruction (for example, an appropriate timing after the individual TTS parameters are stored). You may do it.

When the download is completed, the information processing unit 101 of the in-car device 100 stores the voice data in the storage unit 102, and the stored voice data is reproduced by the voice input / output unit 107 at a timing specified in advance (step S2B). Thus, for example, immediately after the download, "Hello, *** Mr.", "Happy Birthday", "Today also go Let in safe driving" can be played with, the timing arriving at the home that has been set in advance You can play “Thank you for your hard work”.
In this way, the service providing server 300 can convert the voice of the user of the mobile terminal 200 to a desired voice, and the voice can be reproduced by the in-vehicle device 100.

Also, the text data stored in the service providing server 300 can be edited by the user operating the portable terminal 200 to execute the editing program. In this case, the login process similar to step S1A shown in FIG. 4 is performed, and when the login is completed, the information processing unit 301 of the service providing server 300 displays a text data editing screen (editing screen) on the screen of the mobile terminal 200. ) Is displayed.

FIG. 7 is a diagram showing an example of the editing screen.
This editing screen is a screen that accepts editing of text data of fixed phrases corresponding to the audio content reproduced by the in-vehicle device 100. As shown in FIG. 7, a text data group stored in the storage unit 302 of the service providing server 300 is displayed on the editing screen, and editing of each text data is accepted. The edited text data is stored in the storage unit 302 in association with identification information (user ID or terminal ID) for identifying the mobile terminal 200.
In this manner, the user of the mobile terminal 200 can edit the text data by operating the mobile terminal 200 that the user owns, and store the text data in the service providing server 300. At the time of editing (when the editing program is executed), since the mobile terminal 200 does not need to be connected to the in-vehicle device 100, the user can edit text data at an arbitrary location such as outside the vehicle.

FIG. 8 is a diagram illustrating an operation when the user of the in-vehicle device 100 performs destination setting using voice recognition. Also in this case, it is assumed that the in-vehicle device 100 and the mobile terminal 200 are connected for communication. Further, the information processing unit 101 of the in-vehicle device 100 executes a conversation program in response to a user operation, waits for input of a voice specifying a destination (destination) via the voice input / output unit 107, and inputs a voice. The voice data corresponding to the voice is transmitted to the service providing server 300 via the portable terminal 200.
The information processing unit 301 of the service providing server 300 recognizes voice data received via the mobile terminal 200 and performs a POI search (POI database search) using the voice recognition result as a search key (step S1C).

Next, the information processing unit 301 of the service providing server 300 uses the TTS engine to generate voice data that reads the search result using the individual TTS parameters, and transmits the voice data to the in-vehicle device 100 via the portable terminal 200 (step) S2C). As a result, voice data adjusted by the individual TTS parameters, that is, voice data desired by the user is generated and transmitted to the in-vehicle device 100.
Subsequently, the information processing unit 101 of the in-vehicle device 100 performs a process of reproducing the received audio data (step S3C). As a result, the in-vehicle device 100 reproduces the voice desired by the user, for example, “Search results are XX”, “First ... second ...”. .

Thereafter, when an appropriate candidate (destination) is selected by a user operation, the information processing unit 101 of the in-vehicle device 100 performs processing (destination setting) for searching for a recommended route from the current location to the destination (step S4C).
When the recommended route is determined, the information processing unit 101 of the in-vehicle device 100 starts a route guidance process, and for example, a guidance voice such as “Start a route guide ... Turn right 300m ahead”. Playback is performed (step S5C).

In this case, the guidance voice is emitted as a voice adjusted by the service providing server 300 based on the individual TTS parameters. That is, the service providing server 300 converts the guidance voice text data stored in the storage unit 302 into voice data based on the individual TTS parameters in advance, and the in-vehicle device from the service providing server 300 via the portable terminal 200. 100 and stored in the storage unit 102 by the in-vehicle device 100, it is possible to reproduce the guidance voice adjusted by the in-vehicle device 100.
In this way, route guidance can be performed with the voice desired by the user. Note that the timing at which the service providing server 300 converts the voice data and the timing at which the voice data is transmitted to the in-vehicle device 100 can be set to any timing as long as the guidance voice is not reproduced.

As described above, according to the present embodiment, mobile terminal 200 accepts input of individual TTS parameters, which are parameters for voice adjustment, through user operation, and service providing server 300 indicates the utterance content. It has a storage unit 302 for storing text data, acquires individual TTS parameters, generates voice data from the text data based on the parameters, and the in-vehicle device 100 acquires the voice data, and converts the acquired voice data into the acquired voice data Since the sound is output based on this, the sound desired by the user can be reproduced on the in-vehicle device 100 side without depending on the storage capacity or the processing capability of the in-vehicle device 100. Moreover, since the voice conversion is performed by the service providing server 300 having a higher processing capability than the in-vehicle device 100, the adjustment range of the voice conversion can be made wider than that in the case of performing the voice conversion by the in-vehicle device 100. As a result, it is possible to diversify the information provided to the user.

In addition, since the individual TTS parameter is set only by communication between the mobile terminal 200 and the service providing server 300, the individual TTS parameter can be set without operating the in-vehicle device 100. Therefore, individual TTS parameters can be set at an arbitrary place and at an arbitrary timing, and convenience is improved.
Note that, if an individual TTS parameter is set on the in-vehicle device 100 side, it is necessary to prepare a dedicated menu screen or program, which depends on the function and performance of the in-vehicle device 100. In other words, in the present embodiment, it is possible to perform parameter setting and acquisition / reproduction of audio data adjusted to a desired voice without depending on the function and performance of the in-vehicle device 100.

Moreover, since the individual TTS parameters include the pitch, speed, height (height), etc. of the voice, it is easy to adjust to the voice desired by the user.
Furthermore, since voice data is generated from text data, a known voice conversion technique can be widely applied, and voice contents can be easily edited and created.
Further, the information processing unit (terminal-side processing unit) 201 of the mobile terminal 200 accepts editing of text data stored in the service providing server 300 through a user operation, and the information processing unit (server side) of the service providing server 300 Since the processing unit 301 edits the text data so as to perform the above editing, it can be easily adjusted to the audio content desired by the user. Also in this case, since the data can be edited without operating the in-vehicle device 100, the data can be edited at any place and at any timing, which improves convenience and depends on the function and performance of the in-vehicle device 100. You can edit without having to.

The above-described embodiment is merely one aspect of the present invention, and can be arbitrarily changed within the scope of the present invention. For example, in the above-described embodiment, the case where the pitch, speed, height (height), inflection, gender, and the like of the voice are adjustable has been described. For example, any one or more of the adjustment items can be adjusted, and the adjustment items can be appropriately increased or decreased.
In the above-described embodiment, the case where the text data is converted into voice data based on the individual TTS parameters has been described. However, the voice data serving as a reference is stored in the service providing server 300 in advance, The pitch, speed, height, etc. may be adjusted based on the individual TTS parameters.

Further, in the above-described embodiment, an application program for reading a mail (hereinafter referred to as a mail reading program) is further stored in the portable terminal 200, and the personalized TTS parameter is adjusted by the service providing server 300 using this program. Voice voice data may be generated by voice and played back by the in-vehicle device 100.

FIG. 9 is a diagram illustrating an example of an operation when the mobile terminal 200 executes the mail reading program.
As illustrated in FIG. 9, when the information processing unit 201 of the mobile terminal 200 executes the voice adjustment program in response to a user operation, the text of the mail is transmitted to the service providing server 300 (step S1D). Next, the information processing unit 301 of the service providing server 300 uses the TTS engine to generate voice data that reads the text of the mail using the individual TTS parameters, and transmits the voice data to the in-vehicle device 100 via the mobile terminal 200. (Step S2D). Subsequently, the information processing unit 101 of the in-vehicle device 100 performs a process of reproducing the received audio data (step S3D). Thereby, in the in-vehicle device 100, for example, a voice indicating the content of the mail such as “Today's meeting, start time has been changed” can be reproduced with a voice desired by the user.

In the above-described embodiment, the in-vehicle device 100 may have various functions of the mobile terminal 200. For example, the in-vehicle device 100 may receive an input of an individual TTS parameter that is a parameter for voice adjustment through a user operation, and cause the service providing server 300 to acquire the individual TTS parameter. Further, in the above-described embodiment, the in-vehicle device 100 includes a communication unit for wirelessly communicating with the service providing server 300 connected to the public communication line network N1 such as the Internet network, and the mobile device 100 is provided via the portable terminal 200. Alternatively, wireless communication with the service providing server 300 or the like may be possible.
In the above-described embodiment, the in-vehicle device 100 is not limited to being mounted on a four-wheeled vehicle such as an automobile, and may be mounted on a two-wheeled vehicle such as a bicycle.

DESCRIPTION OF SYMBOLS 1 Audio | voice adjustment system 100 In-vehicle apparatus 101 Information processing part (in-vehicle side processing part)
102, 202, 302 Storage unit 200 Mobile terminal 201 Information processing unit (terminal side processing unit)
300 Service providing server 301 Information processing unit (server side processing unit)

Claims

A mobile terminal that accepts user operations;
A server with which the mobile terminal can communicate;
An in-vehicle device that outputs sound based on sound data,
The mobile terminal has a terminal-side processing unit that receives input of parameters for audio adjustment through the user operation,
The server includes a storage unit that stores data indicating utterance content, and a server-side processing unit that acquires the parameters and generates voice data from the data based on the parameters.
The in-vehicle device includes an in-vehicle side processing unit that acquires the audio data and outputs audio based on the acquired audio data.
The sound adjustment system according to claim 1, wherein the parameters include at least one of sound pitch, speed, and height parameters.
The voice adjustment system according to claim 1, wherein the data is text data.
The terminal side processing unit accepts editing of the data stored in the server via the user operation,
The voice adjustment system according to claim 1, wherein the server side processing unit edits the data so as to perform editing received by the terminal side processing unit.
In a server that can communicate with a mobile terminal,
A storage unit for storing data indicating utterance contents;
A server-side processing unit that acquires a parameter for voice adjustment received by the portable terminal through a user operation, and generates voice data of an utterance content to be reproduced by the in-vehicle device based on the parameter. Feature server.
The server according to claim 5, wherein the parameter includes at least one of a pitch, a speed, and a height of a voice.
The server according to claim 5, wherein the data is text data.
The terminal side processing unit accepts editing of the data stored in the server via the user operation,
The server according to claim 5, wherein the server side processing unit edits the data so as to perform editing received by the terminal side processing unit.
In an in-vehicle device that outputs audio based on audio data,
A storage unit that stores data indicating utterance content, and a server-side processing unit that acquires a parameter for voice adjustment received by a mobile terminal through a user operation, and generates voice data from the data based on the parameter. An in-vehicle apparatus comprising: an in-vehicle processing unit that acquires the audio data from a server provided and outputs audio based on the acquired audio data.