WO2018105064A1 - Voice data transfer method, voice data transfer system, and reception device - Google Patents

Voice data transfer method, voice data transfer system, and reception device Download PDF

Info

Publication number
WO2018105064A1
WO2018105064A1 PCT/JP2016/086410 JP2016086410W WO2018105064A1 WO 2018105064 A1 WO2018105064 A1 WO 2018105064A1 JP 2016086410 W JP2016086410 W JP 2016086410W WO 2018105064 A1 WO2018105064 A1 WO 2018105064A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
restoration
divided
memory
predetermined
Prior art date
Application number
PCT/JP2016/086410
Other languages
French (fr)
Japanese (ja)
Inventor
浩平 真下
Original Assignee
株式会社日立製作所
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社日立製作所 filed Critical 株式会社日立製作所
Priority to PCT/JP2016/086410 priority Critical patent/WO2018105064A1/en
Priority to JP2018555387A priority patent/JP6668503B2/en
Publication of WO2018105064A1 publication Critical patent/WO2018105064A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm

Definitions

  • the present invention relates to an audio data transfer method and system.
  • RTP / RTCP Real-time Transport Protocol / RTP Control Protocol
  • UDP User Datagram Protocol
  • the transport layer UDP protocol including the RTP protocol does not have a function of guaranteeing the integrity of data transfer by data retransmission or the like, unlike TCP, in order to improve data transfer efficiency.
  • Patent Document 1 is known as a technique for preventing data loss and realizing fault-tolerant data transfer even when a failure occurs on a communication path for voice data transfer.
  • transmission data is divided into a plurality of data by a transmission device, parity data is generated from actual data of transmission data, and is transmitted through different communication paths.
  • the receiving apparatus when the divided actual data or the actual data is lost on the communication path, the data restored from the parity data is combined and transferred to the subsequent apparatus.
  • the state of the communication path is not always bad, and it is conceivable that actual data communication is performed stably and parity data is not used. In this case, there is a problem that the calculation of parity data that is not actually used and the network bandwidth is compressed due to the transfer of parity data, and network resources are constantly consumed wastefully.
  • the present invention has been made in view of the above problems, and by controlling the necessity of transmission of parity data in accordance with the transfer state of audio data, the audio disturbance due to the deterioration of the communication state is minimized.
  • Another object of the present invention is to provide an audio data transfer system that can reduce the consumption of network resources.
  • the present invention includes a transmission device including a processor and a memory, a reception device including a processor and a memory, and a communication path connecting the transmission device and the reception device.
  • a voice data transfer method for transferring data wherein the transmitting device generates a plurality of divided data obtained by dividing the voice data, and transmits the divided data via different communication paths,
  • a third step a fourth step in which the receiving device determines whether or not the divided data stored in the memory is missing, and the receiving device However, when the divided data stored in the memory is free of data, a fifth step of combining the divided data to restore and output the audio data, and the receiving device stops the restoration data And when the divided data stored in the memory is missing, a sixth step of supplementing and outputting the audio data with predetermined substitute data, and the receiving device Monitoring a reception state, and when the reception state satisfies a predetermined start condition, a seventh step of shifting to the restoration data transmission mode and requesting the transmission device to start transmission of restoration data; In the first step, when the reception device requests the start of transmission of restoration data, the first step generates restoration data for the divided data and transmits it through a different communication path.
  • the second step includes storing the restoration data received from a different communication path in a memory, and the receiving device stores the restoration data transmission mode in the memory. If there is a deficiency in the divided data, the data includes restoration data stored in the memory and an eighth step of restoring and outputting audio data from the divided data.
  • the present invention it is not always necessary to ensure the integrity of the audio data in the transfer of the audio data for the purpose of finally recognizing the audio data by a person.
  • the restoration data parity data
  • FIG. 1 is a block diagram illustrating an example of an audio data transfer system according to an embodiment of the present invention.
  • the audio data transfer system includes a transmitting device 1 that transmits divided data obtained by dividing audio data, a receiving device 2 that combines and outputs divided audio data (divided data), and an independent network 3 that transfers the divided data. -1 and 3-2 and an independent network 3-3 for transferring the parity data DP generated by the transmission device 1 according to the communication state.
  • the networks 3-1 to 3-3 are configured with different communication paths. For all the network pairs, the symbol “3” is used in which “ ⁇ ” and subsequent characters are omitted. The same applies to the reference numerals of other components.
  • the transmission device 1 includes a processor 11 that performs arithmetic processing, a memory 12 that stores programs and data, a network I / F (interface) unit 13 connected to the network 3, and an I / F unit 14 that receives audio data. including.
  • a data dividing unit 110 that generates divided data D1 and D2 obtained by dividing audio data, and data generation for restoration that generates parity data DP of the divided data in response to a request from the receiving device 2
  • the unit 120 is loaded and executed by the processor 11.
  • Each functional unit of the data dividing unit 110 and the restoration data generating unit 120 is loaded into the memory 12 as a program.
  • the processor 11 operates as a functional unit that provides a predetermined function by performing processing according to a program of each functional unit.
  • the processor 11 functions as the data dividing unit 110 by performing processing according to the data dividing program. The same applies to other programs.
  • the processor 11 also operates as a functional unit that provides each function of a plurality of processes executed by each program.
  • a computer and a computer system are an apparatus and a system including these functional units.
  • Information such as programs and tables for realizing the functions of the data dividing unit 110 and the restoration data generating unit 120 are storage, nonvolatile semiconductor memory, hard disk drive, storage device such as SSD (Solid State Drive), or IC.
  • the data can be stored in a computer-readable non-transitory data storage medium such as a card, SD card, or DVD.
  • the receiving apparatus 2 includes a processor 21 that performs arithmetic processing, a memory 22 that stores programs and data, a network I / F (interface) unit 23 connected to the network 3, and an I / F unit 24 that outputs audio data. Including.
  • a reception buffer 210 for temporarily storing the received divided data D1 and D2 and the parity data DP is set. Further, in the memory 22, as will be described later, a data combining unit 220 that combines the divided data of the reception buffer 210, and data that generates compensation data when the divided data is missing and requests transmission start of parity data Compensation unit 230, path control unit 240 that detects the reception state and requests transmission or stop of parity data according to the reception state, and restoration data control that requests transmission device 1 to start or stop transmission of parity data The unit 250 is loaded and executed by the processor 21.
  • Each function unit of the data combination unit 220, the data compensation unit 230, the path control unit 240, and the restoration data control unit 250 is loaded into the memory 22 as a program.
  • the processor 21 operates as a functional unit that provides a predetermined function by performing processing according to a program of each functional unit.
  • the processor 21 functions as the data combining unit 220 by performing processing according to the data combining program. The same applies to other programs.
  • the processor 21 also operates as a function unit that provides each function of a plurality of processes executed by each program.
  • a computer and a computer system are an apparatus and a system including these functional units.
  • Information such as programs and tables for realizing the functions of the data combination unit 220 to the restoration data control unit 250 is a storage device such as a storage, nonvolatile semiconductor memory, hard disk drive, SSD (Solid State Drive), or an IC card. It can be stored in a computer-readable non-transitory data storage medium such as an SD card or a DVD.
  • a storage device such as a storage, nonvolatile semiconductor memory, hard disk drive, SSD (Solid State Drive), or an IC card. It can be stored in a computer-readable non-transitory data storage medium such as an SD card or a DVD.
  • the receiving buffer 210 receives the divided data received from the network 3-1, the receiving buffer 210-1 for storing the divided data received from the network 3-1, the receiving buffer 210-2 for storing the divided data received from the network 3-2, and the received from the network 3-3.
  • the I / F unit 14 of the transmission device 1 receives audio data from a device (not shown).
  • the data dividing unit 110 divides the audio data received by the I / F unit 14 into divided data D1 and divided data D2, and outputs the divided data to the network I / F unit 13 and the restoration data generating unit 120.
  • the data dividing unit 110 gives a sequence number obtained by adding a predetermined increment value to each received audio data, and gives the same sequence number (or ID) to the divided data D1 and D2 and the parity data DP.
  • the restoration data generation unit 120 When there is a transmission request for the parity data DP from the receiving device 2, the restoration data generation unit 120 generates the parity data DP of the divided data D ⁇ b> 1 and D ⁇ b> 2 and outputs it to the network I / F unit 13 as described later. To do.
  • the network I / F unit 13 transmits the divided data D1 and D2 and the parity data DP from different networks 3-1 to 3-3, respectively. However, the network I / F unit 13 transmits the divided data D1 and D2 from the networks 3-1 and 3-2, respectively, when the parity data DP is not generated. In the present embodiment, an example is shown in which the divided data D1 and D2 and the parity data DP are transmitted by UDP.
  • the network I / F unit 23 sends the divided data D1 and D2 and the parity data DP received from the plurality of communication paths (networks 3-1 to 3-3) to the reception buffers 210-1 to 210-3. Store each one.
  • the data combining unit 220 combines the divided data D1 and D2 having the same sequence number to restore the audio data, generates the audio data, and outputs the audio data to the I / F unit 24 and the data compensation unit 230.
  • the data combining unit 220 determines that the parity data DP and the divided data are The audio data is restored and output to the I / F unit 24 and the data compensation unit 230.
  • the data compensation unit 230 holds the restored audio data as the previous (previous) audio data.
  • the data compensation unit 230 generates compensation data when at least one of the divided data D1 and D2 is missing and there is no parity data DP.
  • the compensation data generated by the data compensation unit 230 can be used, for example, by replacing the previously restored audio data with the data restored this time. However, when the data compensation unit 230 generates compensation data for a predetermined number of times (for example, 100 times) or more within a predetermined time (for example, several tens of seconds), the reception state has deteriorated by satisfying a predetermined start condition. And the restoration data control unit 250 is requested to start transmission of parity data. Note that the restoration data to be substituted is not limited to the previous audio data, and a known or well-known method such as silent data can be used.
  • the path control unit 240 monitors the reception state of the reception buffer 210, and when the reception state deteriorates or detects a sign of failure, requests the restoration data control unit 250 to start transmission of parity data. Further, when the reception state is improved (recovered), the path control unit 240 requests the restoration data control unit 250 to stop transmission of parity data.
  • the path control unit 240 of this embodiment monitors the amount of data received within a predetermined time for each of the reception buffers 210-1 and 210-2, and the fluctuation of the data amount within a predetermined time (for example, 1 second) ( (Reception fluctuation) is calculated, and a reception state is determined by comparing the reception threshold with a preset threshold value.
  • the path control unit 240 compares the data amount within a predetermined time of the reception buffers 210-1 and 210-2 with two preset threshold values (where the first threshold value ⁇ the second threshold value), It is determined that the reception state has deteriorated when the amount of data in the data becomes less than the first threshold. In this case, it can be estimated that the amount of data within a predetermined time has decreased due to the occurrence of a failure in the network 3-1 or 3-2 of the communication path.
  • the path control unit 240 detects a sign of deterioration of the reception state or a failure when the amount of data within a predetermined time exceeds the second threshold. In this case, it can be estimated that the data temporarily stays due to the occurrence of a problem in the network 3-1 or 3-2 of the communication path, and the staying data is transferred at once.
  • the path control unit 240 can suppress deterioration of the voice data by transmitting a parity data transmission start request to the restoration data control unit 250.
  • the path control unit 240 can reduce resource consumption of the voice data transfer system by transmitting a request to stop transmission of parity data to the restoration data control unit 250.
  • the reception device 2 requests the transmission device 1 to start transmission of parity data when detecting a deterioration of the reception state (a sign of deterioration), and transmits the parity data to the transmission device 1 when the reception state is improved. Request a stop. Further, by transferring the divided data D1 and D2 and the parity data DP via different networks 3-1 to 3-3, it is possible to make the audio data transfer redundant.
  • FIG. 2 is a sequence diagram showing an example of voice data communication processing performed in the voice data transfer system.
  • the transmission apparatus 1 transmits the divided data D1 and D2 obtained by dividing the audio data to the reception apparatus 2 (S1).
  • the receiving device 2 shifts to the parity data transmission mode when the divided data D1 and D2 are lost and cannot be compensated, or when the reception state is deteriorated (increase in reception fluctuation) (S2) ( S3), a transmission start request for parity data is transmitted to the transmission apparatus 1 (S4).
  • the transmission device 1 Upon receiving the parity data transmission start request, the transmission device 1 shifts to the parity data transmission mode (S5), and starts generating and transmitting the parity data DP (S6).
  • reception apparatus 2 When there is no loss of the divided data D1 and D2 over a certain time and the reception apparatus 2 detects an improvement in reception state (decrease in fluctuation) (S7), the reception apparatus 2 shifts to the parity data stop mode (S8). ), A transmission stop request for parity data is transmitted to the transmission apparatus 1 (S9).
  • the transmission apparatus 1 Upon receiving the parity data transmission stop request (S10), the transmission apparatus 1 shifts to the parity data stop mode, stops generating and transmitting the parity data DP, and generates and transmits the divided data D1 and D2 ( S11).
  • the receiving device 2 has two determination criteria for determining the transition to the parity transmission mode.
  • the first is a case where the receiving device 2 cannot combine the divided data D1 and D2 and performs data compensation a predetermined number of times (frequency threshold, for example, 100 times) or more within a certain time (several tens of seconds).
  • the other case is when the receiving apparatus 2 detects an excessive fluctuation in the amount of data in the receiving buffer 210.
  • the probability that the receiving apparatus 2 cannot combine the divided data is the sum of the two communication paths.
  • the quality of the audio data can be maintained by starting the transmission of parity data.
  • FIG. 3 is a flowchart illustrating an example of the data combining process performed by the receiving device 2 data combining unit 220. This process is executed at a predetermined cycle (for example, several tens of milliseconds).
  • the data combining unit 220 identifies the divided data D1 and D2 to be combined next by the sequence number, and acquires data from the respective reception buffers 210-1 and 210-2 (S21).
  • the sequence number is calculated by adding a predetermined value to the previous sequence number.
  • the data combining unit 220 determines whether or not the divided data D1 and D2 to be combined are aligned in the reception buffers 210-1 and 210-2 (S22). If the divided data D1 and D2 are ready, the process proceeds to step S26, and the data combining unit 220 combines the divided data D1 and D2 to restore the audio data.
  • the restored audio data is recorded as the immediately preceding audio data by the data compensation unit 230 and transmitted from the I / F unit 24 to a subsequent device (not shown) (S27).
  • the data combining unit 220 determines whether or not the parity data DP is received (S23). The data combining unit 220 proceeds to step S25 if the parity data DP corresponding to the above sequence number exists in the reception buffer 210-3, and proceeds to step S24 if not received.
  • the data combining unit 220 restores the missing divided data D1 and D2 from the other received divided data and the parity data DP (S25). Thereafter, the data combining unit 220 restores the audio data using the divided data restored in step S25 (S26).
  • the restored audio data is recorded as the immediately preceding audio data by the data filling unit 230 and output from the I / F unit 24.
  • the data combining unit 220 determines that the data is missing and notifies the data supplementing unit 230 of the data missing. In this case, audio data is not output to the subsequent apparatus.
  • FIG. 4 is a flowchart showing an example of the data compensation process performed by the data compensation unit 230 of the receiving device 2. This process is executed after step S24 of FIG.
  • the data compensation unit 230 determines whether or not data loss is detected (S31). That is, if the data filling unit 230 has received a data loss notification from the data combining unit 220, the data filling unit 230 determines that the data combining unit 220 has detected a data loss, and proceeds to step S32. Proceed to S36.
  • the data compensation unit 230 replaces the audio data restored immediately before with the audio data of the current sequence number (S32), and transmits it to the I / F unit 24 (S33).
  • the data compensation unit 230 determines whether or not data compensation has been performed a predetermined number of times (for example, 100 times) or more within a predetermined time (for example, several tens of seconds) (S34).
  • the data compensation unit 230 determines that the state of the communication path (networks 3-1 and 3-2) has deteriorated when data compensation is performed a predetermined number of times (frequency threshold) within a certain time, and parity data
  • the restoration data control unit 250 is requested to shift to the transmission mode, the number of data compensation is reset, and the process is terminated (S35).
  • the data compensation unit 230 increments the number of data compensation and ends the process.
  • step S31 determines whether a data loss has been detected within a predetermined time (S36). If no data loss is detected within a certain time, the data compensation unit 230 determines that a predetermined stop condition is satisfied, and shifts to the parity data stop mode. Therefore, the data compensation unit 230 requests the restoration data control unit 250 to shift to the parity data stop mode, resets the number of data compensations, and ends the process (S37).
  • the data compensation unit 230 performs data compensation a predetermined number of times or more within a predetermined time, the frequency of data compensation becomes equal to or greater than a predetermined compensation threshold, and the reception state deteriorates, and a predetermined start condition is established. And the restoration data control unit 250 is requested to shift to the parity data transmission mode.
  • the data filling unit 230 determines that a predetermined stop condition has been satisfied, and requests the restoration data control unit 250 to shift to the parity stop mode. Note that, when data loss is detected within a certain time, the data compensation unit 230 does not shift to the parity data stop mode but maintains the parity data transmission mode.
  • the audio data does not necessarily need to be completely restored, and if it is restored to some extent, it has a characteristic that the user on the receiving side can estimate the audio data based on the contents of the previous and next audio data. That is, in audio streaming transfer, it is not always necessary to ensure the integrity of transferred audio data.
  • the receiving device 2 does not necessarily need to request a transmission request for parity data, which is restoration data, by performing a data compensation process using the immediately preceding audio data after detecting a data loss.
  • a transmission request for parity data which is restoration data
  • unnecessary communication can be reduced.
  • the receiving device 2 requests the restoration data control unit 250 to shift to the parity data stop mode when data loss is not detected within a certain time. Thereby, after the transmission quality of the communication path is restored, the resource consumption of the voice data transfer system can be reduced by stopping the transmission of the parity data.
  • FIG. 5 is a flowchart illustrating an example of a route control process performed by the route control unit 240 of the receiving device 2. This process is executed at a predetermined cycle (for example, 1 second).
  • the path control unit 240 calculates the data amount (reception fluctuation) within a predetermined time (for example, 1 second) for each of the reception buffers 210-1 and 210-2, and calculates two threshold values (first threshold value ⁇ first threshold value). 2 threshold). The path control unit 240 determines that the reception state is normal if the reception fluctuation is equal to or greater than the first threshold value and equal to or less than the second threshold value, and otherwise determines that an abnormality (a sign of failure) has been detected.
  • the path control unit 240 determines that a predetermined stop condition has been satisfied, proceeds to step S42, and requests the restoration data control unit 250 to stop transmission of parity data.
  • the path control unit 240 proceeds to step S43 and stores the parity data in the restoration data control unit 250. Request to start sending.
  • the reception state may be determined only with the first threshold value.
  • the path control unit 240 determines that the predetermined start condition is satisfied, and is greater than or equal to the first threshold value. If there is, it is determined that a predetermined stop condition is satisfied.
  • FIG. 6 is a flowchart illustrating an example of the mode control process performed by the restoration data control unit 250 of the receiving device 2. This process is executed at a predetermined cycle (for example, 1 second).
  • the restoration data control unit 250 determines whether or not the current mode is the parity data transmission mode. If the current mode is the parity data transmission mode, the process proceeds to step S52. If the current mode is the parity data stop mode, the process proceeds to step S55.
  • step S52 of the parity data transmission mode it is determined whether the restoration data control unit 250 has received a parity data stop request from both the path control unit 240 and the data compensation unit 230, and a parity data stop request is received from both sides. If YES in step S53, the process advances to step S53. If not, the parity data transmission mode is maintained and the process ends.
  • step S53 the restoration data control unit 250 transmits a parity data transmission stop request to the transmission device 1.
  • step S54 the restoration data control unit 250 shifts to the parity data stop mode and ends the process.
  • step S55 it is determined in step S55 whether or not a parity data transmission start request has been received from either the path control unit 240 or the data compensation unit 230. If a parity data transmission start request is accepted, the process proceeds to step S56. If not, the parity data stop mode is maintained and the process ends.
  • step S56 the restoration data control unit 250 transmits a parity data transmission start request to the transmission device 1.
  • step S57 the restoration data control unit 250 shifts to the parity data transmission mode and ends the process.
  • the restoration data control unit 250 accepts a request to start or stop transmission of parity data from the data compensation unit 230 and the path control unit 240, and based on a predetermined criterion, the mode of parity data To control. That is, the restoration data control unit 250 shifts to the parity transmission mode when it receives a shift to the parity transmission mode from either the path control unit 240 or the data compensation unit 230. On the other hand, the restoration data control unit 250 shifts to the parity data stop mode when it receives a shift to the parity data stop mode from both the path control unit 240 and the data compensation unit 230.
  • the mode when a deterioration of the reception state or a sign of failure is detected, the mode immediately shifts to the parity data transmission mode to suppress voice data loss. Then, the parity data transmission mode can be maintained until the reception state is stabilized by shifting to the parity data stop mode after the frequency of data compensation is reduced and the reception state returns to normal.
  • the audio data does not necessarily need to be completely restored, and if it is restored to some extent, it has a characteristic that the user on the receiving side can estimate the audio data based on the contents of the previous and next audio data. In transferring audio data, it is not always necessary to ensure the integrity of the audio data.
  • the data loss of the divided data D1 and D2 is detected, and when there is no restoration data (parity data), the immediately preceding audio data is substituted as the restored data, so that the restoration data is not necessarily obtained. There is no need to request transmission of parity. Thereby, since the transmission frequency of the parity data can be reduced for the transmission device 1, unnecessary communication can be reduced.
  • the restoration data is transmitted.
  • the transmission apparatus 1 is requested to start and shift to the parity data transmission mode, and the audio data is restored from the parity data to ensure the quality of the audio data. Then, the frequency of data compensation is reduced, and the resource consumption of the voice data transfer system can be reduced by shifting to the parity data stop mode after the reception state returns to normal.
  • this invention is not limited to the above-mentioned Example, Various modifications are included.
  • the above-described embodiments are described in detail for easy understanding of the present invention, and are not necessarily limited to those having all the configurations described.
  • a part of the configuration of one embodiment can be replaced with the configuration of another embodiment, and the configuration of another embodiment can be added to the configuration of one embodiment.
  • any of the additions, deletions, or substitutions of other configurations can be applied to a part of the configuration of each embodiment, either alone or in combination.
  • each of the above-described configurations, functions, processing units, processing means, and the like may be realized by hardware by designing a part or all of them with, for example, an integrated circuit.
  • each of the above-described configurations, functions, and the like may be realized by software by the processor interpreting and executing a program that realizes each function.
  • Information such as programs, tables, and files for realizing each function can be stored in a memory, a hard disk, a recording device such as an SSD (Solid State Drive), or a recording medium such as an IC card, an SD card, or a DVD.
  • control lines and information lines indicate what is considered necessary for the explanation, and not all the control lines and information lines on the product are necessarily shown. Actually, it may be considered that almost all the components are connected to each other.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Communication Control (AREA)
  • Detection And Prevention Of Errors In Transmission (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

This reception device for receiving voice data includes: a network interface unit that stores, in a memory, divided data and restoration data received from different communication paths; a data coupling unit that determines whether there are defects in the divided data, couples the divided data if there are no defects in the divided data to restore the voice data, and, if there are defects in the divided data and if in a restoration data transmission mode, restores the voice data from the restoration data and the divided data, and outputs the voice data; a data compensation unit that compensates the voice data using substitute data, if there are defects in the divided data and if in a restoration data stopping mode; a path control unit that monitors the reception state of the divided data, and determines whether the reception state satisfies a prescribed start condition; and a restoration data control unit that, when the prescribed start condition is satisfied, switches to the restoration data transmission mode, and requests that transmission of the restoration data is started.

Description

音声データ転送方法、音声データ転送システム及び受信装置Audio data transfer method, audio data transfer system, and receiver
 本発明は、音声データの転送方法及びシステムに関する。 The present invention relates to an audio data transfer method and system.
 音声データを転送するプロトコルとしては、UDP(User Datagram Protocol)上で利用されるRTP/RTCP(Real-time Transport Protocol/RTP Control Protocol)プロトコルが知られている。RTPプロトコルを含むトランスポート層のUDPプロトコルでは、データの転送効率を向上させるために、TCPのようにデータ再送等によりデータ転送の完全性を保証する機能はない。 As a protocol for transferring voice data, RTP / RTCP (Real-time Transport Protocol / RTP Control Protocol) protocol used on UDP (User Datagram Protocol) is known. The transport layer UDP protocol including the RTP protocol does not have a function of guaranteeing the integrity of data transfer by data retransmission or the like, unlike TCP, in order to improve data transfer efficiency.
 音声のデータ転送に対して、通信経路上に障害が発生した場合においてもデータ損失を防止し、フォールトトレラントなデータ転送を実現する技術として、例えば特許文献1が知られている。特許文献1では、送信装置で送信データを複数に分割し、送信データの実データからパリティデータを生成してそれぞれ異なる通信経路で送信する。そして、受信装置では分割された実データ、または実データが通信経路上で損失した場合にはパリティデータから復元したデータを結合して後段の装置へ転送している。 For example, Patent Document 1 is known as a technique for preventing data loss and realizing fault-tolerant data transfer even when a failure occurs on a communication path for voice data transfer. In Patent Document 1, transmission data is divided into a plurality of data by a transmission device, parity data is generated from actual data of transmission data, and is transmitted through different communication paths. In the receiving apparatus, when the divided actual data or the actual data is lost on the communication path, the data restored from the parity data is combined and transferred to the subsequent apparatus.
特開2015-149647号公報JP2015-149647A
 しかしながら、通信経路の状態は常に悪いわけではなく、実データの通信が安定的に行われて、パリティデータが使用されない状態が続くということが考えられる。この場合、実際には使用されないパリティデータの算出処理や、パリティデータの転送によるネットワーク帯域の圧迫が発生し、定常的にネットワークのリソースを無駄に消費する、という問題があった。 However, the state of the communication path is not always bad, and it is conceivable that actual data communication is performed stably and parity data is not used. In this case, there is a problem that the calculation of parity data that is not actually used and the network bandwidth is compressed due to the transfer of parity data, and network resources are constantly consumed wastefully.
 そこで本発明は、上記問題点に鑑みてなされたもので、音声データの転送状態に応じてパリティデータの送信の要否を制御することで、通信状態の悪化による音声の乱れを最小限に抑えて、かつネットワークのリソースの消費を低減可能な音声データの転送システムを提供することを目的とする。 Therefore, the present invention has been made in view of the above problems, and by controlling the necessity of transmission of parity data in accordance with the transfer state of audio data, the audio disturbance due to the deterioration of the communication state is minimized. Another object of the present invention is to provide an audio data transfer system that can reduce the consumption of network resources.
 本発明は、プロセッサとメモリを含む送信装置と、プロセッサとメモリを含む受信装置と、前記送信装置と前記受信装置を接続する通信経路とを含んで、前記送信装置と前記受信装置の間で音声データを転送する音声データ転送方法であって、前記送信装置が、音声データを分割した複数の分割データを生成し、異なる通信経路を介して前記分割データをそれぞれ送信する第1のステップと、前記受信装置が、前記異なる通信経路から受信した前記分割データをそれぞれ前記メモリに格納する第2のステップと、前記受信装置が、復元用データ送信モードと復元用データ停止モードのいずれであるかを判定する第3のステップと、前記受信装置が、前記メモリに格納された分割データの欠損の有無を判定する第4のステップと、前記受信装置が、前記メモリに格納された分割データに欠損がない場合には、前記分割データを結合して前記音声データを復元して出力する第5のステップと、前記受信装置が、前記復元用データ停止モードで、かつ、前記メモリに格納された分割データに欠損がある場合には、所定の代用データで前記音声データを補填して出力する第6のステップと、前記受信装置が、前記分割データの受信状態を監視して、前記受信状態が所定の開始条件を満たしたときには、前記復元用データ送信モードに移行して前記送信装置へ復元用データの送信開始を要求する第7のステップと、を含み、前記第1のステップは、前記受信装置から復元用データの送信開始を要求されたときには、前記分割データの復元用データを生成して、異なる通信経路から送信するステップを含み、前記第2のステップは、異なる通信経路から受信した前記復元用データをメモリに格納するステップを含み、前記受信装置が、前記復元用データ送信モードで、かつ、前記メモリに格納された分割データに欠損がある場合には、前記メモリに格納された復元用データと前記分割データから音声データを復元して出力する第8のステップと、を含む。 The present invention includes a transmission device including a processor and a memory, a reception device including a processor and a memory, and a communication path connecting the transmission device and the reception device. A voice data transfer method for transferring data, wherein the transmitting device generates a plurality of divided data obtained by dividing the voice data, and transmits the divided data via different communication paths, A second step in which the receiving device stores the divided data received from the different communication paths in the memory, and determines whether the receiving device is in a restoration data transmission mode or a restoration data stop mode. A third step, a fourth step in which the receiving device determines whether or not the divided data stored in the memory is missing, and the receiving device However, when the divided data stored in the memory is free of data, a fifth step of combining the divided data to restore and output the audio data, and the receiving device stops the restoration data And when the divided data stored in the memory is missing, a sixth step of supplementing and outputting the audio data with predetermined substitute data, and the receiving device Monitoring a reception state, and when the reception state satisfies a predetermined start condition, a seventh step of shifting to the restoration data transmission mode and requesting the transmission device to start transmission of restoration data; In the first step, when the reception device requests the start of transmission of restoration data, the first step generates restoration data for the divided data and transmits it through a different communication path. The second step includes storing the restoration data received from a different communication path in a memory, and the receiving device stores the restoration data transmission mode in the memory. If there is a deficiency in the divided data, the data includes restoration data stored in the memory and an eighth step of restoring and outputting audio data from the divided data.
 本発明によれば、最終的には人が音声データを認識することを目的とした音声データの転送においては必ずしも音声データの完全性が担保される必要がないため、音声データの受信状況に応じて復元用データ(パリティデータ)の送信要否を制御することで、通信品質の悪化による音声の乱れを最小限に抑えて、かつネットワークのリソースの消費を低減することが可能となる。 According to the present invention, it is not always necessary to ensure the integrity of the audio data in the transfer of the audio data for the purpose of finally recognizing the audio data by a person. By controlling whether or not the restoration data (parity data) needs to be transmitted, it is possible to minimize the voice disturbance due to the deterioration of the communication quality and reduce the consumption of network resources.
本発明の実施例を示し、音声データ転送システムの一例を示すブロック図である。It is a block diagram which shows the Example of this invention and shows an example of an audio | voice data transfer system. 本発明の実施例を示し、音声データ転送システムで行われる音声データの通信処理の一例を示すシーケンス図である。It is a sequence diagram which shows the Example of this invention and shows an example of the communication process of the audio | voice data performed with the audio | voice data transfer system. 本発明の実施例を示し、受信装置で行われるデータ結合処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the data combination process performed with a receiver. 本発明の実施例を示し、受信装置で行われるデータ補填処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the data compensation process performed with a receiver. 本発明の実施例を示し、受信装置で行われる経路制御処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the path control process performed with a receiver. 本発明の実施例を示し、受信装置で行われるモード制御処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the mode control process performed with a receiver.
 以下、本発明の一実施例について添付図面を用いて説明する。 Hereinafter, an embodiment of the present invention will be described with reference to the accompanying drawings.
 図1は、本発明の実施例を示し、音声データ転送システムの一例を示すブロック図である。音声データ転送システムは、音声データを分割した分割データを送信する送信装置1と、分割された音声データ(分割データ)を結合して出力する受信装置2と、分割データを転送する独立したネットワーク3-1、3-2と、通信状態に応じて送信装置1が生成したパリティデータDPを転送する独立したネットワーク3-3を含む。ネットワーク3-1~3-3はそれぞれ異なる通信経路で構成される。なお、ネットワーク全対については「-」以降を省略した符号「3」を用いる。なお、他の構成要素の符号についても同様とする。 FIG. 1 is a block diagram illustrating an example of an audio data transfer system according to an embodiment of the present invention. The audio data transfer system includes a transmitting device 1 that transmits divided data obtained by dividing audio data, a receiving device 2 that combines and outputs divided audio data (divided data), and an independent network 3 that transfers the divided data. -1 and 3-2 and an independent network 3-3 for transferring the parity data DP generated by the transmission device 1 according to the communication state. The networks 3-1 to 3-3 are configured with different communication paths. For all the network pairs, the symbol “3” is used in which “−” and subsequent characters are omitted. The same applies to the reference numerals of other components.
 送信装置1は、演算処理を行うプロセッサ11と、プログラムやデータを保持するメモリ12と、ネットワーク3に接続されるネットワークI/F(インタフェース)部13と、音声データを受け付けるI/F部14とを含む。 The transmission device 1 includes a processor 11 that performs arithmetic processing, a memory 12 that stores programs and data, a network I / F (interface) unit 13 connected to the network 3, and an I / F unit 14 that receives audio data. including.
 メモリ12には、後述するように音声データを分割した分割データD1、D2を生成するデータ分割部110と、受信装置2からの要求に応じて分割データのパリティデータDPを生成する復元用データ生成部120がロードされ、プロセッサ11によって実行される。 In the memory 12, as will be described later, a data dividing unit 110 that generates divided data D1 and D2 obtained by dividing audio data, and data generation for restoration that generates parity data DP of the divided data in response to a request from the receiving device 2 The unit 120 is loaded and executed by the processor 11.
 データ分割部110と、復元用データ生成部120の各機能部はプログラムとしてメモリ12にロードされる。プロセッサ11は、各機能部のプログラムに従って処理することによって、所定の機能を提供する機能部として稼働する。例えば、プロセッサ11は、データ分割プログラムに従って処理することでデータ分割部110として機能する。他のプログラムについても同様である。さらに、プロセッサ11は、各プログラムが実行する複数の処理のそれぞれの機能を提供する機能部としても稼働する。計算機及び計算機システムは、これらの機能部を含む装置及びシステムである。 Each functional unit of the data dividing unit 110 and the restoration data generating unit 120 is loaded into the memory 12 as a program. The processor 11 operates as a functional unit that provides a predetermined function by performing processing according to a program of each functional unit. For example, the processor 11 functions as the data dividing unit 110 by performing processing according to the data dividing program. The same applies to other programs. Furthermore, the processor 11 also operates as a functional unit that provides each function of a plurality of processes executed by each program. A computer and a computer system are an apparatus and a system including these functional units.
 データ分割部110や、復元用データ生成部120の各機能を実現するプログラム、テーブル等の情報は、ストレージや不揮発性半導体メモリ、ハードディスクドライブ、SSD(Solid State Drive)等の記憶デバイス、または、ICカード、SDカード、DVD等の計算機読み取り可能な非一時的データ記憶媒体に格納することができる。 Information such as programs and tables for realizing the functions of the data dividing unit 110 and the restoration data generating unit 120 are storage, nonvolatile semiconductor memory, hard disk drive, storage device such as SSD (Solid State Drive), or IC. The data can be stored in a computer-readable non-transitory data storage medium such as a card, SD card, or DVD.
 受信装置2は、演算処理を行うプロセッサ21と、プログラムやデータを保持するメモリ22と、ネットワーク3に接続されるネットワークI/F(インタフェース)部23と、音声データを出力するI/F部24とを含む。 The receiving apparatus 2 includes a processor 21 that performs arithmetic processing, a memory 22 that stores programs and data, a network I / F (interface) unit 23 connected to the network 3, and an I / F unit 24 that outputs audio data. Including.
 メモリ22には、受信した分割データD1、D2とパリティデータDPを一時的に格納する受信バッファ210が設定される。また、メモリ22には、後述するように受信バッファ210の分割データを結合するデータ結合部220と、分割データに欠損がある場合には補填データを生成し、パリティデータの送信開始を要求するデータ補填部230と、受信状態を検出して受信状態に応じてパリティデータの送信または停止を要求する経路制御部240と、送信装置1にパリティデータの送信開始または送信停止を要求する復元用データ制御部250がロードされてプロセッサ21によって実行される。 In the memory 22, a reception buffer 210 for temporarily storing the received divided data D1 and D2 and the parity data DP is set. Further, in the memory 22, as will be described later, a data combining unit 220 that combines the divided data of the reception buffer 210, and data that generates compensation data when the divided data is missing and requests transmission start of parity data Compensation unit 230, path control unit 240 that detects the reception state and requests transmission or stop of parity data according to the reception state, and restoration data control that requests transmission device 1 to start or stop transmission of parity data The unit 250 is loaded and executed by the processor 21.
 データ結合部220と、データ補填部230と、経路制御部240と、復元用データ制御部250の各機能部はプログラムとしてメモリ22にロードされる。プロセッサ21は、各機能部のプログラムに従って処理することによって、所定の機能を提供する機能部として稼働する。例えば、プロセッサ21は、データ結合プログラムに従って処理することでデータ結合部220として機能する。他のプログラムについても同様である。さらに、プロセッサ21は、各プログラムが実行する複数の処理のそれぞれの機能を提供する機能部としても稼働する。計算機及び計算機システムは、これらの機能部を含む装置及びシステムである。 Each function unit of the data combination unit 220, the data compensation unit 230, the path control unit 240, and the restoration data control unit 250 is loaded into the memory 22 as a program. The processor 21 operates as a functional unit that provides a predetermined function by performing processing according to a program of each functional unit. For example, the processor 21 functions as the data combining unit 220 by performing processing according to the data combining program. The same applies to other programs. Furthermore, the processor 21 also operates as a function unit that provides each function of a plurality of processes executed by each program. A computer and a computer system are an apparatus and a system including these functional units.
 データ結合部220~復元用データ制御部250の各機能を実現するプログラム、テーブル等の情報は、ストレージや不揮発性半導体メモリ、ハードディスクドライブ、SSD(Solid State Drive)等の記憶デバイス、または、ICカード、SDカード、DVD等の計算機読み取り可能な非一時的データ記憶媒体に格納することができる。 Information such as programs and tables for realizing the functions of the data combination unit 220 to the restoration data control unit 250 is a storage device such as a storage, nonvolatile semiconductor memory, hard disk drive, SSD (Solid State Drive), or an IC card. It can be stored in a computer-readable non-transitory data storage medium such as an SD card or a DVD.
 受信バッファ210は、ネットワーク3-1から受信した分割データを格納する受信バッファ210-1と、ネットワーク3-2から受信した分割データを格納する受信バッファ210-2と、ネットワーク3-3から受信したパリティデータを格納する受信バッファ210-3とを含む。 The receiving buffer 210 receives the divided data received from the network 3-1, the receiving buffer 210-1 for storing the divided data received from the network 3-1, the receiving buffer 210-2 for storing the divided data received from the network 3-2, and the received from the network 3-3. A reception buffer 210-3 for storing parity data.
 送信装置1のI/F部14が音声データを図示しない装置から受け付ける。データ分割部110は、I/F部14が受け付けた音声データを分割データD1と分割データD2に分割し、ネットワークI/F部13と復元用データ生成部120へ出力する。 The I / F unit 14 of the transmission device 1 receives audio data from a device (not shown). The data dividing unit 110 divides the audio data received by the I / F unit 14 into divided data D1 and divided data D2, and outputs the divided data to the network I / F unit 13 and the restoration data generating unit 120.
 データ分割部110は、受け付けた音声データ毎に所定の増分値を加算したシーケンス番号を付与し、分割データD1、D2とパリティデータDPに同一のシーケンス番号(またはID)を付与する。 The data dividing unit 110 gives a sequence number obtained by adding a predetermined increment value to each received audio data, and gives the same sequence number (or ID) to the divided data D1 and D2 and the parity data DP.
 復元用データ生成部120は、受信装置2からパリティデータDPの送信要求がある場合には、後述するように、分割データD1、D2のパリティデータDPを生成してネットワークI/F部13に出力する。 When there is a transmission request for the parity data DP from the receiving device 2, the restoration data generation unit 120 generates the parity data DP of the divided data D <b> 1 and D <b> 2 and outputs it to the network I / F unit 13 as described later. To do.
 ネットワークI/F部13は、分割データD1、D2とパリティデータDPを異なるネットワーク3-1~3-3からそれぞれ送信する。ただし、ネットワークI/F部13は、パリティデータDPが生成されていないときには分割データD1、D2をネットワーク3-1、3-2からそれぞれ送信する。なお、本実施例では、分割データD1、D2及びパリティデータDPをUDPで送信する例を示す。 The network I / F unit 13 transmits the divided data D1 and D2 and the parity data DP from different networks 3-1 to 3-3, respectively. However, the network I / F unit 13 transmits the divided data D1 and D2 from the networks 3-1 and 3-2, respectively, when the parity data DP is not generated. In the present embodiment, an example is shown in which the divided data D1 and D2 and the parity data DP are transmitted by UDP.
 受信装置2では、ネットワークI/F部23が、複数の通信経路(ネットワーク3-1~3-3)から受信した分割データD1、D2とパリティデータDPを受信バッファ210-1~210-3へそれぞれ格納する。 In the receiving apparatus 2, the network I / F unit 23 sends the divided data D1 and D2 and the parity data DP received from the plurality of communication paths (networks 3-1 to 3-3) to the reception buffers 210-1 to 210-3. Store each one.
 データ結合部220は、同一のシーケンス番号の分割データD1、D2を結合して音声データを復元して音声データを生成してI/F部24とデータ補填部230へ出力する。 The data combining unit 220 combines the divided data D1 and D2 having the same sequence number to restore the audio data, generates the audio data, and outputs the audio data to the I / F unit 24 and the data compensation unit 230.
 データ結合部220は、分割データD1、D2の少なくとも一方が欠損して、当該音声データと同一のシーケンス番号のパリティデータDPが受信バッファ210-3に存在する場合、当該パリティデータDPと分割データから音声データを復元しI/F部24とデータ補填部230へ出力する。 If at least one of the divided data D1 and D2 is missing and the parity data DP having the same sequence number as the audio data is present in the reception buffer 210-3, the data combining unit 220 determines that the parity data DP and the divided data are The audio data is restored and output to the I / F unit 24 and the data compensation unit 230.
 データ補填部230は、復元された音声データを直前(前回の)音声データとして保持する。データ補填部230は、分割データD1、D2の少なくとも一方が欠損して、パリティデータDPがない場合、補填データを生成する。 The data compensation unit 230 holds the restored audio data as the previous (previous) audio data. The data compensation unit 230 generates compensation data when at least one of the divided data D1 and D2 is missing and there is no parity data DP.
 データ補填部230が生成する補填データは、例えば、前回復元した音声データを今回復元したデータとして代用することができる。ただし、データ補填部230は、一定時間(例えば、数十秒間)内で所定回数(例えば、100回)以上、補填データを生成した場合には、受信状態が所定の開始条件を満たして悪化したと判定して、復元用データ制御部250へパリティデータの送信開始を要求する。なお、代用する復元用データとしては、前回の音声データに限定されるものではなく、無音のデータなど公知または周知の手法を用いることができる。 The compensation data generated by the data compensation unit 230 can be used, for example, by replacing the previously restored audio data with the data restored this time. However, when the data compensation unit 230 generates compensation data for a predetermined number of times (for example, 100 times) or more within a predetermined time (for example, several tens of seconds), the reception state has deteriorated by satisfying a predetermined start condition. And the restoration data control unit 250 is requested to start transmission of parity data. Note that the restoration data to be substituted is not limited to the previous audio data, and a known or well-known method such as silent data can be used.
 経路制御部240は、受信バッファ210の受信状態を監視して、受信状態が悪化または障害の予兆を検出すると、復元用データ制御部250にパリティデータの送信開始を要求する。また、経路制御部240は、受信状態が改善(復旧)すると、復元用データ制御部250にパリティデータの送信停止を要求する。 The path control unit 240 monitors the reception state of the reception buffer 210, and when the reception state deteriorates or detects a sign of failure, requests the restoration data control unit 250 to start transmission of parity data. Further, when the reception state is improved (recovered), the path control unit 240 requests the restoration data control unit 250 to stop transmission of parity data.
 本実施例の経路制御部240は、受信バッファ210-1、210-2のそれぞれについて、所定時間内に受信したデータ量を監視して所定時間(例えば、1秒)内のデータ量の揺らぎ(受信揺らぎ)を算出し、予め設定された閾値と受信揺らぎを比較して受信状態を判定する。 The path control unit 240 of this embodiment monitors the amount of data received within a predetermined time for each of the reception buffers 210-1 and 210-2, and the fluctuation of the data amount within a predetermined time (for example, 1 second) ( (Reception fluctuation) is calculated, and a reception state is determined by comparing the reception threshold with a preset threshold value.
 経路制御部240は、受信バッファ210-1、210-2の所定時間内のデータ量について、予め設定された2つの閾値(ただし、第1の閾値<第2の閾値)と比較し、所定時間内のデータ量が第1の閾値未満となったときに受信状態が悪化したことを判定する。この場合、通信経路のネットワーク3-1または3-2で障害が発生したことにより、所定時間内のデータ量が低下したと推定できる。 The path control unit 240 compares the data amount within a predetermined time of the reception buffers 210-1 and 210-2 with two preset threshold values (where the first threshold value <the second threshold value), It is determined that the reception state has deteriorated when the amount of data in the data becomes less than the first threshold. In this case, it can be estimated that the amount of data within a predetermined time has decreased due to the occurrence of a failure in the network 3-1 or 3-2 of the communication path.
 また、経路制御部240は、所定時間内のデータ量が第2の閾値を超えたときに受信状態が悪化または障害の予兆を検出する。この場合、通信経路のネットワーク3-1または3-2で不具合が発生したことによりデータが一時的に滞留し、その後滞留していたデータが一気に転送されたと推定できる。 In addition, the path control unit 240 detects a sign of deterioration of the reception state or a failure when the amount of data within a predetermined time exceeds the second threshold. In this case, it can be estimated that the data temporarily stays due to the occurrence of a problem in the network 3-1 or 3-2 of the communication path, and the staying data is transferred at once.
 このように第1の閾値と第2の閾値の範囲を超えて所定時間内のデータ量の揺らぎが過大になると、ネットワーク3-1、3-2の伝送品質が悪化したこと(または悪化の兆候)を検出でき、分割データD1、D2がバーストロス(連続してパケットが欠損する状態)となる可能性が増大する。このため、経路制御部240は、パリティデータの送信開始要求を復元用データ制御部250へ送信することで、音声データの悪化を抑制することができる。 As described above, when the fluctuation of the data amount within a predetermined time exceeding the range of the first threshold value and the second threshold value becomes excessive, the transmission quality of the networks 3-1 and 3-2 has deteriorated (or a sign of deterioration). ) Can be detected, and the possibility that the divided data D1 and D2 become burst loss (a state in which packets are continuously lost) increases. For this reason, the path control unit 240 can suppress deterioration of the voice data by transmitting a parity data transmission start request to the restoration data control unit 250.
 なお、受信状態が一旦悪化した後、再び所定時間内のデータ量が第1の閾値以上、かつ第2の閾値以下の受信状態が所定期間(例えば、数十秒)経過すると経路制御部240は、受信状態が復旧(改善)したと判定することができる。この場合、経路制御部240は、復元用データ制御部250へパリティデータの送信停止の要求を送信することで、音声データ転送システムのリソースの消費を低減することができる。 Note that after the reception state deteriorates once, the path control unit 240 once again, when a predetermined period (for example, several tens of seconds) elapses in the reception state where the data amount within the predetermined time is equal to or greater than the first threshold and equal to or less than the second threshold. It can be determined that the reception state has been recovered (improved). In this case, the path control unit 240 can reduce resource consumption of the voice data transfer system by transmitting a request to stop transmission of parity data to the restoration data control unit 250.
 以上の処理によって、受信装置2は、受信状態の悪化(悪化の予兆)を検出すると送信装置1へパリティデータの送信開始を要求し、また、受信状態が改善すると送信装置1へパリティデータの送信停止を要求する。さらに、分割データD1、D2とパリティデータDPをそれぞれ異なるネットワーク3-1~3-3を介して転送することで、音声データ転送の冗長化を図ることができる。 Through the above processing, the reception device 2 requests the transmission device 1 to start transmission of parity data when detecting a deterioration of the reception state (a sign of deterioration), and transmits the parity data to the transmission device 1 when the reception state is improved. Request a stop. Further, by transferring the divided data D1 and D2 and the parity data DP via different networks 3-1 to 3-3, it is possible to make the audio data transfer redundant.
 図2は、音声データ転送システムで行われる音声データの通信処理の一例を示すシーケンス図である。まず、送信装置1は音声データを分割した分割データD1、D2を受信装置2へ送信する(S1)。 FIG. 2 is a sequence diagram showing an example of voice data communication processing performed in the voice data transfer system. First, the transmission apparatus 1 transmits the divided data D1 and D2 obtained by dividing the audio data to the reception apparatus 2 (S1).
 受信装置2は、分割データD1、D2が欠損して補填も不可である場合、もしくは受信状態の悪化(受信揺らぎの増大)を検出した場合に(S2)、パリティデータ送信モードに移行して(S3)、送信装置1にパリティデータの送信開始要求を送信する(S4)。 The receiving device 2 shifts to the parity data transmission mode when the divided data D1 and D2 are lost and cannot be compensated, or when the reception state is deteriorated (increase in reception fluctuation) (S2) ( S3), a transmission start request for parity data is transmitted to the transmission apparatus 1 (S4).
 送信装置1は、パリティデータの送信開始要求を受け付けると、パリティデータ送信モードへ移行し(S5)、パリティデータDPの生成及び送信を開始する(S6)。 Upon receiving the parity data transmission start request, the transmission device 1 shifts to the parity data transmission mode (S5), and starts generating and transmitting the parity data DP (S6).
 受信装置2は、一定時間に渡って分割データD1、D2の欠損が無く、かつ受信状態の改善(揺らぎの減少)を検出した場合(S7)には、パリティデータ停止モードへ移行して(S8)、送信装置1に対してパリティデータの送信停止要求を送信する(S9)。 When there is no loss of the divided data D1 and D2 over a certain time and the reception apparatus 2 detects an improvement in reception state (decrease in fluctuation) (S7), the reception apparatus 2 shifts to the parity data stop mode (S8). ), A transmission stop request for parity data is transmitted to the transmission apparatus 1 (S9).
 送信装置1は、パリティデータの送信停止要求を受け付けると(S10)は、パリティデータ停止モードへ移行してパリティデータDPの生成及び送信を停止し、分割データD1、D2の生成と送信を行う(S11)。 Upon receiving the parity data transmission stop request (S10), the transmission apparatus 1 shifts to the parity data stop mode, stops generating and transmitting the parity data DP, and generates and transmits the divided data D1 and D2 ( S11).
 受信装置2は、パリティ送信モードへの移行判定には2つの判断基準を有する。まずひとつは、受信装置2で分割データD1、D2の結合ができず、かつデータ補填を一定時間(数十秒間)内に所定回数(頻度の閾値、例えば、100回)以上行った場合である。そして、もうひとつは、受信装置2が受信バッファ210内のデータ量の過大な変動を検出した場合である。 The receiving device 2 has two determination criteria for determining the transition to the parity transmission mode. The first is a case where the receiving device 2 cannot combine the divided data D1 and D2 and performs data compensation a predetermined number of times (frequency threshold, for example, 100 times) or more within a certain time (several tens of seconds). . The other case is when the receiving apparatus 2 detects an excessive fluctuation in the amount of data in the receiving buffer 210.
 通信経路のネットワーク3-1、3-2で、ある確率でパケットロスが発生している場合、受信装置2で分割データを結合できない確率は2つの通信経路の合算となるため、送信装置1でパリティデータの送信を開始することで音声データの品質を維持することが可能となる。 If there is a packet loss with a certain probability in the communication path networks 3-1 and 3-2, the probability that the receiving apparatus 2 cannot combine the divided data is the sum of the two communication paths. The quality of the audio data can be maintained by starting the transmission of parity data.
 また、一方の通信経路(ネットワーク3-1、3-2)でデータ受信量の揺らぎが増大した場合、通信経路の品質悪化の兆候であり、パケットがバーストロスする可能性などがあるため、予めパリティデータの送信を開始することで音声品質の急激な悪化を抑止することが可能である。 In addition, if fluctuations in the amount of received data increase in one communication path (networks 3-1 and 3-2), this is a sign of deterioration in the quality of the communication path, and there is a possibility that the packet may be lost in burst. By starting the transmission of parity data, it is possible to suppress a rapid deterioration in voice quality.
 図3は、受信装置2データ結合部220で行われるデータ結合処理の一例を示すフローチャートである。この処理は、所定の周期(例えば、数十msec)で実行される。 FIG. 3 is a flowchart illustrating an example of the data combining process performed by the receiving device 2 data combining unit 220. This process is executed at a predetermined cycle (for example, several tens of milliseconds).
 データ結合部220は、シーケンス番号によって次に結合する分割データD1、D2を識別し、それぞれの受信バッファ210-1、210-2からデータを取得する(S21)。シーケンス番号は、前回のシーケンス番号に所定値を加算して算出される。 The data combining unit 220 identifies the divided data D1 and D2 to be combined next by the sequence number, and acquires data from the respective reception buffers 210-1 and 210-2 (S21). The sequence number is calculated by adding a predetermined value to the previous sequence number.
 データ結合部220は、結合する分割データD1、D2が受信バッファ210-1、210-2に揃っているか否かを判定する(S22)。分割データD1、D2が揃っている場合はステップS26へ進み、データ結合部220は分割データD1、D2を結合して音声データを復元する。復元された音声データは、データ補填部230で直前の音声データとして記録され、I/F部24から後段の装置(図示省略)へ送信される(S27)。 The data combining unit 220 determines whether or not the divided data D1 and D2 to be combined are aligned in the reception buffers 210-1 and 210-2 (S22). If the divided data D1 and D2 are ready, the process proceeds to step S26, and the data combining unit 220 combines the divided data D1 and D2 to restore the audio data. The restored audio data is recorded as the immediately preceding audio data by the data compensation unit 230 and transmitted from the I / F unit 24 to a subsequent device (not shown) (S27).
 一方、受信バッファ210に分割データD1、D2が揃っていなかった場合、データ結合部220は、パリティデータDPを受信しているか否かを判定する(S23)。データ結合部220は、上述のシーケンス番号に対応するパリティデータDPが受信バッファ210-3に存在すればステップS25へ進み、受信していない場合にはステップS24へ進む。 On the other hand, when the divided data D1 and D2 are not prepared in the reception buffer 210, the data combining unit 220 determines whether or not the parity data DP is received (S23). The data combining unit 220 proceeds to step S25 if the parity data DP corresponding to the above sequence number exists in the reception buffer 210-3, and proceeds to step S24 if not received.
 データ結合部220は、パリティデータDPを受信していた場合は、欠損した分割データD1、D2を、受信している他方の分割データとパリティデータDPから復元する(S25)。その後、データ結合部220は、ステップS25で復元した分割データを用いて音声データを復元する(S26)。 If the data combining unit 220 has received the parity data DP, the data combining unit 220 restores the missing divided data D1 and D2 from the other received divided data and the parity data DP (S25). Thereafter, the data combining unit 220 restores the audio data using the divided data restored in step S25 (S26).
 そして、上述のように、復元された音声データは、データ補填部230で直前の音声データとして記録され、I/F部24から出力される。 As described above, the restored audio data is recorded as the immediately preceding audio data by the data filling unit 230 and output from the I / F unit 24.
 一方、データ結合部220は、分割データD1、D2が揃っていなかった場合に、パリティデータDPも受信していなかった場合では、データ欠損と判断し、データ補填部230へデータ欠損を通知する。この場合、後段の装置への音声データの出力は行われない。 On the other hand, when the divided data D1 and D2 are not prepared and the parity data DP is not received, the data combining unit 220 determines that the data is missing and notifies the data supplementing unit 230 of the data missing. In this case, audio data is not output to the subsequent apparatus.
 図4は、受信装置2のデータ補填部230で行われるデータ補填処理の一例を示すフローチャートである。この処理は、図3のステップS24の後に実行される。 FIG. 4 is a flowchart showing an example of the data compensation process performed by the data compensation unit 230 of the receiving device 2. This process is executed after step S24 of FIG.
 データ補填部230は、データ欠損を検出したか否かを判定する(S31)。すなわち、データ補填部230は、データ結合部220からデータ欠損の通知を受信していれば、データ結合部220がデータ欠損を検出したと判定してステップS32へ進み、そうでない場合には、ステップS36に進む。 The data compensation unit 230 determines whether or not data loss is detected (S31). That is, if the data filling unit 230 has received a data loss notification from the data combining unit 220, the data filling unit 230 determines that the data combining unit 220 has detected a data loss, and proceeds to step S32. Proceed to S36.
 データ欠損を検出した場合、データ補填部230は、直前に復元した音声データを今回のシーケンス番号の音声データとして置き換えて(S32)、I/F部24へ送信する(S33)。次に、データ補填部230は、一定時間(例えば、数十秒間)内に所定回数(例えば、100回)以上のデータ補填を実施したか否を判定する(S34)。 When the data loss is detected, the data compensation unit 230 replaces the audio data restored immediately before with the audio data of the current sequence number (S32), and transmits it to the I / F unit 24 (S33). Next, the data compensation unit 230 determines whether or not data compensation has been performed a predetermined number of times (for example, 100 times) or more within a predetermined time (for example, several tens of seconds) (S34).
 データ補填部230は、一定時間内に所定の回数(頻度の閾値)以上データ補填を実施した場合、通信経路(ネットワーク3-1、3-2)の状態が悪化したと判定して、パリティデータ送信モードに移行するよう復元用データ制御部250へ依頼し、データ補填の回数をリセットして処理を終了する(S35)。 The data compensation unit 230 determines that the state of the communication path (networks 3-1 and 3-2) has deteriorated when data compensation is performed a predetermined number of times (frequency threshold) within a certain time, and parity data The restoration data control unit 250 is requested to shift to the transmission mode, the number of data compensation is reset, and the process is terminated (S35).
 一方、一定時間内に所定の回数以上のデータ補填を実施していない場合、データ補填部230は、データ補填回数をインクリメントして処理を終了する。 On the other hand, when data compensation is not performed more than a predetermined number of times within a certain time, the data compensation unit 230 increments the number of data compensation and ends the process.
 上記ステップS31の判定で、データ欠損を検出していない場合、データ補填部230は、一定時間内でデータ欠損を検出したか否を判定する(S36)。一定時間内でデータ欠損を検出していない場合、データ補填部230は、所定の停止条件を満たしたと判定して、パリティデータ停止モードへ移行する。このため、データ補填部230は、パリティデータ停止モードに移行するよう復元用データ制御部250へ依頼して、データ補填の回数をリセットして処理を終了する(S37)。 If it is determined in step S31 that no data loss has been detected, the data compensation unit 230 determines whether a data loss has been detected within a predetermined time (S36). If no data loss is detected within a certain time, the data compensation unit 230 determines that a predetermined stop condition is satisfied, and shifts to the parity data stop mode. Therefore, the data compensation unit 230 requests the restoration data control unit 250 to shift to the parity data stop mode, resets the number of data compensations, and ends the process (S37).
 以上の処理によって、データ補填部230は、一定時間内で所定回数以上のデータ補填を行っていれば、データ補填の頻度が所定の補填閾値以上になって受信状態の悪化し、所定の開始条件を満たしたと判定して、パリティデータ送信モードへ移行するよう復元用データ制御部250へ依頼する。 As a result of the above processing, if the data compensation unit 230 performs data compensation a predetermined number of times or more within a predetermined time, the frequency of data compensation becomes equal to or greater than a predetermined compensation threshold, and the reception state deteriorates, and a predetermined start condition is established. And the restoration data control unit 250 is requested to shift to the parity data transmission mode.
 そして、一定時間の間データ欠損を検出しなかった場合、データ補填部230は、所定の停止条件を満足したと判定して、パリティ停止モードに移行するよう復元用データ制御部250へ依頼する。なお、データ補填部230は、一定時間内にデータ欠損を検出していた場合、パリティデータ停止モードには移行せず、パリティデータ送信モードを維持する。 If no data loss is detected for a certain period of time, the data filling unit 230 determines that a predetermined stop condition has been satisfied, and requests the restoration data control unit 250 to shift to the parity stop mode. Note that, when data loss is detected within a certain time, the data compensation unit 230 does not shift to the parity data stop mode but maintains the parity data transmission mode.
 音声データには、必ずしも完全に復元される必要はなく、ある程度の復元がなされれば、受信側のユーザが前後の音声データの内容に基づき当該音声データを推測できる、という特性がある。即ち、音声ストリーミング転送においては、必ずしも転送される音声データの完全性が担保される必要はない。 The audio data does not necessarily need to be completely restored, and if it is restored to some extent, it has a characteristic that the user on the receiving side can estimate the audio data based on the contents of the previous and next audio data. That is, in audio streaming transfer, it is not always necessary to ensure the integrity of transferred audio data.
 そこで、受信装置2では、データ欠損を検出した後、直前の音声データによってデータ補填を行う処理を実施することにより、必ずしも復元用データであるパリティデータの送信依頼を要求する必要がなくなる。この結果、送信装置1にとっては、パリティデータの送信回数を削減できるので、不要な通信を減らすことが可能となる。 Therefore, the receiving device 2 does not necessarily need to request a transmission request for parity data, which is restoration data, by performing a data compensation process using the immediately preceding audio data after detecting a data loss. As a result, since the number of times of transmission of parity data can be reduced for the transmission apparatus 1, unnecessary communication can be reduced.
 なお、受信装置2は、一定時間内にデータ欠損を検出しなかった場合は、パリティデータ停止モードに移行するよう復元用データ制御部250へ依頼する。これにより、通信経路の伝送品質が回復した後には、パリティデータの送信を停止することで、音声データ転送システムのリソースの消費を低減することができる。 Note that the receiving device 2 requests the restoration data control unit 250 to shift to the parity data stop mode when data loss is not detected within a certain time. Thereby, after the transmission quality of the communication path is restored, the resource consumption of the voice data transfer system can be reduced by stopping the transmission of the parity data.
 図5は、受信装置2の経路制御部240で行われる経路制御処理の一例を示すフローチャートである。この処理は、所定の周期(例えば、1秒)で実行される。 FIG. 5 is a flowchart illustrating an example of a route control process performed by the route control unit 240 of the receiving device 2. This process is executed at a predetermined cycle (for example, 1 second).
 経路制御部240は、受信バッファ210-1、210-2のそれぞれについて、所定時間(例えば、1秒)内のデータ量(受信ゆらぎ)を算出して、2つの閾値(第1の閾値<第2の閾値)と比較する。経路制御部240は、受信ゆらぎが第1の閾値以上、かつ、第2の閾値以下であれば受信状態は正常と判定し、そうでない場合には異常(障害の予兆)の検出と判定する。 The path control unit 240 calculates the data amount (reception fluctuation) within a predetermined time (for example, 1 second) for each of the reception buffers 210-1 and 210-2, and calculates two threshold values (first threshold value <first threshold value). 2 threshold). The path control unit 240 determines that the reception state is normal if the reception fluctuation is equal to or greater than the first threshold value and equal to or less than the second threshold value, and otherwise determines that an abnormality (a sign of failure) has been detected.
 経路制御部240は、受信状態が正常の場合には所定の停止時条件を満足したと判定してステップS42へ進み、復元用データ制御部250にパリティデータの送信停止を依頼する。一方、経路制御部240は、受信状態が異常(または障害の予兆)の場合(所定の開始条件を満たして悪化した場合)には、ステップS43へ進み、復元用データ制御部250にパリティデータの送信開始を依頼する。 If the reception state is normal, the path control unit 240 determines that a predetermined stop condition has been satisfied, proceeds to step S42, and requests the restoration data control unit 250 to stop transmission of parity data. On the other hand, when the reception state is abnormal (or a sign of failure) (when the reception condition is deteriorated by satisfying a predetermined start condition), the path control unit 240 proceeds to step S43 and stores the parity data in the restoration data control unit 250. Request to start sending.
 なお、上記では受信揺らぎを2つの閾値によって比較して受信状態を判定する例を示したが、第1の閾値のみで受信状態を判定してもよい。この場合、経路制御部240は、所定時間内のデータ量(受信ゆらぎ)が第1の閾値(揺らぎ閾値)未満であれば、所定の開始条件を満足したと判定し、第1の閾値以上であれば、所定の停止条件を満足したと判定する。 In the above description, an example in which the reception fluctuation is determined by comparing the reception fluctuations with two threshold values has been described. However, the reception state may be determined only with the first threshold value. In this case, if the data amount (reception fluctuation) within a predetermined time is less than the first threshold value (fluctuation threshold value), the path control unit 240 determines that the predetermined start condition is satisfied, and is greater than or equal to the first threshold value. If there is, it is determined that a predetermined stop condition is satisfied.
 図6は、受信装置2の復元用データ制御部250で行われるモード制御処理の一例を示すフローチャートである。この処理は、所定の周期(例えば、1秒)で実行される。 FIG. 6 is a flowchart illustrating an example of the mode control process performed by the restoration data control unit 250 of the receiving device 2. This process is executed at a predetermined cycle (for example, 1 second).
 復元用データ制御部250は、現在のモードが、パリティデータ送信モードであるか否かを判定し、パリティデータ送信モードであればステップS52へ進み、パリティデータ停止モードであればステップS55へ進む。 The restoration data control unit 250 determines whether or not the current mode is the parity data transmission mode. If the current mode is the parity data transmission mode, the process proceeds to step S52. If the current mode is the parity data stop mode, the process proceeds to step S55.
 パリティデータ送信モードのステップS52では、復元用データ制御部250が、経路制御部240とデータ補填部230の双方からパリティデータの停止要求を受け付けたか否かを判定し、双方からパリティデータの停止要求を受け付けていた場合にはステップS53へ進み、そうでない場合にはパリティデータ送信モードを維持して処理を終了する。 In step S52 of the parity data transmission mode, it is determined whether the restoration data control unit 250 has received a parity data stop request from both the path control unit 240 and the data compensation unit 230, and a parity data stop request is received from both sides. If YES in step S53, the process advances to step S53. If not, the parity data transmission mode is maintained and the process ends.
 ステップS53では、復元用データ制御部250が送信装置1へパリティデータの送信停止要求を送信する。そして、ステップS54で、復元用データ制御部250は、パリティデータ停止モードに移行して処理を終了する。 In step S53, the restoration data control unit 250 transmits a parity data transmission stop request to the transmission device 1. In step S54, the restoration data control unit 250 shifts to the parity data stop mode and ends the process.
 一方、現在のモードがパリティデータ停止モードの場合は、ステップS55で、経路制御部240とデータ補填部230のいずれか一方からパリティデータの送信開始要求を受け付けたか否かを判定する。パリティデータの送信開始要求を受け付けた場合にはステップS56へ進み、そうでない場合には、パリティデータ停止モードを維持して処理を終了する。 On the other hand, if the current mode is the parity data stop mode, it is determined in step S55 whether or not a parity data transmission start request has been received from either the path control unit 240 or the data compensation unit 230. If a parity data transmission start request is accepted, the process proceeds to step S56. If not, the parity data stop mode is maintained and the process ends.
 ステップS56では、復元用データ制御部250が送信装置1へパリティデータの送信開始要求を送信する。そして、ステップS57で、復元用データ制御部250は、パリティデータ送信モードに移行して処理を終了する。 In step S56, the restoration data control unit 250 transmits a parity data transmission start request to the transmission device 1. In step S57, the restoration data control unit 250 shifts to the parity data transmission mode and ends the process.
 以上の処理により、復元用データ制御部250は、データ補填部230と経路制御部240からのパリティデータの送信開始または送信停止の要求を受け付けて、予め設定した判定基準に基づいてパリティデータのモードを制御する。すなわち、復元用データ制御部250は、経路制御部240またはデータ補填部230のいずれか一方からパリティ送信モードへの移行を受け付けたとき、パリティ送信モードに移行する。一方、復元用データ制御部250は、経路制御部240及びデータ補填部230の双方からパリティデータ停止モードへの移行を受け付けたときにパリティデータ停止モードに移行する。 With the above processing, the restoration data control unit 250 accepts a request to start or stop transmission of parity data from the data compensation unit 230 and the path control unit 240, and based on a predetermined criterion, the mode of parity data To control. That is, the restoration data control unit 250 shifts to the parity transmission mode when it receives a shift to the parity transmission mode from either the path control unit 240 or the data compensation unit 230. On the other hand, the restoration data control unit 250 shifts to the parity data stop mode when it receives a shift to the parity data stop mode from both the path control unit 240 and the data compensation unit 230.
 上記制御によって、受信状態の悪化または障害の予兆が検出されると速やかにパリティデータ送信モードへ移行して音声データの欠損を抑制する。そして、データ補填の頻度が低減し、かつ、受信状態が正常に復帰してからパリティデータ停止モードに移行することで、受信状態が安定するまでパリティデータ送信モードを維持することができる。 上 記 By the above control, when a deterioration of the reception state or a sign of failure is detected, the mode immediately shifts to the parity data transmission mode to suppress voice data loss. Then, the parity data transmission mode can be maintained until the reception state is stabilized by shifting to the parity data stop mode after the frequency of data compensation is reduced and the reception state returns to normal.
 音声データには、必ずしも完全に復元される必要はなく、ある程度の復元がなされれば、受信側のユーザが前後の音声データの内容に基づき当該音声データを推測できる、という特性がある。音声データの転送においては、必ずしも音声データの完全性が担保される必要はない。 The audio data does not necessarily need to be completely restored, and if it is restored to some extent, it has a characteristic that the user on the receiving side can estimate the audio data based on the contents of the previous and next audio data. In transferring audio data, it is not always necessary to ensure the integrity of the audio data.
 そこで、本実施例では、分割データD1、D2のデータ欠損を検出し、復元用データ(パリティデータ)がない場合には、直前の音声データを復元したデータとして代用することにより、必ずしも復元用データであるパリティの送信依頼をする必要がなくなる。これにより、送信装置1にとっては、パリティデータの送信回数を削減できるので、不要な通信を減らすことが可能となる。 Therefore, in this embodiment, the data loss of the divided data D1 and D2 is detected, and when there is no restoration data (parity data), the immediately preceding audio data is substituted as the restored data, so that the restoration data is not necessarily obtained. There is no need to request transmission of parity. Thereby, since the transmission frequency of the parity data can be reduced for the transmission device 1, unnecessary communication can be reduced.
 そして、受信状態を監視して受信状態が所定の条件を満たして悪化したとき(データ欠損の頻度が増大したり、受信状態の悪化または障害の予兆を検出、)には、復元用データの送信開始を送信装置1に要求してパリティデータ送信モードに移行し、パリティデータから音声データを復元して、音声データの品質を確保する。そして、データ補填の頻度が低減し、かつ、受信状態が正常に復帰してからパリティデータ停止モードに移行することで音声データ転送システムのリソースの消費を低減することが可能となる。 When the reception state is monitored and the reception state deteriorates by satisfying a predetermined condition (when the frequency of data loss increases, or the reception state deteriorates or a sign of failure is detected), the restoration data is transmitted. The transmission apparatus 1 is requested to start and shift to the parity data transmission mode, and the audio data is restored from the parity data to ensure the quality of the audio data. Then, the frequency of data compensation is reduced, and the resource consumption of the voice data transfer system can be reduced by shifting to the parity data stop mode after the reception state returns to normal.
 なお、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施例は本発明を分かりやすく説明するために詳細に記載したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施例の構成の一部を他の実施例の構成に置き換えることが可能であり、また、ある実施例の構成に他の実施例の構成を加えることも可能である。また、各実施例の構成の一部について、他の構成の追加、削除、又は置換のいずれもが、単独で、又は組み合わせても適用可能である。 In addition, this invention is not limited to the above-mentioned Example, Various modifications are included. For example, the above-described embodiments are described in detail for easy understanding of the present invention, and are not necessarily limited to those having all the configurations described. Further, a part of the configuration of one embodiment can be replaced with the configuration of another embodiment, and the configuration of another embodiment can be added to the configuration of one embodiment. In addition, any of the additions, deletions, or substitutions of other configurations can be applied to a part of the configuration of each embodiment, either alone or in combination.
 また、上記の各構成、機能、処理部、及び処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。また、上記の各構成、及び機能等は、プロセッサがそれぞれの機能を実現するプログラムを解釈し、実行することによりソフトウェアで実現してもよい。各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリや、ハードディスク、SSD(Solid State Drive)等の記録装置、または、ICカード、SDカード、DVD等の記録媒体に置くことができる。 In addition, each of the above-described configurations, functions, processing units, processing means, and the like may be realized by hardware by designing a part or all of them with, for example, an integrated circuit. In addition, each of the above-described configurations, functions, and the like may be realized by software by the processor interpreting and executing a program that realizes each function. Information such as programs, tables, and files for realizing each function can be stored in a memory, a hard disk, a recording device such as an SSD (Solid State Drive), or a recording medium such as an IC card, an SD card, or a DVD.
 また、制御線や情報線は説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。実際には殆ど全ての構成が相互に接続されていると考えてもよい。 Also, the control lines and information lines indicate what is considered necessary for the explanation, and not all the control lines and information lines on the product are necessarily shown. Actually, it may be considered that almost all the components are connected to each other.

Claims (14)

  1.  プロセッサとメモリを含む送信装置と、プロセッサとメモリを含む受信装置と、前記送信装置と前記受信装置を接続する通信経路とを含んで、前記送信装置と前記受信装置の間で音声データを転送する音声データ転送方法であって、
     前記送信装置が、音声データを分割した複数の分割データを生成し、異なる通信経路を介して前記分割データをそれぞれ送信する第1のステップと、
     前記受信装置が、前記異なる通信経路から受信した前記分割データをそれぞれ前記メモリに格納する第2のステップと、
     前記受信装置が、復元用データ送信モードと復元用データ停止モードのいずれであるかを判定する第3のステップと、
     前記受信装置が、前記メモリに格納された分割データの欠損の有無を判定する第4のステップと、
     前記受信装置が、前記メモリに格納された分割データに欠損がない場合には、前記分割データを結合して前記音声データを復元して出力する第5のステップと、
     前記受信装置が、前記復元用データ停止モードで、かつ、前記メモリに格納された分割データに欠損がある場合には、所定の代用データで前記音声データを補填して出力する第6のステップと、
     前記受信装置が、前記分割データの受信状態を監視して、前記受信状態が所定の開始条件を満たすときには、前記復元用データ送信モードに移行して前記送信装置へ復元用データの送信開始を要求する第7のステップと、を含み、
     前記第1のステップは、
     前記受信装置から復元用データの送信開始を要求されたときには、前記分割データの復元用データを生成して、異なる通信経路から送信するステップを含み、
     前記第2のステップは、
     異なる通信経路から受信した前記復元用データをメモリに格納するステップを含み、
     前記受信装置が、前記復元用データ送信モードで、かつ、前記メモリに格納された分割データに欠損がある場合には、前記メモリに格納された復元用データと前記分割データから音声データを復元して出力する第8のステップと、
    を含むことを特徴とする音声データ転送方法。
    Audio data is transferred between the transmission device and the reception device, including a transmission device including a processor and a memory, a reception device including a processor and a memory, and a communication path connecting the transmission device and the reception device. An audio data transfer method,
    A first step of generating a plurality of divided data obtained by dividing the audio data and transmitting the divided data via different communication paths;
    A second step in which the receiving device stores the divided data received from the different communication paths in the memory;
    A third step of determining whether the receiving device is in a restoration data transmission mode or a restoration data stop mode;
    A fourth step in which the receiving device determines whether or not the divided data stored in the memory is missing;
    The receiving device, when there is no loss in the divided data stored in the memory, a fifth step of combining the divided data to restore and output the audio data;
    A sixth step in which the receiving device is in the restoration data stop mode, and if the divided data stored in the memory is missing, the audio data is supplemented with predetermined substitute data and output. ,
    The reception device monitors the reception state of the divided data, and when the reception state satisfies a predetermined start condition, shifts to the restoration data transmission mode and requests the transmission device to start transmission of restoration data. And a seventh step of
    The first step includes
    When the receiving device is requested to start transmission of restoration data, it includes the step of generating restoration data of the divided data and transmitting it from a different communication path,
    The second step includes
    Storing the restoration data received from a different communication path in a memory;
    When the receiving device is in the restoration data transmission mode and the divided data stored in the memory is defective, the receiving device restores audio data from the restoration data and the divided data stored in the memory. And an eighth step of outputting
    An audio data transfer method comprising:
  2.  請求項1に記載の音声データ転送方法であって、
     前記受信装置が、前記復元用データ送信モードで、かつ、前記受信状態が所定の停止条件を満たしたときには、前記復元用データ停止モードに移行して前記送信装置へ復元用データの送信停止を要求する第9のステップを、さらに含むことを特徴とする音声データ転送方法。
    The audio data transfer method according to claim 1,
    When the reception device is in the restoration data transmission mode and the reception state satisfies a predetermined stop condition, the reception device makes a transition to the restoration data stop mode and requests the transmission device to stop transmission of restoration data. The audio data transfer method further comprising a ninth step of:
  3.  請求項1に記載の音声データ転送方法であって、
     前記第7のステップは、
     前記代用データで音声データを補填した頻度が所定の頻度閾値以上の場合には、前記受信状態が所定の開始条件を満たしたと判定することを特徴とする音声データ転送方法。
    The audio data transfer method according to claim 1,
    The seventh step includes
    An audio data transfer method comprising: determining that the reception state satisfies a predetermined start condition when the frequency of the audio data supplemented with the substitute data is equal to or greater than a predetermined frequency threshold.
  4.  請求項2に記載の音声データ転送方法であって、
     前記第9のステップは、
     所定時間内に前記代用データで音声データを補填しなかった場合には、前記受信状態が所定の停止条件を満たしたと判定することを特徴とする音声データ転送方法。
    The audio data transfer method according to claim 2,
    The ninth step includes
    An audio data transfer method comprising: determining that the reception state satisfies a predetermined stop condition when audio data is not supplemented with the substitute data within a predetermined time.
  5.  請求項1に記載の音声データ転送方法であって、
     前記第7のステップは、
     所定の時間内に前記メモリに格納された分割データのデータ量が所定の揺らぎ閾値未満の場合には、前記受信状態が所定の開始条件を満たしたと判定することを特徴とする音声データ転送方法。
    The audio data transfer method according to claim 1,
    The seventh step includes
    An audio data transfer method comprising: determining that the reception state satisfies a predetermined start condition when a data amount of the divided data stored in the memory within a predetermined time is less than a predetermined fluctuation threshold.
  6.  請求項2に記載の音声データ転送方法であって、
     前記第9のステップは、
     所定の時間内に前記メモリに格納された分割データのデータ量が所定の揺らぎ閾値以上の場合には、前記受信状態が所定の停止条件を満たしたと判定することを特徴とする音声データ転送方法。
    The audio data transfer method according to claim 2,
    The ninth step includes
    An audio data transfer method comprising: determining that the reception state satisfies a predetermined stop condition when a data amount of divided data stored in the memory within a predetermined time is equal to or greater than a predetermined fluctuation threshold.
  7.  請求項1に記載の音声データ転送方法であって、
     前記第7のステップは、
     前記代用データで音声データを補填した頻度が所定の頻度閾値以上の場合、または所定の時間内に前記メモリに格納された分割データのデータ量が所定の揺らぎ閾値未満の場合には、前記受信状態が所定の開始条件を満たしたと判定することを特徴とする音声データ転送方法。
    The audio data transfer method according to claim 1,
    The seventh step includes
    When the frequency of supplementing the voice data with the substitute data is equal to or higher than a predetermined frequency threshold, or when the data amount of the divided data stored in the memory within a predetermined time is less than a predetermined fluctuation threshold, the reception state Is determined to satisfy a predetermined start condition.
  8.  請求項2に記載の音声データ転送方法であって、
     前記第9のステップは、
     所定時間内に前記代用データで音声データを補填せずに、かつ、所定の時間内に前記メモリに格納された分割データのデータ量が所定の揺らぎ閾値以上の場合には、前記受信状態が所定の停止条件を満たしたと判定することを特徴とする音声データ転送方法。
    The audio data transfer method according to claim 2,
    The ninth step includes
    If the voice data is not supplemented with the substitute data within a predetermined time, and the data amount of the divided data stored in the memory within the predetermined time is greater than or equal to a predetermined fluctuation threshold, the reception state is predetermined. A voice data transfer method characterized in that it is determined that the stop condition is satisfied.
  9.  プロセッサとメモリを含む送信装置と、プロセッサとメモリを含む受信装置と、前記送信装置と前記受信装置を接続する通信経路とを含んで、前記送信装置と前記受信装置の間で音声データを転送する音声データ転送システムであって、
     前記送信装置は、
     音声データを分割した複数の分割データを生成するデータ分割部と、
     前記受信装置から復元用データの送信開始を要求されたときには、前記分割データの復元用データを生成する復元用データ生成部と、
     異なる通信経路を介して前記分割データと前記復元用データとをそれぞれ送信するネットワークインタフェース部と、を有し、
     前記受信装置は、
     前記異なる通信経路から受信した前記分割データと前記復元用データをそれぞれ前記メモリに格納するネットワークインタフェース部と、
     前記メモリに格納された分割データの欠損の有無を判定し、前記メモリに格納された分割データに欠損がない場合には、前記分割データを結合して前記音声データを復元して出力し、復元用データ送信モードと復元用データ停止モードのいずれであるかを判定して、前記復元用データ送信モードで、かつ、前記メモリに格納された分割データに欠損がある場合には、前記メモリに格納された復元用データと前記分割データから音声データを復元して出力するデータ結合部と、
     前記復元用データ停止モードで、かつ、前記メモリに格納された分割データに欠損がある場合には、所定の代用データで前記音声データを補填して出力するデータ補填部と、
     前記分割データの受信状態を監視して前記受信状態が所定の開始条件を満たすか否かを判定する経路制御部と、
     前記受信状態が所定の開始条件を満たしたときには、前記復元用データ送信モードに移行して前記送信装置へ復元用データの送信開始を要求する復元用データ制御部と、
    を有することを特徴とする音声データ転送システム。
    Audio data is transferred between the transmission device and the reception device, including a transmission device including a processor and a memory, a reception device including a processor and a memory, and a communication path connecting the transmission device and the reception device. An audio data transfer system,
    The transmitter is
    A data dividing unit for generating a plurality of divided data obtained by dividing the audio data;
    When requested to start transmission of restoration data from the receiving device, a restoration data generation unit that generates restoration data of the divided data;
    A network interface unit that transmits the divided data and the restoration data via different communication paths, and
    The receiving device is:
    A network interface unit for storing the divided data and the restoration data received from the different communication paths in the memory;
    It is determined whether or not the divided data stored in the memory is missing. If the divided data stored in the memory is not missing, the audio data is restored by combining the divided data and output. It is determined whether the data transmission mode for restoration or the data stop mode for restoration, and if the divided data stored in the memory is missing in the restoration data transmission mode, the data is stored in the memory. A data combining unit for restoring and outputting audio data from the restored data and the divided data;
    In the restoration data stop mode, and when there is a loss in the divided data stored in the memory, a data compensation unit that supplements and outputs the audio data with predetermined substitute data; and
    A path control unit that monitors a reception state of the divided data and determines whether the reception state satisfies a predetermined start condition; and
    When the reception state satisfies a predetermined start condition, a restoration data control unit that shifts to the restoration data transmission mode and requests the transmission apparatus to start transmission of restoration data;
    An audio data transfer system comprising:
  10.  請求項9に記載の音声データ転送システムであって、
     前記復元用データ制御部は、
     前記復元用データ送信モードで、かつ、前記受信状態が所定の停止条件を満たしたときには、前記復元用データ停止モードに移行して前記送信装置へ復元用データの送信停止を要求することを特徴とする音声データ転送システム。
    The voice data transfer system according to claim 9,
    The restoration data control unit
    In the restoration data transmission mode, and when the reception state satisfies a predetermined stop condition, the restoration data stop mode is entered and the transmission apparatus is requested to stop transmission of restoration data. Voice data transfer system.
  11.  請求項9に記載の音声データ転送システムであって、
     前記復元用データ制御部は、
     前記代用データで音声データを補填した頻度が所定の頻度閾値以上の場合、または所定の時間内に前記メモリに格納された分割データのデータ量が所定の揺らぎ閾値未満の場合には、前記受信状態が所定の開始条件を満たしたと判定することを特徴とする音声データ転送システム。
    The voice data transfer system according to claim 9,
    The restoration data control unit
    When the frequency of supplementing the voice data with the substitute data is equal to or higher than a predetermined frequency threshold, or when the data amount of the divided data stored in the memory within a predetermined time is less than a predetermined fluctuation threshold, the reception state Is determined to satisfy a predetermined start condition.
  12.  請求項10に記載の音声データ転送システムであって、
     前記復元用データ制御部は、
     所定時間内に前記代用データで音声データを補填せずに、かつ、所定の時間内に前記メモリに格納された分割データのデータ量が所定の揺らぎ閾値以上の場合には、前記受信状態が所定の停止条件を満たしたと判定することを特徴とする音声データ転送システム。
    The voice data transfer system according to claim 10,
    The restoration data control unit
    If the voice data is not supplemented with the substitute data within a predetermined time, and the data amount of the divided data stored in the memory within the predetermined time is greater than or equal to a predetermined fluctuation threshold, the reception state is predetermined. An audio data transfer system that determines that the stop condition is satisfied.
  13.  プロセッサとメモリを含んで音声データを受信する受信装置であって、
     異なる通信経路から受信した分割データと復元用データを受信してそれぞれ前記メモリに格納するネットワークインタフェース部と、
     前記メモリに格納された分割データの欠損の有無を判定し、前記メモリに格納された分割データに欠損がない場合には、前記分割データを結合して前記音声データを復元して出力し、復元用データ送信モードと復元用データ停止モードのいずれであるかを判定して、前記復元用データ送信モードで、かつ、前記メモリに格納された分割データに欠損がある場合には、前記メモリに格納された復元用データと前記分割データから音声データを復元して出力するデータ結合部と、
     前記復元用データ停止モードで、かつ、前記メモリに格納された分割データに欠損がある場合には、所定の代用データで前記音声データを補填して出力するデータ補填部と、
     前記分割データの受信状態を監視して、前記受信状態が所定の開始条件を満たすか否かを判定する経路制御部と、
     前記受信状態が所定の開始条件を満たすときには、前記復元用データ送信モードに移行して復元用データの送信開始を要求する復元用データ制御部と、
    を有することを特徴とする受信装置。
    A receiving device that includes a processor and a memory to receive audio data,
    A network interface unit for receiving divided data and restoration data received from different communication paths and storing them in the memory, respectively;
    It is determined whether or not the divided data stored in the memory is missing. If the divided data stored in the memory is not missing, the audio data is restored by combining the divided data and output. It is determined whether the data transmission mode for restoration or the data stop mode for restoration, and if the divided data stored in the memory is missing in the restoration data transmission mode, the data is stored in the memory. A data combining unit for restoring and outputting audio data from the restored data and the divided data;
    In the restoration data stop mode, and when there is a loss in the divided data stored in the memory, a data compensation unit that supplements and outputs the audio data with predetermined substitute data; and
    A path control unit that monitors a reception state of the divided data and determines whether the reception state satisfies a predetermined start condition;
    When the reception state satisfies a predetermined start condition, a restoration data control unit that shifts to the restoration data transmission mode and requests the restoration data transmission start,
    A receiving apparatus comprising:
  14.  請求項13に記載の受信装置であって、
     前記復元用データ制御部は、
     前記復元用データ送信モードで、かつ、前記受信状態が所定の停止条件を満たしたときには、前記復元用データ停止モードに移行して前記送信装置へ復元用データの送信停止を要求することを特徴とする受信装置。
    The receiving device according to claim 13,
    The restoration data control unit
    In the restoration data transmission mode, and when the reception state satisfies a predetermined stop condition, the restoration data stop mode is entered and the transmission apparatus is requested to stop transmission of restoration data. Receiving device.
PCT/JP2016/086410 2016-12-07 2016-12-07 Voice data transfer method, voice data transfer system, and reception device WO2018105064A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/JP2016/086410 WO2018105064A1 (en) 2016-12-07 2016-12-07 Voice data transfer method, voice data transfer system, and reception device
JP2018555387A JP6668503B2 (en) 2016-12-07 2016-12-07 Voice data transfer method, voice data transfer system and receiving device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2016/086410 WO2018105064A1 (en) 2016-12-07 2016-12-07 Voice data transfer method, voice data transfer system, and reception device

Publications (1)

Publication Number Publication Date
WO2018105064A1 true WO2018105064A1 (en) 2018-06-14

Family

ID=62491789

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2016/086410 WO2018105064A1 (en) 2016-12-07 2016-12-07 Voice data transfer method, voice data transfer system, and reception device

Country Status (2)

Country Link
JP (1) JP6668503B2 (en)
WO (1) WO2018105064A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPWO2022039095A1 (en) * 2020-08-17 2022-02-24

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005252665A (en) * 2004-03-04 2005-09-15 Nec Corp Voice packet transferring method and terminal used for the same
JP2011087091A (en) * 2009-10-14 2011-04-28 Sony Corp Transmission device and operation mode control method of the same
JP2011172162A (en) * 2010-02-22 2011-09-01 Nec Access Technica Ltd Receiving apparatus, transmitting apparatus, communication system, communication method and program

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005252665A (en) * 2004-03-04 2005-09-15 Nec Corp Voice packet transferring method and terminal used for the same
JP2011087091A (en) * 2009-10-14 2011-04-28 Sony Corp Transmission device and operation mode control method of the same
JP2011172162A (en) * 2010-02-22 2011-09-01 Nec Access Technica Ltd Receiving apparatus, transmitting apparatus, communication system, communication method and program

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPWO2022039095A1 (en) * 2020-08-17 2022-02-24
WO2022039095A1 (en) * 2020-08-17 2022-02-24 株式会社Gdi Restoration terminal, communication system, restoration method, communication method, and program
JP7126303B2 (en) 2020-08-17 2022-08-26 株式会社ポリテック Restoration terminal, communication system, restoration method, communication method, and program

Also Published As

Publication number Publication date
JP6668503B2 (en) 2020-03-18
JPWO2018105064A1 (en) 2019-07-25

Similar Documents

Publication Publication Date Title
JP4405875B2 (en) Method and apparatus for generating data for error correction, generation program, and computer-readable recording medium storing the program
JP5191826B2 (en) Stream communication apparatus, stream communication method, and stream communication system
US7904781B2 (en) Data transmitting device, data receiving device, and data distribution system
US20080239953A1 (en) Method and apparatus for minimizing congestion in gateways
US20170346601A1 (en) Data transmission method and computing apparatus having data transmission function
JP5084362B2 (en) Data transmission apparatus and data transmission / reception system
JP2001230809A (en) Communication system and communication method and transmission terminal and reception terminal
JP5142828B2 (en) Transmission device, transmission method, and program
JP2006287284A (en) Wireless communication system and header compression control method thereof
JP2016092640A (en) Generation device, generation method and program
WO2018105064A1 (en) Voice data transfer method, voice data transfer system, and reception device
US7724755B2 (en) Communications apparatus
US11444723B2 (en) Signal protection
JP2013225761A (en) Coding device, decoding device, communication system and communication control method
JP2017536052A (en) Tag exchange path connectivity detection method and apparatus
US8948252B2 (en) Moving picture transmission apparatus, moving picture transmission system, moving picture transmission method, and program
US8472310B2 (en) Packet distribution band controlling method, distributing apparatus, and video distributing system
CN116506409A (en) Data transmission method and device, electronic equipment and storage medium
JP2012216964A (en) Client device, communication system, existence confirmation method and program
JP4909590B2 (en) Media signal receiving device, transmitting device, and transmitting / receiving system
JP6358430B2 (en) Communication control device and communication control method
JP2016005220A (en) Transmission apparatus, transmission method, and program
JP5813835B2 (en) Client device and communication system
JP2008219551A (en) Transmission system, communication device, transmission control method, and program
WO2014174801A1 (en) Wireless communication device, wireless communication system, and wireless communication method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16923319

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2018555387

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16923319

Country of ref document: EP

Kind code of ref document: A1