CN113689884A - Multi-channel speech signal evaluation system and method - Google Patents
Multi-channel speech signal evaluation system and method Download PDFInfo
- Publication number
- CN113689884A CN113689884A CN202111094740.2A CN202111094740A CN113689884A CN 113689884 A CN113689884 A CN 113689884A CN 202111094740 A CN202111094740 A CN 202111094740A CN 113689884 A CN113689884 A CN 113689884A
- Authority
- CN
- China
- Prior art keywords
- signal
- upper computer
- processor
- state machine
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 238000011156 evaluation Methods 0.000 title claims abstract description 27
- 238000004891 communication Methods 0.000 claims abstract description 75
- 238000012545 processing Methods 0.000 claims abstract description 60
- 230000005236 sound signal Effects 0.000 claims abstract description 55
- 238000006243 chemical reaction Methods 0.000 claims description 25
- 230000001360 synchronised effect Effects 0.000 claims description 6
- 230000003044 adaptive effect Effects 0.000 claims description 5
- 238000005070 sampling Methods 0.000 claims description 5
- 230000000875 corresponding effect Effects 0.000 description 9
- 238000010586 diagram Methods 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 2
- 206010071299 Slow speech Diseases 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/60—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Quality & Reliability (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Communication Control (AREA)
- Analogue/Digital Conversion (AREA)
Abstract
The invention relates to a multi-channel voice signal evaluation system and a method. The device comprises an analog-to-digital converter, a signal processing device and an upper computer; when each path of digital audio signal is transmitted to an upper computer, the signal processing device can determine the communication rate with the upper computer according to the frequency of the current audio digital signal and transmits the current audio digital signal to the upper computer at the determined communication rate; the upper computer carries out fast Fourier transform processing on the received audio digital signals to determine signal frequency domain values of corresponding audio digital signals, estimates and judges the signal quality of the current audio signals by utilizing a spurious-free dynamic range according to the signal frequency domain values of each audio signal, and outputs the estimated voice signal quality. The invention can realize the evaluation of the multi-path voice signals and improve the efficiency and the reliability of the evaluation of the voice signals.
Description
Technical Field
The present invention relates to an evaluation system and method, and more particularly, to a multi-channel speech signal evaluation system and method.
Background
The voice audio signal sent by human being is generally 500-4000Hz, the voice used by people in daily communication comprises the frequency band of the fast speech speed, the normal speech speed and the slow speech speed are both 500-4000Hz, the audio signal in the frequency band contains a great deal of valuable information, and the research on the audio quality in the frequency band has very important practical significance. The speech is usually mixed with noise in various frequency bands, so how to evaluate the speech signal is very important, but the necessary technical means is not available at present.
Disclosure of Invention
The invention aims to overcome the defects in the prior art and provides a multi-channel speech signal evaluation system and a multi-channel speech signal evaluation method, which can realize the evaluation of multi-channel speech signals and improve the efficiency and reliability of speech signal evaluation.
According to the technical scheme provided by the invention, the multichannel voice signal evaluation system comprises an analog-to-digital converter, a signal processing device and an upper computer, wherein the analog-to-digital converter can receive a plurality of channels of audio signals generated by a signal source, the signal processing device is in adaptive connection with the analog-to-digital converter, and the upper computer is electrically connected with the signal processing device;
after the analog-to-digital converter performs analog-to-digital conversion on the received multiple channels of audio signals, the signal processing device can sample and process the audio digital signals of each channel converted by the analog-to-digital converter, and the signal processing device can sequentially transmit the multiple channels of sampled and processed audio digital signals to an upper computer; when each path of digital audio signal is transmitted to an upper computer, the signal processing device can determine the communication rate with the upper computer according to the frequency of the current audio digital signal and transmits the current audio digital signal to the upper computer at the determined communication rate;
the upper computer carries out fast Fourier transform processing on the received audio digital signals to determine signal frequency domain values of corresponding audio digital signals, estimates and judges the signal quality of the current audio signals by utilizing a spurious-free dynamic range according to the signal frequency domain values of each audio signal, and outputs the estimated voice signal quality.
The signal processing device comprises a first signal processor and a second signal processor, wherein the first signal processor is connected with the upper computer through USB communication, and the second signal processor is connected with the first signal processor;
the first signal processor can sample and process each channel of audio digital signals after being converted by the analog-digital converter, and can transmit the multiple channels of sampled and processed audio digital signals to the upper computer in sequence through the signal processing device;
and after the first signal processor and the upper computer are adjusted to the determined communication rate, the first signal processor transmits the current audio digital signals to the upper computer in a USB communication mode at the determined communication rate.
The first signal processor comprises an FPGA, and the second signal processor comprises an ARM;
when the signal second processor adopts ARM and the signal first processor adopts FPGA, the signal second processor and the signal first processor communicate through the FSMC protocol.
When the second signal processor determines the communication rate between the first signal processor and the upper computer, the communication rate v is
Wherein, FinFor the frequency of the current audio signal, FsThe sampling frequency of the analog-to-digital converter is m, the conversion bit number of the analog-to-digital converter is n, the channel number of the analog-to-digital converter is n, g is the effective data bit number when the first signal processor and the upper computer adopt RS232 protocol before conversion into USB protocol for communication, and q is the total data bit number when the RS232 protocol before conversion into USB protocol for communication is adopted.
Setting a three-layer state machine in the first signal processor, wherein the three-layer state machine comprises a first layer state machine, a second layer state machine and a third layer state machine; the first signal processor can process each path of audio digital signals converted by the analog-digital converter after sequentially passing through the first layer state machine, the second layer state machine and the third layer state machine, and finally sends the processed audio digital signals to the upper computer;
the first layer state machine comprises a first layer state machine idle state, a write address state, a write data state and an ADC data receiving state;
the second layer state machine comprises a second layer state machine idle state, a SPORT data reading state, a FIFO data writing state, a FIFO data reading state, an RS232-USB protocol conversion state and a synchronous state of SPORT received data and FIFO read data;
the third layer state machine comprises an idle state of the third layer state machine, an ADC data receiving and sending state, a data state of RS232 sending to GUI, a data state of RS232 receiving data from GUI, a data state of FSMC sending, a data state of FSMC receiving and a communication rate synchronization state.
The frequency of the multi-channel audio signal generated by the signal source is 500 Hz-4000 Hz.
A multi-channel voice signal evaluation method comprises the steps of providing an analog-to-digital converter capable of receiving multi-channel audio signals generated by a signal source, a signal processing device in adaptive connection with the analog-to-digital converter, and an upper computer electrically connected with the signal processing device;
after the analog-to-digital converter performs analog-to-digital conversion on the received multiple channels of audio signals, the signal processing device can sample and process the audio digital signals of each channel converted by the analog-to-digital converter, and the signal processing device can sequentially transmit the multiple channels of sampled and processed audio digital signals to an upper computer; when each path of digital audio signal is transmitted to an upper computer, the signal processing device can determine the communication rate with the upper computer according to the frequency of the current audio digital signal and transmits the current audio digital signal to the upper computer at the determined communication rate;
the upper computer carries out fast Fourier transform processing on the received audio digital signals to determine signal frequency domain values of corresponding audio digital signals, estimates and judges the signal quality of the current audio signals by utilizing a spurious-free dynamic range according to the signal frequency domain values of each audio signal, and outputs the estimated voice signal quality.
The signal processing device comprises a first signal processor and a second signal processor, wherein the first signal processor is connected with the upper computer through USB communication, and the second signal processor is connected with the first signal processor;
the first signal processor can sample and process each channel of audio digital signals after being converted by the analog-digital converter, and can transmit the multiple channels of sampled and processed audio digital signals to the upper computer in sequence through the signal processing device;
and after the first signal processor and the upper computer are adjusted to the determined communication rate, the first signal processor transmits the current audio digital signals to the upper computer in a USB communication mode at the determined communication rate.
When the second signal processor determines the communication rate between the first signal processor and the upper computer, the communication rate v is
Wherein, FinFor the frequency of the current audio signal, FsThe sampling frequency of the analog-to-digital converter is m, the conversion bit number of the analog-to-digital converter is n, the channel number of the analog-to-digital converter is n, g is the effective data bit number when the first signal processor and the upper computer adopt RS232 protocol before conversion into USB protocol for communication, and q is the total data bit number when the RS232 protocol before conversion into USB protocol for communication is adopted.
Setting a three-layer state machine in the first signal processor, wherein the three-layer state machine comprises a first layer state machine, a second layer state machine and a third layer state machine; the first signal processor can process each path of audio digital signals converted by the analog-digital converter after sequentially passing through the first layer state machine, the second layer state machine and the third layer state machine, and finally sends the processed audio digital signals to the upper computer;
the first layer state machine comprises a first layer state machine idle state, a write address state, a write data state and an ADC data receiving state;
the second layer state machine comprises a second layer state machine idle state, a SPORT data reading state, a FIFO data writing state, a FIFO data reading state, an RS232-USB protocol conversion state and a synchronous state of SPORT received data and FIFO read data;
the third layer state machine comprises an idle state of the third layer state machine, an ADC data receiving and sending state, a data state of RS232 sending to GUI, a data state of RS232 receiving data from GUI, a data state of FSMC sending, a data state of FSMC receiving and a communication rate synchronization state.
The invention has the advantages that: after the analog-to-digital converter performs analog-to-digital conversion on the received multiple channels of audio signals, the signal processing device can sample and process the audio digital signals of each channel converted by the analog-to-digital converter, and the signal processing device can sequentially transmit the multiple channels of sampled and processed audio digital signals to an upper computer; when each path of digital audio signal is transmitted to an upper computer, the signal processing device can determine the communication rate with the upper computer according to the frequency of the current audio digital signal and transmits the current audio digital signal to the upper computer at the determined communication rate;
the upper computer carries out fast Fourier transform processing on the received audio digital signals to determine signal frequency domain values of corresponding audio digital signals, estimates and judges the signal quality of the current audio signals by utilizing a spurious-free dynamic range according to the signal frequency domain values of each audio signal, and outputs the voice signal quality which is estimated and judged, namely, the estimation of multiple paths of voice signals can be realized, and the efficiency and the reliability of the voice signal estimation are improved.
Drawings
FIG. 1 is a block diagram of the present invention.
FIG. 2 is a diagram of a first level state machine according to the present invention.
FIG. 3 is a diagram of a second level state machine according to the present invention.
FIG. 4 is a diagram of a third level state machine according to the present invention.
FIG. 5 is a flow chart of the operation of the upper computer of the present invention.
Description of reference numerals: the device comprises a 1-analog-to-digital converter, a 2-signal first processor, a 3-USB driver, a 4-USB interface, a 5-signal second processor and a 6-upper computer.
Detailed Description
The invention is further illustrated by the following specific figures and examples.
As shown in fig. 1: in order to realize the evaluation of multi-channel voice signals and improve the efficiency and reliability of the evaluation of the voice signals, the invention comprises an analog-to-digital converter 1 which can receive multi-channel audio signals generated by a signal source, a signal processing device which is in adaptive connection with the analog-to-digital converter 1 and an upper computer 6 which is electrically connected with the signal processing device;
after the analog-to-digital converter 1 performs analog-to-digital conversion on the received multiple channels of audio signals, a signal processing device can sample and process each channel of audio digital signals converted by the analog-to-digital converter 1, and the multiple channels of sampled and processed audio digital signals can be sequentially transmitted into an upper computer 6; when each path of digital audio signal is transmitted to the upper computer 6, the signal processing device can determine the communication rate with the upper computer 6 according to the frequency of the current audio digital signal, and transmits the current audio digital signal into the upper computer 6 at the determined communication rate;
the upper computer 6 performs fast fourier transform processing on the received audio digital signal to determine a signal frequency domain value of a corresponding path of audio digital signal, estimates and judges the signal quality of the current path of audio signal by utilizing a spurious-free dynamic range according to the signal frequency domain value of each path of audio signal, and outputs the estimated voice signal quality.
Specifically, the signal source may adopt an existing commonly used signal generator form, and may be specifically selected according to needs, where a corresponding frequency of the multiple audio signals generated by the signal source is 500Hz to 4000 Hz. The analog-to-digital converter 1 is selected to receive multiple audio signals simultaneously, the specific type of the analog-to-digital converter 1 can be selected according to actual needs, and the analog-to-digital converter 1 can be used to perform analog-to-digital conversion on the multiple audio signals, and the specific form and process of the analog-to-digital conversion are consistent with those in the prior art, which are known to those skilled in the art and will not be described herein again. Generally, the multiple audio signals refer to at least one audio signal, the number of the audio signals, the signal source, and the specific type of the line pipe of the analog-to-digital converter 1, which are well known in the art and will not be described herein.
The signal processing device can sample and process the audio digital signal obtained after the analog-to-digital converter 1 is converted, generally, the signal processing device can simultaneously sample and process multiple paths of digital audio signals, or sample and process one path of audio digital signal at each time, which can be specifically selected according to actual needs and is not described herein again. When multiple paths of audio digital signals exist, the signal processing device sequentially transmits the audio digital signals of each path to the upper computer 6.
In order to prevent data loss in the transmission process of the audio digital signals, before the audio digital signals of each channel are transmitted to the upper computer 6, the communication rate between the signal processing device and the upper computer 6 needs to be determined according to the frequency of the current audio digital signals to be transmitted, after the communication rate is determined, the signal processing device communicates with the upper computer 6 at the current communication rate to transmit the current audio digital signals to the upper computer 6, and the working flow of the upper computer 6 is as shown in fig. 5.
The upper computer 6 carries out fast fourier transform processing to the audio digital signal that receives, after carrying out fast fourier transform, can confirm the signal frequency domain value of corresponding way audio digital signal, and the process that the upper computer 6 specifically carries out fast fourier transform all is unanimous with current, specifically is known for this technical field personnel, and it is no longer repeated here.
In the embodiment of the invention, after the signal frequency domain value is obtained, the signal quality of the current path of audio signal is estimated and judged by utilizing the spurious-free dynamic range according to the signal frequency domain value of each path of audio signal, and the estimated voice signal quality is output. Spurious-free dynamic range is an important indicator for evaluating the performance of an analog-to-digital converter. In particular, Spurious Free Dynamic Range (SFDR) is used to evaluate the quality of the incoming signal. For one acknowledgementA fixed analog-to-digital converter 1, wherein the ideal SFDR value can be determined to be SFDR according to the working manual of the analog-to-digital converter 1ADCSFDR can be obtained by calculating the signal frequency domain value obtained by the upper computer 6OUTIn case of no line loss, no interference and no attenuation of the input signal, | SFDRADC|=|SFDROUTIf considering signal attenuation, it can be considered as SFDRADC|-|SFDROUTWhen | < 4dbm, the detected input signal quality is high; when 4dbm < | SFDRADC|-|SFDROUTWhen the | is less than 7dbm, the detected quality of the input signal is general; when SFDRADC|-|SFDROUTIf | is greater than 7dbm, the detected input signal quality is low, so that the speech signal can be evaluated.
In specific implementation, the upper computer 6 performs fast fourier transform processing on the received audio digital signal to determine a signal frequency domain value of the corresponding audio digital signal, wherein, according to the definition of SFDR, the signal frequency domain value obtained by the upper computer 6 is calculated to obtain SFDROUTThe methods and calculation processes are well known in the art and will not be described herein.
Further, the signal processing device comprises a first signal processor 2 and a second signal processor 5, wherein the first signal processor 2 is connected with an upper computer 6 through a USB communication mode, and the second signal processor 5 is connected with the first signal processor 2;
the first signal processor 2 can sample and process each channel of audio digital signals after being converted by the analog-digital converter 1, and can sequentially transmit the multiple channels of sampled and processed audio digital signals into the upper computer 6 through the signal processing device;
for any audio digital signal transmitted to the upper computer 6, the second signal processor 5 determines the frequency of the current audio digital signal to determine the communication rate between the first signal processor 2 and the upper computer 6, the first signal processor 2 transmits the determined communication rate to the upper computer 6 through USB communication, and after the first signal processor 2 and the upper computer 6 are adjusted to the determined communication rate, the first signal processor 2 transmits the current audio digital signal into the upper computer 6 through a USB communication mode at the determined communication rate.
In the embodiment of the invention, the first signal processor 2 comprises an FPGA, and the second signal processor 5 comprises an ARM; of course, the upper computer 6 may be in the form of a commonly used computer or the like, and may be specifically selected according to actual needs.
When the second signal processor 5 adopts an ARM and the first signal processor 2 adopts an FPGA, the second signal processor 5 is in communication connection with the first signal processor 2 through an RS 232. In specific implementation, the first signal processor 2 can realize USB connection and cooperation with the upper computer 6 through the USB driver 3 and the USB interface 4.
Further, when the second signal processor 5 determines the communication rate between the first signal processor 2 and the upper computer 6, the communication rate v is
Wherein, FinFor the frequency of the current audio signal, FsThe sampling frequency of the analog-to-digital converter 1 is m, the conversion bit number of the analog-to-digital converter 1 is n, the channel number of the analog-to-digital converter 1 is n, g is the effective data bit number when the signal first processor 2 and the upper computer 6 adopt USB communication, and q is the total data bit number when the USB communication is adopted.
Further, a three-layer state machine is arranged in the first signal processor 2, and the three-layer state machine comprises a first layer state machine, a second layer state machine and a third layer state machine; the first signal processor 2 can process each path of audio digital signals converted by the analog-digital converter 1 after sequentially passing through the first layer state machine, the second layer state machine and the third layer state machine, and finally sends the processed audio digital signals to the upper computer 6;
the first layer state machine comprises a first layer state machine idle state, a write address state, a write data state and an ADC data receiving state;
the second layer state machine comprises a second layer state machine idle state, a SPORT data reading state, a FIFO data writing state, a FIFO data reading state, an RS232-USB protocol conversion state and a synchronous state of SPORT received data and FIFO read data;
the third layer state machine comprises an idle state of the third layer state machine, an ADC data receiving and sending state, a data state of RS232 sending to GUI, a data state of RS232 receiving data from GUI, a data state of FSMC sending, a data state of FSMC receiving and a communication rate synchronization state.
In the embodiment of the invention, for the first layer state machine, the first layer state, namely the idle state, mainly has the main function of waiting for the issuance of a subsequent instruction after one ADC read-write operation is finished, and carrying out corresponding action; the write address state is a state in which address encoding is performed; the data writing state is a state for data encoding; the ADC data reception state is a state in which ADC data reception is performed, as shown in fig. 2.
For the second layer state machine, the idle state of the second layer state machine has the function that after data are sent to the upper computer 6, a subsequent new round of instruction is issued, and the subsequent action of the subsequent round of instruction is triggered; the SPORT read data state is a state in which data is sent using the SPORT protocol; the FIFO write data state is a state in which data is written to the FIFO; the FIFO read data state is a state in which data is read from the FIFO; the RS232-USB protocol conversion state is a state in which data transmission is converted from RS232 to the USB protocol; the synchronous state of the received data of the SPORT and the read data of the FIFO is a state of finding balance between read data and write data, and the specific working process settings of the second-layer state machine and the second-layer state machine in the first processor 2 are well known in the art and will not be described herein again, as shown in fig. 3.
For the third-layer state machine, the idle state of the third-layer state machine is used for waiting for the upper computer to send a starting instruction after power-on and reset; the ADC data receiving and sending state is an ADC data receiving and sending state and is used for realizing the receiving and sending of ADC data; the data state sent by the RS232 to the GUI has the function of sending data to the GUI by converting the RS232 protocol to the USB protocol in the last state; the FSMC sends a data state to enable the FPGA to transmit data information to the signal second processor 5 for calculation through the FSMC protocol; the FSMC receiving data has the function of receiving the calculated value from the signal second processor 5 through the FSMC protocol; the effect of the communication rate synchronization state is to match the optimal communication rate between the upper computer 6 and the first processor 2 according to the obtained synchronization rate value, so as to achieve the purpose of not losing codes, as shown in fig. 4.
In summary, the multi-channel speech signal evaluation method of the present invention can be obtained by providing an analog-to-digital converter 1 capable of receiving multiple audio signals generated by a signal source, a signal processing device adaptively connected to the analog-to-digital converter 1, and an upper computer 6 electrically connected to the signal processing device;
after the analog-to-digital converter 1 performs analog-to-digital conversion on the received multiple channels of audio signals, a signal processing device can sample and process each channel of audio digital signals converted by the analog-to-digital converter 1, and the multiple channels of sampled and processed audio digital signals can be sequentially transmitted into an upper computer 6; when each path of digital audio signal is transmitted to the upper computer 6, the signal processing device can determine the communication rate with the upper computer 6 according to the frequency of the current audio digital signal, and transmits the current audio digital signal into the upper computer 6 at the determined communication rate;
the upper computer 6 performs fast fourier transform processing on the received audio digital signal to determine a signal frequency domain value of a corresponding path of audio digital signal, estimates and judges the signal quality of the current path of audio signal by utilizing a spurious-free dynamic range according to the signal frequency domain value of each path of audio signal, and outputs the estimated voice signal quality.
Specifically, the process of performing the speech signal evaluation by the specific coordination among the analog-to-digital converter 1, the upper computer 6 and the signal processing device can refer to the above description, and is not described herein again.
The above description is only for the purpose of describing the preferred embodiments of the present invention, and is not intended to limit the scope of the present invention, and any variations and modifications made by those skilled in the art based on the above disclosure are within the scope of the appended claims.
Claims (10)
1. A multi-channel speech signal evaluation system, characterized by: the device comprises an analog-to-digital converter (1) capable of receiving multi-channel audio signals generated by a signal source, a signal processing device in adaptive connection with the analog-to-digital converter (1), and an upper computer (6) electrically connected with the signal processing device;
after the analog-to-digital converter (1) performs analog-to-digital conversion on the received multi-channel audio signals, the signal processing device can sample and process each channel of audio digital signals converted by the analog-to-digital converter (1), and the signal processing device can sequentially transmit the multi-channel sampled and processed audio digital signals into an upper computer (6); when each path of digital audio signal is transmitted to the upper computer (6), the signal processing device can determine the communication rate with the upper computer (6) according to the frequency of the current audio digital signal, and transmits the current audio digital signal into the upper computer (6) at the determined communication rate;
the upper computer (6) carries out fast Fourier transform processing on the received audio digital signals to determine the signal frequency domain values of the corresponding audio digital signals, estimates and judges the signal quality of the current audio signals by utilizing a spurious-free dynamic range according to the signal frequency domain values of each audio signal, and outputs the voice signal quality which is estimated and judged.
2. The multi-channel speech signal evaluation system of claim 1, wherein: the signal processing device comprises a first signal processor (2) and a second signal processor (5), wherein the first signal processor (2) is connected with an upper computer (6) through USB communication, and the second signal processor (5) is connected with the first signal processor (2);
the first signal processor (2) can sample and process each channel of audio digital signals converted by the analog-digital converter (1), and can sequentially transmit the multiple channels of sampled and processed audio digital signals into the upper computer (6) through the signal processing device;
for any audio digital signal transmitted to the upper computer (6), the frequency of the current audio digital signal is determined through the signal second processor (5), the communication speed between the signal first processor (2) and the upper computer (6) can be determined, the determined communication speed is transmitted to the upper computer (6) through the signal first processor (2) through USB communication, the signal first processor (2) and the upper computer (6) are adjusted to the determined communication speed, and then the signal first processor (2) transmits the current audio digital signal into the upper computer (6) through a USB communication mode at the determined communication speed.
3. The multi-channel speech signal evaluation system of claim 2, wherein: the first signal processor (2) comprises an FPGA, and the second signal processor (5) comprises an ARM;
when the signal second processor (5) adopts ARM and the signal first processor (2) adopts FPGA, the signal second processor (5) and the signal first processor (2) communicate through RS 232.
4. The multi-channel speech signal evaluation system of claim 2, wherein: when the second signal processor (5) determines the communication rate between the first signal processor (2) and the upper computer (6), the communication rate v is
Wherein, FinFor the frequency of the current audio signal, FsThe sampling frequency of the analog-to-digital converter (1) is shown as m, the conversion bit number of the analog-to-digital converter (1) is shown as n, the channel number of the analog-to-digital converter (1) is shown as g, the effective data bit number of the signal first processor (2) and the upper computer (6) is shown as q, and the effective data bit number is shown as q.
5. A multi-channel speech signal evaluation system according to any one of claims 2 to 4, characterized by: setting a three-layer state machine in the first signal processor (2), wherein the three-layer state machine comprises a first layer state machine, a second layer state machine and a third layer state machine; the first signal processor (2) can process each path of audio digital signals converted by the analog-digital converter (1) after sequentially passing through the first layer state machine, the second layer state machine and the third layer state machine, and finally sends the processed audio digital signals to the upper computer (6);
the first layer state machine comprises a first layer state machine idle state, a write address state, a write data state and an ADC data receiving state;
the second layer state machine comprises a second layer state machine idle state, a SPORT data reading state, a FIFO data writing state, a FIFO data reading state, an RS232-USB protocol conversion state and a synchronous state of SPORT received data and FIFO read data;
the third layer state machine comprises an idle state of the third layer state machine, an ADC data receiving and sending state, a data state of RS232 sending to GUI, a data state of RS232 receiving data from GUI, a data state of FSMC sending, a data state of FSMC receiving and a communication rate synchronization state.
6. The multi-channel speech signal evaluation system of any of claims 1 to 4, wherein: the frequency of the multi-channel audio signal generated by the signal source is 500 Hz-4000 Hz.
7. A multi-channel speech signal evaluation method is characterized by: providing an analog-to-digital converter (1) capable of receiving a plurality of paths of audio signals generated by a signal source, a signal processing device which is in adaptive connection with the analog-to-digital converter (1), and an upper computer (6) which is electrically connected with the signal processing device;
after the analog-to-digital converter (1) performs analog-to-digital conversion on the received multi-channel audio signals, the signal processing device can sample and process each channel of audio digital signals converted by the analog-to-digital converter (1), and the signal processing device can sequentially transmit the multi-channel sampled and processed audio digital signals into an upper computer (6); when each path of digital audio signal is transmitted to the upper computer (6), the signal processing device can determine the communication rate with the upper computer (6) according to the frequency of the current audio digital signal, and transmits the current audio digital signal into the upper computer (6) at the determined communication rate;
the upper computer (6) carries out fast Fourier transform processing on the received audio digital signals to determine the signal frequency domain values of the corresponding audio digital signals, estimates and judges the signal quality of the current audio signals by utilizing a spurious-free dynamic range according to the signal frequency domain values of each audio signal, and outputs the voice signal quality which is estimated and judged.
8. The multi-channel speech signal evaluation method of claim 7, wherein: the signal processing device comprises a first signal processor (2) and a second signal processor (5), wherein the first signal processor (2) is connected with an upper computer (6) through USB communication, and the second signal processor (5) is connected with the first signal processor (2);
the first signal processor (2) can sample and process each channel of audio digital signals converted by the analog-digital converter (1), and can sequentially transmit the multiple channels of sampled and processed audio digital signals into the upper computer (6) through the signal processing device;
for any audio digital signal transmitted to the upper computer (6), the frequency of the current audio digital signal is determined through the signal second processor (5), the communication speed between the signal first processor (2) and the upper computer (6) can be determined, the determined communication speed is transmitted to the upper computer (6) through the signal first processor (2) through USB communication, the signal first processor (2) and the upper computer (6) are adjusted to the determined communication speed, and then the signal first processor (2) transmits the current audio digital signal into the upper computer (6) through a USB communication mode at the determined communication speed.
9. The multi-channel speech signal evaluation method of claim 8, wherein: when the second signal processor (5) determines the communication rate between the first signal processor (2) and the upper computer (6), the communication rate v is
Wherein, FinFor the frequency of the current audio signal, FsIs the sampling frequency of the analog-to-digital converter (1), m is the conversion bit number of the analog-to-digital converter (1), n is the channel number of the analog-to-digital converter (1), g is the effective data bit number when the signal first processor (2) and the upper computer (6) adopt the RS232 protocol before being converted into the USB protocol for communication,and q is the total data digit when the RS232 protocol before conversion into the USB protocol is adopted for communication.
10. A method for multi-channel speech signal evaluation according to claim 8 or 9, characterized by: setting a three-layer state machine in the first signal processor (2), wherein the three-layer state machine comprises a first layer state machine, a second layer state machine and a third layer state machine; the first signal processor (2) can process each path of audio digital signals converted by the analog-digital converter (1) after sequentially passing through the first layer state machine, the second layer state machine and the third layer state machine, and finally sends the processed audio digital signals to the upper computer (6);
the first layer state machine comprises a first layer state machine idle state, a write address state, a write data state and an ADC data receiving state;
the second layer state machine comprises a second layer state machine idle state, a SPORT data reading state, a FIFO data writing state, a FIFO data reading state, an RS232-USB protocol conversion state and a synchronous state of SPORT received data and FIFO read data;
the third layer state machine comprises an idle state of the third layer state machine, an ADC data receiving and sending state, a data state of RS232 sending to GUI, a data state of RS232 receiving data from GUI, a data state of FSMC sending, a data state of FSMC receiving and a communication rate synchronization state.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111094740.2A CN113689884B (en) | 2021-09-17 | 2021-09-17 | Multichannel voice signal evaluation system and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111094740.2A CN113689884B (en) | 2021-09-17 | 2021-09-17 | Multichannel voice signal evaluation system and method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113689884A true CN113689884A (en) | 2021-11-23 |
CN113689884B CN113689884B (en) | 2024-01-26 |
Family
ID=78586602
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111094740.2A Active CN113689884B (en) | 2021-09-17 | 2021-09-17 | Multichannel voice signal evaluation system and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113689884B (en) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6281819B1 (en) * | 2000-04-03 | 2001-08-28 | Industrial Technology Research Institute | Device for ENOB estimation for ADC's based on dynamic deviation and method therefor |
CN102868402A (en) * | 2012-09-18 | 2013-01-09 | 上海航天测控通信研究所 | Test method for testing main performance indices of analog-to-digital converter (ADC) |
CN102917141A (en) * | 2012-10-26 | 2013-02-06 | 上海大唐移动通信设备有限公司 | Test method, test device and test system for evaluating voice quality |
US20150073785A1 (en) * | 2013-09-06 | 2015-03-12 | Nuance Communications, Inc. | Method for voicemail quality detection |
CN105657592A (en) * | 2014-11-10 | 2016-06-08 | 北京握奇智能科技有限公司 | Audio communication method and device |
US20180255413A1 (en) * | 2015-01-09 | 2018-09-06 | Setuo ANIYA | Method and apparatus for evaluating audio device, audio device and speaker device |
CN111508528A (en) * | 2020-03-12 | 2020-08-07 | 上海交通大学 | No-reference audio quality evaluation method and device based on natural audio statistical characteristics |
CN111542810A (en) * | 2017-12-08 | 2020-08-14 | 谷歌有限责任公司 | Signal processing coordination among digital voice assistant computing devices |
-
2021
- 2021-09-17 CN CN202111094740.2A patent/CN113689884B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6281819B1 (en) * | 2000-04-03 | 2001-08-28 | Industrial Technology Research Institute | Device for ENOB estimation for ADC's based on dynamic deviation and method therefor |
CN102868402A (en) * | 2012-09-18 | 2013-01-09 | 上海航天测控通信研究所 | Test method for testing main performance indices of analog-to-digital converter (ADC) |
CN102917141A (en) * | 2012-10-26 | 2013-02-06 | 上海大唐移动通信设备有限公司 | Test method, test device and test system for evaluating voice quality |
US20150073785A1 (en) * | 2013-09-06 | 2015-03-12 | Nuance Communications, Inc. | Method for voicemail quality detection |
CN105657592A (en) * | 2014-11-10 | 2016-06-08 | 北京握奇智能科技有限公司 | Audio communication method and device |
US20180255413A1 (en) * | 2015-01-09 | 2018-09-06 | Setuo ANIYA | Method and apparatus for evaluating audio device, audio device and speaker device |
CN111542810A (en) * | 2017-12-08 | 2020-08-14 | 谷歌有限责任公司 | Signal processing coordination among digital voice assistant computing devices |
CN111508528A (en) * | 2020-03-12 | 2020-08-07 | 上海交通大学 | No-reference audio quality evaluation method and device based on natural audio statistical characteristics |
Also Published As
Publication number | Publication date |
---|---|
CN113689884B (en) | 2024-01-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109412763B (en) | Digital signal existence detection method based on signal energy-entropy ratio | |
CN205864457U (en) | A kind of frequency diversity underwater sound communication modem | |
JPH09510855A (en) | Base station receiver at pipelined cell locations for multi-spread spectrum signals | |
JPH06502057A (en) | Data transfer method and apparatus for communication between peripheral and master | |
RU2009107097A (en) | METHOD AND DEVICE FOR SENDING AN ALARM FOR TRANSMISSION OF DATA IN A WIRELESS COMMUNICATION SYSTEM | |
CN102404099A (en) | Underwater multi-user voice communication method and device capable of distributing frequency spectrum dynamically | |
CN113794521B (en) | Underwater acoustic communicator awakening method based on multi-tone signal | |
CN102832963A (en) | Rate adaptive system and method for power line carrier spread spectrum communication | |
US4041442A (en) | Acoustic data acquisition system | |
JP2007519328A5 (en) | ||
US20050036573A1 (en) | Baseband receiver for binary symbol data | |
CN103033705A (en) | Test method and test system of electronic equipment | |
CN107004418B (en) | Digital communication method and device based on audio channel | |
CN114640902B (en) | Multi-interface high-speed optical fiber transmission device and method | |
CN113689884A (en) | Multi-channel speech signal evaluation system and method | |
US20100220773A1 (en) | Indicating and detecting the start of signal transmission employing frequency division multiplexing | |
CN106817200B (en) | Dynamic signal detection method based on multi-channel communication receiving system | |
CN110286392B (en) | Frequency domain narrow-band interference resistance realization method based on down-sampling | |
CN101867543B (en) | A kind of frame arrival detection method based on signal-to-noise ratio (SNR) estimation | |
RU167430U1 (en) | A device for estimating the probability of error per bit for signals with eight-position phase modulation by four-position signals | |
CN101931594A (en) | Signal processing method for base band receiver of base station and base band receiver | |
CN117714254A (en) | Underwater sound modem based on STM32 and raspberry group | |
CN208445563U (en) | A kind of high speed underwater sound voice communication system | |
CN101277281A (en) | Channel equalization for receiver | |
CN103647736A (en) | Double communication system underwater acoustic Modem and implementation method thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |