CN106504762A

CN106504762A - Bird community quantity survey system and method

Info

Publication number: CN106504762A
Application number: CN201610969673.7A
Authority: CN
Inventors: 杨春勇; 吴迪; 刘恒; 彭焱; 陈彬
Original assignee: South Central University for Nationalities
Current assignee: South Central Minzu University
Priority date: 2016-11-04
Filing date: 2016-11-04
Publication date: 2017-03-15
Anticipated expiration: 2036-11-04
Also published as: CN106504762B

Abstract

The invention discloses a kind of bird community quantity survey system and method, are related to bird community protection and voice de-noising and separation technology field.The system includes apparatus for processing audio（10）, data transmission device（20）, terminal data processing device（30）And power supply（40）；Apparatus for processing audio（10）, data transmission device（20）With terminal data processing device（30）It is sequentially connected；Power supply（40）Respectively with apparatus for processing audio（10）And data transmission device（20）Connection.This method is：1. voice de-noising；2. speech Separation.The present invention is applied to the quantity of zoo, agricultural department and conservation of wildlife department more easily observability estimate birds；Also important reference is provided to environmental administration to evaluating and testing somewhere environmental rating.

Description

Bird community quantity survey system and method

Technical field

The present invention relates to bird community protection and voice de-noising and separation technology field, more particularly to a kind of bird community Quantity survey system and method.

Background technology

Currently, bird community protection is cut down in a large number because of vegetation, the continuous deterioration reason of ecological environment and interesting, effectively Group's bird quantity statistics method is most important for bird community protection.Traditional method is more to count animal kind using marked recapture The quantity of group, the method easily produce impact to the normal activities for affecting animal, and statistical accuracy is not enough.In recent years, use Before application well is presented in terms of the speech Separation algorithm for estimating of field of voice signal is in voice identification, noise reduction Scape.

Traditional voice de-noising algorithm has adaptive filter algorithm, spectrum-subtraction and Wiener Filter Method etc., using characteristic parameter End-point detection is carried out, judge voice signal has words section and without words section, by without words section estimated noise spectrum, reaching speech enhan-cement Purpose；In terms of speech Separation, Independent Component Analysis Algorithm（Independent Component Analysis, ICA）Including pole Maximum-likelihood method of estimation, maximum entropy method, Minimum mutual information method, Higher-Order Cumulants etc., using signal after separation each Between component, maximum independence to be setting up contrast function, by maximum or minimize contrast function and obtain separation matrix, reaches point Purpose from source signal.Document points out that ICA algorithm is only used for the number of source signal less than or equal to received by sensor In the case of the number of mixed signal, the estimation of bird community quantity is not suitable for.

Content of the invention

The purpose of the present invention is rested in the artificial traditional method using bird appreciation instrument for observation birds activity at present, is damaged Consumption manpower and the again low problem of efficiency, there is provided a kind of bird community quantity survey system and method, so as to realize using right The collection of bird community birds mixing voice and noise reduction, based on the birds mixing voice isolation technics of peak estimation, separate and mix Voice can estimate birds number.

The mentality of designing of the present invention：

As the collection of voice in truth has the interference of noise, birds voice signal there may be error.In order to improve The accuracy of speech Separation, the first mixing voice to gathering carry out noise reduction process；Human speech isolation technics are analogous to, in conjunction with Birds phonetic feature, designs a kind of DUET Speech separation algorithms clustered based on PDTA；Sound attenuating is simultaneously taken account of, will be adopted High-fidelity mike, strengthens voice signal.

For the needs of speech Separation, speech signal collection and processing system is initially set up, is realized to birds voice collecting, A/D is changed；Secondly, the Transmission system of Speech processing is designed, it is contemplated that the verity of wild environment, we adopt channel radio Letter technology, realizes Long-range Data Transmission；Next the operation such as the pretreatment to data, end-point detection is carried out at PC ends；Pre- place Reason includes aliasing filtering, analog to digital conversion, framing adding window；End-point detection carries out end-point detection using the good Parameter Spectrum entropy of noise immunity, Difference whether there is words section, estimates noise spectrum, carries out noise reduction using Wiener filtering；Finally, speech Separation is carried out to reducing noise of voice, we Using a kind of DUET Speech separation algorithms clustered based on PDTA, separate voice number and be and estimate birds quantity number and extensive Multiple source voice signal.

Specifically：

First, bird community quantity survey system（Abbreviation system）

The system includes apparatus for processing audio, data transmission device, terminal data processing device and power supply；

Its annexation is：

Apparatus for processing audio, data transmission device and terminal data processing device are sequentially connected；

Power supply is connected with apparatus for processing audio and data transmission device respectively.

2nd, bird community quantity survey method

This method includes two parts of voice de-noising and speech Separation.

1. voice de-noising

Birds mixing voice Y is gathered in the field environment using microphone array, framing adding window is carried out to the voice for gathering and is located in advance Manage, extracting the good characteristic parameter spectrum entropy of noise immunity carries out end-point detection, judges there is words frame and without words frame noise, using Wiener filtering Noise reduction algorithm carries out noise reduction process to the voice for gathering, and obtains pure birds voice；

2. speech Separation

Speech Separation is divided into peak estimation and source voice recovers two stages：

In the peak estimation stage, adding window, framing, Fourier transformation is carried out to clean speech, realize the rarefaction representation to gathering voice, Pretreatment is carried out to rarefaction speech data again, is allowed on the upper semicircumference of speech data point chorologic unit circle, is asked each point to arrive（1, 0）Radian, region division is carried out to radian value, the probability density in each region is calculated, is obtained using local optimum resolving Algorithm close Degree peak value, the number of peak value are the number for separating voice；

Source voice Restoration stage, finds the corresponding radian value of peak value, calculates hybrid matrix A, that is, need to solve Y=AX nonhomogeneous equations Group, using DUET algorithms, is projected in each for mixing voice data point in all directions of hybrid matrix A, is sentenced according to projector distance Which source voice class is fixed each data point belong to, you can recover each source voice X.

The present invention has the advantages that：

1. a kind of technical scheme of the voice de-noising based on improved Wiener filtering algorithm is provided

Framing adding window pretreatment is carried out to the voice for gathering, extracting the good characteristic parameter spectrum entropy of noise immunity carries out end-point detection, Judge to have words frame with without words frame noise, noise reduction process is carried out to the voice for gathering using Wiener filtering noise reduction algorithm, is obtained pure Birds voice.

2. the technical scheme of the Speech separation algorithm that a kind of combination DUET and PDTA is clustered is provided

Speech Separation is divided into peak estimation and recovers two stages with source voice.The peak estimation stage, clean speech is carried out adding window, Framing, Fourier transformation, realize, to gathering the rarefaction representation of voice, then carrying out pretreatment to rarefaction speech data, allow voice On the upper semicircumference of data point distribution unit circle, each point is asked to arrive（1,0）Radian, carry out region division to radian value, calculate The probability density in each region, obtains density peaks using local optimum resolving Algorithm, and the number of peak value is and separates the individual of voice Number.Source voice Restoration stage, finds the corresponding radian value of peak value, can calculate hybrid matrix A, that is, need to solve y=Ax nonhomogeneous equations Group, using DUET algorithms, is projected in each for mixing voice data point in all directions of hybrid matrix A, is sentenced according to projector distance Which source voice class is fixed each data point belong to, you can recover each source voice.

3. the technical scheme that a kind of bird community number based on speech Separation is estimated is provided

The method for breaking away from the bird community of original manual observation, using separating to mixing voice, the method for estimating source voice number Reach to birds quantity survey.

4. it is applied to the quantity of zoo, agricultural department and conservation of wildlife department more easily observability estimate birds； Also important reference is provided to environmental administration to evaluating and testing somewhere environmental rating.

Description of the drawings

Fig. 1 is the block diagram of the system；

Fig. 2 is the fundamental diagram of the present invention；

Fig. 3 is PC ends voice de-noising algorithm flow chart；

Fig. 4 is PC ends Speech separation algorithm flow chart.

In figure：

10 apparatus for processing audio,

11 microphone arrays, 12 amplifiers, 13 audio decoders；

20 data transmission devices,

21 communication interfaces, 22 controllers, 23 external expansion interfaces, 24 radiofrequency emitting modules,

25 radiating antennas, 26 reception antennas, 27 Receiver Modules；

30 terminal data processing devices,

31 communication interfaces, 32 PC ends；

40 power supplys.

Specific embodiment

Describe in detail with reference to the accompanying drawings and examples：

First, system

1st, overall

Such as Fig. 1, the system include apparatus for processing audio 10, data transmission device 20, terminal data processing device 30 and power supply 40；

Its annexation is：

Apparatus for processing audio 10, data transmission device 20 and terminal data processing device 30 are sequentially connected；

Power supply 40 is connected with apparatus for processing audio 10 and data transmission device 20 respectively.

Its working mechanism is：

Management and control of the apparatus for processing audio 10 by data transmission device 20, are that data transmission device 20 provides basic data money Material；

Data processing equipment 20 is managed to apparatus for processing audio 10 and controls, and is that terminal data processing device 30 provides and waits to locate Reason data；

The data of 30 processing data transmitting device of terminal data processing device transmission；

Apparatus for processing audio 10, data link is powered by power supply 40.

Such as Fig. 2, its working-flow is：

Collection birds voice believed signal before this；

Then the transmission of speech data is carried out using radio-frequency module；

Finally data are processed at PC ends, first the voice to gathering utilizes a kind of improved Wiener filtering algorithm noise reduction；Connect Get off and mixing voice is separated based on the DUET Speech separation algorithms that PDTA is clustered to clean speech using a kind of, obtain source voice Number is birds quantity survey number.

2nd, functional part

1）Apparatus for processing audio 10

Apparatus for processing audio 10 includes microphone array 11, amplifier 12 and the audio decoder 13 being sequentially connected；

（1）Microphone array 11

Microphone array 11 adopts multiple high-fidelity type mikes；

Birds voice is acquired, is amplified through 12 signal of amplifier in the birds voice that will be collected.

（2）Amplifier 12

Amplifier of the amplifier 12 using model OPA684 of TI companies；

The voice signal for gathering is amplified, the voice transfer for amplifying signal is decoded to audio decoder 13.

（3）Audio decoder 13

Audio processing chip TLV320AIC34 of the audio decoder 13 using the production of TI companies；

Modulation is decoded to audio signal, and the signal of collection is sent to data transmission device 20.

2）Data transmission device

Communication interface 21 that data transmission device 20 includes being sequentially communicated, controller 22, external expansion interface 23, radio-frequency transmissions mould Block 24, transmitting antenna 25, reception antenna 26, Receiver Module 27.

（1）Communication interface 21

Communication interface 21 is a kind of multi-functional synchronous serial interface, with very strong programmability, is configurable to multiple Synchronous serial interface standard, directly with various device high-speed interfaces；

Communication between responsible external audio processing meanss 10 and controller 22.

（2）Controller 22

TMS320C6713 of the controller 22 using TI companies（225 MHz of dominant frequency）Type dsp chip；

Mainly it is responsible for being managed apparatus for processing audio 10 and radiofrequency emitting module 24 and controlling.

（3）External expansion interface 23

External expansion interface 23 is a kind of interface provided by controller 22；

It is mainly used in connecting radiofrequency emitting module 24, realizes control and management of the controller 22 to radio communication.

（4）Radiofrequency emitting module 24

Wireless microwave equipment of the radiofrequency irradiation module 24 using the external antenna of model VS-5854, transmission power is 28dBm, High receiving sensitivity 802.11a/n agreement, in 5.150-5.850GHz, transmission range is up to 30 kilometers for working frequency range.

（5）Transmitting antenna 25

Transmitting antenna 25 adopts MIMO double antenna patterns, V+H polarity, and the gain of antenna is 28Bi, lobe angle：V：40 degree, H： 60 degree.

（6）Reception antenna（26）

Transmitting antenna（26）Using MIMO double antenna patterns, V+H polarity, the gain of antenna is 28Bi, lobe angle：V：40 degree, H：60 degree.

（7）Receiver Module 27

Receiver Module 27 using model VS-5854 external antenna Wireless microwave equipment, working method be point-to-point, The mode of point-to-multipoint, frequency range passage adjust automatically, eliminates the trouble of manual modification.

3）Terminal data processing device 30

Terminal data processing device 30 includes front latter linked communication interface 31 and PC ends 32；

The data that collection is processed by PC ends 32, based on MATLAB platform processes mixing voices, reach voice point using respective algorithms From purpose.

（1）Communication interface 31

Communication interface 31 is a kind of multi-functional synchronous serial interface, with very strong programmability, is configurable to multiple Synchronous serial interface standard, directly with various device high-speed interfaces；

Communication between responsible Receiver Module 27 and PC ends 32.

（2）PC ends 32

PC ends 32 are computer, and which is embedded with the working software of this method；

The data that sends back are processed by PC ends 32, estimate the number of source voice.

2nd, method

1st, the workflow of voice de-noising

Such as Fig. 3, the workflow of voice de-noising are as follows：

A, system initialization -301；

B, to gather voice pre-removal is carried out to noise using spectrum-subtraction, to improve the signal to noise ratio -302 of input speech signal；

C, the noise power spectrum for calculating front 5 frame signal, used as the initial value -303 of dynamic estimation；

D, carry out end-point detection to Noisy Speech Signal using subband spectrum entropy, and record the starting point of sound section of voice signal with Terminating point -304；

E, judge whether the frame signal is sound section -305, be then entrance step G, otherwise enter step F；

F, renewal noise power spectrum -306；

G, Wiener filtering noise reduction -307.

2nd, the workflow of speech Separation

Such as Fig. 4, the workflow of speech Separation are as follows：

A, to noise reduction after clean speech carry out adding window and framing pretreatment -401；

B, Fourier's change, carry out rarefaction representation -402 to voice signal；

C, normalization are made on the upper semicircumference of speech data point chorologic unit circle, ask each point to arrive（1,0）Radian, to radian value Carry out M region division -403；

D, the probability density -404 for calculating each region；

E, density peaks -405 are obtained using local optimum resolving Algorithm；

F, the corresponding radian value of searching peak value, can calculate hybrid matrix A-406；

G, using DUET algorithms, each for mixing voice data point is projected in all directions of hybrid matrix A, according to projector distance Judge which source voice class is each data point belong to, you can recover each source voice -407.

Claims

1. a kind of bird community quantity survey system, it is characterised in that：

Including apparatus for processing audio（10）, data transmission device（20）, terminal data processing device（30）And power supply（40）；

Apparatus for processing audio（10）, data transmission device（20）With terminal data processing device（30）It is sequentially connected；

Power supply（40）Respectively with apparatus for processing audio（10）And data transmission device（20）Connection.

2. a kind of bird community quantity survey system according to claim 1, it is characterised in that：

Described apparatus for processing audio（10）Including the microphone array being sequentially connected（11）, amplifier（12）And audio decoder （13）.

3. a kind of bird community quantity survey system according to claim 1, it is characterised in that：

Described data transmission device（20）Including the communication interface being sequentially communicated（21）, controller（22）, external expansion interface （23）, radiofrequency irradiation module（24）, radiating antenna（25）, reception antenna（26）And Receiver Module（27）.

4. the bird community quantity survey method based on system described in claim 1,2,3, it is characterised in that：

1. voice de-noising

2. speech Separation

5. the bird community quantity survey method as described in claim 4, it is characterised in that the workflow of voice de-noising is as follows：

A, system initialization（301）；

B, to gather voice pre-removal is carried out to noise using spectrum-subtraction, to improve the signal to noise ratio of input speech signal（302）；

C, the noise power spectrum for calculating front 5 frame signal, used as the initial value of dynamic estimation（303）；

D, carry out end-point detection to Noisy Speech Signal using subband spectrum entropy, and record the starting point of sound section of voice signal with Terminating point（304）；

E, judge whether the frame signal is sound section（305）, it is then to enter step G, otherwise enters step F；

F, renewal noise power spectrum（306）；

G, Wiener filtering noise reduction（307）.

6. the bird community quantity survey method as described in claim 4, its feature are as follows in the workflow of speech Separation：

A, to noise reduction after clean speech carry out adding window and framing pretreatment（401）；

B, Fourier's change, carry out rarefaction representation to voice signal（402）；

C, normalization are made on the upper semicircumference of speech data point chorologic unit circle, ask each point to arrive（1,0）Radian, to radian value Carry out M region division（403）；

D, the probability density for calculating each region（404）；

E, density peaks are obtained using local optimum resolving Algorithm（405）；

F, the corresponding radian value of searching peak value, can calculate hybrid matrix A（406）；

G, using DUET algorithms, each for mixing voice data point is projected in all directions of hybrid matrix A, according to projector distance Judge which source voice class is each data point belong to, you can recover each source voice（407）.