CN104539904A

CN104539904A - Visual intercom echo processing system for building

Info

Publication number: CN104539904A
Application number: CN201410847190.0A
Authority: CN
Inventors: 黄伟雄
Original assignee: ZHUHAI TAICHUAN CLOUD TECHNOLOGY Co Ltd
Current assignee: ZHUHAI TAICHUAN CLOUD TECHNOLOGY Co Ltd
Priority date: 2014-12-31
Filing date: 2014-12-31
Publication date: 2015-04-22

Abstract

The invention provides a visual intercom echo processing system for a building. According to the visual intercom echo processing system for the building, manual operation is not needed, and voice communication quality is high. The visual intercom echo processing system for the building comprises a plurality of visual intercom terminals, an Ethernet networking mode is adopted, and each visual intercom terminal comprises a central processor, a power unit, an Ethernet control unit, a camera, a display screen, an input unit, an audio coder and decoder, a microphone and a horn, wherein the power unit, the Ethernet control unit, the camera, the display screen, the input unit, the audio coder and decoder, the microphone and the horn are electrically connected with the central processor, and the microphone and the horn are electrically connected with the audio coder and decoder. The system is characterized in that an embedded system is arranged in the central processor and comprises an audio receiving module, a synchronization module, an echo eliminating module, an audio coding module and a transmitting module. The visual intercom echo processing system for the building can be widely applied to the field of visual intercoms of buildings.

Description

Building visual intercommunication echo processing system

Technical field

The present invention relates to a kind of building visual intercommunication echo processing system.

Background technology

Current storied building visible intercommunication system roughly has 3 kinds to the processing method of the echo of sound: 1. adopt earphone type sound reproduction structure, avoids echo by reducing sound reproduction volume; 2. adopt the mode of operation pressing intercommunication, determined the direction of voice transfer in the single time by resident family by button, avoid echogenicity with semiduplex transmission means; 3. determined the direction of voice transfer in the single time by the height of speech level, select direction the opposing party transmission that speech level is relatively high, avoid echogenicity with semiduplex transmission means.There is following problem and defect in above-mentioned building talkback echo processing method: 1. needs hand-held intercommunication inconvenient respectively.2. need manual operation voice transfer direction inconvenient.3. side's voice that sound is large are just transmitted, and side's voice that sound is little will be left in the basket, and the side that sounding is little feels passive in intercommunication.

Summary of the invention

Technical problem to be solved by this invention overcomes the deficiencies in the prior art, provide a kind of need not manual operation and the high building visual intercommunication echo processing system of voice call quality.

The technical solution adopted in the present invention is: the present invention includes multiple video interphone terminal and adopt Ethernet networking mode, the power subsystem that described video interphone terminal comprises central processing unit and is electrically connected with described central processing unit, Ethernet control unit, camera, display screen, input unit, audio codec and the microphone be electrically connected with described audio codec, loudspeaker, embedded system is provided with in described central processing unit, described embedded system comprises audio frequency receiver module, synchronization module, echo cancellation module, audio coding module and sending module, wherein:

Audio frequency receiver module, for the audio stream after received code compression;

Synchronization module, the audio stream sent for the audio stream recorded this locality and opposite end carries out data syn-chronization;

Echo cancellation module, the audio stream sent described opposite end filters as with reference to audio stream the echo in the audio stream of described this locality recording;

Audio coding module, for carrying out compression coding to the audio stream after filtration;

Sending module, for sending the audio stream of compression coding by network.

Further, described synchronization module comprises records buffering area, reference buffer district and data synchronisation unit, wherein:

Record buffering area, carry out buffer memory for the audio stream recorded described this locality;

Reference buffer district, for flowing to row cache to described reference audio;

Data synchronisation unit, for when described reference buffer district receives data, the audio stream record described this locality and described reference audio stream carry out data syn-chronization.

Further, described echo cancellation module is used for the time period according to the fixed intervals preset, check the frame data of audio stream that opposite end sends, when frame data are less than the predetermined buffer value of speex algorithm, the echo in the audio stream recorded described this locality by speex algorithm is filtered; When the frame data of audio stream sent when opposite end are greater than speex algorithm predetermined buffer value, then discarded part downlink data upon handover.

Further, described embedded system also comprises audio decoder module and audio playing module, wherein:

Described audio decoder module, for reducing the audio stream after the compression coding that receives;

Described audio playing module, for playing the audio stream of reduction.

The invention has the beneficial effects as follows: multiple video interphone terminal can be comprised due to the present invention and adopt Ethernet networking mode, the power subsystem that described video interphone terminal comprises central processing unit and is electrically connected with described central processing unit, Ethernet control unit, camera, display screen, input unit, audio codec and the microphone be electrically connected with described audio codec, loudspeaker, embedded system is provided with in described central processing unit, described embedded system comprises audio frequency receiver module, synchronization module, echo cancellation module, audio coding module and sending module, wherein: audio frequency receiver module, for the audio stream after received code compression, synchronization module, the audio stream sent for the audio stream recorded this locality and opposite end carries out data syn-chronization, echo cancellation module, the audio stream sent described opposite end filters as with reference to audio stream the echo in the audio stream of described this locality recording, audio coding module, for carrying out compression coding to the audio stream after filtration, sending module, for sending the audio stream of compression coding by network.The audio stream sent by the audio stream recorded this locality and opposite end carries out data syn-chronization, the audio stream sent opposite end is as the reference audio of filtering echo, optimize encoding and decoding and eliminate echo, improve the intercommunication speech quality of voice-over-net, intercommunication both sides voice can transmit by real time full duplex, and period is without any need for manual operation, can hands-free way intercommunication, liberation both hands, improve intercommunication and experience, so the present invention need not manual operation and voice call quality is high.

Accompanying drawing explanation

Fig. 1 is system principle diagram of the present invention;

Fig. 2 is the function structure chart of central processing unit of the present invention.

Embodiment

As Fig. 1, shown in Fig. 2, the present invention can comprise multiple video interphone terminal and adopt Ethernet networking mode, the power subsystem 2 that described video interphone terminal comprises central processing unit 1 and is electrically connected with described central processing unit 1, Ethernet control unit 3, camera 4, display screen 5, input unit 6, audio codec 7 and the microphone 8 be electrically connected with described audio codec 7, loudspeaker 9, embedded system is provided with in described central processing unit 1, described embedded system comprises audio frequency receiver module 11, synchronization module 12, echo cancellation module 13, audio coding module 14, audio decoder module 16, audio playing module 17 and sending module 15, wherein:

Audio frequency receiver module 11, for the audio stream after received code compression;

Synchronization module 12, the audio stream sent for the audio stream recorded this locality and opposite end carries out data syn-chronization, described synchronization module comprises records buffering area 1201, reference buffer district 1202 and data synchronisation unit 1203, wherein: record buffering area 1201, the audio stream for recording described this locality carries out buffer memory; Reference buffer district 1202, for flowing to row cache to described reference audio; Data synchronisation unit 1203, for when described reference buffer district 1202 receives data, the audio stream record described this locality and described reference audio stream carry out data syn-chronization;

Echo cancellation module 13, the audio stream sent described opposite end filters as with reference to audio stream the echo in the audio stream of described this locality recording, described echo cancellation module is used for the time period according to the fixed intervals preset, check the frame data of the audio stream that opposite end sends, when frame data are less than the predetermined buffer value of speex algorithm, the echo in the audio stream recorded described this locality by speex algorithm is filtered; When the frame data of audio stream sent when opposite end are greater than speex algorithm predetermined buffer value, then discarded part downlink data upon handover;

Audio coding module 14, for carrying out compression coding to the audio stream after filtration;

Sending module 15, for sending the audio stream of compression coding by network;

Audio decoder module 16, for reducing the audio stream after the compression coding that receives;

Audio playing module 17, for playing the audio stream of reduction.

Intercommunication product receives by Ethernet the speech data that the opposing party sends.Embedded system is decoded to the speech data received by universal audio codec, buffer memory, D/A switch are play by loudspeaker after also amplifying, due to the relation of acoustic propagation medium and operating system process time delay, data voice playback with original buffer memory contrasts by the data through analog/digital conversion that microphone collects after a specific time delay, offset the part identical with the speech data of original buffer memory, reach echo cancellor effect.Contrast computing is pressed voice frame time sheet and is performed, and each speech frame is 20 milliseconds.Data after contrast computing is filtered are real user's communication data, by sending the other side to by Ethernet after compressed encoding, reaching in intercommunication and eliminating echo.

Claims

1. a building visual intercommunication echo processing system, it can comprise multiple video interphone terminal and adopt Ethernet networking mode, the power subsystem that described video interphone terminal comprises central processing unit and is electrically connected with described central processing unit, Ethernet control unit, camera, display screen, input unit, audio codec and the microphone be electrically connected with described audio codec, loudspeaker, it is characterized in that: in described central processing unit, be provided with embedded system, described embedded system comprises audio frequency receiver module, synchronization module, echo cancellation module, audio coding module and sending module, wherein:

Sending module, for sending the audio stream of compression coding by network.

2. according to the building visual intercommunication echo processing system described in claim 1, it is characterized in that: described synchronization module comprises records buffering area, reference buffer district and data synchronisation unit, wherein:

3. according to a kind of building visual intercommunication echo processing system described in claim 1, it is characterized in that: described echo cancellation module is used for the time period according to the fixed intervals preset, check the frame data of the audio stream that opposite end sends, when frame data are less than the predetermined buffer value of speex algorithm, the echo in the audio stream recorded described this locality by speex algorithm is filtered; When the frame data of audio stream sent when opposite end are greater than speex algorithm predetermined buffer value, then discarded part downlink data upon handover.

4., according to the building visual intercommunication echo processing system described in claim 1, it is characterized in that: described embedded system also comprises audio decoder module and audio playing module, wherein:

Described audio playing module, for playing the audio stream of reduction.