CN104539904A - Visual intercom echo processing system for building - Google Patents

Visual intercom echo processing system for building Download PDF

Info

Publication number
CN104539904A
CN104539904A CN201410847190.0A CN201410847190A CN104539904A CN 104539904 A CN104539904 A CN 104539904A CN 201410847190 A CN201410847190 A CN 201410847190A CN 104539904 A CN104539904 A CN 104539904A
Authority
CN
China
Prior art keywords
audio
module
audio stream
echo
processing system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410847190.0A
Other languages
Chinese (zh)
Inventor
黄伟雄
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZHUHAI TAICHUAN CLOUD TECHNOLOGY Co Ltd
Original Assignee
ZHUHAI TAICHUAN CLOUD TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZHUHAI TAICHUAN CLOUD TECHNOLOGY Co Ltd filed Critical ZHUHAI TAICHUAN CLOUD TECHNOLOGY Co Ltd
Priority to CN201410847190.0A priority Critical patent/CN104539904A/en
Publication of CN104539904A publication Critical patent/CN104539904A/en
Pending legal-status Critical Current

Links

Abstract

The invention provides a visual intercom echo processing system for a building. According to the visual intercom echo processing system for the building, manual operation is not needed, and voice communication quality is high. The visual intercom echo processing system for the building comprises a plurality of visual intercom terminals, an Ethernet networking mode is adopted, and each visual intercom terminal comprises a central processor, a power unit, an Ethernet control unit, a camera, a display screen, an input unit, an audio coder and decoder, a microphone and a horn, wherein the power unit, the Ethernet control unit, the camera, the display screen, the input unit, the audio coder and decoder, the microphone and the horn are electrically connected with the central processor, and the microphone and the horn are electrically connected with the audio coder and decoder. The system is characterized in that an embedded system is arranged in the central processor and comprises an audio receiving module, a synchronization module, an echo eliminating module, an audio coding module and a transmitting module. The visual intercom echo processing system for the building can be widely applied to the field of visual intercoms of buildings.

Description

Building visual intercommunication echo processing system
Technical field
The present invention relates to a kind of building visual intercommunication echo processing system.
Background technology
Current storied building visible intercommunication system roughly has 3 kinds to the processing method of the echo of sound: 1. adopt earphone type sound reproduction structure, avoids echo by reducing sound reproduction volume; 2. adopt the mode of operation pressing intercommunication, determined the direction of voice transfer in the single time by resident family by button, avoid echogenicity with semiduplex transmission means; 3. determined the direction of voice transfer in the single time by the height of speech level, select direction the opposing party transmission that speech level is relatively high, avoid echogenicity with semiduplex transmission means.There is following problem and defect in above-mentioned building talkback echo processing method: 1. needs hand-held intercommunication inconvenient respectively.2. need manual operation voice transfer direction inconvenient.3. side's voice that sound is large are just transmitted, and side's voice that sound is little will be left in the basket, and the side that sounding is little feels passive in intercommunication.
Summary of the invention
Technical problem to be solved by this invention overcomes the deficiencies in the prior art, provide a kind of need not manual operation and the high building visual intercommunication echo processing system of voice call quality.
The technical solution adopted in the present invention is: the present invention includes multiple video interphone terminal and adopt Ethernet networking mode, the power subsystem that described video interphone terminal comprises central processing unit and is electrically connected with described central processing unit, Ethernet control unit, camera, display screen, input unit, audio codec and the microphone be electrically connected with described audio codec, loudspeaker, embedded system is provided with in described central processing unit, described embedded system comprises audio frequency receiver module, synchronization module, echo cancellation module, audio coding module and sending module, wherein:
Audio frequency receiver module, for the audio stream after received code compression;
Synchronization module, the audio stream sent for the audio stream recorded this locality and opposite end carries out data syn-chronization;
Echo cancellation module, the audio stream sent described opposite end filters as with reference to audio stream the echo in the audio stream of described this locality recording;
Audio coding module, for carrying out compression coding to the audio stream after filtration;
Sending module, for sending the audio stream of compression coding by network.
Further, described synchronization module comprises records buffering area, reference buffer district and data synchronisation unit, wherein:
Record buffering area, carry out buffer memory for the audio stream recorded described this locality;
Reference buffer district, for flowing to row cache to described reference audio;
Data synchronisation unit, for when described reference buffer district receives data, the audio stream record described this locality and described reference audio stream carry out data syn-chronization.
Further, described echo cancellation module is used for the time period according to the fixed intervals preset, check the frame data of audio stream that opposite end sends, when frame data are less than the predetermined buffer value of speex algorithm, the echo in the audio stream recorded described this locality by speex algorithm is filtered; When the frame data of audio stream sent when opposite end are greater than speex algorithm predetermined buffer value, then discarded part downlink data upon handover.
Further, described embedded system also comprises audio decoder module and audio playing module, wherein:
Described audio decoder module, for reducing the audio stream after the compression coding that receives;
Described audio playing module, for playing the audio stream of reduction.
The invention has the beneficial effects as follows: multiple video interphone terminal can be comprised due to the present invention and adopt Ethernet networking mode, the power subsystem that described video interphone terminal comprises central processing unit and is electrically connected with described central processing unit, Ethernet control unit, camera, display screen, input unit, audio codec and the microphone be electrically connected with described audio codec, loudspeaker, embedded system is provided with in described central processing unit, described embedded system comprises audio frequency receiver module, synchronization module, echo cancellation module, audio coding module and sending module, wherein: audio frequency receiver module, for the audio stream after received code compression, synchronization module, the audio stream sent for the audio stream recorded this locality and opposite end carries out data syn-chronization, echo cancellation module, the audio stream sent described opposite end filters as with reference to audio stream the echo in the audio stream of described this locality recording, audio coding module, for carrying out compression coding to the audio stream after filtration, sending module, for sending the audio stream of compression coding by network.The audio stream sent by the audio stream recorded this locality and opposite end carries out data syn-chronization, the audio stream sent opposite end is as the reference audio of filtering echo, optimize encoding and decoding and eliminate echo, improve the intercommunication speech quality of voice-over-net, intercommunication both sides voice can transmit by real time full duplex, and period is without any need for manual operation, can hands-free way intercommunication, liberation both hands, improve intercommunication and experience, so the present invention need not manual operation and voice call quality is high.
Accompanying drawing explanation
Fig. 1 is system principle diagram of the present invention;
Fig. 2 is the function structure chart of central processing unit of the present invention.
Embodiment
As Fig. 1, shown in Fig. 2, the present invention can comprise multiple video interphone terminal and adopt Ethernet networking mode, the power subsystem 2 that described video interphone terminal comprises central processing unit 1 and is electrically connected with described central processing unit 1, Ethernet control unit 3, camera 4, display screen 5, input unit 6, audio codec 7 and the microphone 8 be electrically connected with described audio codec 7, loudspeaker 9, embedded system is provided with in described central processing unit 1, described embedded system comprises audio frequency receiver module 11, synchronization module 12, echo cancellation module 13, audio coding module 14, audio decoder module 16, audio playing module 17 and sending module 15, wherein:
Audio frequency receiver module 11, for the audio stream after received code compression;
Synchronization module 12, the audio stream sent for the audio stream recorded this locality and opposite end carries out data syn-chronization, described synchronization module comprises records buffering area 1201, reference buffer district 1202 and data synchronisation unit 1203, wherein: record buffering area 1201, the audio stream for recording described this locality carries out buffer memory; Reference buffer district 1202, for flowing to row cache to described reference audio; Data synchronisation unit 1203, for when described reference buffer district 1202 receives data, the audio stream record described this locality and described reference audio stream carry out data syn-chronization;
Echo cancellation module 13, the audio stream sent described opposite end filters as with reference to audio stream the echo in the audio stream of described this locality recording, described echo cancellation module is used for the time period according to the fixed intervals preset, check the frame data of the audio stream that opposite end sends, when frame data are less than the predetermined buffer value of speex algorithm, the echo in the audio stream recorded described this locality by speex algorithm is filtered; When the frame data of audio stream sent when opposite end are greater than speex algorithm predetermined buffer value, then discarded part downlink data upon handover;
Audio coding module 14, for carrying out compression coding to the audio stream after filtration;
Sending module 15, for sending the audio stream of compression coding by network;
Audio decoder module 16, for reducing the audio stream after the compression coding that receives;
Audio playing module 17, for playing the audio stream of reduction.
Intercommunication product receives by Ethernet the speech data that the opposing party sends.Embedded system is decoded to the speech data received by universal audio codec, buffer memory, D/A switch are play by loudspeaker after also amplifying, due to the relation of acoustic propagation medium and operating system process time delay, data voice playback with original buffer memory contrasts by the data through analog/digital conversion that microphone collects after a specific time delay, offset the part identical with the speech data of original buffer memory, reach echo cancellor effect.Contrast computing is pressed voice frame time sheet and is performed, and each speech frame is 20 milliseconds.Data after contrast computing is filtered are real user's communication data, by sending the other side to by Ethernet after compressed encoding, reaching in intercommunication and eliminating echo.

Claims (4)

1. a building visual intercommunication echo processing system, it can comprise multiple video interphone terminal and adopt Ethernet networking mode, the power subsystem that described video interphone terminal comprises central processing unit and is electrically connected with described central processing unit, Ethernet control unit, camera, display screen, input unit, audio codec and the microphone be electrically connected with described audio codec, loudspeaker, it is characterized in that: in described central processing unit, be provided with embedded system, described embedded system comprises audio frequency receiver module, synchronization module, echo cancellation module, audio coding module and sending module, wherein:
Audio frequency receiver module, for the audio stream after received code compression;
Synchronization module, the audio stream sent for the audio stream recorded this locality and opposite end carries out data syn-chronization;
Echo cancellation module, the audio stream sent described opposite end filters as with reference to audio stream the echo in the audio stream of described this locality recording;
Audio coding module, for carrying out compression coding to the audio stream after filtration;
Sending module, for sending the audio stream of compression coding by network.
2. according to the building visual intercommunication echo processing system described in claim 1, it is characterized in that: described synchronization module comprises records buffering area, reference buffer district and data synchronisation unit, wherein:
Record buffering area, carry out buffer memory for the audio stream recorded described this locality;
Reference buffer district, for flowing to row cache to described reference audio;
Data synchronisation unit, for when described reference buffer district receives data, the audio stream record described this locality and described reference audio stream carry out data syn-chronization.
3. according to a kind of building visual intercommunication echo processing system described in claim 1, it is characterized in that: described echo cancellation module is used for the time period according to the fixed intervals preset, check the frame data of the audio stream that opposite end sends, when frame data are less than the predetermined buffer value of speex algorithm, the echo in the audio stream recorded described this locality by speex algorithm is filtered; When the frame data of audio stream sent when opposite end are greater than speex algorithm predetermined buffer value, then discarded part downlink data upon handover.
4., according to the building visual intercommunication echo processing system described in claim 1, it is characterized in that: described embedded system also comprises audio decoder module and audio playing module, wherein:
Described audio decoder module, for reducing the audio stream after the compression coding that receives;
Described audio playing module, for playing the audio stream of reduction.
CN201410847190.0A 2014-12-31 2014-12-31 Visual intercom echo processing system for building Pending CN104539904A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410847190.0A CN104539904A (en) 2014-12-31 2014-12-31 Visual intercom echo processing system for building

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410847190.0A CN104539904A (en) 2014-12-31 2014-12-31 Visual intercom echo processing system for building

Publications (1)

Publication Number Publication Date
CN104539904A true CN104539904A (en) 2015-04-22

Family

ID=52855363

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410847190.0A Pending CN104539904A (en) 2014-12-31 2014-12-31 Visual intercom echo processing system for building

Country Status (1)

Country Link
CN (1) CN104539904A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011508990A (en) * 2007-11-29 2011-03-17 テレフオンアクチーボラゲット エル エム エリクソン(パブル) Method and apparatus for echo cancellation of audio signals
CN103607669A (en) * 2013-10-12 2014-02-26 公安部第三研究所 Detection method and detection system for audio frequency transmission characteristics of building intercom system
US20160261994A1 (en) * 2013-10-17 2016-09-08 Zte Corporation Method and Device for Realizing Terminal WIFI Talkback

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011508990A (en) * 2007-11-29 2011-03-17 テレフオンアクチーボラゲット エル エム エリクソン(パブル) Method and apparatus for echo cancellation of audio signals
CN103607669A (en) * 2013-10-12 2014-02-26 公安部第三研究所 Detection method and detection system for audio frequency transmission characteristics of building intercom system
US20160261994A1 (en) * 2013-10-17 2016-09-08 Zte Corporation Method and Device for Realizing Terminal WIFI Talkback

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
孙克辉等: "基于GM8126的数字可视对讲系统设计与实现", 《计算机工程与设计》 *
杨果等: "Speex编码器中回声消除算法的分析与评估", 《电声技术》 *

Similar Documents

Publication Publication Date Title
KR102569374B1 (en) How to operate a Bluetooth device
CN110010139B (en) Audio input/output method, system and computer readable storage medium
CN112135285B (en) Real-time audio interaction method for multi-Bluetooth audio equipment
JP2010081220A5 (en) Digital information signal transmitting / receiving apparatus and digital information signal transmitting / receiving method
CN103905928A (en) Network voice intercom method, device and system
CN101583009A (en) Video terminal and method thereof for realizing interface content sharing
CN112770212B (en) Wireless earphone, video recording system and method, and storage medium
JP2013141246A (en) Video apparatus and control method therefor
CN112261633B (en) Audio recording and converting method for intelligent earphone
CN104469587A (en) Earphones
CN101931816A (en) Method and system for transferring video from 3G mobile phone to TV
CN103442427B (en) Method of data synchronization, Apparatus and system, echo cancel method and system
CN213906675U (en) Portable wireless bluetooth recording equipment
CN111246331A (en) Wireless panorama sound mixing sound earphone
CN103873714A (en) Communication method, call initiating end device and call receiving end device
CN200990664Y (en) Television set capable of realizing long-distance video frequency conversational function
CN102402851A (en) Remote controller, receiver and sound remote control method
WO2020232631A1 (en) Voice frequency division transmission method, source terminal, playback terminal, source terminal circuit and playback terminal circuit
JP5447034B2 (en) Remote conference apparatus and remote conference method
WO2016045233A1 (en) Communication device capable of collecting acoustic field information and communication method
CN104539904A (en) Visual intercom echo processing system for building
CN203368633U (en) Video monitoring system
CN114584659A (en) Audio processing unit of conference system
CN102348007B (en) Method and mobile terminal of realizing bidirectional call recording in packet switched domain
CN218941188U (en) Wireless live broadcast system with guide broadcasting function

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150422

RJ01 Rejection of invention patent application after publication