CN103475633A - Voice and video communication engine and extensible communication service framework based such engine - Google Patents
Voice and video communication engine and extensible communication service framework based such engine Download PDFInfo
- Publication number
- CN103475633A CN103475633A CN2012102766317A CN201210276631A CN103475633A CN 103475633 A CN103475633 A CN 103475633A CN 2012102766317 A CN2012102766317 A CN 2012102766317A CN 201210276631 A CN201210276631 A CN 201210276631A CN 103475633 A CN103475633 A CN 103475633A
- Authority
- CN
- China
- Prior art keywords
- engine
- server
- voice
- video
- communication
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Data Exchanges In Wide-Area Networks (AREA)
- Telephonic Communication Services (AREA)
Abstract
A voice and video communication engine and an extensible communication service framework based on such an engine are provided. The voice and video communication engine is an improved novel solution which combines digital signal coding and decoding, network transmission and relevant processing together, which has excellent adaptability and robustness, can realize high-quality voice and video communication on an IP network, and solve network problems such as packet loss and continuous transmission. At the same time, the engine supports mainstream mobile platforms such as iOS and Android and PC platforms such as Windows, Mac and Linux. The engine can be widely applied to the things of Internet such as intelligent homes. The extensible communication service framework based on the communication engine altogether has six parts, namely a client, a verification server, an application server, an NAT server, a database and a push server, wherein the client is connected with the database via the verification server, the application server and the NAT server. Since the server cluster does not have particular restrictions or requirements of the client end, a service platform can be built on the basis of the engine and the framework.
Description
Technical field
The present invention relates to the engine field, but be specifically related to a kind of voice and video communication engine and the extended communication services framework based on this engine.
Background technology
Continuous infiltration along with information technology in life and the continuous enhancing of network infrastructure development, people have had increasing demand to real-time speech communicating and video communication.VoIP and Video chat/meeting are just universal fast in global businesses and individual application field, become the part that people are indispensable in Internet age.The tradition voip technology faces lot of challenges on the solution voice quality, such as using traditional error correction element, makes nature of sound; Use general jitter buffer treatment technology, can not solve the problem of time delay and quality simultaneously; Use general coding decoder, be not suitable for network voice; Client software is operated the shortcoming of system own to be affected; Use traditional echo cancellation technology, efficiency is not high.Aspect Video chat/meeting, having many users to mean the video time delay and freezing is maximum at present problem.
Summary of the invention
In order to solve the deficiency of current existence, but the invention provides a kind of powerful voice and video communication engine and the extended communication services framework based on this engine.
A kind of voice and video communication engine, it is characterized in that, embedded can with the intelligent sound IO module of the IO subsystem interaction of any equipment, closely Integrated Acoustic echo is eliminated simultaneously, the echo limiter, automatic gain is controlled, nonlinear processor, and these voice of voice activity detection and anti-whistle strengthen assembly.
Preferably, adopt G.72x, G.711, GSM, AMR NB/WB, Speex, SILK, audio coder & decoder (codec) and the MPEG-4 such as iLBC, AVC H.264, these Video Codecs of VP8.
Preferably, the special module that comprises IP network transmission and compensation, this module adopts RTP, rtcp protocol, the submodule of integrated self-adapted jitter buffer device and data-bag lost controller, by self-defining New Algorithm, realize the compensation of delay, shake and loss of packets.
Preferably, two kinds of automatic rate selection algorithms based on coupling and bandwidth, and cpu load is controlled.
But a kind of extended communication services framework based on described voice and video engine, this communication service framework has six parts: client, authentication server, application server, NAT server, database, push server; Client is passed through authentication server, application server, and the NAT server is connected in database.Authentication server realizes that the user logins and verifies, the basic function functions such as application server realizing communication, and the NAT server realizes that NAT penetrates, push server realizes information pushing to client.
Compared with prior art, advantage of the present invention is:
Voice of the present invention, video communication engine, in basic voice transfer, encoding and decoding speech, the video communication aspect has more existing technology better performance is arranged.This engine is supported 2G, and more more complicated network condition such as 3G, also support iOS, the more system such as Android.
From the practical application aspect, passing application developer need to rely on a plurality of suppliers, and integrated different technology to be to build an ip voice or IP Video Applications, and this has brought such as the interactive and unstable equivalent risk of quality.This engine is joined and is got perfect development interface ready, for many secondary development persons provide digital signal and the network processes solution of a set of complete test maturation.The present invention can simplify the development of real-time voice and video communication related application, has really realized one-stop solution.
the accompanying drawing explanationthe structural representation that Fig. 1 is a kind of voice of the present invention and video engine.
But the schematic diagram of the extended communication services framework that Fig. 2 is voice based on shown in Fig. 1 of the present invention and video engine.
Embodiment
Shown in Fig. 1, a kind of voice of the present invention and video communication engine, embedded can with the intelligent sound IO module of the IO subsystem interaction of any equipment, closely Integrated Acoustic echo is eliminated simultaneously, the echo limiter, automatic gain is controlled, nonlinear processor, and these voice of voice activity detection and anti-whistle strengthen assembly.
As preferred implementation, adopt G.72x, G.711, GSM, AMR NB/WB, Speex, SILK, audio coder & decoder (codec) and the MPEG-4 such as iLBC, AVC H.264, these Video Codecs of VP8.
As preferred implementation, the special module that comprises IP network transmission and compensation, this module adopts RTP, rtcp protocol, the submodule of integrated self-adapted jitter buffer device and data-bag lost controller, by self-defining New Algorithm, realize the compensation of delay, shake and loss of packets.
As preferred implementation, two kinds of automatic rate selection algorithms based on coupling and bandwidth, and cpu load control.
Shown in Fig. 2, but a kind of extended communication services framework based on described voice and video engine of the present invention, and this communication service framework has six parts: client, authentication server, application server, NAT server, database, push server; Client is passed through authentication server, application server, and the NAT server is connected in database.Authentication server realizes that the user logins and verifies, the basic function functions such as application server realizing communication, and the NAT server realizes that NAT penetrates, push server realizes information pushing to client.Voice of the present invention, video engine, can be widely applied to exploitation voice and video communication applications.The existing WowTalk of this cover engine of application at present, the world-famous VoIP software such as Ringit.
The simple communication flow process of applying the application that this engine completes is: independent database is set up in each application.Authentication server, application server is connected separately with database with the NAT server.Suppose that customer end A wants to communicate by letter with customer end B.At first customer end A carries out user's registration and login by authentication server, then by application server, completes route, sends chat messages, then completes NAT by the NAT server and penetrate.Final application server can be notified customer end B by push server, to set up actual communication linkage.Final A and B can realize voice or video communication.
It is simply using priciple that the secondary development DLL (dynamic link library) of this engine be take, and no matter is to have rich experiences or unfamiliar developer for exploitation VoIP and Video over IP application, and this engine can help him to write smoothly the application of oneself.
Innovative point:
1. this engine is suitable for the multiple network environment, also can guarantee the smooth propagation of voice under narrowband network.
2. designed and Implemented the encoding and decoding speech module low to hardware requirement.
3. support iOS, main flow mobile platform and the Windows such as Android, Mac, the PC platforms such as Linux.
4. greatly simplify the relevant application secondary development process of voice and video communication, made developer not need to go again to touch the communication protocol etc. of bottom, can guarantee the maintainable and performance of whole application.
Above-described embodiment just is to allow the one of ordinary skilled in the art can understand content of the present invention and implement according to this for technical conceive of the present invention and characteristics being described, its objective is, can not limit the scope of the invention with this.Variation or the modification of every equivalence that the essence of content has been done according to the present invention, all should be encompassed in protection scope of the present invention.
Claims (5)
1. voice and video communication engine, it is characterized in that, embedded can with the intelligent sound IO module of the IO subsystem interaction of any equipment, closely Integrated Acoustic echo is eliminated simultaneously, the echo limiter, automatic gain is controlled, nonlinear processor, and these voice of voice activity detection and anti-whistle strengthen assembly.
2. voice according to claim 1, video communication engine, is characterized in that, adopt G.72x, G.711, GSM, AMR NB/WB, Speex, SILK, audio coder & decoder (codec) and the MPEG-4 such as iLBC, AVC H.264, these Video Codecs of VP8.
3. voice according to claim 1, video communication engine, it is characterized in that, the special module that comprises IP network transmission and compensation, this module adopts RTP, rtcp protocol, the submodule of integrated self-adapted jitter buffer device and data-bag lost controller, by self-defining New Algorithm, realize the compensation of delay, shake and loss of packets.
4. voice according to claim 1, video engine, is characterized in that, realizes that Voice & Video is synchronous, two kinds of automatic rate selection algorithms based on coupling and bandwidth, and cpu load is controlled.
5. but the extended communication services framework based on described voice and video engine, is characterized in that, this communication service framework has six parts: client, authentication server, application server, NAT server, database, push server; Client is passed through authentication server, application server, and the NAT server is connected in database, authentication server realizes that the user logins and verifies, the basic function functions such as application server realizing communication, the NAT server realizes that NAT penetrates, push server realizes information pushing to client.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012102766317A CN103475633A (en) | 2012-08-06 | 2012-08-06 | Voice and video communication engine and extensible communication service framework based such engine |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012102766317A CN103475633A (en) | 2012-08-06 | 2012-08-06 | Voice and video communication engine and extensible communication service framework based such engine |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103475633A true CN103475633A (en) | 2013-12-25 |
Family
ID=49800331
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2012102766317A Pending CN103475633A (en) | 2012-08-06 | 2012-08-06 | Voice and video communication engine and extensible communication service framework based such engine |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103475633A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108282685A (en) * | 2018-01-04 | 2018-07-13 | 华南师范大学 | A kind of method and monitoring system of audio-visual synchronization |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100260074A1 (en) * | 2009-04-09 | 2010-10-14 | Nortel Networks Limited | Enhanced communication bridge |
CN102461141A (en) * | 2009-04-14 | 2012-05-16 | 思杰系统有限公司 | Systems and methods for computer and voice conference audio transmission during conference call via pstn phone |
-
2012
- 2012-08-06 CN CN2012102766317A patent/CN103475633A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100260074A1 (en) * | 2009-04-09 | 2010-10-14 | Nortel Networks Limited | Enhanced communication bridge |
CN102461141A (en) * | 2009-04-14 | 2012-05-16 | 思杰系统有限公司 | Systems and methods for computer and voice conference audio transmission during conference call via pstn phone |
Non-Patent Citations (2)
Title |
---|
王文亮: "P2P多媒体群组通信平台多媒体技术的研究与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
王立伟: "多媒体客户端视音频引擎技术研究与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108282685A (en) * | 2018-01-04 | 2018-07-13 | 华南师范大学 | A kind of method and monitoring system of audio-visual synchronization |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10819757B2 (en) | System and method for real-time communication by using a client application communication protocol | |
US7804954B2 (en) | Infrastructure for enabling high quality real-time audio | |
CN103475793B (en) | Attaching terminal is used to call out | |
US9544340B2 (en) | Application programming interface enabling communication features for different communication protocols | |
US20140348044A1 (en) | Real-Time Rich Communications Client Architecture | |
CN107005589A (en) | service ability in heterogeneous network | |
JP2016508357A (en) | Wireless real-time media communication using multiple media streams | |
WO2016184001A1 (en) | Video monitoring processing method and apparatus | |
CN105554029A (en) | Method for realizing media intercommunication between WebRTC terminal and SIP terminal and media gateway | |
Xue et al. | A WebRTC-based video conferencing system with screen sharing | |
US20070115949A1 (en) | Infrastructure for enabling high quality real-time audio | |
WO2021073155A1 (en) | Video conference method, apparatus and device, and storage medium | |
US20160149984A1 (en) | Method and system for providing remote transcoding of media data on a voip system | |
US9961209B2 (en) | Codec selection optimization | |
US10469667B2 (en) | Conferencing system including a remote microphone and method of using the same | |
WO2012174908A1 (en) | Method, device and system for realizing audio transcoding of text to speech | |
WO2021017807A1 (en) | Call connection establishment method, first terminal, server, and storage medium | |
CN103475633A (en) | Voice and video communication engine and extensible communication service framework based such engine | |
US20150237524A1 (en) | Method for transmitting audio information and packet communication system | |
CN106973300A (en) | A kind of mobile Internet net cast platform | |
JP2024529655A (en) | Supporting quality of service for media communications | |
Kaul et al. | Opus and session initiation protocol security in voice over IP (VOIP) | |
WO2020106541A1 (en) | Interface and authorization for cross-network communications | |
Hsu et al. | Improving the efficiency of presence service in IMS by JSON | |
JP6183881B2 (en) | Codec conversion gateway, codec conversion method, and codec conversion program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20131225 |