CN103475633A - Voice and video communication engine and extensible communication service framework based such engine - Google Patents

Voice and video communication engine and extensible communication service framework based such engine Download PDF

Info

Publication number
CN103475633A
CN103475633A CN2012102766317A CN201210276631A CN103475633A CN 103475633 A CN103475633 A CN 103475633A CN 2012102766317 A CN2012102766317 A CN 2012102766317A CN 201210276631 A CN201210276631 A CN 201210276631A CN 103475633 A CN103475633 A CN 103475633A
Authority
CN
China
Prior art keywords
engine
server
voice
video
communication
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012102766317A
Other languages
Chinese (zh)
Inventor
陈奕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WOWTECH Inc
Original Assignee
WOWTECH Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WOWTECH Inc filed Critical WOWTECH Inc
Priority to CN2012102766317A priority Critical patent/CN103475633A/en
Publication of CN103475633A publication Critical patent/CN103475633A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)
  • Telephonic Communication Services (AREA)

Abstract

A voice and video communication engine and an extensible communication service framework based on such an engine are provided. The voice and video communication engine is an improved novel solution which combines digital signal coding and decoding, network transmission and relevant processing together, which has excellent adaptability and robustness, can realize high-quality voice and video communication on an IP network, and solve network problems such as packet loss and continuous transmission. At the same time, the engine supports mainstream mobile platforms such as iOS and Android and PC platforms such as Windows, Mac and Linux. The engine can be widely applied to the things of Internet such as intelligent homes. The extensible communication service framework based on the communication engine altogether has six parts, namely a client, a verification server, an application server, an NAT server, a database and a push server, wherein the client is connected with the database via the verification server, the application server and the NAT server. Since the server cluster does not have particular restrictions or requirements of the client end, a service platform can be built on the basis of the engine and the framework.

Description

But voice and video communication engine and the extended communication services framework based on this engine
Technical field
The present invention relates to the engine field, but be specifically related to a kind of voice and video communication engine and the extended communication services framework based on this engine.
Background technology
Continuous infiltration along with information technology in life and the continuous enhancing of network infrastructure development, people have had increasing demand to real-time speech communicating and video communication.VoIP and Video chat/meeting are just universal fast in global businesses and individual application field, become the part that people are indispensable in Internet age.The tradition voip technology faces lot of challenges on the solution voice quality, such as using traditional error correction element, makes nature of sound; Use general jitter buffer treatment technology, can not solve the problem of time delay and quality simultaneously; Use general coding decoder, be not suitable for network voice; Client software is operated the shortcoming of system own to be affected; Use traditional echo cancellation technology, efficiency is not high.Aspect Video chat/meeting, having many users to mean the video time delay and freezing is maximum at present problem.
 
Summary of the invention
In order to solve the deficiency of current existence, but the invention provides a kind of powerful voice and video communication engine and the extended communication services framework based on this engine.
A kind of voice and video communication engine, it is characterized in that, embedded can with the intelligent sound IO module of the IO subsystem interaction of any equipment, closely Integrated Acoustic echo is eliminated simultaneously, the echo limiter, automatic gain is controlled, nonlinear processor, and these voice of voice activity detection and anti-whistle strengthen assembly.
Preferably, adopt G.72x, G.711, GSM, AMR NB/WB, Speex, SILK, audio coder & decoder (codec) and the MPEG-4 such as iLBC, AVC H.264, these Video Codecs of VP8.
Preferably, the special module that comprises IP network transmission and compensation, this module adopts RTP, rtcp protocol, the submodule of integrated self-adapted jitter buffer device and data-bag lost controller, by self-defining New Algorithm, realize the compensation of delay, shake and loss of packets.
Preferably, two kinds of automatic rate selection algorithms based on coupling and bandwidth, and cpu load is controlled.
But a kind of extended communication services framework based on described voice and video engine, this communication service framework has six parts: client, authentication server, application server, NAT server, database, push server; Client is passed through authentication server, application server, and the NAT server is connected in database.Authentication server realizes that the user logins and verifies, the basic function functions such as application server realizing communication, and the NAT server realizes that NAT penetrates, push server realizes information pushing to client.
Compared with prior art, advantage of the present invention is:
Voice of the present invention, video communication engine, in basic voice transfer, encoding and decoding speech, the video communication aspect has more existing technology better performance is arranged.This engine is supported 2G, and more more complicated network condition such as 3G, also support iOS, the more system such as Android.
From the practical application aspect, passing application developer need to rely on a plurality of suppliers, and integrated different technology to be to build an ip voice or IP Video Applications, and this has brought such as the interactive and unstable equivalent risk of quality.This engine is joined and is got perfect development interface ready, for many secondary development persons provide digital signal and the network processes solution of a set of complete test maturation.The present invention can simplify the development of real-time voice and video communication related application, has really realized one-stop solution.
the accompanying drawing explanationthe structural representation that Fig. 1 is a kind of voice of the present invention and video engine.
But the schematic diagram of the extended communication services framework that Fig. 2 is voice based on shown in Fig. 1 of the present invention and video engine.
 
Embodiment
Shown in Fig. 1, a kind of voice of the present invention and video communication engine, embedded can with the intelligent sound IO module of the IO subsystem interaction of any equipment, closely Integrated Acoustic echo is eliminated simultaneously, the echo limiter, automatic gain is controlled, nonlinear processor, and these voice of voice activity detection and anti-whistle strengthen assembly.
As preferred implementation, adopt G.72x, G.711, GSM, AMR NB/WB, Speex, SILK, audio coder & decoder (codec) and the MPEG-4 such as iLBC, AVC H.264, these Video Codecs of VP8.
As preferred implementation, the special module that comprises IP network transmission and compensation, this module adopts RTP, rtcp protocol, the submodule of integrated self-adapted jitter buffer device and data-bag lost controller, by self-defining New Algorithm, realize the compensation of delay, shake and loss of packets.
As preferred implementation, two kinds of automatic rate selection algorithms based on coupling and bandwidth, and cpu load control.
Shown in Fig. 2, but a kind of extended communication services framework based on described voice and video engine of the present invention, and this communication service framework has six parts: client, authentication server, application server, NAT server, database, push server; Client is passed through authentication server, application server, and the NAT server is connected in database.Authentication server realizes that the user logins and verifies, the basic function functions such as application server realizing communication, and the NAT server realizes that NAT penetrates, push server realizes information pushing to client.Voice of the present invention, video engine, can be widely applied to exploitation voice and video communication applications.The existing WowTalk of this cover engine of application at present, the world-famous VoIP software such as Ringit.
The simple communication flow process of applying the application that this engine completes is: independent database is set up in each application.Authentication server, application server is connected separately with database with the NAT server.Suppose that customer end A wants to communicate by letter with customer end B.At first customer end A carries out user's registration and login by authentication server, then by application server, completes route, sends chat messages, then completes NAT by the NAT server and penetrate.Final application server can be notified customer end B by push server, to set up actual communication linkage.Final A and B can realize voice or video communication.
It is simply using priciple that the secondary development DLL (dynamic link library) of this engine be take, and no matter is to have rich experiences or unfamiliar developer for exploitation VoIP and Video over IP application, and this engine can help him to write smoothly the application of oneself.
Innovative point:
1. this engine is suitable for the multiple network environment, also can guarantee the smooth propagation of voice under narrowband network.
2. designed and Implemented the encoding and decoding speech module low to hardware requirement.
3. support iOS, main flow mobile platform and the Windows such as Android, Mac, the PC platforms such as Linux.
4. greatly simplify the relevant application secondary development process of voice and video communication, made developer not need to go again to touch the communication protocol etc. of bottom, can guarantee the maintainable and performance of whole application.
Above-described embodiment just is to allow the one of ordinary skilled in the art can understand content of the present invention and implement according to this for technical conceive of the present invention and characteristics being described, its objective is, can not limit the scope of the invention with this.Variation or the modification of every equivalence that the essence of content has been done according to the present invention, all should be encompassed in protection scope of the present invention.

Claims (5)

1. voice and video communication engine, it is characterized in that, embedded can with the intelligent sound IO module of the IO subsystem interaction of any equipment, closely Integrated Acoustic echo is eliminated simultaneously, the echo limiter, automatic gain is controlled, nonlinear processor, and these voice of voice activity detection and anti-whistle strengthen assembly.
2. voice according to claim 1, video communication engine, is characterized in that, adopt G.72x, G.711, GSM, AMR NB/WB, Speex, SILK, audio coder & decoder (codec) and the MPEG-4 such as iLBC, AVC H.264, these Video Codecs of VP8.
3. voice according to claim 1, video communication engine, it is characterized in that, the special module that comprises IP network transmission and compensation, this module adopts RTP, rtcp protocol, the submodule of integrated self-adapted jitter buffer device and data-bag lost controller, by self-defining New Algorithm, realize the compensation of delay, shake and loss of packets.
4. voice according to claim 1, video engine, is characterized in that, realizes that Voice & Video is synchronous, two kinds of automatic rate selection algorithms based on coupling and bandwidth, and cpu load is controlled.
5. but the extended communication services framework based on described voice and video engine, is characterized in that, this communication service framework has six parts: client, authentication server, application server, NAT server, database, push server; Client is passed through authentication server, application server, and the NAT server is connected in database, authentication server realizes that the user logins and verifies, the basic function functions such as application server realizing communication, the NAT server realizes that NAT penetrates, push server realizes information pushing to client.
CN2012102766317A 2012-08-06 2012-08-06 Voice and video communication engine and extensible communication service framework based such engine Pending CN103475633A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012102766317A CN103475633A (en) 2012-08-06 2012-08-06 Voice and video communication engine and extensible communication service framework based such engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012102766317A CN103475633A (en) 2012-08-06 2012-08-06 Voice and video communication engine and extensible communication service framework based such engine

Publications (1)

Publication Number Publication Date
CN103475633A true CN103475633A (en) 2013-12-25

Family

ID=49800331

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012102766317A Pending CN103475633A (en) 2012-08-06 2012-08-06 Voice and video communication engine and extensible communication service framework based such engine

Country Status (1)

Country Link
CN (1) CN103475633A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108282685A (en) * 2018-01-04 2018-07-13 华南师范大学 A kind of method and monitoring system of audio-visual synchronization

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100260074A1 (en) * 2009-04-09 2010-10-14 Nortel Networks Limited Enhanced communication bridge
CN102461141A (en) * 2009-04-14 2012-05-16 思杰系统有限公司 Systems and methods for computer and voice conference audio transmission during conference call via pstn phone

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100260074A1 (en) * 2009-04-09 2010-10-14 Nortel Networks Limited Enhanced communication bridge
CN102461141A (en) * 2009-04-14 2012-05-16 思杰系统有限公司 Systems and methods for computer and voice conference audio transmission during conference call via pstn phone

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
王文亮: "P2P多媒体群组通信平台多媒体技术的研究与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
王立伟: "多媒体客户端视音频引擎技术研究与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108282685A (en) * 2018-01-04 2018-07-13 华南师范大学 A kind of method and monitoring system of audio-visual synchronization

Similar Documents

Publication Publication Date Title
US10819757B2 (en) System and method for real-time communication by using a client application communication protocol
US7804954B2 (en) Infrastructure for enabling high quality real-time audio
CN103475793B (en) Attaching terminal is used to call out
US9544340B2 (en) Application programming interface enabling communication features for different communication protocols
US20140348044A1 (en) Real-Time Rich Communications Client Architecture
CN107005589A (en) service ability in heterogeneous network
JP2016508357A (en) Wireless real-time media communication using multiple media streams
WO2016184001A1 (en) Video monitoring processing method and apparatus
CN105554029A (en) Method for realizing media intercommunication between WebRTC terminal and SIP terminal and media gateway
Xue et al. A WebRTC-based video conferencing system with screen sharing
US20070115949A1 (en) Infrastructure for enabling high quality real-time audio
WO2021073155A1 (en) Video conference method, apparatus and device, and storage medium
US20160149984A1 (en) Method and system for providing remote transcoding of media data on a voip system
US9961209B2 (en) Codec selection optimization
US10469667B2 (en) Conferencing system including a remote microphone and method of using the same
WO2012174908A1 (en) Method, device and system for realizing audio transcoding of text to speech
WO2021017807A1 (en) Call connection establishment method, first terminal, server, and storage medium
CN103475633A (en) Voice and video communication engine and extensible communication service framework based such engine
US20150237524A1 (en) Method for transmitting audio information and packet communication system
CN106973300A (en) A kind of mobile Internet net cast platform
JP2024529655A (en) Supporting quality of service for media communications
Kaul et al. Opus and session initiation protocol security in voice over IP (VOIP)
WO2020106541A1 (en) Interface and authorization for cross-network communications
Hsu et al. Improving the efficiency of presence service in IMS by JSON
JP6183881B2 (en) Codec conversion gateway, codec conversion method, and codec conversion program

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20131225