CN117201468A - VOIP self-adaptive voice coding system - Google Patents
VOIP self-adaptive voice coding system Download PDFInfo
- Publication number
- CN117201468A CN117201468A CN202311329523.6A CN202311329523A CN117201468A CN 117201468 A CN117201468 A CN 117201468A CN 202311329523 A CN202311329523 A CN 202311329523A CN 117201468 A CN117201468 A CN 117201468A
- Authority
- CN
- China
- Prior art keywords
- client
- voice
- coding
- opposite
- server
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108091026890 Coding region Proteins 0.000 claims abstract description 32
- 238000006243 chemical reaction Methods 0.000 claims abstract description 12
- 230000005540 biological transmission Effects 0.000 claims abstract description 11
- 238000012163 sequencing technique Methods 0.000 claims abstract description 5
- 239000012634 fragment Substances 0.000 claims description 18
- 230000002159 abnormal effect Effects 0.000 claims description 12
- 230000003044 adaptive effect Effects 0.000 claims description 10
- 238000012545 processing Methods 0.000 claims description 10
- 230000004048 modification Effects 0.000 claims description 9
- 238000012986 modification Methods 0.000 claims description 9
- 238000012544 monitoring process Methods 0.000 claims description 9
- 238000000034 method Methods 0.000 claims description 7
- 230000003993 interaction Effects 0.000 claims description 6
- 230000008569 process Effects 0.000 claims description 6
- 230000008707 rearrangement Effects 0.000 claims description 5
- 238000013524 data verification Methods 0.000 description 5
- 230000008676 import Effects 0.000 description 4
- 238000004806 packaging method and process Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
Landscapes
- Telephonic Communication Services (AREA)
Abstract
The application relates to the technical field of language coding, and discloses a VOIP self-adaptive voice coding system, which comprises: the SIP server side, the client side and the opposite-end client side; the SIP server side is used for receiving a registration request from the client side, detecting the network available bandwidth of the client side, sequencing the voice coding sequence according to the detected network available bandwidth of the client side and sending the voice coding sequence to the client side of the opposite side; the client is used for selecting a self-supported coding format according to the received voice coding sequence to carry out voice coding on the real-time transmission protocol RTP stream; the SIP server compares the voice coding formats of the two parties of the conversation of the client and the client of the opposite terminal, and if the coding formats of the two parties are the same, the media relay server is controlled to carry out transparent transmission on the RTP stream; if the coding formats are different, the SIP server end carries out voice coding conversion on the RTP stream, converts the voice coding format into the coding format of the client end of the opposite end and then transmits the voice coding format to the opposite end.
Description
Technical Field
The application relates to the technical field of language coding, in particular to a VOIP self-adaptive language coding system.
Background
VoIP is a technology for achieving the purpose of voice communication over an IP network by digitally encoding, compressing and framing voice signals, and then converting the voice signals into IP packets for transmission over the IP network. VOIP has the greatest advantage of being able to widely utilize the Internet and global IP interconnection environments, and is very inexpensive to provide voice, fax, video, and data services.
Session Initiation Protocol (SIP) is an application-layer control protocol for multimedia communications over an IP network. SIP is used to set up, change and terminate calls between users of IP-based networks, and in order to provide telephony services, SIP also requires the incorporation of different standards and protocols, particularly the real-time transport protocol (RTP).
The existing VOIP phone system cannot make dynamic adjustment according to the requirements of different encoding formats for different bandwidths, that is, cannot select an appropriate encoding format according to the condition of the SIP client, so that high-quality voice call service cannot be provided for the user.
Disclosure of Invention
The present application is directed to a VOIP adaptive speech coding system that provides high quality voice call services.
To achieve the above object, the basic scheme of the present application is as follows:
a VOIP adaptive speech coding system, comprising: the SIP server side, the client side and the opposite-end client side;
the SIP server side is used for receiving a registration request from the client side, detecting the network available bandwidth of the client side, sequencing the voice coding sequence according to the detected network available bandwidth of the client side and sending the voice coding sequence to the client side of the opposite side;
the client is used for selecting a self-supported coding format according to the received voice coding sequence to carry out voice coding on the real-time transmission protocol RTP stream;
the SIP server compares the voice coding formats of the two parties of the conversation of the client and the client of the opposite terminal, and if the coding formats of the two parties are the same, the media relay server is controlled to carry out transparent transmission on the RTP stream; if the coding formats are different, the SIP server end carries out voice coding conversion on the RTP stream, converts the voice coding format into the coding format of the client end of the opposite end and then transmits the voice coding format to the opposite end.
Further, the SIP server side is configured with a session information interaction module, a bandwidth monitoring module and a data packet control module,
in the call process, the network fluctuation condition monitors the bandwidth condition to be sent to the SIP server terminal at regular time through the bandwidth monitoring module, the SIP server terminal adjusts the voice coding sequence to rearrange according to the bandwidth condition of the client terminal, the data packet after the modification is sent to the client terminal of the opposite terminal, and the coding sequence of the client terminal of the opposite terminal is readjusted to the coding format.
Further, the SIP server is configured with a processing module, where when speech coding formats of two parties of the call of the client and the opposite-end client are different, the processing module screens abnormal features, and when the abnormal features are abnormal in the bandwidth condition of the client and/or abnormal in the bandwidth condition of the opposite-end client, the processing module performs priority configuration on available bandwidths of the client and the opposite-end client according to a pre-configured priority policy.
Further, the SIP server is configured with a first server and a second server, the first server transmits unidirectional data to the second server, the first server is connected to the client, and the second server is connected to the client;
the first server is configured with an encryption unit, the encryption unit is configured with an encryption strategy, and the encryption unit encrypts the voice code according to the preset encryption strategy and forms a plurality of encrypted voice data packets to be transmitted to the second server;
the second server is configured with a decryption unit, the decryption unit is configured with a decryption strategy, and the decryption unit decrypts the encrypted data packet according to the preset decryption strategy to form a decrypted voice data packet and transmits the decrypted voice data packet to the opposite-end client.
Further, the encryption unit is configured with an encryption policy and an encryption algorithm, the encryption algorithm is configured with a plurality of dynamic encryption factors, the dynamic encryption factors comprise network available bandwidth of the client, voice coding sequence, voice coding format and device characteristic factors of the client, and the device characteristic factors of the client are sequence numbers configured when the client leaves the factory.
Further, the encrypted voice data packet comprises a plurality of encrypted voice data fragments, the encrypted voice data fragments comprise a voice data fragment set, and the voice data fragment set is associated with available bandwidth of a client network;
each encrypted voice data segment is configured with feature index information including the available bandwidth of the client network and the device feature factor of the client, the feature index information being interrelated with the set of voice data segments.
Further, the decryption strategy is configured with a decryption algorithm, and the decryption algorithm decrypts the encrypted voice data segment to form a decrypted voice data segment;
the decryption unit configures a voice coding format conversion strategy, the voice coding format conversion strategy adjusts the rearrangement of the voice coding sequence according to the bandwidth condition of the client corresponding to the decrypted voice data segment, the data packet after the modification is sent to the opposite client, and the coding format of the opposite client is readjusted.
Compared with the prior art, the beneficial effect of this scheme is:
1. the voice coding system of the scheme utilizes the SIP server side to compare the voice coding formats of the two parties of the conversation of the client side and the client side of the opposite end, and if the coding formats of the two parties are the same, the media relay server is controlled to carry out transparent transmission on RTP streams; if the coding formats are different, the SIP server end carries out voice coding conversion on the RTP stream, converts the voice coding format into the coding format of the client end of the opposite end and transmits the opposite end to the opposite end, thereby realizing the purpose of providing high-quality voice call service for users;
2. in the call process, the network fluctuation condition monitors the bandwidth condition to be sent to the SIP server terminal at regular time through the bandwidth monitoring module, the SIP server terminal adjusts the voice coding sequence to rearrange according to the bandwidth condition of the client terminal, the data packet after the modification is sent to the client terminal of the opposite terminal, and the coding sequence of the client terminal of the opposite terminal is readjusted to the coding format.
3. The SIP server side is configured with a first server and a second server, and the first server and the second server are respectively configured with an encryption unit and a decryption unit, so that high call security in the call process of the client side and the opposite-end client side is realized.
Drawings
FIG. 1 is a schematic diagram of an embodiment of the present application.
Detailed Description
The application is described in further detail below with reference to the drawings in the specification and by way of specific embodiments:
examples:
a VOIP adaptive speech coding system, as shown in fig. 1, comprising: the SIP server side, the client side and the opposite-end client side;
the SIP server side is used for receiving a registration request from the client side, detecting the network available bandwidth of the client side, sequencing the voice coding sequence according to the detected network available bandwidth of the client side and sending the voice coding sequence to the client side of the opposite side;
the client is used for selecting a self-supported coding format according to the received voice coding sequence to carry out voice coding on the real-time transmission protocol RTP stream;
the SIP server compares the voice coding formats of the two parties of the conversation of the client and the client of the opposite terminal, and if the coding formats of the two parties are the same, the media relay server is controlled to carry out transparent transmission on the RTP stream; if the coding formats are different, the SIP server end carries out voice coding conversion on the RTP stream, converts the voice coding format into the coding format of the client end of the opposite end and then transmits the voice coding format to the opposite end.
The SIP server is configured with a session information interaction module, a bandwidth monitoring module and a data packet control module,
session information interaction module: for processing interaction information from the client;
and a bandwidth monitoring module: the method comprises the steps of sending a negotiation request to a client when the registration of the client is detected; receiving a network available bandwidth detection request from a client, detecting the current network available bandwidth of the client network by using a detection algorithm carried in the negotiation request, and storing the obtained network available bandwidth of the client;
in the call process, the network fluctuation condition sends bandwidth condition to the SIP server terminal at regular time through network monitoring, the SIP server terminal adjusts the rearrangement of the voice coding sequence according to the bandwidth condition of the client terminal, the data packet after the modification is sent to the client terminal of the opposite terminal, and the coding sequence of the client terminal of the opposite terminal is readjusted to the coding format.
And the data packet control module: and the voice coding sequence is ordered according to the detected network available bandwidth of the opposite terminal when the client terminal is detected to initiate the invitation, and the voice coding sequence is sent to the opposite terminal client terminal through the session information interaction module.
The SIP server side is configured with a first server and a second server, the first server transmits unidirectional data to the second server, the first server is connected with the client side, and the second server is connected with the opposite-end client side;
the first server is configured with an encryption unit, the encryption unit is configured with an encryption strategy, and the encryption unit encrypts the voice code according to the preset encryption strategy and forms a plurality of encrypted voice data packets to be transmitted to the second server;
the encryption unit is configured with an encryption strategy and an encryption algorithm, the encryption algorithm is configured with a plurality of dynamic encryption factors, the dynamic encryption factors comprise network available bandwidth of a client, voice coding sequence, voice coding format and device characteristic factors of the client, and the device characteristic factors of the client are sequence numbers configured when the client leaves a factory.
The encrypted voice data packet comprises a plurality of encrypted voice data fragments, the encrypted voice data fragments comprise a voice data fragment set, and the voice data fragment set is associated with available bandwidth of a client network;
each encrypted voice data segment is configured with feature index information including the available bandwidth of the client network and the device feature factor of the client, the feature index information being interrelated with the set of voice data segments.
The second server is configured with a decryption unit, the decryption unit is configured with a decryption strategy, and the decryption unit decrypts the encrypted data packet according to the preset decryption strategy to form a decrypted voice data packet and transmits the decrypted voice data packet to the opposite-end client;
the second server is configured with a voice data calling unit, a voice data packaging unit and a voice data verification unit, the voice data calling unit is configured with a voice data calling strategy, the voice data calling strategy takes the network available bandwidth of the client as an index template, an encrypted voice data fragment corresponding to characteristic index information matched with the index template is called as a voice data fragment to be decrypted, and the decryption unit decrypts the target data fragment to be decrypted according to a preset decryption strategy to form a decrypted voice data fragment;
the voice data packaging unit is configured with an import module and a packaging module, the import module is configured with a plurality of data templates, each data template corresponds to the network available bandwidth of the client, the import module imports the corresponding voice data template according to the network available bandwidth of the client corresponding to the decrypted data fragment, and the packaging module packages the voice data template imported with the decrypted voice data fragment into a comprehensive data packet;
the voice data verification unit is configured with a voice data verification strategy, and the voice data verification strategy is used for verifying the consistency of the network available bandwidth of the client corresponding to the decrypted voice data segment in the comprehensive data packet and the characteristic index information configured on the encrypted data segment;
when the data verification strategy verifies that the feature index information corresponding to the decrypted voice data segment in the comprehensive data packet is consistent with the feature index information configured with the encrypted voice data segment, the corresponding voice encrypted data packet in the first server is automatically deleted.
The decryption strategy is configured with a decryption algorithm, and the decryption algorithm decrypts the encrypted voice data segment to form a decrypted voice data segment;
the decryption unit configures a voice coding format conversion strategy, the voice coding format conversion strategy adjusts the rearrangement of the voice coding sequence according to the bandwidth condition of the client corresponding to the decrypted voice data segment, the data packet after the modification is sent to the opposite client, and the coding format of the opposite client is readjusted.
The SIP server is configured with a processing module, when the voice coding formats of the two parties of the conversation of the client and the opposite terminal client are different, the processing module screens abnormal characteristics, and when the abnormal characteristics are abnormal in the bandwidth condition of the client and/or abnormal in the bandwidth condition of the opposite terminal client, the processing module carries out priority configuration on the available bandwidths of the client and the opposite terminal client according to a pre-configured priority strategy; when the client priority is higher than the opposite client priority, executing the client speech coding format; when the client-side priority is higher than the client-side priority, the client-side speech coding format is executed.
When a user registers to a server through a client, the SIP server is used for receiving a registration request from the client, detecting the network available bandwidth of the client, sequencing the voice coding sequence according to the detected network available bandwidth of the client and sending the voice coding sequence to the client;
the SIP server compares the voice coding formats of the two parties of the conversation of the client and the client of the opposite terminal, and if the coding formats of the two parties are the same, the media relay server is controlled to carry out transparent transmission on the RTP stream; if the coding formats are different, the SIP server end carries out voice coding conversion on the RTP stream, converts the voice coding format into the coding format of the client end of the opposite end and then transmits the voice coding format to the opposite end;
in the call process, the network fluctuation condition sends bandwidth condition to the SIP server terminal at regular time through network monitoring, the SIP server terminal adjusts the rearrangement of the voice coding sequence according to the bandwidth condition of the client terminal, the data packet after the modification is sent to the client terminal of the opposite terminal, and the coding sequence of the client terminal of the opposite terminal is readjusted to the coding format.
The voice coding system of the application realizes that the voice coding format matched with the available network bandwidth is always kept on the client and the opposite terminal client network, thereby better ensuring that high-quality voice call service is provided for users.
The foregoing is merely exemplary embodiments of the present application, and specific structures and features that are well known in the art are not described in detail herein. It should be noted that modifications and improvements can be made by those skilled in the art without departing from the structure of the present application, and these should also be considered as the scope of the present application, which does not affect the effect of the implementation of the present application and the utility of the patent. The protection scope of the present application is subject to the content of the claims, and the description of the specific embodiments and the like in the specification can be used for explaining the content of the claims.
Claims (7)
1. A VOIP adaptive speech coding system, characterized by: comprising the following steps: the SIP server side, the client side and the opposite-end client side;
the SIP server side is used for receiving a registration request from the client side, detecting the network available bandwidth of the client side, sequencing the voice coding sequence according to the detected network available bandwidth of the client side and sending the voice coding sequence to the client side of the opposite side;
the client is used for selecting a self-supported coding format according to the received voice coding sequence to carry out voice coding on the real-time transmission protocol RTP stream;
the SIP server compares the voice coding formats of the two parties of the conversation of the client and the client of the opposite terminal, and if the coding formats of the two parties are the same, the media relay server is controlled to carry out transparent transmission on the RTP stream; if the coding formats are different, the SIP server end carries out voice coding conversion on the RTP stream, converts the voice coding format into the coding format of the client end of the opposite end and then transmits the voice coding format to the opposite end.
2. The VOIP adaptive speech coding system according to claim 1, wherein: the SIP server is configured with a session information interaction module, a bandwidth monitoring module and a data packet control module,
in the call process, the network fluctuation condition monitors the bandwidth condition to be sent to the SIP server terminal at regular time through the bandwidth monitoring module, the SIP server terminal adjusts the voice coding sequence to rearrange according to the bandwidth condition of the client terminal, the data packet after the modification is sent to the client terminal of the opposite terminal, and the coding sequence of the client terminal of the opposite terminal is readjusted to the coding format.
3. The VOIP adaptive speech coding system according to claim 1, wherein: the SIP server is configured with a processing module, when the voice coding formats of the two parties of the conversation of the client and the opposite terminal client are different, the processing module screens abnormal characteristics, and when the abnormal characteristics are abnormal in the bandwidth condition of the client and/or abnormal in the bandwidth condition of the opposite terminal client, the processing module carries out priority configuration on the available bandwidths of the client and the opposite terminal client according to a pre-configured priority strategy.
4. The VOIP adaptive speech coding system according to claim 1, wherein: the SIP server side is configured with a first server and a second server, the first server transmits unidirectional data to the second server, the first server is connected with the client side, and the second server is connected with the opposite-end client side;
the first server is configured with an encryption unit, the encryption unit is configured with an encryption strategy, and the encryption unit encrypts the voice code according to the preset encryption strategy and forms a plurality of encrypted voice data packets to be transmitted to the second server;
the second server is configured with a decryption unit, the decryption unit is configured with a decryption strategy, and the decryption unit decrypts the encrypted data packet according to the preset decryption strategy to form a decrypted voice data packet and transmits the decrypted voice data packet to the opposite-end client.
5. The VOIP adaptive speech coding system according to claim 4, wherein: the encryption unit is configured with an encryption strategy and an encryption algorithm, the encryption algorithm is configured with a plurality of dynamic encryption factors, the dynamic encryption factors comprise network available bandwidth of a client, voice coding sequence, voice coding format and device characteristic factors of the client, and the device characteristic factors of the client are sequence numbers configured when the client leaves a factory.
6. The VOIP adaptive speech coding system according to claim 5, wherein: the encrypted voice data packet comprises a plurality of encrypted voice data fragments, the encrypted voice data fragments comprise a voice data fragment set, and the voice data fragment set is associated with available bandwidth of a client network;
each encrypted voice data segment is configured with feature index information including the available bandwidth of the client network and the device feature factor of the client, the feature index information being interrelated with the set of voice data segments.
7. The VOIP adaptive speech coding system according to claim 4, wherein: the decryption strategy is configured with a decryption algorithm, and the decryption algorithm decrypts the encrypted voice data segment to form a decrypted voice data segment;
the decryption unit configures a voice coding format conversion strategy, the voice coding format conversion strategy adjusts the rearrangement of the voice coding sequence according to the bandwidth condition of the client corresponding to the decrypted voice data segment, the data packet after the modification is sent to the opposite client, and the coding format of the opposite client is readjusted.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311329523.6A CN117201468A (en) | 2023-10-13 | 2023-10-13 | VOIP self-adaptive voice coding system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311329523.6A CN117201468A (en) | 2023-10-13 | 2023-10-13 | VOIP self-adaptive voice coding system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117201468A true CN117201468A (en) | 2023-12-08 |
Family
ID=89001742
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311329523.6A Pending CN117201468A (en) | 2023-10-13 | 2023-10-13 | VOIP self-adaptive voice coding system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117201468A (en) |
-
2023
- 2023-10-13 CN CN202311329523.6A patent/CN117201468A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8855086B2 (en) | Method and apparatus for efficient multimedia delivery in a wireless packet network | |
US8031728B2 (en) | Method of controlling audio communication on a network | |
US20050157708A1 (en) | System and method for providing unified messaging system service using voice over Internet protocol | |
US20050175032A1 (en) | Processing session initiation protocol signaling in voice/data integrated switching system | |
EP1495612B1 (en) | Method and apparatus for efficient transmission of voip traffic | |
CN101909192B (en) | Television terminal and communication method thereof | |
US10630656B2 (en) | System and method of encrypted media encapsulation | |
EP2186290B1 (en) | System and method for identifying encrypted conference media traffic | |
JP4832959B2 (en) | Voice communication terminal device, voice communication control method, and voice communication terminal program | |
EP2071771B1 (en) | A method, system and gateway for negotiating about the ability of the data signal detector | |
CN100525476C (en) | Content transmitting method in media gate control protocol calling | |
CN117201468A (en) | VOIP self-adaptive voice coding system | |
JP2002503069A (en) | Communication network, signal transmission method, network connection unit, and method for adjusting bit rate of scaled data flow | |
EP2359562B1 (en) | Digital telecommunications system, program product for, and method of managing such a system | |
JP2004343417A (en) | VoIP MULTIPLEXER | |
KR20060038296A (en) | Apparatus and method for multiplexing the packet in mobile communication network | |
WO2012057703A1 (en) | Reduced size real time protocol header | |
KR101384125B1 (en) | Apparatus and method for generating quality of service parameter about mac layer in communication system | |
JP5831095B2 (en) | Voice communication system, voice communication apparatus and program | |
KR100666956B1 (en) | Apparatus and method for transmitting of media on network | |
Lyimo | Implementing a campus VoIP intercom VLAN: A technology review, system requirements and architecture | |
KR100330140B1 (en) | Internet phone system | |
Coinchon et al. | Ebu tech. doc. 3326 for interoperability between audio over ip units |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |