CN114257526A - In-band telemetry system, method and device - Google Patents

In-band telemetry system, method and device Download PDF

Info

Publication number
CN114257526A
CN114257526A CN202010955292.XA CN202010955292A CN114257526A CN 114257526 A CN114257526 A CN 114257526A CN 202010955292 A CN202010955292 A CN 202010955292A CN 114257526 A CN114257526 A CN 114257526A
Authority
CN
China
Prior art keywords
int
report
node
traffic
flow
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010955292.XA
Other languages
Chinese (zh)
Inventor
毛修斌
潘恬
贾晨昊
祝晓平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
XFusion Digital Technologies Co Ltd
Original Assignee
XFusion Digital Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by XFusion Digital Technologies Co Ltd filed Critical XFusion Digital Technologies Co Ltd
Priority to CN202010955292.XA priority Critical patent/CN114257526A/en
Publication of CN114257526A publication Critical patent/CN114257526A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/06Generation of reports
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0852Delays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/16Threshold monitoring

Abstract

The embodiment of the application provides an in-band telemetry system, method and device, which are used for making full use of the flexibility and high efficiency of network edge nodes and the capacity of flow characteristic perception, unloading INT information acquisition decisions to the edge nodes, and enabling an intermediate switch to act only according to specific INT identifications, so that the flexible and self-adaptive INT acquisition is realized. The in-band telemetry system includes a first node, a switching device, and a second node; the first node acquires the traffic characteristics of the service traffic; determining an INT strategy of the service flow according to the flow characteristics; then the first node sends the business message carrying the INT strategy corresponding to the business flow to the switching equipment; the switching equipment acquires INT information corresponding to the service flow according to the INT strategy; the switching device sending the INT information to the second node; and finally, the second node generates an INT report according to the INT information and reports the INT report.

Description

In-band telemetry system, method and device
Technical Field
The present application relates to the field of communications, and in particular, to an in-band telemetry system, method, and apparatus.
Background
The network scale of the current data center is huge, the service flow is complex, and great challenges are brought to network operation and maintenance. A Network system of a traditional Data Center adopts an out-of-band remote monitoring mechanism, the Network state of a Data Center Network (DCN) cannot be collected in real time, and a Software Defined Network (SDN) controller cannot perform optimization and management actions according to the Network real-time state in time; an out-of-band information collector adopted by network operation and maintenance needs to collect telemetering (telemeasurement) information from each network node (network card and switch), so that the collector is overloaded and has poor real-time performance; currently, the industry provides an In-band Network Telemetry (INT) technology, which can support embedding of telemetrology information In a Network message, such as delay, port number, queue usage, timestamp and the like, inserting corresponding Metadata (Metadata) into each hop of Network equipment supporting the characteristic, and finally extracting and reporting the telemetrology information from the message to a collector by a sink node, thereby realizing real-time collection of Network states and tenant-level traffic state perception, and improving Network operation and maintenance efficiency.
The existing INT technology is mainly based on intermediate equipment such as a switch and the like to decide acquisition of telemetering information and generate a telemetering report, so that the design of the switch is complicated, and more refined INT acquisition control is difficult to realize because the switch cannot distinguish the characteristics of specific flow.
Disclosure of Invention
The embodiment of the application provides an in-band telemetry system, method and device, which are used for making full use of the flexibility and high efficiency of network edge nodes and the capacity of flow characteristic perception, unloading INT information acquisition decisions to the edge nodes, and enabling an intermediate switch to act only according to specific INT identifications, so that the flexible and self-adaptive INT acquisition is realized.
In a first aspect, an embodiment of the present application provides an in-band telemetry system, which includes a first node, a switching device, and a second node; wherein the first node acts as a source end node, the switching device may be a switch, and the second node acts as a sink end node. In this embodiment, the first node obtains traffic characteristics of a service traffic; determining an INT strategy of the service flow according to the flow characteristics; then the first node sends the service message of the service flow to the switching equipment, and the INT strategy is carried in the service message; after receiving the service message, the switching equipment analyzes the service message to obtain the INT strategy and acquires INT information corresponding to the service flow according to the INT strategy; the switching equipment sends a service message corresponding to the service flow to the second node, and the INT information is carried in the service message; and finally, after receiving the service message, the second node analyzes the service message to obtain the INT information, generates an INT report according to the INT information and reports the INT report.
In this embodiment, the first node and the second node both include a host and an intelligent network card, and the host runs a management process of a control plane, and is mainly responsible for managing the traffic flow, and generates a corresponding filtering table entry according to traffic characteristics of the traffic flow, and issues the filtering table entry to the intelligent network card. And the intelligent network card performs identification of part of the service flow, for example, the service flow is identified as elephant flow or the service flow is rat flow, and reports the identification result to the host control plane. In addition, an INT unit is also operated on the intelligent network card. And in the source end node, the INT unit is mainly responsible for carrying out INT detection identification on the matched and hit service flow according to the INT filtering rule unloaded by the control surface. On the receiving end node, the INT unit is responsible for finishing the extraction and the report of INT information, and undertakes the pretreatment of partial INT information, and reduces the load of the collector. The special INT acquisition unit is operated on the switching equipment, and corresponding INT information is mainly inserted into specific service flow according to an INT strategy in a service message.
In the embodiment, the network edge nodes (such as the first node and the second node) are fully utilized, the flexibility and the high efficiency are realized, the capacity of sensing the flow characteristics is realized, the acquisition decision of the INT information is unloaded to the edge nodes, and the intermediate switch only acts according to a specific INT strategy, so that the flexible and self-adaptive INT acquisition is realized.
Optionally, a specific manner of determining, by the first node, the INT policy of the service traffic according to the traffic characteristic may be as follows: and the first node determines an INT strategy of the service flow from an INT acquisition rule table according to the flow characteristics, wherein the INT acquisition rule table is used for indicating the corresponding relation between the flow characteristics and the INT strategy, and the INT strategy comprises INT options and flow type identifications. It will be appreciated that the INT option is used to indicate which information the switching device collects, and the stream type identifier is used to indicate the traffic characteristics of the traffic flow. Therefore, the edge node identifies the traffic characteristics of each service traffic, and then determines whether to perform INT acquisition and how to perform INT acquisition according to the traffic characteristics, so that the INT strategy can be flexibly and efficiently adjusted, and the load of a collector receiving an INT report is reduced.
Optionally, the INT option includes at least one of: hop-by-hop forwarding delay, cumulative delay, queue depth, total buffer occupancy, port utilization, ingress and egress timestamp, port cumulative number of forwarding bytes, and port cumulative number of forwarding messages.
Optionally, the flow type identifier includes at least one of: the flow type identifier is a mouse flow or a elephant flow; the stream type identification is a delay sensitive stream or a non-delay sensitive stream; the flow type is identified as east-west traffic or north-south traffic. In the embodiment, the accuracy of INT information acquisition can be effectively ensured by matching according to the stream type identifier and the INT option, so that the acquisition amount of INT information by the switch is reduced, and the load of a collector receiving an INT report is reduced.
Optionally, based on the above scheme, when the INT policy is carried by the first node through the service packet, the INT policy is inserted into the service packet by the first node in the form of an INT header.
Optionally, a specific manner of the switching device acquiring the INT information corresponding to the service traffic according to the INT policy may be as follows: the switching equipment acquires self state information, wherein the self state information comprises information such as port occupancy rate, buffer occupancy condition, time delay of current information sending or accumulated time delay accumulated on the switching equipment; and then the switching equipment acquires INT information corresponding to the service flow according to the self state information and the INT strategy. The switch realizes the self-adaptive acquisition of the INT information according to the combination of the state information of the switch and the INT strategy, and effectively reduces the load of a collector for receiving the INT report.
Optionally, a specific manner of generating an INT report according to the INT information and reporting the INT report by the second node may be as follows: the second node preprocesses the INT information according to a preset rule to generate an INT report; the INT report is then reported.
In an exemplary aspect, the preset rule includes at least one of: generating an INT report when the accumulated time delay is greater than a first threshold; generating an INT report when the link utilization is greater than a second threshold; generating an INT report when the link jitter condition exceeds a preset range; generating an INT report when the buffer occupancy rate of the switching equipment is greater than a third threshold value; and generating an INT report when the path of the service message is switched.
In an exemplary scheme, in order to reduce the load, the second node may report the INT report by using a Committed Access Rate (CAR) Rate limit when reporting the INT report. In an exemplary scenario, when the INT report marker is a non-sensitive report, the second node reports the INT report when the CAR is stained green, and discards the INT report when the CAR is stained yellow or red; when the INT report marker is a sensitive report, the second node reports the INT report when the CAR is stained green or yellow and discards the INT report when the CAR is stained red.
It can be understood that when the CAR speed limiting system is adopted by the second node, two layers can be designed, one layer is for the whole network system, and the other layer is for a single tenant (i.e. a single user).
In a second aspect, an embodiment of the present application provides an in-band telemetry method, including: the first node acquires the traffic characteristics of the service traffic; determining an INT strategy of the service flow according to the flow characteristics; then the first node sends the service message of the service flow to the switching equipment, and the INT strategy is carried in the service message; and after receiving the service message, the switching equipment analyzes the service message to obtain the INT strategy and acquires INT information corresponding to the service flow according to the INT strategy.
In this embodiment, the first node includes a host and an intelligent network card, where the host runs a management process of a control plane, and is mainly responsible for managing the traffic flow, and generates a corresponding filtering table entry according to traffic characteristics of the traffic flow, and issues the filtering table entry to the intelligent network card. And the intelligent network card performs identification of part of the service flow, for example, the service flow is identified as elephant flow or the service flow is rat flow, and reports the identification result to the host control plane. In addition, an INT unit is also operated on the intelligent network card. And in the source end node, the INT unit is mainly responsible for carrying out INT detection identification on the matched and hit service flow according to the INT filtering rule unloaded by the control surface. If the first node is a receiving end node, the INT unit is responsible for finishing the extraction and the report of INT information, and undertakes the pretreatment of part of INT information, thereby reducing the load of the collector. The special INT acquisition unit is operated on the switching equipment, and corresponding INT information is mainly inserted into specific service flow according to an INT strategy in a service message.
In the embodiment, the network edge node (such as the first node) is fully utilized, is flexible and efficient, has the capacity of sensing the flow characteristics, and unloads the INT information acquisition decision to the edge node, and the intermediate switch only acts according to a specific INT strategy, so that flexible and self-adaptive INT acquisition is realized.
Optionally, a specific manner of determining, by the first node, the INT policy of the service traffic according to the traffic characteristic may be as follows: and the first node determines an INT strategy of the service flow from an INT acquisition rule table according to the flow characteristics, wherein the INT acquisition rule table is used for indicating the corresponding relation between the flow characteristics and the INT strategy, and the INT strategy comprises INT options and flow type identifications. It will be appreciated that the INT option is used to indicate which information the switching device collects, and the stream type identifier is used to indicate the traffic characteristics of the traffic flow. Therefore, the edge node identifies the traffic characteristics of each service traffic, and then determines whether to perform INT acquisition and how to perform INT acquisition according to the traffic characteristics, so that the INT strategy can be flexibly and efficiently adjusted, and the load of a collector receiving an INT report is reduced.
Optionally, at least one of the INT options: hop-by-hop forwarding delay, cumulative delay, queue depth, total buffer occupancy, port utilization, ingress and egress timestamp, port cumulative number of forwarding bytes, and port cumulative number of forwarding messages.
Optionally, the flow type identifier includes at least one of: the stream type is identified as a rat stream or a elephant stream; the stream type identification is a delay sensitive stream or a non-delay sensitive stream; the flow type is identified as east-west traffic or north-south traffic. In the embodiment, the accuracy of INT information acquisition can be effectively ensured by matching according to the stream type identifier and the INT option, so that the acquisition amount of INT information by the switch is reduced, and the load of a collector receiving an INT report is reduced.
Optionally, based on the above scheme, when the INT policy is carried by the first node through the service packet, the INT policy is inserted into the service packet by the first node in the form of an INT header.
Optionally, when the first node serves as a receiving end node, the first node may further receive INT information sent by an INT exchange device; and generating an INT report according to the INT information, and reporting the INT report.
In an exemplary scheme, a specific manner for the first node to generate an INT report according to the INT information and report the INT report may be as follows: the first node preprocesses the INT information according to a preset rule to generate an INT report; the INT report is then reported.
In an exemplary aspect, the preset rule includes at least one of: generating an INT report when the accumulated time delay is greater than a first threshold; generating an INT report when the link utilization is greater than a second threshold; generating an INT report when the link jitter condition exceeds a preset range; generating an INT report when the buffer occupancy rate of the switching equipment is greater than a third threshold value; and generating an INT report when the path of the service message is switched.
In an exemplary scheme, in order to reduce the load, the first node may report the INT report by using a CAR rate limiting system when reporting the INT report. In an exemplary scenario, when the INT report marker is a non-sensitive report, the first node reports the INT report when the CAR is stained green, and discards the INT report when the CAR is stained yellow or red; when the INT report marker is a sensitive report, the first node reports the INT report when CAR stains green or yellow and discards the INT report when CAR stains red.
It can be understood that when the CAR speed limiting system is adopted by the first node, two layers can be designed, one layer is for the whole network system, and the other layer is for a single tenant (i.e. a single user).
In a third aspect, the present application provides a communication apparatus having a function of implementing the behavior of the first node in the second aspect. The function can be realized by hardware, and can also be realized by executing corresponding software by hardware. The hardware or software includes one or more modules corresponding to the functions described above.
In one possible implementation, the apparatus includes means or modules for performing the steps of the second aspect above. For example, the apparatus includes: the acquisition module is used for acquiring the traffic characteristics of the service traffic; the processing module is used for determining an in-band telemetry INT strategy of the service flow according to the flow characteristics; and the sending module is used for sending the service message of the service flow to the switching equipment, and the INT strategy is carried by the service message, so that the switching equipment acquires the INT information of the service flow according to the INT strategy.
Optionally, the device further comprises a storage module for storing necessary program instructions and data of the communication device.
In one possible implementation, the apparatus includes: a processor and a transceiver, the processor being configured to enable the communication device to perform the respective functions of the method provided by the second aspect described above. The transceiver is used for indicating the communication between the communication device and the switching equipment and the network equipment thereof, and sending the service message related in the method to the switching equipment or receiving the service message sent by the switching equipment. Optionally, the device may further include a memory for coupling to the processor that retains program instructions and data necessary for the communication device.
In one possible implementation, when the apparatus is a chip within a communication apparatus, the chip includes: the system comprises a processing module and a transceiver module, wherein the processing module can be a processor, and the processor is used for acquiring traffic characteristics of traffic; and determining an in-band telemetry INT strategy of the service flow according to the flow characteristics, wherein the transceiver module can be an input/output interface, a pin or a circuit on the chip, and the like, and transmits the service message carrying the INT strategy generated by the processor to other chips or modules coupled with the chip. The processing module can execute the computer executable instructions stored by the storage unit to support the communication device to execute the method provided by the first aspect. Alternatively, the storage unit may be a storage unit in the chip, such as a register, a cache, and the like, and the storage unit may also be a storage unit located outside the chip, such as a read-only memory (ROM) or another type of static storage device that can store static information and instructions, a Random Access Memory (RAM), and the like.
In one possible implementation, the apparatus includes: a processor, baseband circuitry, radio frequency circuitry, and an antenna. The processor is used for realizing control of functions of each circuit part, the baseband circuit is used for generating a service message containing an INT strategy, and the service message is subjected to analog conversion, filtering, amplification, up-conversion and the like through the radio frequency circuit and then sent to the switching equipment through the antenna. Optionally, the device further comprises a memory that holds program instructions and data necessary for the communication device.
In one possible implementation, the apparatus includes a communication interface and logic circuitry, the communication interface to obtain traffic characteristics of a traffic flow; the logic circuit is used for determining an in-band telemetry INT strategy of the service flow according to the flow characteristics; the communication interface is used for sending the service message of the service flow to the switching equipment, and the INT strategy is carried by the service message, so that the switching equipment acquires the INT of the service flow according to the INT strategy.
The processor mentioned in any of the above may be a general Processing Unit (CPU), a microprocessor, an application-specific integrated circuit (ASIC), or one or more integrated circuits for controlling the execution of programs of the above-mentioned data transmission methods.
In a fourth aspect, the present application provides a computer-readable storage medium storing computer instructions for executing the method according to any possible implementation manner of any one of the above aspects.
In a fifth aspect, embodiments of the present application provide a computer program product comprising instructions which, when run on a computer, cause the computer to perform the method of any one of the above aspects.
In a sixth aspect, the present application provides a chip system comprising a processor for enabling a communication device to implement the functions referred to in the above aspects, such as generating or processing data and/or information referred to in the above methods. In one possible design, the system-on-chip further includes a memory for storing program instructions and data necessary for the communication device to implement the functionality of any of the above aspects. The chip system may be formed by a chip, and may also include a chip and other discrete devices.
Drawings
FIG. 1 is an exemplary diagram of a system architecture in an embodiment of the present application;
fig. 2 is an exemplary structural diagram of an edge node in the embodiment of the present application;
fig. 3 is an exemplary structural diagram of a switching device in an embodiment of the present application;
FIG. 4 is a schematic diagram of one embodiment of an in-band telemetry method in an embodiment of the present application;
fig. 5 is a schematic diagram of a service packet format with an INT header in an embodiment of the present application;
fig. 6 is a schematic diagram of a format of a service packet with an INT header and INT information in an embodiment of the present application;
fig. 7 is another schematic diagram of a service packet format with an INT header and INT information in the embodiment of the present application;
FIG. 8 is a schematic workflow diagram of CAR speed limit in an embodiment of the present application;
FIG. 9 is a diagram of an application scenario for in-band telemetry in an embodiment of the present application;
FIG. 10 is a schematic diagram of one embodiment of an in-band telemetry system in an embodiment of the present application;
fig. 11 is a schematic diagram of an embodiment of a communication device in an embodiment of the present application;
fig. 12 is a schematic diagram of another embodiment of the communication device in the embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the embodiments of the present application are described below with reference to the accompanying drawings, and it is obvious that the described embodiments are only some embodiments of the present application, but not all embodiments. As can be known to those skilled in the art, with the advent of new application scenarios, the technical solution provided in the embodiments of the present application is also applicable to similar technical problems.
The terms "first," "second," and the like in the description and in the claims of the present application and in the above-described drawings are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that the embodiments described herein may be practiced otherwise than as specifically illustrated or described herein. Moreover, the terms "comprises," "comprising," and any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or modules is not necessarily limited to those steps or modules explicitly listed, but may include other steps or modules not expressly listed or inherent to such process, method, article, or apparatus. The naming or numbering of the steps appearing in the present application does not mean that the steps in the method flow have to be executed in the chronological/logical order indicated by the naming or numbering, and the named or numbered process steps may be executed in a modified order depending on the technical purpose to be achieved, as long as the same or similar technical effects are achieved. The division of the units presented in this application is a logical division, and in practical applications, there may be another division, for example, multiple units may be combined or integrated into another system, or some features may be omitted, or not executed, and in addition, the shown or discussed coupling or direct coupling or communication connection between each other may be through some interfaces, and the indirect coupling or communication connection between the units may be in an electrical or other similar form, which is not limited in this application. Furthermore, the units or sub-units described as the separate parts may or may not be physically separate, may or may not be physical units, or may be distributed in a plurality of circuit units, and some or all of the units may be selected according to actual needs to achieve the purpose of the present disclosure.
The technical scheme of the embodiment of the invention can be applied to various communication systems, such as: a Global System for Mobile communications (GSM) System, a Code Division Multiple Access (CDMA) System, a Wideband Code Division Multiple Access (WCDMA) System, a Long Term Evolution (Long Term Evolution) System, an LTE (Frequency Division Duplex) System, an LTE Time Division Duplex (FDD) System, a Time Division Duplex (TDD) System, a Universal Mobile Telecommunications System (UMTS), a 5G Communication System, and a future wireless Communication System.
As shown in fig. 1, a system architecture applied by the embodiment of the present application may be as shown in fig. 1, and the network includes a first node, a switching device, and a second node. The first node and the second node (i.e. the edge node) both comprise a host and an intelligent network card. In an exemplary scheme, the schematic structural diagram of the edge node may be as shown in fig. 2, where a host of the edge node runs a control plane process, and is mainly responsible for managing service traffic, and generates a corresponding filtering table entry according to traffic characteristics of the service traffic, and sends the filtering table entry to the intelligent network card. And the intelligent network card performs identification of part of the service flow, for example, the service flow is identified as elephant flow or the service flow is rat flow, and reports the identification result to the host control plane. In addition, an INT unit is also operated on the intelligent network card. And in the source end node, the INT unit is mainly responsible for carrying out INT detection identification on the matched and hit service flow according to the INT filtering rule unloaded by the control surface. On the receiving end node, the INT unit is responsible for finishing the extraction and the report of INT information, and undertakes the pretreatment of partial INT information, and reduces the load of the collector. The special INT acquisition unit is operated on the switching equipment, and corresponding INT information is mainly inserted into specific service flow according to an INT strategy in a service message.
The switching device in the present application may be a switch, and the network may include a plurality of switches. The switch may be an access layer switch, a convergence layer switch, or a core layer switch, and the specific situation is not limited herein. In an exemplary scheme, a schematic structural diagram of the switch may be as shown in fig. 3, where a special INT acquisition unit is run on the switch, and corresponding INT information is mainly inserted into a specific service traffic according to an INT policy in a service packet. Meanwhile, the switch can also carry out self-adaptive acquisition by combining self state information and the INT strategy, so that the load of INT acquisition is reduced.
It will be appreciated that there is also a centralized collector in the inband telemetry system, which mainly receives INT reports reported by the receiver nodes, thus implementing the functionality responsible for aggregating INT information across the network and synchronizing the information to the analyzer or other control plane according to the actual operation and maintenance requirements.
For convenience of understanding, some terms in the embodiments of the present application are described below:
in-band Network Telemetry (INT): a framework for realizing network state collection and reporting in a data plane, which is proposed by P4 language alliance (P4.org), can integrate data according to network states and quickly detect states such as network faults, sub-health and the like. The in-band remote measurement does not affect the forwarding path of the original service message, the cost is low, and the operation and maintenance cost of the data center network is greatly reduced.
Data Center Network (DCN): data center networks are networks applied in data centers, and internal traffic of the data center networks presents characteristics of typical data concentration, east-west traffic increase and the like. The networking of the data center network has strong symmetry and regularity and good closure.
Referring specifically to fig. 4, an embodiment of an in-band telemetry method in an embodiment of the present application includes:
401. the first node identifies a traffic characteristic of the traffic flow.
When the first node runs in service flow, the host or the intelligent network card of the first node senses and identifies the flow characteristics of the service flow.
In this embodiment, the first node may generate a service flow table in advance, where the service flow table includes all service flows that may be operated in a network where the first node is located. And classifying the service flow according to the flow characteristics in the service flow table. In an exemplary scheme, the specific rules for classifying the traffic based on the traffic characteristics may be as follows:
1. the traffic is classified as elephant or rat based on its duration and bandwidth occupancy. Wherein, elephant flow refers to a large and continuous data transmission process (i.e. large bandwidth occupied by data and long data transmission time) performed through a network link. The mouse flow refers to a small and short-time data transmission process (namely, the data occupies small bandwidth and has short duration) through a network link. Information transfer such as social software belongs to the stream of mice; while the migration of virtual machines belongs to the elephant stream.
2. And based on the information such as protocol type, priority and the like of the service flow, the service flow is divided into a time delay sensitive flow or a non-time delay sensitive flow. The delay-sensitive flow refers to a data transmission process that realizes low delay, low jitter and zero packet loss rate through a network link (that is, the data transmission process needs to ensure the service quality).
3. Traffic is classified as east-west traffic or north-south traffic based on the type of destination node of the traffic. The traffic flow between the client and the server is referred to as north-south traffic, i.e., the north-south traffic is server-client traffic. Traffic between different servers and traffic between different data centers is referred to as east-west traffic, i.e., east-west traffic is server-server traffic.
It is understood that the same traffic may be tagged with at least one flow type, such as rat flow and delay sensitive flow.
402. And the first node determines an INT strategy corresponding to the service flow according to the flow characteristic.
And after recognizing the flow characteristics, the first node determines the flow type identifier of the service flow according to the service flow table and determines whether the service flow needs to be subjected to INT detection. And if the business flow needs to be subjected to INT detection, determining an INT strategy corresponding to the business flow from a pre-generated INT acquisition rule table according to the flow type identifier of the business flow, wherein the INT strategy at least comprises an INT option and the flow type identifier.
In an exemplary scheme, the specific manner of the first node in generating the INT acquisition rule table may be as follows: the host control surface of the first node performs flow characteristic identification and classification on part of the service flow to generate a corresponding service flow table, and meanwhile, the service flow table is unloaded to the intelligent network card. The intelligent network card identifies and classifies the flow characteristics of part of the service flow, records the result obtained by identification and classification to the service flow table, and finally synchronizes the updated service flow table to the control surface of the host. And the host control surface decides whether to carry out INT acquisition on each service flow, which information to acquire, acquisition interval and the like according to the information recorded by the service flow table. Besides the switch ID, the ingress port and the egress port number as the necessary options for identifying the attribution of the related information, the collected INT information can also comprise other INT options. The INT option includes at least one of: hop-by-hop forwarding delay; accumulating the time delay; queue depth; overall Buffer occupancy; port utilization; a timestamp of an ingress port and a timestamp of an egress port; the port accumulated forwarding byte number and the port accumulated forwarding message number. The control surface generates an INT acquisition rule table according to the information and unloads the INT acquisition rule table to the intelligent network card, each item of information needing to be acquired in the INT acquisition rule table is marked by an enabling bit, flow characteristics are marked by flow type identifications (the flow type identifications of the same service flow can comprise at least one), and each acquisition rule can also specify the sampling interval. An alternative format for the INT acquisition rule table is shown in table 1:
TABLE 1
Figure BDA0002678397480000081
In table 1 "queue depth: 0 "may be used to indicate the depth of the non-acquisition queue," port utilization: a 1 "may be used to indicate acquisition port utilization. Keys may be in forms other than five-tuple. While the acquisition interval may be expressed using time, such as "10 microseconds" as shown in table 1; the number of service messages may also be used for representation, for example, the collection is performed at intervals of 4 service messages.
When the first node forwards the service message, the INT acquisition rule table needs to be queried according to the traffic characteristics of the service traffic corresponding to the service message, and then the INT policy (i.e., a certain INT acquisition rule in the INT acquisition rule table) is determined according to the INT acquisition rule table under the condition of query hit.
403. And the first node sends a service message corresponding to the service flow to the switching equipment, wherein the service message carries the INT strategy.
When the first node forwards the service message, the INT acquisition rule table needs to be queried according to the traffic characteristics of the service traffic corresponding to the service message, and then the INT policy (that is, the INT acquisition rule matching the service traffic in the INT acquisition rule table) is inserted into the service message according to the acquisition interval of the INT acquisition rule table under the condition of query hit. In an exemplary scheme, the first node may insert the INT policy by inserting an INT header in the traffic message. At this time, the format of the service packet with the INT header may be as shown in fig. 5: and inserting the INT header between the message header and the load, wherein the INT header comprises a stream type identifier and an enabling identifier of an INT option in the INT acquisition rule table.
404. And the switching equipment acquires INT information corresponding to the service flow according to the INT strategy.
After receiving the service message, the switching device analyzes the service message to obtain an INT strategy carried by the service message, and then acquires INT information corresponding to the service flow according to the INT strategy.
In an exemplary scheme, to reduce the load, the switching device may adaptively acquire INT information corresponding to the traffic flow according to self-state information and the INT policy. I.e. the switching device decides the information to collect according to the stream type indicated by the stream type identification, the enable bit of the INT option and the state of the switching device itself. The switching equipment adjusts according to the self state and the stream type identified in the INT header, and meanwhile, the switching equipment further judges whether the corresponding INT information needs to be inserted into the service message or not only when the enabling bit identification is valid. For example, for a service flow sensitive to delay, the INT option is to acquire hop-by-hop forwarding delay, and at this time, the switching device may perform INT acquisition (that is, acquire the current hop-by-hop forwarding delay) only when it is checked that the current hop-by-hop forwarding delay exceeds a preset threshold. In addition, the switching device can make an acquisition decision according to the buffer use condition, the port utilization rate and the like. The general principle is to feed back sensitive states such as congestion, time delay and the like existing in the network as much as possible, and for the states such as low load and low time delay, INT acquisition can be reduced or not performed, so that the load is reduced.
In an exemplary scheme, after the switching device recognizes that the traffic flow is in the sensitive state, the switching device may further identify the INT information as the sensitive information in the INT header. The sensitive status here can be used to indicate a network status anomaly.
405. The switching device sends the INT information to the second node.
The switching equipment inserts the acquired INT information into the service message and forwards the service message to the second node.
In an exemplary scheme, the switching device inserts the INT information into the INT header and the Payload (Payload) of the service packet, where an alternative format of the service packet may be as shown in fig. 6. The INT information includes identification information of the switching device, and information corresponding to an ingress port, an egress port number and an INT option. If a plurality of switching devices are included between the first node and the second node, and at least two switching devices in the plurality of switching devices collect the INT information, the format of the service packet may be as shown in fig. 7.
406. The second node generates an INT report based on the INT information.
And after receiving the service message, the second node analyzes the service message to obtain INT information on the whole forwarding path, and finally generates an INT report according to the INT information.
In an exemplary scheme, in order to reduce the load, the second node may analyze INT information of the hop-by-hop switching device and process the INT information according to a preset rule, and only INT information satisfying the preset rule is generated into an INT report. An exemplary scheme of the preset rule and the preprocessing mode may be as shown in table 2:
TABLE 2
Figure BDA0002678397480000101
In this embodiment, the table 2 only shows a part of examples of the preset rule, and the specific preset rule may have other manners, which is not limited herein, as long as the load can be reduced as much as possible on the premise that the operation and maintenance requirements are met.
407. The second node sends the INT report to a collector.
The second node reports the INT to a collector after generating the INT report.
In an exemplary scenario, the second node may rate limit the reporting of INT reports by a hierarchical CAR in order to further control the collector's load. The work flow of the method can be as shown in fig. 8, after the INT report is generated by the second node, the INT report is subjected to speed limitation (namely, the overall speed limitation shown in fig. 8) according to the overall network, after the overall network speed limitation stage passes, the single user is subjected to speed limitation (namely, the second-stage speed limitation), and after the second-stage speed limitation stage passes, the INT report is reported by the second node. While the specific implementation of the overall speed limit and the second stage speed limit may be as follows:
when the INT report marker is a non-sensitive report, the second node reports the INT report when the CAR is stained green and discards the INT report when the CAR is stained yellow or red; when the INT report marker is a sensitive report, the second node reports the INT report when the CAR is stained green or yellow and discards the INT report when the CAR is stained red.
In this embodiment, a total speed limit is taken as an example, the second node performs CAR speed limit dyeing on the INT report, and then the second node determines whether the INT report dyeing is green, and if the INT report dyeing is green, the second node may directly determine that the INT report passes through the total speed limit; if the INT report staining is not green, the second node judges whether the INT report staining is red, and if the INT report staining is red, the second node selectively discards the INT report; if the INT report is not dyed red, the second node judges whether the INT report is marked as a sensitive state, and if so, the second node determines that the INT report passes through the overall speed limit; if not, the second node chooses to discard the INT report. In the overall speed limit, the CAR dyeing of the INT report by the second node can be realized by comparing the bandwidth required for reporting the INT report with the available bandwidth used for reporting the INT report by the overall network, and if the difference between the available bandwidth and the bandwidth required for reporting the INT report is larger, the second node determines that the INT report is dyed in green; if the available bandwidth is less than the bandwidth required by the INT report, the second node determines that the INT report is colored red; if the difference between the available bandwidth and the bandwidth required for the INT report is small (i.e., the available bandwidth is similar to the bandwidth required for the INT report), then the second node determines that the INT report is colored yellow.
Similarly, the second-stage speed limit also adopts the above process, but the available bandwidth of the second-stage speed limit is the available bandwidth of a single user. The detailed process is not described herein.
In the following, a specific application scenario is described as an in-band telemetry method in the embodiment of the present application, please refer to a two-stage three-hop networking shown in fig. 9, where the two-stage three-hop networking includes an edge node 1, a first ToR switch, a Leaf switch, a second ToR switch, and an edge node 2. Wherein this edge node 1 includes host computer 1 and intelligent network card 1, and first ToR switch is as first hop switch, and the Leaf switch is as the second hop switch, and the second ToR switch is as the third hop switch, and this edge node 2 includes host computer 2 and intelligent network card 2. In the network, the intelligent network card 1 on the host 1 cooperates with the host 1 to complete the identification of the flow characteristics of each service flow, and simultaneously, the host 1 issues a pre-generated rule table to the intelligent network card 1. A currently sent service message is matched with the hit rule table, the intelligent network card 1 identifies the current service message as a delay sensitive flow message according to the content of the rule table, and the INT information to be collected is the hop-by-hop port utilization rate.
After receiving the message, the first ToR switch on the forwarding path resolves that the service message has an INT policy (that is, the flow type identifier is a delay sensitive flow, and the INT option is a port utilization rate). At this time, the port utilization rate of the first ToR switch has reached 126%, which exceeds the collection upper limit by 120%, and there is a risk of path overload. Therefore, the first ToR switch inserts necessary information such as its own switch identifier, a message forwarding port, etc., and the current port utilization rate as INT metadata into the INT option of the original service message, and identifies the INT information in the service message as sensitive information. And then the first ToR switch forwards the service message inserted with the INT metadata to the Leaf switch.
The message is analyzed and processed on the Leaf switch according to the same flow, and at the moment, the port utilization rate of the Leaf switch is only 24%, and the risk of path overload is avoided; meanwhile, in a low-load state of a link, the low-delay forwarding of the current service message can be kept, and the delay sensitivity requirement of the current service message is met, so that port utilization rate information can be acquired on the leaf switch according to the calculated probability. In one exemplary scenario, the algorithm is described as follows:
Figure BDA0002678397480000121
according to the above algorithm, if p is 1, the port utilization information of the current switch is collected with a probability of only 6.11% under the link load with the port utilization of 24%. Assume that the leaf switch does not collect port utilization at this time. The leaf switch does not insert INT metadata into the service packet forwarded by the first ToR switch, but forwards the service packet forwarded by the first ToR switch to the second ToR switch again.
And finally, on the second ToR switch, the condition that the port utilization rate exceeds the acquisition upper limit also appears on the forwarding port of the message, so that the second ToR switch inserts necessary information such as the switch identification of the second ToR switch, the message forwarding port and the like and the current port utilization rate as INT metadata into the metadata of the service message forwarded by the first ToR switch, and the INT information in the service message is identified as sensitive information. Then, the second ToR switch forwards the service packet into which the INT metadata is inserted to the edge node 2.
And after receiving the service message, the edge node 2 strips the acquired INT data and preprocesses the INT data, checks that overload risks exist on a message forwarding path, and decides to generate an INT report and report the INT report to a collector. It is assumed that the collection of INT is strictly controlled on all source end hosts (i.e., nodes having the function of edge node 1), the total INT information collected on the network is within the processing capacity range of the collector, at this time, the two-stage rate-limiting filtering on the destination host (i.e., nodes having the function of edge node 2) is passed, and the finally generated INT report is uploaded to the collector.
In the embodiment of the application, the network edge nodes (such as the first node and the second node) are fully utilized, the flexibility and the high efficiency are realized, the capacity of sensing the flow characteristics is realized, the acquisition decision of the INT information is unloaded to the edge nodes, and the intermediate switch only acts according to a specific INT strategy, so that the flexible and self-adaptive INT acquisition is realized. Meanwhile, the collection of INT information and the INT report are limited, so that the load is reduced.
The in-band telemetry method in the embodiment of the present application is described above, and the in-band telemetry system and the communication apparatus in the embodiment of the present application are described below.
Referring specifically to fig. 10, in the embodiment of the present application, the in-band telemetry system includes a first node 1001, a switching device 1002, and a second node 1003. The first node 1001 is configured to obtain a traffic characteristic of a service traffic; determining an in-band telemetry INT strategy of the service flow according to the flow characteristics; sending the service packet of the service traffic to the switching device 1002, where the service packet carries the INT policy;
the switching device 1002 is configured to acquire INT information corresponding to the service traffic according to the INT policy; sending the INT information to the second node 1003;
the second node 1003 is configured to generate an INT report according to the INT information and report the INT report.
Optionally, the first node 1001 is specifically configured to determine an INT policy of the service traffic from an INT acquisition rule table according to the traffic characteristic, where the INT acquisition rule table is used to indicate a correspondence between the traffic characteristic and the INT policy, and the INT policy includes an INT option and a stream type identifier.
Optionally, the INT option includes at least one of: hop-by-hop forwarding delay, accumulated delay, queue depth, buffer occupancy of a total buffer area, port utilization, ingress and egress timestamps, the number of port accumulated forwarding bytes, and the number of port accumulated forwarding messages.
Optionally, the flow type identifier includes at least one of: the stream type is identified as a rat stream or elephant stream; the flow type identification is a time delay sensitive flow or a non-time delay sensitive flow; the flow type is identified as east-west or north-south traffic.
Optionally, the first node 1001 is further configured to insert the INT policy in the service message.
Optionally, the switching device 1002 is specifically configured to obtain self status information; and acquiring INT information corresponding to the service flow according to the self state information and the INT strategy.
Optionally, the second node 1003 is specifically configured to preprocess the INT information according to a preset rule to generate an INT report; and reporting the INT report.
Optionally, the preset rule includes at least one of: generating an INT report when the accumulated time delay is greater than a first threshold; generating an INT report when the link utilization is greater than a second threshold; generating an INT report when the link jitter condition exceeds a preset range; generating an INT report when the buffer occupancy rate of the switching device 1002 is greater than a third threshold; and generating an INT report when the path of the service message is switched.
Optionally, the second node 1003 is specifically configured to report the INT report according to the CAR speed limit.
Optionally, the second node 1003 is specifically configured to, when the INT report is a non-sensitive report, report the INT report when the CAR is stained green, and discard the INT report when the CAR is stained yellow or red; reporting the INT report when the INT report marker is a sensitive report when the CAR is stained green or yellow, and discarding the INT report when the CAR is stained red.
Specifically, referring to fig. 11, the communication device 1100 in the embodiment of the present application includes: the device comprises an acquisition module 1101, a processing module 1102 and a sending module 1103, wherein the acquisition module 1101, the processing module 1102 and the sending module 1103 are connected through a bus. The communication device 1100 may be the first node or the second node in the above method embodiments, and may also be configured as one or more chips within the first node or the second node. The communication device 1100 may be used to perform part or all of the functionality of the first node or the second node in the above-described method embodiments.
For example, the obtaining module 1101 may be configured to execute step 401 in the foregoing method embodiment. For example, the obtaining module 1101 obtains traffic characteristics of the service traffic; the processing module 1102 may be configured to perform steps 402 or 406 in the above method embodiments, for example, the processing module 1102 determines an in-band telemetry INT policy for the traffic flow according to the flow characteristics; the sending module 1103 may be configured to perform step 403 or step 407 in the above method embodiment. For example, the sending module 1103 sends a service packet of the service traffic to the switching device, where the service packet carries the INT policy, so that the switching device collects INT information of the service traffic according to the INT policy.
Optionally, the communication device 1100 further includes a storage module, which is coupled to the processing module, so that the processing module can execute the computer-executable instructions stored in the storage module to implement the functions of the terminal in the above-described method embodiments. In one example, the memory module optionally included in the communication apparatus 1100 may be a memory unit within a chip, such as a register, a cache, etc., and may also be a memory unit located outside the chip, such as a ROM or other types of static memory devices that can store static information and instructions, a RAM, etc.
It should be understood that the flow executed between the modules of the first node or the second node in the corresponding embodiment of fig. 11 is similar to the flow executed by the first node or the second node in the corresponding method embodiment of fig. 1 to fig. 9, and details thereof are not repeated here.
Fig. 12 shows a possible structure diagram of a communication apparatus 1200 in the above embodiment, and the communication apparatus 1200 may be configured as the aforementioned first node or second node. The communication apparatus 1200 may include: a processor 1202, a computer-readable storage medium/memory 1203, a transceiver 1204, an input device 1205, an output device 1206, and a bus 1201. Wherein the processor, transceiver, computer readable storage medium, etc. are connected by a bus. The embodiments of the present application do not limit the specific connection medium between the above components.
In one example, the processor 1202 obtains traffic characteristics of the traffic flow; determining an in-band telemetry INT strategy of the service traffic according to the traffic characteristics;
the transceiver 1204 sends the service packet of the service traffic to the switching device, where the service packet carries the INT policy, so that the switching device collects INT information of the service traffic according to the INT policy.
In yet another example, the processor 1202 may run an operating system that controls functions between various devices and appliances. The transceiver 1204 may include a baseband circuit and a radio frequency circuit, for example, the traffic message may be processed by the baseband circuit and the radio frequency circuit and then sent to a switching device or a collector.
The transceiver 1204 and the processor 1202 may implement corresponding steps in any of the embodiments of fig. 1 to 9, which are not described herein in detail.
It is understood that fig. 12 merely illustrates a simplified design of a communication device, and in practical applications, the communication device may include any number of transceivers, processors, memories, etc., and all communication devices that may implement the present application are within the scope of the present application.
The processor 1202 involved in the apparatus 1200 may be a general-purpose processor, such as a CPU, a Network Processor (NP), a microprocessor, etc., or may be an ASIC, or one or more integrated circuits for controlling the execution of the program according to the present invention. But also a Digital Signal Processor (DSP), a field-programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components. The controller/processor can also be a combination of computing functions, e.g., comprising one or more microprocessors, DSPs, and microprocessors, among others. Processors typically perform logical and arithmetic operations based on program instructions stored within memory.
The bus 1201 may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown in FIG. 12, but this is not intended to represent only one bus or type of bus.
The computer-readable storage medium/memory 1203 referred to above may also hold an operating system and other application programs. In particular, the program may include program code including computer operating instructions. More specifically, the memory may be ROM, other types of static storage devices that may store static information and instructions, RAM, other types of dynamic storage devices that may store information and instructions, disk storage, and so forth. The memory 1203 may be a combination of the above storage types. And the computer-readable storage medium/memory described above may be in the processor, may be external to the processor, or distributed across multiple entities including the processor or processing circuitry. The computer-readable storage medium/memory described above may be embodied in a computer program product. By way of example, a computer program product may include a computer-readable medium in packaging material.
Alternatively, embodiments of the present application also provide a general-purpose processing system, such as that commonly referred to as a chip, including one or more microprocessors that provide processor functionality; and an external memory providing at least a portion of the storage medium, all connected together with other supporting circuitry through an external bus architecture. The memory stores instructions that, when executed by the processor, cause the processor to perform some or all of the steps of the in-band telemetry method of the embodiment of fig. 1-9 of the communication device, and/or other processes for the techniques described herein.
The steps of a method or algorithm described in connection with the disclosure herein may be embodied in hardware or in software instructions executed by a processor. The software instructions may consist of corresponding software modules that may be stored in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, a hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. Of course, the storage medium may also be integral to the processor. The processor and the storage medium may reside in an ASIC. Additionally, the ASIC may reside in a communication device. Of course, the processor and the storage medium may reside as discrete components in a communication apparatus.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application may be substantially implemented or contributed to by the prior art, or all or part of the technical solution may be embodied in a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
The above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions in the embodiments of the present application.

Claims (33)

1. An in-band telemetry system, comprising:
a first node, a switching device and a second node;
the first node acquires the traffic characteristics of the service traffic;
the first node determines an in-band telemetry (INT) strategy of the service traffic according to the traffic characteristics;
the first node sends the service message of the service flow to the switching equipment, and the service message carries the INT strategy;
the switching equipment acquires INT information corresponding to the service flow according to the INT strategy;
the switching equipment sends the INT information to the second node;
and the second node generates an INT report according to the INT information and reports the INT report.
2. The system of claim 1, wherein the first node determining the in-band telemetry INT policy for the traffic based on the traffic characteristics comprises:
and the first node determines an INT strategy of the service flow from an INT acquisition rule table according to the flow characteristics, wherein the INT acquisition rule table is used for indicating the corresponding relation between the flow characteristics and the INT strategy, and the INT strategy comprises INT options and flow type identifications.
3. The system according to claim 2, wherein the INT option comprises at least one of: hop-by-hop forwarding delay, accumulated delay, queue depth, buffer occupancy of a total buffer area, port utilization, ingress and egress timestamps, the number of port accumulated forwarding bytes, and the number of port accumulated forwarding messages.
4. The system of claim 2, wherein the flow type identifier comprises at least one of:
the stream type is identified as a rat stream or a elephant stream;
the stream type identification is a delay sensitive stream or a non-delay sensitive stream;
the flow type is identified as east-west traffic or north-south traffic.
5. The system according to any of claims 1 to 4, wherein before the first node sends a traffic packet of the traffic flow to the switching device, the first node is further configured to insert the INT policy in the traffic packet.
6. The system according to any one of claims 1 to 5, wherein the acquiring, by the switching device, INT information corresponding to the traffic flow according to the INT policy includes:
the switching equipment acquires self state information;
and the switching equipment acquires INT information corresponding to the service flow according to the self state information and the INT strategy.
7. The system according to any one of claims 1 to 6, wherein the generating and reporting of the INT report by the second node according to the INT information comprises:
the second node preprocesses the INT information according to a preset rule to generate an INT report;
and the second node reports the INT report.
8. The system of claim 7, wherein the preset rules include at least one of:
generating an INT report when the accumulated time delay is greater than a first threshold;
generating an INT report when the link utilization is greater than a second threshold;
generating an INT report when the link jitter condition exceeds a preset range;
generating an INT report when the buffer occupancy rate of the switching equipment is greater than a third threshold value;
and generating an INT report when the path of the service message is switched.
9. The system according to any of claims 7 or 8, wherein said second node reporting said INT report comprises:
and the second node reports the INT report according to the committed access rate CAR speed limit.
10. The system of claim 9, wherein the second node reporting the INT report according to the CAR rate limit system comprises:
when the INT report marker is a non-sensitive report, the second node reports the INT report when the CAR stain is green and discards the INT report when the CAR stain is yellow or red;
when the INT report marker is a sensitive report, the second node reports the INT report when CAR staining is green or yellow and discards the INT report when the CAR staining is red.
11. An in-band telemetry method, comprising:
a first node acquires traffic characteristics of service traffic;
the first node determines an in-band telemetry (INT) strategy of the service traffic according to the traffic characteristics;
and the first node sends the service message of the service flow to the switching equipment, and the service message carries the INT strategy, so that the switching equipment acquires the INT information of the service flow according to the INT strategy.
12. The method of claim 11, wherein the first node determining an in-band telemetry (INT) policy for the traffic based on the traffic characteristics comprises:
and the first node determines an INT strategy of the service flow from an INT acquisition rule table according to the flow characteristics, wherein the INT acquisition rule table is used for indicating the corresponding relation between the flow characteristics and the INT strategy, and the INT strategy comprises INT options and flow type identifications.
13. The method according to claim 12, wherein the INT option comprises at least one of: hop-by-hop forwarding delay, cumulative delay, queue depth, total buffer occupancy, port utilization, ingress and egress timestamps, port cumulative number of forwarding bytes, and port cumulative number of forwarding messages.
14. The method of claim 12, wherein the flow type identifier comprises at least one of:
the stream type is identified as a rat stream or a elephant stream;
the stream type identification is a delay sensitive stream or a non-delay sensitive stream;
the flow type is identified as east-west traffic or north-south traffic.
15. The method according to any of claims 11 to 14, wherein before the first node sends the traffic packet of the traffic flow to the switching device, the method further comprises:
and the first node inserts the INT strategy into the service message.
16. The method according to any one of claims 11 to 15, further comprising:
the first node receives INT information sent by the switching equipment;
and the first node generates an INT report according to the INT information and reports the INT report.
17. The method of claim 16, wherein generating and reporting an INT report by the first node based on the INT information comprises:
the first node preprocesses the INT information according to a preset rule to generate an INT report;
and the first node reports the INT report.
18. The method of claim 17, wherein the preset rules comprise at least one of:
generating an INT report when the accumulated time delay is greater than a first threshold;
generating an INT report when the link utilization is greater than a second threshold;
generating an INT report when the link jitter condition exceeds a preset range;
generating an INT report when the buffer occupancy rate of the switching equipment is greater than a third threshold value;
and generating an INT report when the path of the service message is switched.
19. The method according to any of claims 17 or 18, wherein reporting, by the first node, the INT report comprises:
and the first node reports the INT report according to the CAR speed limiting system.
20. The method of claim 19, wherein reporting, by the first node, the INT report according to a CAR rate limit system comprises:
when the INT report marker is a non-sensitive report, the first node reports the INT report when the CAR staining is green and discards the INT report when the CAR staining is yellow or red;
when the INT report marker is a sensitive report, the first node reports the INT report when CAR staining is green or yellow and discards the INT report when the CAR staining is red.
21. A communications apparatus, comprising:
the acquisition module is used for acquiring the traffic characteristics of the service traffic;
the processing module is used for determining an in-band telemetry INT strategy of the service flow according to the flow characteristics;
and the sending module is used for sending the service message of the service flow to the switching equipment, and the INT strategy is carried by the service message, so that the switching equipment acquires the INT information of the service flow according to the INT strategy.
22. The communication device according to claim 21, wherein the processing module is specifically configured to determine an INT policy of the service traffic from an INT acquisition rule table according to the traffic characteristics, wherein the INT acquisition rule table is configured to indicate a correspondence between the traffic characteristics and the INT policy, and the INT policy includes an INT option and a stream type identifier.
23. The communication device according to claim 22, wherein the INT option comprises at least one of: hop-by-hop forwarding delay, cumulative delay, queue depth, total buffer occupancy, port utilization, ingress and egress timestamps, port cumulative number of forwarding bytes, and port cumulative number of forwarding messages.
24. The communications apparatus of claim 22, wherein the flow type identifier comprises at least one of:
the stream type is identified as a rat stream or a elephant stream;
the stream type identification is a delay sensitive stream or a non-delay sensitive stream;
the flow type is identified as east-west traffic or north-south traffic.
25. The communication device according to any of claims 21 to 24, wherein the processing module is further configured to insert the INT policy in the traffic packet.
26. The communication apparatus according to any of claims 21 to 25, wherein the communication apparatus further comprises a receiving module, configured to receive INT information sent by the switching device;
the processing module is used for generating an INT report according to the INT information;
and the sending module is used for reporting the INT report.
27. The communication device according to claim 26, wherein the processing module is specifically configured to preprocess the INT information according to a preset rule to generate an INT report.
28. The communications apparatus as claimed in claim 27, wherein the preset rules comprise at least one of:
generating an INT report when the accumulated time delay is greater than a first threshold;
generating an INT report when the link utilization is greater than a second threshold;
generating an INT report when the link jitter condition exceeds a preset range;
generating an INT report when the buffer occupancy rate of the switching equipment is greater than a third threshold value;
and generating an INT report when the path of the service message is switched.
29. The communication device according to any of claims 27 or 28, wherein the sending module is configured to report the INT report according to a CAR speed limit system.
30. The communication device according to claim 29, wherein the sending module is specifically configured to report the INT report when the INT report is labeled as a non-sensitive report, when the CAR is colored green, and to discard the INT report when the CAR is colored yellow or red;
reporting the INT report when the INT report marker is a sensitive report when CAR staining is green or yellow, and discarding the INT report when the CAR staining is red.
31. A communications apparatus, comprising at least one processor and a memory, the processor configured to couple with the memory, the processor invoking instructions stored in the memory to control the communications apparatus to perform the method of any of claims 11-20.
32. A computer storage medium having computer instructions stored thereon for performing the method of any of the above claims 11 to 20.
33. A computer program product comprising instructions which, when run on a computer, cause the computer to perform the method of any one of the preceding claims 11 to 20.
CN202010955292.XA 2020-09-11 2020-09-11 In-band telemetry system, method and device Pending CN114257526A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010955292.XA CN114257526A (en) 2020-09-11 2020-09-11 In-band telemetry system, method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010955292.XA CN114257526A (en) 2020-09-11 2020-09-11 In-band telemetry system, method and device

Publications (1)

Publication Number Publication Date
CN114257526A true CN114257526A (en) 2022-03-29

Family

ID=80788771

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010955292.XA Pending CN114257526A (en) 2020-09-11 2020-09-11 In-band telemetry system, method and device

Country Status (1)

Country Link
CN (1) CN114257526A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115442275A (en) * 2022-07-27 2022-12-06 北京邮电大学 Hybrid telemetry method and system based on hierarchical trusted streams

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115442275A (en) * 2022-07-27 2022-12-06 北京邮电大学 Hybrid telemetry method and system based on hierarchical trusted streams
CN115442275B (en) * 2022-07-27 2024-02-27 北京邮电大学 Hybrid telemetry method and system based on hierarchical trusted streams

Similar Documents

Publication Publication Date Title
JP4774357B2 (en) Statistical information collection system and statistical information collection device
US9332546B2 (en) Radio resource optimizing method, apparatus, and system
CN111130946B (en) Acceleration method and device for deep packet identification and storage medium
CN103444132A (en) Network system, and switching method
EP3979577B1 (en) Queue congestion control method, device and storage medium
CN115766591B (en) Method for limiting flow rate of fragmented messages and method and device for forwarding DPU fragmented messages
CN102413054B (en) Method, device and system for controlling data traffic as well as gateway equipment and switchboard equipment
CN109952743B (en) System and method for low memory and low flow overhead high flow object detection
CN114257526A (en) In-band telemetry system, method and device
Gomez et al. Traffic classification in IP networks through Machine Learning techniques in final systems
CN114189902A (en) Customized power 5G/B5G communication access method based on power service QoS flow mapping
US11218411B2 (en) Flow monitoring in network devices
EP4181479A1 (en) Method for identifying flow, and apparatus
CN114095383B (en) Network flow sampling method and system and electronic equipment
US11477126B2 (en) Network device and method for processing data about network packets
CN112769650A (en) Multi-VLAN loop detection method and system
CN114866488A (en) Information flow identification method, network chip and network equipment
US9319327B2 (en) Packet transmission method, packet transmission apparatus, and storage medium
CN106557527A (en) A kind of pornographic image detection and system
CN114785396B (en) Logic port configuration, lookup mapping and traffic management method, system and terminal
CN111030939B (en) Method and device for realizing IPFIX output based on chip
CN114143209B (en) System and method for controlling elastic expansion of channel in network measurement
CN114826775A (en) Method, device, system, equipment and medium for generating filtering rule of data packet
US20190068475A1 (en) System and method for the utilization of mesh networks and distributed data buffering to increase data retention
CN117499352A (en) Active identification and forwarding method and system for TSN (traffic channel network) mixed traffic

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination