WO2021057517A1 - Method for monitoring multicast service and board - Google Patents

Method for monitoring multicast service and board Download PDF

Info

Publication number
WO2021057517A1
WO2021057517A1 PCT/CN2020/114858 CN2020114858W WO2021057517A1 WO 2021057517 A1 WO2021057517 A1 WO 2021057517A1 CN 2020114858 W CN2020114858 W CN 2020114858W WO 2021057517 A1 WO2021057517 A1 WO 2021057517A1
Authority
WO
WIPO (PCT)
Prior art keywords
group
board
layer
hardware
data
Prior art date
Application number
PCT/CN2020/114858
Other languages
French (fr)
Chinese (zh)
Inventor
路永斌
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2021057517A1 publication Critical patent/WO2021057517A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • H04Q11/0001Selecting arrangements for multiplex systems using optical switching
    • H04Q11/0062Network aspects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • H04Q11/0001Selecting arrangements for multiplex systems using optical switching
    • H04Q11/0003Details
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • H04Q11/0001Selecting arrangements for multiplex systems using optical switching
    • H04Q11/0062Network aspects
    • H04Q11/0067Provisions for optical access or distribution networks, e.g. Gigabit Ethernet Passive Optical Network (GE-PON), ATM-based Passive Optical Network (A-PON), PON-Ring
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q11/00Selecting arrangements for multiplex systems
    • H04Q11/0001Selecting arrangements for multiplex systems using optical switching
    • H04Q11/0062Network aspects
    • H04Q2011/0079Operation or maintenance aspects
    • H04Q2011/0081Fault tolerance; Redundancy; Recovery; Reconfigurability

Abstract

Disclosed in embodiments of the present invention are a method for monitoring a multicast service and a board. The method is applied to an optical line terminal (OLT) and comprises: the board in the OLT regularly scans its own upper-layer service data and lower-layer hardware data; compare whether the upper-layer service data is consistent with the lower-layer hardware data; and perform multicast service monitoring according to the comparison result. In this way, a multicast service can be monitored in real time on an OLT device.

Description

一种组播业务监控的方法及板卡Method and board for monitoring multicast service 技术领域Technical field
本发明实施例涉及但不限于组播业务,更具体地涉及一种组播业务监控的方法及板卡。The embodiment of the present invention relates to but is not limited to a multicast service, and more specifically relates to a method and board for monitoring a multicast service.
背景技术Background technique
目前大规模商用的主要是EPON(Ethernet Passive Optical Network,以太网无源光网络)和GPON(Gigabit-capable Passive Optical Network,吉比特无源光网络)技术,与传统的接入网技术及以太网交换机技术相比,PON(Passive Optical Network,无源光纤网络)的物理层是点到多点的拓扑结构,其下行数据采用物理层广播方式,即下行数据会被传送到该链路上所有的终端设备(ONU,Optical Network Unit,光网络单元)。At present, large-scale commercial applications are mainly EPON (Ethernet Passive Optical Network, Ethernet passive optical network) and GPON (Gigabit-capable Passive Optical Network, Gigabit Passive Optical Network) technologies, which are compatible with traditional access network technologies and Ethernet technologies. Compared with switch technology, the physical layer of PON (Passive Optical Network) is a point-to-multipoint topology. The downlink data adopts the physical layer broadcast mode, that is, the downlink data will be transmitted to all the links on the link. Terminal equipment (ONU, Optical Network Unit, Optical Network Unit).
在物理层的基础上,PON链路层建立了逻辑上的点对点通道,所以其对于单播及组播业务的转发方式如下:On the basis of the physical layer, the PON link layer establishes a logical point-to-point channel, so its forwarding methods for unicast and multicast services are as follows:
a)、单播业务走逻辑单播通道发送给ONU。a) The unicast service is sent to the ONU through the logical unicast channel.
b)、组播业务利用了物理层下行广播的特性直接走广播通道发送给ONU。b) The multicast service takes advantage of the downstream broadcast characteristics of the physical layer and sends it directly to the ONU through the broadcast channel.
单播和组播业务使用转发通道不同的原因,是因为对于组播业务来说,如果也走单播通道意味着需要在OLT(Optical Line Terminal,光线路终端)侧做大量的组播数据包复制,增加了设备实现的复杂度,且大流量的复制组播包会给链路带宽造成很大的浪费。The reason why unicast and multicast services use different forwarding channels is because for multicast services, if the unicast channel is also used, it means that a large number of multicast data packets need to be made on the OLT (Optical Line Terminal) side Replication increases the complexity of device implementation, and large-traffic replication multicast packets will cause a lot of waste of link bandwidth.
如图1所示,对于局端OLT设备来说,在转发面上主流采用的都是PON板、交换板(控制板/主控板)、上联板的架构。其中PON板存储本板的用户加入信息,交换板(控制板/主控板)存储整个OLT的用户加入信息。因此通常组播业务的复制要在设备内的多个点进行,比如组播业务包在交换板(控制板/主控板)上需要复制转发给多个PON板,在PON板上需要复 制转发给多个PON口。当组播业务包被复制到PON口后,将不再进一步复制转发给多个ONU,而是采用下行的组播或广播通道以单拷贝方式(SCB,Single Copy Broadcast)转发给ONU。组播转发过程中,OLT设备从源端口接收组播业务流量,源端口作为组播的一级复制点,一般位于交换板(控制板/主控板)上;在PON板上,再次将流量复制分发到PON口,分发端口则为组播的二级复制点。例如,下行数据转发方式具体如图2所示,交换板上的1为一级复制点;PON板上的2为二级复制点,黑色方框为PON口。As shown in Figure 1, for the central office OLT equipment, the mainstream adopts the architecture of PON board, switch board (control board/main control board), and uplink board on the forwarding plane. The PON board stores the user joining information of the board, and the switch board (control board/main control board) stores the user joining information of the entire OLT. Therefore, the replication of multicast services usually needs to be performed at multiple points in the equipment. For example, multicast service packets need to be replicated and forwarded to multiple PON boards on the switch board (control board/main control board), and replicated and forwarded on the PON board. Give multiple PON ports. After the multicast service packet is copied to the PON port, it will not be further copied and forwarded to multiple ONUs. Instead, the downstream multicast or broadcast channel will be forwarded to the ONU in a single copy mode (SCB, Single Copy Broadcast). During the multicast forwarding process, the OLT device receives the multicast service traffic from the source port. The source port is used as the first-level replication point of the multicast, which is generally located on the switch board (control board/main control board); on the PON board, the traffic is transferred again The replication is distributed to the PON port, and the distribution port is the secondary replication point of the multicast. For example, the downstream data forwarding method is specifically shown in Figure 2. The 1 on the switch board is the first-level replication point; the 2 on the PON board is the second-level replication point, and the black box is the PON port.
随着PON网络在现网上的普及,运营商对OLT设备的业务稳定性要求也越来越高。组播作为一种非常重要且使用广泛的业务,也会在现网运行中出现各种问题。通常出现故障时,维护人员都只能被动等待用户报障,然后再去现场进行处理,效率较低,且易导致用户投诉。同时,OLT设备无法在出现故障后进行自诊断、自恢复,也导致了问题无法快速解决。With the popularization of PON networks on existing networks, operators have higher and higher requirements for the business stability of OLT equipment. As a very important and widely used service, multicast will also have various problems in the operation of the existing network. Generally, when a fault occurs, the maintenance personnel can only passively wait for the user to report the fault, and then go to the scene to deal with it, which is inefficient and easily leads to user complaints. At the same time, the OLT device cannot perform self-diagnosis and self-recovery after a failure, which also leads to problems that cannot be solved quickly.
为此,迫切需要对组播这样高价值业务进行实时监控,一旦出现问题能及时进行自诊断自恢复,确实无法自动恢复的,能及时上报告警,便于维护人员立即介入处理。但是目前现网上的OLT设备绝并不支持此功能。For this reason, there is an urgent need for real-time monitoring of high-value services such as multicast. Once a problem occurs, it can perform self-diagnosis and self-recovery in a timely manner. If it does not recover automatically, it can report an alarm in time so that maintenance personnel can immediately intervene and deal with it. However, the OLT equipment on the current network does not support this function at all.
发明内容Summary of the invention
有鉴于此,本发明实施例提供了一种组播业务监控的方法,应用于光线路终端OLT,包括:In view of this, an embodiment of the present invention provides a method for monitoring multicast services, which is applied to an optical line terminal OLT, and includes:
所述OLT中的板卡定时扫描自身的上层业务数据和底层硬件数据;The board in the OLT regularly scans its own upper-layer service data and lower-layer hardware data;
比较所述上层业务数据和底层硬件数据是否一致;Compare whether the upper-layer business data and the underlying hardware data are consistent;
根据比较结果进行组播业务监控。Monitor the multicast service according to the comparison result.
本发明实施例还提供了一种板卡,应用于光线路终端OLT,包括:The embodiment of the present invention also provides a board, which is applied to an optical line terminal OLT, and includes:
扫描单元,设置为定时扫描自身的上层业务数据和底层硬件数据;The scanning unit is set to scan its own upper-layer business data and lower-layer hardware data regularly;
比较单元,设置为比较所述上层业务数据和底层硬件数据是否一致;The comparing unit is set to compare whether the upper-layer business data and the lower-layer hardware data are consistent;
监控单元,设置为根据比较结果进行组播业务监控。The monitoring unit is set to monitor the multicast service according to the comparison result.
本发明实施例提供的技术方案,能够在OLT设备上对组播业务进行实时监控。The technical solution provided by the embodiment of the present invention can perform real-time monitoring of the multicast service on the OLT device.
本发明的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本发明而了解。本发明的目的和其他优点可通过在说明书、权利要求书以及附图中所特别指出的结构来实现和获得。Other features and advantages of the present invention will be described in the following description, and partly become obvious from the description, or understood by implementing the present invention. The purpose and other advantages of the present invention can be realized and obtained through the structures specifically pointed out in the specification, claims and drawings.
附图说明Description of the drawings
附图用来提供对本发明技术方案的进一步理解,并且构成说明书的一部分,与本申请的实施例一起用于解释本发明的技术方案,并不构成对本发明技术方案的限制。The accompanying drawings are used to provide a further understanding of the technical solution of the present invention, and constitute a part of the specification. Together with the embodiments of the present application, they are used to explain the technical solution of the present invention, and do not constitute a limitation to the technical solution of the present invention.
图1为现有OLT设备的架构示意图;Figure 1 is a schematic diagram of the architecture of an existing OLT device;
图2为现有OLT设备下行数据转发的示意图;Figure 2 is a schematic diagram of downlink data forwarding of existing OLT equipment;
图3为现有OLT设备中数据转发端口的示意图;Figure 3 is a schematic diagram of a data forwarding port in an existing OLT device;
图4为本发明一实施例提供的一种组播业务监控的方法的流程示意图;4 is a schematic flowchart of a method for monitoring multicast services according to an embodiment of the present invention;
图5为本发明另一实施例提供的一种组播业务监控的方法的流程示意图;5 is a schematic flowchart of a method for monitoring multicast services according to another embodiment of the present invention;
图6为本发明另一实施例提供的一种组播业务监控的方法的流程示意图;6 is a schematic flowchart of a method for monitoring multicast services according to another embodiment of the present invention;
图7为本发明的另一实施例中板卡内监控比较结果不一致时的流程示意图;FIG. 7 is a schematic diagram of a flow when the results of monitoring and comparison in the board are inconsistent in another embodiment of the present invention;
图8为本发明的另一实施例中板卡内监控比较结果一致时的流程示意图;FIG. 8 is a schematic diagram of the flow when the comparison results of the internal monitoring of the boards are consistent in another embodiment of the present invention; FIG.
图9为本发明一实施例提供的一种板卡的结构示意图。FIG. 9 is a schematic structural diagram of a board card provided by an embodiment of the present invention.
具体实施方式detailed description
为使本发明的目的、技术方案和优点更加清楚明白,下文中将结合附 图对本发明的实施例进行详细说明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互任意组合。In order to make the objectives, technical solutions and advantages of the present invention clearer, the embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be noted that the embodiments in this application and the features in the embodiments can be combined with each other arbitrarily if there is no conflict.
在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行。并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。The steps shown in the flowcharts of the drawings can be executed in a computer system such as a set of computer-executable instructions. And, although a logical sequence is shown in the flowchart, in some cases, the steps shown or described may be performed in a different order than here.
针对目前OLT的转发架构,组播的关键转发端口如图3所示,1为PON板上的PON口;2为PON板侧与交换板(控制板/主控板)链接的口;3为交换板(控制板/主控板)侧与PON板连接的口;4为交换板(控制板/主控板)侧与上联板链接的口;5为上联板侧与交换板(控制板/主控板)链接的口;6为组播源端口。For the current OLT forwarding architecture, the key forwarding ports for multicast are shown in Figure 3. 1 is the PON port on the PON board; 2 is the port that connects the PON board to the switch board (control board/main control board); 3 is The port connecting the switch board (control board/main control board) side to the PON board; 4 is the port connecting the switch board (control board/main control board) side to the upper link board; 5 is the upper link board side and the switch board (control Board/main control board) link port; 6 is the multicast source port.
本发明实施例提供的技术方案,基于组播的关键转发节点,在OLT设备上对组播业务进行实时监控。The technical solution provided by the embodiment of the present invention is based on the key forwarding node of the multicast, and the multicast service is monitored in real time on the OLT device.
图4为本发明一实施例提供的一种组播业务监控的方法的流程示意图。该方法应用于光线路终端OLT,如图4所示,该方法包括:FIG. 4 is a schematic flowchart of a method for monitoring multicast services according to an embodiment of the present invention. The method is applied to the optical line terminal OLT, as shown in Fig. 4, the method includes:
步骤401,所述OLT中的板卡定时扫描自身的上层业务数据和底层硬件数据;Step 401: The board in the OLT periodically scans its own upper-layer service data and lower-layer hardware data;
步骤402,比较所述上层业务数据和底层硬件数据是否一致;Step 402: Compare whether the upper-layer service data and the lower-layer hardware data are consistent;
步骤403,根据比较结果进行组播业务监控。Step 403: Perform multicast service monitoring according to the comparison result.
其中,所述上层业务数据包括上层业务的组播组Group和每个Group对应的出口列表;Wherein, the upper-layer service data includes the multicast group group of the upper-layer service and the export list corresponding to each group;
所述底层硬件数据包括底层硬件的Group、每个Group对应的出口列表和每个Group的入口和出口流量。The bottom-level hardware data includes the Group of the bottom-level hardware, the export list corresponding to each group, and the ingress and egress traffic of each group.
其中,所述比较所述上层业务数据和底层硬件数据是否一致,包括:Wherein, the comparing whether the upper-layer business data and the lower-layer hardware data are consistent includes:
比较上层业务的组播组Group与底层硬件的Group是否一致;Compare whether the multicast group group of the upper-layer service is consistent with the group of the underlying hardware;
比较上层业务每个Group对应的出口列表与底层硬件每个Group对应的出口列表是否一致。Compare whether the export list corresponding to each Group of the upper-level business is consistent with the export list corresponding to each Group of the underlying hardware.
其中,所述根据比较结果进行组播业务监控,包括:Wherein, the monitoring of the multicast service according to the comparison result includes:
当所述上层业务和底层硬件的第一Group不一致,和/或,所述上层业务和底层硬件的第一Group对应的出口列表不一致时,进行故障恢复,包括:When the upper-layer service and the first group of the lower-layer hardware are inconsistent, and/or the exit lists corresponding to the first group of the upper-layer service and the lower-layer hardware are inconsistent, performing fault recovery includes:
向所有用户发送第一Group的特定查询报文;Send a specific query message of the first Group to all users;
如果有用户应答,则记录所有应答用户的端口信息,根据所述所有应答用户的端口信息更新所述上层业务和底层硬件的所述第一Group对应的出口列表;If there is a user response, record the port information of all the responding users, and update the export list corresponding to the first Group of the upper-layer service and the underlying hardware according to the port information of all the responding users;
如果没有用户应答,则通知上层业务和底层硬件删除所述第一Group对应的条目。If there is no user response, the upper-layer service and the lower-layer hardware are notified to delete the entry corresponding to the first Group.
其中,所述根据比较结果进行组播业务监控,包括:Wherein, the monitoring of the multicast service according to the comparison result includes:
当上层业务和底层硬件的Group一致且上层业务和底层硬件的Group对应的出口列表一致时,比较所述每个Group的入口和出口流量;When the upper-layer service and the group of the lower-layer hardware are consistent, and the export list corresponding to the upper-layer service and the group of the lower-layer hardware is consistent, compare the ingress and egress traffic of each group;
当任一Group的入口和出口流量差异超过阈值时,确认丢包位置并进行故障恢复。When the difference between the ingress and egress traffic of any group exceeds the threshold, confirm the location of the packet loss and perform fault recovery.
其中,所述板卡为以下之一:无源光网络PON板、交换板、上联板。Wherein, the board card is one of the following: a passive optical network PON board, a switch board, and an uplink board.
其中,当所述板卡为交换板时,该方法还包括:Wherein, when the board is a switch board, the method further includes:
所述交换板定时扫描不同PON板及上联板上的底层硬件数据;The switch board regularly scans the underlying hardware data on different PON boards and uplink boards;
比较自身的上层业务数据与所述不同PON板及上联板上的底层硬件数据是否一致;Compare whether its own upper-layer service data is consistent with the bottom-layer hardware data on the different PON boards and uplink boards;
当所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的第一Group不一致,和/或,所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的第一Group对应的出口列表不一致时,进行故障恢复,包括:When the upper-layer service of the switch board is inconsistent with the first Group of the underlying hardware on the different PON board and the uplink board, and/or, the upper-layer service of the switch board is different from the different PON board and the uplink board When the export list corresponding to the first Group of the underlying hardware is inconsistent, perform fault recovery, including:
向所有用户发送第一Group的特定查询报文;Send a specific query message of the first Group to all users;
如果有用户应答,则记录所有应答用户的端口信息,根据所述所有应 答用户的端口信息更新所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的所述第一Group对应的出口列表;If a user responds, record the port information of all the responding users, and update the upper-layer services of the switch board and the first layer of the underlying hardware on the different PON boards and the upper-link board according to the port information of all the responding users. The export list corresponding to the Group;
如果没有用户应答,则通知所述交换板的上层业务和所述不同PON板及上联板上的底层硬件删除所述第一Group对应的条目。If there is no response from the user, the upper-layer service of the switch board and the lower-layer hardware on the different PON board and the upper-link board are notified to delete the entry corresponding to the first Group.
其中,该方法还包括:Among them, the method also includes:
当所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的Group一致,且所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的Group对应的出口列表一致时,When the upper-layer service of the switch board is consistent with the Group of the lower-layer hardware on the different PON board and the uplink board, and the upper-layer service of the switch board is the Group of the lower-layer hardware on the different PON board and the uplink board When the corresponding export lists are consistent,
比较每个Group的所述交换板的出口流量和所述不同PON板的入口流量;Comparing the egress traffic of the switch board of each group with the ingress traffic of the different PON boards;
比较每个Group的所述交换板的入口流量和所述上联板的出口流量;Compare the inlet flow rate of the switch board and the outlet flow rate of the uplink board of each group;
当任一Group的入口和出口流量差异超过阈值时,确认丢包位置并进行故障恢复。When the difference between the ingress and egress traffic of any group exceeds the threshold, confirm the location of the packet loss and perform fault recovery.
其中,进行故障恢复之后,该方法还包括:Among them, after the failure recovery, the method also includes:
再次扫描上层业务数据和底层硬件数据;Scan the upper-level business data and the underlying hardware data again;
比较对应的上层业务数据和底层硬件数据是否一致;Compare whether the corresponding upper-layer business data and the underlying hardware data are consistent;
当一致时,故障恢复成功,当不一致时故障恢复失败,上报告警。When they are consistent, the fault recovery is successful; when they are inconsistent, the fault recovery fails, and an alarm is reported.
图5为本发明另一实施例提供的一种组播业务监控的方法的流程示意图。FIG. 5 is a schematic flowchart of a method for monitoring a multicast service according to another embodiment of the present invention.
本实施例应用于板卡内监控的场景。This embodiment is applied to the scene of monitoring in the board.
该方法应用于光线路终端OLT,如图5所示,该方法包括:The method is applied to the optical line terminal OLT, as shown in Fig. 5, the method includes:
步骤501,板卡定时扫描,从上层业务和底层硬件获取上层业务数据和底层硬件数据; Step 501, the board scans regularly, and obtains the upper-layer business data and the lower-layer hardware data from the upper-layer business and the lower-layer hardware;
其中,目前主流OLT设备,均是采用分布式处理方式,即在PON板、 交换板(控制板/主控板)、上联板上均会处理组播业务,维护组播转发条目,可以在PON板、交换板(控制板/主控板)、上联板上均启用定时器,定时对对应单板(板卡)的动态条目进行扫描,并从硬件获取组播转发数据即底层硬件数据。Among them, the current mainstream OLT equipment adopts a distributed processing method, that is, the PON board, the switch board (control board/main control board), and the uplink board will all process the multicast service and maintain the multicast forwarding entries. The PON board, switch board (control board/main control board), and uplink board all enable timers to scan the dynamic entries of the corresponding single board (board card) regularly, and obtain multicast forwarding data from the hardware, that is, the underlying hardware data .
其中,所述板卡可以为PON板、交换板(控制板/主控板)、或上联板。Wherein, the board can be a PON board, a switch board (control board/main control board), or an uplink board.
其中,所述上层业务数据包括上层业务的组播组Group和每个Group对应的出口列表;Wherein, the upper-layer service data includes the multicast group group of the upper-layer service and the export list corresponding to each group;
所述底层硬件数据包括底层硬件的Group、每个Group对应的出口列表和每个Group的入口和出口流量。The bottom-level hardware data includes the Group of the bottom-level hardware, the export list corresponding to each group, and the ingress and egress traffic of each group.
具体而言,从底层硬件获取的数据有两个:Specifically, there are two data obtained from the underlying hardware:
1,硬件转发的Group信息,包括硬件上动态Group个数、具体的Group信息和Srouce信息、每个Group对应的出端口列表等;1. Group information forwarded by the hardware, including the number of dynamic groups on the hardware, specific group information and source information, and the list of outgoing ports corresponding to each group;
其中,可以在触发扫描时抓取组播转发数据和上层业务数据。Among them, the multicast forwarding data and upper-layer service data can be captured when scanning is triggered.
2,每个Group转发报文总数,包括每个Group的入端口统计和出端口统计。2. The total number of packets forwarded by each group, including the inbound port statistics and the outbound port statistics of each group.
其中,入端口统计是指上次扫描到本次扫描这段时间的入口流量累加数量,出端口统计是指上次扫描到本次扫描这段时间的出口流量累加数量。Among them, the ingress port statistics refers to the cumulative number of ingress traffic from the last scan to this scan, and the egress port statistics refers to the cumulative number of egress traffic from the last scan to this scan.
步骤502,比较所述上层业务数据和底层硬件数据是否一致;Step 502: Compare whether the upper-layer service data and the lower-layer hardware data are consistent;
其中,所述比较所述上层业务数据和底层硬件数据是否一致,包括:Wherein, the comparing whether the upper-layer business data and the lower-layer hardware data are consistent includes:
比较上层业务的组播组Group与底层硬件的Group是否一致;Compare whether the multicast group group of the upper-layer service is consistent with the group of the underlying hardware;
比较上层业务每个Group对应的出口列表与底层硬件每个Group对应的出口列表是否一致。Compare whether the export list corresponding to each Group of the upper-level business is consistent with the export list corresponding to each Group of the underlying hardware.
当比较结果不一致时,说明业务存在故障,需要进行故障恢复,执行步骤503,当比较结果一致时,执行步骤506。When the comparison results are inconsistent, it means that there is a fault in the service and fault recovery is required. Step 503 is executed. When the comparison results are consistent, step 506 is executed.
其中,所述比较结果不一致,包括:所述上层业务和底层硬件的任一Group不一致,和/或,所述上层业务和底层硬件的任一Group对应的出口 列表不一致。Wherein, the comparison result is inconsistent, including: any Group of the upper-layer service and the lower-layer hardware is inconsistent, and/or, the export list corresponding to any Group of the upper-layer service and the lower-layer hardware is inconsistent.
所述比较结果一致,包括:上层业务和底层硬件的Group一致且上层业务和底层硬件的Group对应的出口列表一致。The comparison result is consistent, including: the upper-layer business and the group of the underlying hardware are consistent, and the export list corresponding to the upper-layer business and the group of the underlying hardware is consistent.
步骤503,向所有用户发送异常Group的特定查询报文;Step 503: Send a specific query message of the abnormal group to all users;
其中,板内监控,用于PON板或上联板中,对OLT整个系统来说,PON板或上联板仅对本板用户负责,它不关心其他板卡的用户加入情况。所以对于它来说,所有用户就是本板上配置的接收端口(即合法用户)。板间监控,用于交换板中,对OLT整个系统来说,交换板上存储了整个系统的用户信息,它会关心所有板卡的加入情况。所以对于他来说,所有用户,就是整个系统上配置的接收端口(即合法用户)。Among them, the in-board monitoring is used in the PON board or the uplink board. For the entire OLT system, the PON board or the uplink board is only responsible for the user of the board, and it does not care about the addition of users of other boards. So for it, all users are the receiving ports configured on the board (ie legal users). Inter-board monitoring is used in the switch board. For the entire OLT system, the switch board stores the user information of the entire system, and it cares about the joining of all boards. So for him, all users are the receiving ports (ie legal users) configured on the entire system.
其中,异常Group是指上层业务和底层硬件中存在状态不一致的Group,或者出口列表不一致的Group。Among them, the abnormal group refers to a group with inconsistent status in the upper-layer business and the underlying hardware, or a group with inconsistent export lists.
如果有用户应答,执行步骤504,如果没有用户应答,则执行步骤506;If there is a user answer, go to step 504, if there is no user answer, go to step 506;
步骤504,记录所有应答用户的端口信息,根据所述所有应答用户的端口信息更新所述上层业务和底层硬件的所述异常Group对应的出口列表;Step 504: Record the port information of all responding users, and update the exit list corresponding to the abnormal group of the upper-layer service and the underlying hardware according to the port information of all the responding users;
步骤505,通知上层业务和底层硬件删除所述异常Group对应的条目;Step 505: Notify the upper-layer service and the lower-layer hardware to delete the entry corresponding to the abnormal Group;
因为是没有用户应答,所以表明该Group当前已经无人观看,所以这里删除的条目具体指整体的Group条目,包括Group组信息、Group的源信息以及Group出口列表信息。Because there is no user response, it indicates that the Group is currently unwatched, so the deleted entry here specifically refers to the entire Group entry, including Group group information, Group source information, and Group export list information.
具体而言,所述比较结果不一致包括两种情况:Specifically, the inconsistent comparison results include two situations:
a,某Group的存在状态不一致;a, the existence status of a group is inconsistent;
其中,上层业务和底层硬件的任一Group不一致具体就是指Groop存在状态不一致。例如上层业务存在Group条目,但底层硬件不存在Group条目;或者上层业务不存在Group条目,而底层硬件存在Group条目。Among them, the inconsistency of any Group between the upper-layer business and the underlying hardware specifically refers to the inconsistent status of Groop. For example, there is a Group entry in the upper-layer business, but there is no Group entry in the underlying hardware; or there is no Group entry in the upper-layer business, but there is a Group entry in the underlying hardware.
b,某Group中的出口列表不一致;当发生任一种情况时,会向所有 用户发送异常Group的特定查询报文,根据该报文的应答情况来处理。例如,发送特定查询报文后,如果没有收到任何用户的应答,则通知上层业务和底层硬件将该条目进行删除;如果有用户应答,会记录所有应答用户的端口信息,并与当前业务与硬件的出口列表进行比较,记录比较结果;然后根据最新出口列表信息来重新更新上层业务和底层硬件针对该Group的出口列表。其中,这里的特定查询报文,是针对异常Group发送的查询报文,其根据rfc2236、rfc3376等标准中Group-Specific Query的定义进行构造和发送。b. The export list in a certain group is inconsistent; when either situation occurs, a specific query message of the abnormal group will be sent to all users, and the message will be processed according to the response of the message. For example, after sending a specific query message, if no user's response is received, the upper-layer service and the underlying hardware are notified to delete the entry; if there is a user's response, the port information of all responding users will be recorded, and it will be compared with the current service. The hardware export list is compared, and the comparison result is recorded; then the export list of the upper-level business and the lower-level hardware for the group is updated according to the latest export list information. Among them, the specific query message here is a query message sent for an abnormal group, which is constructed and sent according to the definition of Group-Specific Query in standards such as rfc2236 and rfc3376.
无论情况a或情况b,在自动恢复后,还可以立即针对恢复Group再次获取一次底层硬件的相关信息,与上层业务进行比较,以用来确认自动恢复的效果。Regardless of the situation a or the situation b, after the automatic recovery, the relevant information of the underlying hardware can be obtained again for the recovery group immediately, and compared with the upper-layer business to confirm the effect of the automatic recovery.
当比较结果不一致时,通过步骤503-505,进行用户实际状态的查询(查询方式可以通过特定查询报文进行),根据查询结果进行强制恢复,并根据恢复情况来决定是否上报告警。When the comparison results are inconsistent, through steps 503-505, the user's actual status is inquired (the inquiry method can be carried out by a specific inquiry message), the forced recovery is carried out according to the inquiry result, and whether to report an alarm is determined according to the recovery situation.
步骤506,比较每个Group的入口和出口流量,当任一Group的入口和出口流量差异超过阈值时,确认丢包位置并进行故障恢复;Step 506: Compare the ingress and egress traffic of each group, and when the difference between the ingress and egress traffic of any group exceeds the threshold, confirm the packet loss location and perform fault recovery;
例如,针对IPTV组播业务,该阈值可以取5%。即入口和出口的流量差异超过5%便认为出现异常。因为根据经验,当流量差异超过5%时,用户就会明显的感知到IPTV画面异常,如马赛克或者黑屏。For example, for IPTV multicast services, the threshold may be 5%. That is, if the flow rate difference between the inlet and the outlet exceeds 5%, it is considered an abnormality. Because according to experience, when the traffic difference exceeds 5%, users will obviously perceive IPTV picture abnormalities, such as mosaic or black screen.
具体而言,在发现入口流量与出口流量差异超过5%后,因为硬件转发可能涉及多个环节,所以首先需要确认是在哪个环节报文转发异常。可以通过硬件进行分析定位,直接调用芯片提供的命令进行查询;也可以通过软件手段进行分析定位。确认丢包位置后,通过预埋命令进行该位置的状态查询,确认丢包具体原因并进行强制恢复。如果恢复成功,记录日志;如果恢复失败,上报告警。其中,目前调用芯片的查询手段都是通过提前将芯片的查询命令预埋到系统中,一旦出现问题进行调用处理,确认丢包位置;以GPON线卡为例,其芯片的数据转发,涉及到如pon-tm、pon-mac 等内部的多个模块,出现问题后,调用提前预埋的这些模块的查询命令,确认其流量转发是否正常。一旦确认芯片内部的异常点,如某个寄存器位出现异常,则对其进行强制修改置位,保证其恢复到正确的状态。Specifically, after it is found that the difference between the ingress flow and the egress flow exceeds 5%, because hardware forwarding may involve multiple links, it is necessary to confirm at which link the packet is forwarded abnormally. The hardware can be used for analysis and positioning, and the commands provided by the chip can be directly called for query; it can also be analyzed and positioned by software means. After confirming the location of the packet loss, use the embedded command to query the status of the location, confirm the specific cause of the packet loss, and perform forced recovery. If the recovery is successful, record the log; if the recovery fails, report an alarm. Among them, the current query method for calling the chip is to embed the query command of the chip in the system in advance. Once a problem occurs, the call is processed to confirm the location of the packet loss; taking the GPON line card as an example, the data forwarding of the chip involves Such as pon-tm, pon-mac and other internal modules, after a problem occurs, call the pre-embedded query commands of these modules to confirm whether the traffic is forwarded normally. Once the abnormal point inside the chip is confirmed, if a certain register bit is abnormal, it will be forcibly modified and set to ensure that it is restored to the correct state.
步骤507,再次扫描自身的上层业务数据和底层硬件数据;比较所述上层业务数据和底层硬件数据是否一致;Step 507: Scan its own upper-layer business data and lower-layer hardware data again; compare whether the upper-layer business data and the lower-layer hardware data are consistent;
当一致时,执行步骤508,故障恢复成功,当不一致时,执行步骤509,故障恢复失败,上报告警。When they are consistent, perform step 508, and the fault recovery is successful; when they are inconsistent, perform step 509, and the fault recovery fails, and an alarm is reported.
本实施例中,在每次扫描中,还可以将获取到的Group信息及其对应的入口和出口流量进行记录,生成log日志,并以定时自动上传或人为触发的方式进行获取,以便于维护人员可以对某个时刻某个Group的历史流量转发情况进行查询,利于更细致的掌握设备的运行状况。In this embodiment, in each scan, you can also record the obtained Group information and its corresponding ingress and egress traffic, generate log logs, and obtain them by means of scheduled automatic uploads or manual triggers for easy maintenance Personnel can inquire about the historical traffic forwarding situation of a certain group at a certain time, which is conducive to a more detailed grasp of the operating status of the equipment.
本实施例中,当下一个扫描周期到达后,重复步骤501-509的操作。In this embodiment, when the next scanning period arrives, the operations of steps 501-509 are repeated.
本实施例提供的技术方案,能够在板卡内实时监控、自恢复组播业务。The technical solution provided in this embodiment is capable of real-time monitoring and self-recovery multicast services in the board.
在本发明的另一实施例中,与上一实施例的区别在于,分为板卡内实时监控和板卡间实时监控两种,以板卡内实时监控为主,板卡间实时监控为辅,共同协调完成整个系统的实时监控功能。对于板内检测,由本板触发定时扫描,仅比较本板的底层硬件与上层业务的数据。而本实施例中,在板内检测的基础上,增加板间检测,由交换板(控制板/主控板)触发定时扫描,获取不同PON板及上联板上的底层硬件数据,交换板(控制板/主控板)汇总各板底层硬件数据后,与交换板(控制板/主控板)自身的上层业务数据进行比较,判断是否一致。In another embodiment of the present invention, the difference from the previous embodiment is that it is divided into real-time monitoring within the board and real-time monitoring between the boards. The real-time monitoring within the board is the main one, and the real-time monitoring between the boards is Auxiliary, coordinate to complete the real-time monitoring function of the entire system. For on-board detection, the board triggers a regular scan, and only compares the data of the bottom-layer hardware of the board with the upper-layer business data. In this embodiment, on the basis of in-board detection, inter-board detection is added, and the switch board (control board/main control board) triggers timing scanning to obtain the bottom hardware data of different PON boards and uplink boards. (Control board/Main control board) After summarizing the bottom hardware data of each board, compare it with the upper-layer business data of the switch board (control board/main control board) to determine whether they are consistent.
当所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的第一Group不一致,和/或,所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的第一Group对应的出口列表不一致时,向所有用户发送第一Group的特定查询报文;如果有用户应答,则记录所有应答用户的端口信息,根据所述所有应答用户的端口信息更新所述交换板的上层 业务和所述不同PON板及上联板上的底层硬件的所述第一Group对应的出口列表;如果没有用户应答,则通知所述交换板的上层业务和所述不同PON板及上联板上的底层硬件删除所述第一Group对应的条目。When the upper-layer service of the switch board is inconsistent with the first Group of the underlying hardware on the different PON board and the uplink board, and/or, the upper-layer service of the switch board is different from the different PON board and the uplink board When the export list corresponding to the first Group of the underlying hardware is inconsistent, send a specific query message of the first Group to all users; if there is a user response, record the port information of all the responding users, according to the port information of all the responding users Update the upper-layer service of the switch board and the export list corresponding to the first Group of the bottom hardware on the different PON board and the upper-link board; if there is no user response, notify the upper-layer service of the switch board and the The bottom layer hardware on different PON boards and uplink boards deletes the entry corresponding to the first Group.
当所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的Group一致,且所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的Group对应的出口列表一致时,比较每个Group的所述交换板的出口流量和所述不同PON板的入口流量;比较每个Group的所述交换板的入口流量和所述上联板的出口流量;当任一Group的入口和出口流量差异超过阈值时,确认丢包位置并进行故障恢复。When the upper-layer service of the switch board is consistent with the Group of the lower-layer hardware on the different PON board and the uplink board, and the upper-layer service of the switch board is the Group of the lower-layer hardware on the different PON board and the uplink board When the corresponding outlet lists are the same, compare the outlet traffic of the switch board of each group with the inlet traffic of the different PON boards; compare the inlet traffic of the switch board of each group and the outlet traffic of the uplink board ; When the difference between the ingress and egress traffic of any Group exceeds the threshold, confirm the packet loss location and perform fault recovery.
其中,进行故障恢复之后,还可以进行板内监控和板间监控,即再次扫描上层业务数据和底层硬件数据;比较对应的上层业务数据和底层硬件数据是否一致;当一致时,说明故障恢复成功,结束流程;当不一致时,说明故障恢复失败,上报告警。Among them, after fault recovery, you can also perform on-board monitoring and inter-board monitoring, that is, scan the upper-layer business data and the underlying hardware data again; compare whether the corresponding upper-layer business data and the underlying hardware data are consistent; when they are consistent, the fault recovery is successful , End the process; when it is inconsistent, it means that the fault recovery has failed, and an alarm is reported.
图6为本发明另一实施例提供的一种组播业务监控的方法的流程示意图。FIG. 6 is a schematic flowchart of a method for monitoring a multicast service according to another embodiment of the present invention.
本实施例应用于板卡内和板卡间同时监控的场景。This embodiment is applied to the scenario of simultaneous monitoring within and between boards.
本实施例应用于图3所示的系统架构中。如图3所示,各个数据转发节点的入口分别为2、4、6;出口分别为1、3、5。This embodiment is applied to the system architecture shown in FIG. 3. As shown in Figure 3, the entrances of each data forwarding node are 2, 4, and 6; the exits are 1, 3, and 5 respectively.
如图6所示,该方法包括:As shown in Figure 6, the method includes:
步骤601,OLT的PON板、交换板、上联板定时扫描,从上层业务和底层硬件获取自身的上层业务数据和底层硬件数据;交换板触发定时扫描,获取不同PON板及上联板上的底层硬件数据; Step 601, the OLT's PON board, switch board, and uplink board periodically scan, and obtain its own upper-layer service data and lower-layer hardware data from the upper-layer service and the underlying hardware; the switch board triggers the regular scan to obtain different PON boards and uplink boards. Low-level hardware data;
具体而言,对于板内检测,由本板触发定时扫描。对于板间检测,由交换板(控制板/主控板)触发定时扫描。Specifically, for in-board detection, the board triggers the timing scan. For inter-board detection, the switch board (control board/main control board) triggers the timing scan.
其中,所述上层业务数据包括上层业务的组播组Group和每个Group 对应的出口列表;Wherein, the upper-layer service data includes the multicast group group of the upper-layer service and the export list corresponding to each group;
所述底层硬件数据包括底层硬件的Group、每个Group对应的出口列表和每个Group的入口和出口流量。The bottom-level hardware data includes the Group of the bottom-level hardware, the export list corresponding to each group, and the ingress and egress traffic of each group.
其中,可以在PON板、交换板(控制板/主控板)、上联板均启用定时器,定时对对应单板的动态条目进行扫描,并从硬件获取组播转发数据。Among them, timers can be enabled on the PON board, switch board (control board/main control board), and uplink board to scan the dynamic entries of the corresponding single board regularly, and obtain multicast forwarding data from the hardware.
步骤602,PON板、交换板、上联板分别比较自身的上层业务数据和底层硬件数据是否一致,交换板比较自身的上层业务数据和不同PON板及上联板上的底层硬件数据是否一致;Step 602: The PON board, the switch board, and the uplink board respectively compare whether their upper-layer service data is consistent with the underlying hardware data, and the switch board compares whether its own upper-layer service data is consistent with the bottom-layer hardware data on different PON boards and uplink boards;
所述PON板、交换板、上联板分别比较自身的上层业务数据和底层硬件数据是否一致,包括:The PON board, the switch board, and the uplink board respectively compare whether their upper-layer service data and the underlying hardware data are consistent, including:
比较自身的上层业务的组播组Group与底层硬件的Group是否一致;Compare whether the multicast group group of its own upper-layer service is consistent with the group of the underlying hardware;
比较自身的上层业务每个Group对应的出口列表与底层硬件每个Group对应的出口列表是否一致。Compare whether the export list corresponding to each Group of its upper-level business is consistent with the export list corresponding to each Group of the underlying hardware.
交换板比较自身的上层业务数据和不同PON板及上联板上的底层硬件数据是否一致,包括:The switch board compares whether its upper-layer service data is consistent with the bottom-layer hardware data on different PON boards and uplink boards, including:
比较自身的上层业务的组播组Group与不同PON板及上联板上的底层硬件的Group是否一致;Compare whether the multicast group group of its own upper-layer service is consistent with the group of the underlying hardware on different PON boards and uplink boards;
比较自身的上层业务每个Group对应的出口列表与不同PON板及上联板上的底层硬件每个Group对应的出口列表是否一致。Compare whether the export list corresponding to each Group of its own upper-level business is consistent with the export list corresponding to each Group of the bottom-level hardware on different PON boards and uplink boards.
当比较结果不一致时,说明业务存在故障,需要进行故障恢复,执行步骤603,当比较结果一致时,执行步骤606:When the comparison results are inconsistent, it means that there is a business failure and fault recovery is required. Step 603 is executed. When the comparison results are consistent, step 606 is executed:
其中,比较结果不一致包括板内比较不一致或板间比较不一致,而比较结果一致是指板内比较一致且板间比较一致。Among them, the inconsistent comparison results include inconsistent comparison within the board or inconsistency between the boards, and the consistency of the comparison result refers to the comparison within the board and the comparison between the boards.
步骤603,向所有用户发送异常Group的特定查询报文;Step 603: Send a specific query message of the abnormal group to all users;
其中,板内监控,用于PON板或上联板中,对OLT整个系统来说,PON板或上联板仅对本板用户负责,它不关心其他板卡的用户加入情况。 所以对于它来说,所有用户就是本板上配置的接收端口(即合法用户)。板间监控,用于交换板中,对OLT整个系统来说,交换板上存储了整个系统的用户信息,它会关心所有板卡的加入情况。所以对于他来说,所有用户,就是整个系统上配置的接收端口(即合法用户)。Among them, the in-board monitoring is used in the PON board or the uplink board. For the entire OLT system, the PON board or the uplink board is only responsible for the user of the board, and it does not care about the addition of users of other boards. So for it, all users are the receiving ports configured on the board (ie legal users). Inter-board monitoring is used in the switch board. For the entire OLT system, the switch board stores the user information of the entire system, and it cares about the joining of all boards. So for him, all users are the receiving ports (ie legal users) configured on the entire system.
如果有用户应答,执行步骤604,如果没有用户应答,执行步骤605,If there is a user answer, go to step 604, if there is no user answer, go to step 605,
步骤604,记录所有应答用户的端口信息,根据所述所有应答用户的端口信息更新异常Group对应的出口列表;Step 604: Record the port information of all responding users, and update the exit list corresponding to the abnormal Group according to the port information of all responding users;
步骤605,通知对应上层业务和底层硬件删除异常Group对应的条目;Step 605: Notify the corresponding upper-layer service and lower-layer hardware to delete the entry corresponding to the abnormal Group;
具体而言,当比较结果不一致时,进行用户实际状态的查询,根据查询结果进行强制恢复,并根据恢复情况来决定是否上报告警。其中用户实际状态查询方式可以通过特定查询报文进行。Specifically, when the comparison results are inconsistent, the user's actual status is inquired, the forced recovery is performed according to the query result, and the decision is made to report an alarm according to the recovery situation. Among them, the user's actual status query can be performed through a specific query message.
例如,当本板(即PON板、交换板或上联板)的上层业务和底层硬件的第一Group不一致,和/或,本板的上层业务和底层硬件的第一Group对应的出口列表不一致时,向本板的所有用户发送第一Group的特定查询报文;For example, when the upper-layer services of the board (ie, PON board, switch board, or uplink board) are inconsistent with the first group of the underlying hardware, and/or the upper-layer services of the board and the first group of the underlying hardware are inconsistent with the export list corresponding to the first group When, send a specific query message of the first Group to all users on the board;
如果有用户应答,则记录所有应答用户的端口信息,根据所述所有应答用户的端口信息更新所述本板的上层业务和底层硬件的所述第一Group对应的出口列表;If there is a user response, record the port information of all the responding users, and update the export list corresponding to the first Group of the upper-layer services of the board and the first group of the underlying hardware according to the port information of all the responding users;
如果没有用户应答,则通知本板的上层业务和底层硬件删除所述第一Group对应的条目。If there is no response from the user, the upper-layer service and the lower-layer hardware of the board are notified to delete the entry corresponding to the first Group.
又例如,当所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的第一Group不一致,和/或,所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的第一Group对应的出口列表不一致时,向所有用户发送第一Group的特定查询报文;For another example, when the upper-layer service of the switch board is inconsistent with the first Group of the lower-layer hardware on the different PON board and the uplink board, and/or, the upper-layer service of the switch board is inconsistent with the different PON board and the upper layer. When the export list corresponding to the first group of the underlying hardware on the connection board is inconsistent, send a specific query message of the first group to all users;
如果有用户应答,则记录所有应答用户的端口信息,根据所述所有应答用户的端口信息更新所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的所述第一Group对应的出口列表;If a user responds, record the port information of all the responding users, and update the upper-layer services of the switch board and the first layer of the underlying hardware on the different PON boards and the upper-link board according to the port information of all the responding users. The export list corresponding to the Group;
如果没有用户应答,则通知所述交换板的上层业务和所述不同PON板及上联板上的底层硬件删除所述第一Group对应的条目。If there is no response from the user, the upper-layer service of the switch board and the lower-layer hardware on the different PON board and the upper-link board are notified to delete the entry corresponding to the first Group.
步骤606,PON板、交换板、上联板分别比较自身每个Group的入口和出口的流量;交换板比较自身每个Group的入口和出口、不同PON板的入口流量数据、上联板的出口流量数据;当任一Group的入口和出口流量差异超过阈值时,确认丢包位置并进行故障恢复; Step 606, the PON board, the switch board, and the uplink board respectively compare the ingress and egress traffic of each group; the switch board compares the ingress and egress of each group, the ingress traffic data of different PON boards, and the egress of the uplink board. Traffic data; when the difference between the ingress and egress traffic of any Group exceeds the threshold, confirm the location of the packet loss and perform fault recovery;
具体到本实施例,对于板内检测,由PON板比较入口2和出口1的流量,由交换板比较入口4和出口3的流量,由上联板比较入口6和出口5的流量。对于板间检测,由交换板比较入口2和出口3的流量以及入口4和出口5的流量。Specifically in this embodiment, for in-board detection, the PON board compares the flow rates of the inlet 2 and the outlet 1, the switch board compares the flows of the inlet 4 and the outlet 3, and the upper link board compares the flow of the inlet 6 and the outlet 5. For inter-board detection, the exchange board compares the flow rates of the inlet 2 and the outlet 3 and the flow rates of the inlet 4 and the outlet 5.
例如,针对IPTV组播业务,该阈值可以取5%。即入口和出口的流量差异超过5%便认为出现异常。因为根据经验,当流量差异超过5%时,用户就会明显的感知到IPTV画面异常,如马赛克或者黑屏。For example, for IPTV multicast services, the threshold may be 5%. That is, if the flow rate difference between the inlet and the outlet exceeds 5%, it is considered an abnormality. Because according to experience, when the traffic difference exceeds 5%, users will obviously perceive IPTV picture abnormalities, such as mosaic or black screen.
其中,当所有Group的入口和出口流量差异不超过阈值时,业务正常,结束流程。Among them, when the difference between the ingress and egress traffic of all groups does not exceed the threshold, the business is normal and the process ends.
步骤607,再次扫描上层业务数据和底层硬件数据;比较对应的上层业务数据和底层硬件数据是否一致;Step 607: Scan the upper-layer business data and the lower-layer hardware data again; compare whether the corresponding upper-layer business data and the lower-layer hardware data are consistent;
具体而言,步骤607重复执行步骤601-602。Specifically, step 607 repeats steps 601-602.
当一致时,执行步骤608,故障恢复成功;当不一致时,执行步骤609,故障恢复失败,上报告警。When they are consistent, perform step 608, and the fault recovery is successful; when they are inconsistent, perform step 609, and the fault recovery fails, and an alarm is reported.
当下一个扫描周期到达后,重复上面步骤601-609操作。When the next scan period arrives, repeat the above steps 601-609.
本实施例中,在每次扫描时,还可以将获取到的Group信息及其对应的入口和出口流量进行记录,生成log日志,并以定时自动上传或人为触发的方式进行获取,以便于维护人员可以对某个时刻某个Group的历史流量转发情况进行查询,利于更细致的掌握设备的运行状况。In this embodiment, during each scan, the obtained Group information and its corresponding ingress and egress traffic can also be recorded, log logs are generated, and they can be obtained by means of scheduled automatic uploading or manual triggering, so as to facilitate maintenance. Personnel can inquire about the historical traffic forwarding situation of a certain group at a certain time, which is conducive to a more detailed grasp of the operating status of the equipment.
本发明实施例提供的技术方案,采用从OLT入口一直到OLT出口全 流程,针对OLT每一个环节进行转发条目检查及流量监控,在出现问题后,快速的自动检测并定位出问题节点,记录问题信息,同时进行业务的自动恢复。The technical solution provided by the embodiment of the present invention adopts the whole process from the OLT entrance to the OLT exit, and performs forwarding entry inspection and flow monitoring for each link of the OLT. After a problem occurs, it automatically detects and locates the problem node and records the problem. Information, and automatic recovery of business at the same time.
本发明的另一实施例应用于第一板卡自身上层业务数据和底层硬件数据比较结果不一致时的场景。当第一板卡自身上层业务数据和底层硬件数据比较结果不一致时,说明需要进行故障恢复,后续执行图7所示的流程。Another embodiment of the present invention is applied to a scenario when the comparison result between the upper-layer service data and the lower-layer hardware data of the first board card itself is inconsistent. When the comparison result between the upper-layer service data and the lower-layer hardware data of the first board card itself is inconsistent, it indicates that fault recovery is required, and the process shown in FIG. 7 is subsequently executed.
如图7所示,包括:As shown in Figure 7, it includes:
步骤701,第一板卡向所有用户发送异常Group的特定查询报文;Step 701: The first board sends a specific query message of the abnormal group to all users;
其中,第一板卡为PON板、交换板或上联板。Among them, the first board is a PON board, a switch board or an uplink board.
根据该报文的应答情况来进行后续处理,当有用户应答时执行步骤702,当没有用户应答时执行步骤705:Follow-up processing is performed according to the response of the message, step 702 is executed when there is a user response, and step 705 is executed when there is no user response:
步骤702,记录所有应答用户的端口信息,生成最新端口列表;Step 702: Record the port information of all responding users, and generate the latest port list;
步骤703,比较最新端口列表与现有端口列表,并记录比较结果;Step 703: Compare the latest port list with the existing port list, and record the comparison result;
步骤704,通知上层业务和底层硬件更新为最新端口列表;Step 704: Notify the upper-layer services and the underlying hardware to update to the latest port list;
之后执行步骤706。Step 706 is then executed.
步骤705,通知上层业务和底层硬件删除异常Group对应的条目; Step 705, notify the upper-layer business and the lower-layer hardware to delete the entry corresponding to the abnormal group;
步骤706,再次获取上层业务数据和底层硬件数据,比较上层业务数据和底层硬件数据是否一致;Step 706: Obtain the upper-layer business data and the lower-layer hardware data again, and compare whether the upper-layer business data and the lower-layer hardware data are consistent;
一致时,执行步骤707,不一致时执行步骤708:If they are consistent, execute step 707, if they are inconsistent, execute step 708:
步骤707,恢复成功,记录日志; Step 707, the recovery is successful, and the log is recorded;
步骤708,恢复失败,上报告警。In step 708, the recovery fails, and an alarm is reported.
具体而言,在自动恢复后,会立即针对恢复Group再次获取一次底层硬件的相关信息,与上层业务进行比较,以用来确认自动恢复的效果。Specifically, after the automatic recovery, the relevant information of the underlying hardware will be obtained again for the recovery group and compared with the upper-level business to confirm the effect of the automatic recovery.
本发明的另一实施例中,当交换板比较自身的上层业务数据和不同PON板及上联板上的底层硬件数据不一致时,第一板卡为交换板,同样执行步骤701-708。由于板卡间出现问题的概率要远小于板卡内,所以板卡间的实时监控可以作为板卡内监控的辅助手段,两种监控同时运行,在出现问题后,根据两种监控的监控结果进行交叉对比,可以快速恢复问题或者通知维护人员快速介入分析,节省定位时间,提高问题解决效率。In another embodiment of the present invention, when the switching board compares its own upper-layer service data and the lower-layer hardware data on different PON boards and uplink boards are inconsistent, the first board is the switching board, and steps 701-708 are also executed. Since the probability of problems between the boards is much smaller than that in the boards, real-time monitoring between boards can be used as an auxiliary means of monitoring within the boards. The two types of monitoring run at the same time. After a problem occurs, the monitoring results of the two types of monitoring are used. Cross-comparison can quickly recover problems or notify maintenance personnel to quickly intervene in analysis, save positioning time and improve problem solving efficiency.
本发明的另一实施例应用于第一板卡自身上层业务数据和底层硬件数据比较结果一致的场景。当第一板卡自身上层业务数据和底层硬件数据比较结果一致时,后续执行图8所示的流程。Another embodiment of the present invention is applied to a scenario where the comparison result of the upper-layer service data and the lower-layer hardware data of the first board card is consistent. When the comparison result of the upper-layer service data and the lower-layer hardware data of the first board is consistent, the process shown in FIG. 8 is subsequently executed.
如图8所示,包括:As shown in Figure 8, it includes:
步骤801,比较第一板卡内任一Group的入口和出口流量差异是否超过阈值;Step 801: Compare whether the difference between the ingress and egress traffic of any Group in the first board exceeds a threshold;
当超过阈值时,进行故障恢复,执行步骤802,当不超过阈值时,业务正常,结束流程When the threshold is exceeded, perform fault recovery and go to step 802. When the threshold is not exceeded, the business is normal and the process ends
步骤802,第一板卡确认丢包位置; Step 802, the first board confirms the packet loss position;
其中,第一板卡为PON板、交换板或上联板。Among them, the first board is a PON board, a switch board or an uplink board.
具体而言,因为硬件转发可能涉及多个环节,所以首先需要确认是在哪个环节报文转发异常。可以通过硬件进行分析定位,直接调用芯片提供的命令进行查询;也可以通过软件手段进行分析定位。Specifically, because hardware forwarding may involve multiple links, it is first necessary to confirm at which link the message is forwarded abnormally. The hardware can be used for analysis and positioning, and the commands provided by the chip can be directly called for query; it can also be analyzed and positioned by software means.
步骤803,确认丢包具体原因并进行强制恢复; Step 803, confirm the specific cause of the packet loss and perform forced recovery;
具体而言,确认丢包位置后,可以通过预埋命令进行该位置的状态查询,确认丢包具体原因并进行强制恢复。Specifically, after confirming the location of the packet loss, you can use the embedded command to query the status of the location, confirm the specific cause of the packet loss, and perform forced recovery.
步骤804,再次获取上层业务数据和底层硬件数据,比较上层业务数 据和底层硬件数据是否一致;Step 804: Obtain the upper-layer business data and the lower-layer hardware data again, and compare whether the upper-layer business data and the lower-layer hardware data are consistent;
一致时,执行步骤805,不一致时执行步骤806:If they are consistent, execute step 805, if they are inconsistent, execute step 806:
步骤805,恢复成功,记录日志; Step 805, the recovery is successful, and the log is recorded;
步骤806,恢复失败,上报告警。In step 806, the recovery fails, and an alarm is reported.
具体而言,在自动恢复后,会立即针对恢复Group再次获取一次底层硬件的相关信息,与上层业务进行比较,以用来确认自动恢复的效果。Specifically, after the automatic recovery, the relevant information of the underlying hardware will be obtained again for the recovery group and compared with the upper-level business to confirm the effect of the automatic recovery.
本发明的另一实施例中,当交换板比较自身的上层业务数据和不同PON板及上联板上的底层硬件数据一致且第一Group的入口和出口流量差异超过阈值时的场景时,第一板卡为交换板,同样执行步骤801-806。由于板卡间出现问题的概率要远小于板卡内,所以板卡间的实时监控可以作为板卡内监控的辅助手段,两种监控同时运行,在出现问题后,根据两种监控的监控结果进行交叉对比,可以快速恢复问题或者通知维护人员快速介入分析,节省定位时间,提高问题解决效率。In another embodiment of the present invention, when the switch board compares its own upper-layer service data with the bottom-layer hardware data on different PON boards and uplink boards that are consistent, and the difference between the ingress and egress traffic of the first group exceeds the threshold, the first group One board is a switch board, and steps 801-806 are also executed. Since the probability of problems between the boards is much smaller than that in the boards, real-time monitoring between boards can be used as an auxiliary means of monitoring within the boards. The two types of monitoring run at the same time. After a problem occurs, the monitoring results of the two types of monitoring are used. Cross-comparison can quickly recover problems or notify maintenance personnel to quickly intervene in analysis, save positioning time and improve problem solving efficiency.
图9为本发明一实施例提供的一种板卡的结构示意图。该板卡应用于光线路终端OLT,如图9所示,包括:FIG. 9 is a schematic structural diagram of a board card provided by an embodiment of the present invention. The board is applied to the optical line terminal OLT, as shown in Figure 9, including:
扫描单元,设置为定时扫描自身的上层业务数据和底层硬件数据;The scanning unit is set to scan its own upper-layer business data and lower-layer hardware data regularly;
比较单元,设置为比较所述上层业务数据和底层硬件数据是否一致;The comparing unit is set to compare whether the upper-layer business data and the lower-layer hardware data are consistent;
监控单元,设置为根据比较结果进行组播业务监控。The monitoring unit is set to monitor the multicast service according to the comparison result.
其中,所述上层业务数据包括上层业务的组播组Group和每个Group对应的出口列表;Wherein, the upper-layer service data includes the multicast group group of the upper-layer service and the export list corresponding to each group;
所述底层硬件数据包括底层硬件的Group、每个Group对应的出口列表和每个Group的入口和出口流量。The bottom-level hardware data includes the Group of the bottom-level hardware, the export list corresponding to each group, and the ingress and egress traffic of each group.
其中,比较单元,具体设置为Among them, the comparison unit is specifically set as
比较上层业务的组播组Group与底层硬件的Group是否一致;Compare whether the multicast group group of the upper-layer service is consistent with the group of the underlying hardware;
比较上层业务每个Group对应的出口列表与底层硬件每个Group对应的出口列表是否一致。Compare whether the export list corresponding to each Group of the upper-level business is consistent with the export list corresponding to each Group of the underlying hardware.
其中,所述监控单元,具体设置为Wherein, the monitoring unit is specifically set as
当所述上层业务和底层硬件的第一Group不一致,和/或,所述上层业务和底层硬件的第一Group对应的出口列表不一致时,进行故障恢复,包括:When the upper-layer service and the first group of the lower-layer hardware are inconsistent, and/or the exit lists corresponding to the first group of the upper-layer service and the lower-layer hardware are inconsistent, performing fault recovery includes:
向所有用户发送第一Group的特定查询报文;Send a specific query message of the first Group to all users;
如果有用户应答,则记录所有应答用户的端口信息,根据所述所有应答用户的端口信息更新所述上层业务和底层硬件的所述第一Group对应的出口列表;If there is a user response, record the port information of all the responding users, and update the export list corresponding to the first Group of the upper-layer service and the underlying hardware according to the port information of all the responding users;
如果没有用户应答,则通知上层业务和底层硬件删除所述第一Group对应的条目。If there is no user response, the upper-layer service and the lower-layer hardware are notified to delete the entry corresponding to the first Group.
其中,所述监控单元,具体设置为Wherein, the monitoring unit is specifically set as
当上层业务和底层硬件的Group一致且上层业务和底层硬件的Group对应的出口列表一致时,比较所述每个Group的入口和出口流量;When the upper-layer service and the group of the lower-layer hardware are consistent, and the export list corresponding to the upper-layer service and the group of the lower-layer hardware is consistent, compare the ingress and egress traffic of each group;
当任一Group的入口和出口流量差异超过阈值时,确认丢包位置并进行故障恢复。When the difference between the ingress and egress traffic of any group exceeds the threshold, confirm the location of the packet loss and perform fault recovery.
其中,所述板卡为以下之一:无源光网络PON板、交换板、上联板。Wherein, the board card is one of the following: a passive optical network PON board, a switch board, and an uplink board.
其中,当所述板卡为交换板时,Wherein, when the board is a switch board,
所述扫描单元,还设置为定时扫描不同PON板及上联板上的底层硬件数据;The scanning unit is also configured to scan the bottom hardware data on different PON boards and uplink boards at regular intervals;
所述比较单元,还设置为比较自身的上层业务数据与所述不同PON板及上联板上的底层硬件数据是否一致;The comparison unit is further configured to compare whether its own upper-layer service data is consistent with the bottom-layer hardware data on the different PON boards and uplink boards;
所述监控单元,还设置为当所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的第一Group不一致,和/或,所述交换板的上 层业务和所述不同PON板及上联板上的底层硬件的第一Group对应的出口列表不一致时,进行故障恢复,包括:The monitoring unit is further configured to: when the upper-layer service of the switch board is inconsistent with the first Group of the lower-layer hardware on the different PON board and the uplink board, and/or, the upper-layer service of the switch board is inconsistent with the first Group of the lower-layer hardware on the uplink board. When the export lists corresponding to the first Group of the bottom hardware on different PON boards and uplink boards are inconsistent, perform fault recovery, including:
向所有用户发送第一Group的特定查询报文;Send a specific query message of the first Group to all users;
如果有用户应答,则记录所有应答用户的端口信息,根据所述所有应答用户的端口信息更新所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的所述第一Group对应的出口列表;If a user responds, record the port information of all the responding users, and update the upper-layer services of the switch board and the first layer of the underlying hardware on the different PON boards and the upper-link board according to the port information of all the responding users. The export list corresponding to the Group;
如果没有用户应答,则通知所述交换板的上层业务和所述不同PON板及上联板上的底层硬件删除所述第一Group对应的条目。If there is no response from the user, the upper-layer service of the switch board and the lower-layer hardware on the different PON board and the upper-link board are notified to delete the entry corresponding to the first Group.
其中,当所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的Group一致,且所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的Group对应的出口列表一致时,Wherein, when the upper-layer service of the switch board is consistent with the Group of the lower-layer hardware on the different PON board and the uplink board, and the upper-layer service of the switch board is the same as the lower-layer hardware on the different PON board and the uplink board When the export list corresponding to the Group is consistent,
所述比较单元,还设置为The comparison unit is also set to
比较每个Group的所述交换板的出口流量和所述不同PON板的入口流量;Comparing the egress traffic of the switch board of each group with the ingress traffic of the different PON boards;
比较每个Group的所述交换板的入口流量和所述上联板的出口流量;Compare the inlet flow rate of the switch board and the outlet flow rate of the uplink board of each group;
所述监控单元,还设置为当任一Group的入口和出口流量差异超过阈值时,确认丢包位置并进行故障恢复。The monitoring unit is further configured to confirm the location of the packet loss and perform fault recovery when the difference between the ingress and egress traffic of any Group exceeds a threshold.
其中,在进行故障恢复之后,Among them, after the failure recovery,
所述扫描单元,还设置为再次扫描自身的上层业务数据和底层硬件数据;The scanning unit is further configured to scan its own upper-layer business data and lower-layer hardware data again;
所述比较单元,还设置为比较所述上层业务数据和底层硬件数据是否一致;The comparing unit is further configured to compare whether the upper-layer service data and the lower-layer hardware data are consistent;
所述监控单元,还设置为当一致时,故障恢复成功,当不一致时故障恢复失败,上报告警。The monitoring unit is also set to succeed in failure recovery when they are consistent, and to report an alarm when failure recovery fails when they are inconsistent.
本发明实施例还提供了一种板卡,包括:包括存储器、处理器及存储 在所述存储器上并可在所述处理器上运行的计算机程序,所述计算机程序被所述处理器执行时实现上述任一项所述组播业务监控的方法。The embodiment of the present invention also provides a board card, including: a memory, a processor, and a computer program stored on the memory and running on the processor. When the computer program is executed by the processor, A method for realizing any one of the foregoing multicast service monitoring.
本发明实施例还提供了一种计算机可读存储介质,所述计算机可读存储介质上存储有信息处理程序,所述信息处理程序被处理器执行时实现上述任一项所述组播业务监控的方法的步骤。The embodiment of the present invention also provides a computer-readable storage medium, the computer-readable storage medium stores an information processing program, and when the information processing program is executed by a processor, the monitoring of any one of the foregoing multicast services is implemented. Steps of the method.
本领域普通技术人员可以理解,上文中所公开方法中的全部或某些步骤、系统、装置中的功能模块/单元可以被实施为软件、固件、硬件及其适当的组合。在硬件实施方式中,在以上描述中提及的功能模块/单元之间的划分不一定对应于物理组件的划分;例如,一个物理组件可以具有多个功能,或者一个功能或步骤可以由若干物理组件合作执行。某些组件或所有组件可以被实施为由处理器,如数字信号处理器或微处理器执行的软件,或者被实施为硬件,或者被实施为集成电路,如专用集成电路。这样的软件可以分布在计算机可读介质上,计算机可读介质可以包括计算机存储介质(或非暂时性介质)和通信介质(或暂时性介质)。如本领域普通技术人员公知的,术语计算机存储介质包括在用于存储信息(诸如计算机可读指令、数据结构、程序模块或其他数据)的任何方法或技术中实施的易失性和非易失性、可移除和不可移除介质。计算机存储介质包括但不限于RAM、ROM、EEPROM、闪存或其他存储器技术、CD-ROM、数字多功能盘(DVD)或其他光盘存储、磁盒、磁带、磁盘存储或其他磁存储装置、或者可以用于存储期望的信息并且可以被计算机访问的任何其他的介质。此外,本领域普通技术人员公知的是,通信介质通常包含计算机可读指令、数据结构、程序模块或者诸如载波或其他传输机制之类的调制数据信号中的其他数据,并且可包括任何信息递送介质。A person of ordinary skill in the art can understand that all or some of the steps, functional modules/units in the system, and apparatus in the methods disclosed above can be implemented as software, firmware, hardware, and appropriate combinations thereof. In the hardware implementation, the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, a physical component may have multiple functions, or a function or step may consist of several physical components. The components are executed cooperatively. Some or all of the components may be implemented as software executed by a processor, such as a digital signal processor or a microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit. Such software may be distributed on a computer-readable medium, and the computer-readable medium may include a computer storage medium (or non-transitory medium) and a communication medium (or transitory medium). As is well known to those of ordinary skill in the art, the term computer storage medium includes volatile and non-volatile data implemented in any method or technology for storing information (such as computer-readable instructions, data structures, program modules, or other data). Sexual, removable and non-removable media. Computer storage media include but are not limited to RAM, ROM, EEPROM, flash memory or other memory technologies, CD-ROM, digital versatile disk (DVD) or other optical disk storage, magnetic cassettes, magnetic tapes, magnetic disk storage or other magnetic storage devices, or Any other medium used to store desired information and that can be accessed by a computer. In addition, as is well known to those of ordinary skill in the art, communication media usually contain computer-readable instructions, data structures, program modules, or other data in a modulated data signal such as carrier waves or other transmission mechanisms, and may include any information delivery media. .

Claims (10)

  1. 一种组播业务监控的方法,应用于光线路终端OLT,包括:A method for monitoring multicast services, applied to an optical line terminal OLT, includes:
    所述OLT中的板卡定时扫描自身的上层业务数据和底层硬件数据;The board in the OLT regularly scans its own upper-layer service data and lower-layer hardware data;
    比较所述上层业务数据和底层硬件数据是否一致;Compare whether the upper-layer business data and the underlying hardware data are consistent;
    根据比较结果进行组播业务监控。Monitor the multicast service according to the comparison result.
  2. 根据权利要求1所述的方法,其中,The method of claim 1, wherein:
    所述上层业务数据包括上层业务的组播组Group和每个Group对应的出口列表;The upper-layer service data includes the multicast group group of the upper-layer service and the export list corresponding to each group;
    所述底层硬件数据包括底层硬件的Group、每个Group对应的出口列表和每个Group的入口和出口流量。The bottom-level hardware data includes the Group of the bottom-level hardware, the export list corresponding to each group, and the ingress and egress traffic of each group.
  3. 根据权利要求2所述的方法,其中,所述比较所述上层业务数据和底层硬件数据是否一致,包括:The method according to claim 2, wherein the comparing whether the upper-layer service data and the lower-layer hardware data are consistent, comprises:
    比较上层业务的组播组Group与底层硬件的Group是否一致;Compare whether the multicast group group of the upper-layer service is consistent with the group of the underlying hardware;
    比较上层业务每个Group对应的出口列表与底层硬件每个Group对应的出口列表是否一致。Compare whether the export list corresponding to each Group of the upper-level business is consistent with the export list corresponding to each Group of the underlying hardware.
  4. 根据权利要求3所述的方法,其中,所述根据比较结果进行组播业务监控,包括:The method according to claim 3, wherein the monitoring of the multicast service according to the comparison result comprises:
    当所述上层业务和底层硬件的第一Group不一致,和/或,所述上层业务和底层硬件的第一Group对应的出口列表不一致时,进行故障恢复,包括:When the upper-layer service and the first group of the lower-layer hardware are inconsistent, and/or the exit lists corresponding to the first group of the upper-layer service and the lower-layer hardware are inconsistent, performing fault recovery includes:
    向所有用户发送第一Group的特定查询报文;Send a specific query message of the first Group to all users;
    如果有用户应答,则记录所有应答用户的端口信息,根据所述所有应答用户的端口信息更新所述上层业务和底层硬件的所述第一Group对应的出口列表;If there is a user response, record the port information of all the responding users, and update the export list corresponding to the first Group of the upper-layer service and the underlying hardware according to the port information of all the responding users;
    如果没有用户应答,则通知上层业务和底层硬件删除所述第一Group对应的条目。If there is no user response, the upper-layer service and the lower-layer hardware are notified to delete the entry corresponding to the first Group.
  5. 根据权利要求3所述的方法,其中,所述根据比较结果进行组播业务监控,包括:The method according to claim 3, wherein the monitoring of the multicast service according to the comparison result comprises:
    当上层业务和底层硬件的Group一致且上层业务和底层硬件的Group对应的出口列表一致时,比较所述每个Group的入口和出口流量;When the upper-layer service and the group of the lower-layer hardware are consistent, and the export list corresponding to the upper-layer service and the group of the lower-layer hardware is consistent, compare the ingress and egress traffic of each group;
    当任一Group的入口和出口流量差异超过阈值时,确认丢包位置并进行故障恢复。When the difference between the ingress and egress traffic of any group exceeds the threshold, confirm the location of the packet loss and perform fault recovery.
  6. 根据权利要求1所述的方法,其中,The method of claim 1, wherein:
    所述板卡为以下之一:无源光网络PON板、交换板、上联板。The board card is one of the following: a passive optical network PON board, a switch board, and an uplink board.
  7. 根据权利要求6所述的方法,其中,当所述板卡为交换板时,该方法还包括:The method according to claim 6, wherein when the board is a switch board, the method further comprises:
    所述交换板定时扫描不同PON板及上联板上的底层硬件数据;The switch board regularly scans the underlying hardware data on different PON boards and uplink boards;
    比较自身的上层业务数据与所述不同PON板及上联板上的底层硬件数据是否一致;Compare whether its own upper-layer service data is consistent with the bottom-layer hardware data on the different PON boards and uplink boards;
    当所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的第一Group不一致,和/或,所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的第一Group对应的出口列表不一致时,进行故障恢复,包括:When the upper-layer service of the switch board is inconsistent with the first Group of the underlying hardware on the different PON board and the uplink board, and/or, the upper-layer service of the switch board is different from the different PON board and the uplink board When the export list corresponding to the first Group of the underlying hardware is inconsistent, perform fault recovery, including:
    向所有用户发送第一Group的特定查询报文;Send a specific query message of the first Group to all users;
    如果有用户应答,则记录所有应答用户的端口信息,根据所述所有应答用户的端口信息更新所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的所述第一Group对应的出口列表;If there is a user answer, record the port information of all the answering users, and update the upper-layer services of the switch board and the first layer of the underlying hardware on the different PON boards and the upper-link board according to the port information of all the answering users. The export list corresponding to the Group;
    如果没有用户应答,则通知所述交换板的上层业务和所述不同PON板及上联板上的底层硬件删除所述第一Group对应的条目。If there is no response from the user, the upper-layer service of the switch board and the lower-layer hardware on the different PON board and the upper-link board are notified to delete the entry corresponding to the first Group.
  8. 根据权利要求7所述的方法,其中,该方法还包括:The method according to claim 7, wherein the method further comprises:
    当所述交换板的上层业务和所述不同PON板及上联板上的底层 硬件的Group一致,且所述交换板的上层业务和所述不同PON板及上联板上的底层硬件的Group对应的出口列表一致时,When the upper-layer service of the switch board is consistent with the Group of the lower-layer hardware on the different PON board and the uplink board, and the upper-layer service of the switch board is the same as the Group of the lower-layer hardware on the different PON board and the uplink board When the corresponding export lists are consistent,
    比较每个Group的所述交换板的出口流量和所述不同PON板的入口流量;Comparing the egress traffic of the switch board of each group with the ingress traffic of the different PON boards;
    比较每个Group的所述交换板的入口流量和所述上联板的出口流量;Compare the inlet flow rate of the switch board and the outlet flow rate of the uplink board of each group;
    当任一Group的入口和出口流量差异超过阈值时,确认丢包位置并进行故障恢复。When the difference between the ingress and egress traffic of any group exceeds the threshold, confirm the location of the packet loss and perform fault recovery.
  9. 根据权利要求4、5、7、8任一项所述的方法,其中,进行故障恢复之后,该方法还包括:The method according to any one of claims 4, 5, 7, and 8, wherein, after performing failure recovery, the method further comprises:
    再次扫描上层业务数据和底层硬件数据;Scan the upper-level business data and the underlying hardware data again;
    比较对应的上层业务数据和底层硬件数据是否一致;Compare whether the corresponding upper-layer business data and the underlying hardware data are consistent;
    当一致时,故障恢复成功,当不一致时故障恢复失败,上报告警。When they are consistent, the fault recovery is successful; when they are inconsistent, the fault recovery fails, and an alarm is reported.
  10. 一种板卡,应用于光线路终端OLT,包括:A board, applied to an optical line terminal OLT, including:
    扫描单元,设置为定时扫描自身的上层业务数据和底层硬件数据;The scanning unit is set to scan its own upper-layer business data and lower-layer hardware data regularly;
    比较单元,设置为比较所述上层业务数据和底层硬件数据是否一致;The comparing unit is set to compare whether the upper-layer business data and the lower-layer hardware data are consistent;
    监控单元,设置为根据比较结果进行组播业务监控。The monitoring unit is set to monitor the multicast service according to the comparison result.
PCT/CN2020/114858 2019-09-24 2020-09-11 Method for monitoring multicast service and board WO2021057517A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910905763.3A CN112637708A (en) 2019-09-24 2019-09-24 Method and board card for monitoring multicast service
CN201910905763.3 2019-09-24

Publications (1)

Publication Number Publication Date
WO2021057517A1 true WO2021057517A1 (en) 2021-04-01

Family

ID=75166412

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/114858 WO2021057517A1 (en) 2019-09-24 2020-09-11 Method for monitoring multicast service and board

Country Status (2)

Country Link
CN (1) CN112637708A (en)
WO (1) WO2021057517A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1972223A (en) * 2006-12-13 2007-05-30 中兴通讯股份有限公司 A method and apparatus for monitoring multicast data flow
CN101056191A (en) * 2007-06-14 2007-10-17 中兴通讯股份有限公司 Multicast processing method in the GPON system
CN102905268A (en) * 2012-09-21 2013-01-30 福建三元达通讯股份有限公司 Method for improving multicast safety
WO2014101185A1 (en) * 2012-12-31 2014-07-03 华为技术有限公司 Method, apparatus and system for detecting performance of multicast channel
US20170055133A1 (en) * 2015-08-17 2017-02-23 Adtran, Inc. Multicast connection admission control

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1972223A (en) * 2006-12-13 2007-05-30 中兴通讯股份有限公司 A method and apparatus for monitoring multicast data flow
CN101056191A (en) * 2007-06-14 2007-10-17 中兴通讯股份有限公司 Multicast processing method in the GPON system
CN102905268A (en) * 2012-09-21 2013-01-30 福建三元达通讯股份有限公司 Method for improving multicast safety
WO2014101185A1 (en) * 2012-12-31 2014-07-03 华为技术有限公司 Method, apparatus and system for detecting performance of multicast channel
US20170055133A1 (en) * 2015-08-17 2017-02-23 Adtran, Inc. Multicast connection admission control

Also Published As

Publication number Publication date
CN112637708A (en) 2021-04-09

Similar Documents

Publication Publication Date Title
JP4610621B2 (en) Network system
JP6678781B2 (en) Optical network unit reset message
US8755685B2 (en) Joint switching method for an aggregation node, aggregation node and system
CN102137009B (en) Method, system and equipment for processing Dual-layer service in network
US7719956B2 (en) Trunk network system for multipoint-to-multipoint relay
WO2010075702A1 (en) Method and system for implementing automatic work order in ethernet passive optical network (epon)
CN106375384A (en) Management system of mirror network flow in virtual network environment and control method
CN103840960A (en) Passive optical network service configuration method and system
CN103763148B (en) ONU automatic authorization configuration recovery method
CN110300283B (en) Monitoring and inspection control method and device based on video networking
CN109561072B (en) Link detection method and system
CN109379254B (en) Network connection detection method and system based on video conference
CN101094087A (en) Controllable multicast system under environment of passive optical network, and implementing method
CN106412730A (en) Method for carrying out remote batch upgrading on ONU (Optical Network Unit) system software in Boot environment
CN111614953B (en) Restarting control method and system for fault IP camera
CN102611519A (en) Method and device for link protection of passive optical network
WO2009055995A1 (en) Maintaining method for automatic switched optical network system when operation engenders alarm
WO2016188196A1 (en) Method and device for detecting abnormal pon terminal
CN109729388B (en) Method and device for switching video networking servers
WO2017166922A1 (en) Multicast service monitoring method and device
WO2021057517A1 (en) Method for monitoring multicast service and board
WO2016124044A1 (en) Method for processing warning information of passive optical device and optical line terminal
CN110166363B (en) Multicast link monitoring method and device
WO2012000314A1 (en) Method, device and system for single board main-standby switching
CN111614480A (en) Service detection method, device and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20868745

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20868745

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 21/02/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 20868745

Country of ref document: EP

Kind code of ref document: A1