WO2015109443A1 - 网络服务故障处理方法,服务管理系统和系统管理模块 - Google Patents

网络服务故障处理方法,服务管理系统和系统管理模块 Download PDF

Info

Publication number
WO2015109443A1
WO2015109443A1 PCT/CN2014/071007 CN2014071007W WO2015109443A1 WO 2015109443 A1 WO2015109443 A1 WO 2015109443A1 CN 2014071007 W CN2014071007 W CN 2014071007W WO 2015109443 A1 WO2015109443 A1 WO 2015109443A1
Authority
WO
WIPO (PCT)
Prior art keywords
fault
network service
vnf
sms
information
Prior art date
Application number
PCT/CN2014/071007
Other languages
English (en)
French (fr)
Inventor
刘建宁
朱雷
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to CN201480000951.7A priority Critical patent/CN105165054B/zh
Priority to BR112016016656A priority patent/BR112016016656A2/pt
Priority to MX2016009433A priority patent/MX2016009433A/es
Priority to PCT/CN2014/071007 priority patent/WO2015109443A1/zh
Priority to EP14880365.3A priority patent/EP3089505B1/en
Priority to RU2016134016A priority patent/RU2641706C1/ru
Publication of WO2015109443A1 publication Critical patent/WO2015109443A1/zh
Priority to US15/215,294 priority patent/US10680874B2/en
Priority to ZA2016/05062A priority patent/ZA201605062B/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/04Network management architectures or arrangements
    • H04L41/046Network management architectures or arrangements comprising network management agents or mobile agents therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/04Arrangements for maintaining operational condition

Definitions

  • the present invention relates to the field of communications, and in particular, to a network service fault processing method, a service management system, and a system management module. Background technique
  • NFV Network Function Virtulization
  • Many equipment vendors and IT vendors participate in the organization to define the requirements for operator network function virtualization and related technical reports.
  • Technology IT's virtualization technology leverages common high-performance, high-capacity servers, switches, and storage to enable software for some network functions.
  • Various types of network devices such as servers, routers, storage devices, CDNs, switches, etc., can be separated by software virtualization technology, so that they can be deployed in data centers, network nodes, or users' homes.
  • NFV completely separates software that defines network functions from common high-performance hardware servers, storage, and network switches, enabling independent modularization of software and hardware components, and fully automated remote installation and management software based on common hardware. device.
  • the embodiment of the invention provides a network service fault processing method, a service management system and a system management module, which are used for handling network service faults in an NFV environment and maintaining the stability of the NFV system.
  • a first aspect of the embodiments of the present invention provides a service management system, including:
  • a monitoring module configured to monitor performance performance data of the virtual network function entity VNF;
  • a determining module configured to determine, according to the running performance data of the VNF monitored by the monitoring module, whether the network service performance is abnormal
  • An information requesting module configured to send a network when the determining module determines that the network service performance is abnormal
  • the service fault association request information is sent to the system management module, where the network service fault association request information is used by the system management module to query the fault and feed back the network service fault association response to the fault diagnosis module;
  • the fault diagnosis module is configured to associate with the network service fault Responding to the operational performance data of the VNF monitored by the monitoring module for fault diagnosis, and obtaining a network service fault diagnosis report;
  • a report sending module configured to send the network service fault diagnosis report obtained by the fault diagnosis module to the system management module, where the network service fault diagnosis report is used by the system management module to perform fault recovery.
  • the monitoring module is specifically configured to: periodically receive the running performance data of the VNF sent by the network element management system EMS. .
  • the system further includes:
  • the receiving module is configured to receive a range of configuration parameter values sent by the orchestrator Orchestrator; the determining module specifically includes:
  • a determining unit configured to determine whether the running performance data of the VNF is within a range of configuration parameter values received by the configuration receiving module
  • a normal determining unit configured to: when the determining unit determines that the running performance data of the VNF is within the range of the configured parameter value, determining that the network service performance is normal;
  • the abnormality determining unit is configured to determine that the network service performance is abnormal when the determining unit determines that the running performance data of the VNF is not within the configuration parameter value range.
  • the fault diagnosis module specifically includes:
  • a fault determining unit configured to determine a root fault and a pseudo fault according to the running performance data of the VNF monitored by the monitoring module and the network service fault association response, where the pseudo fault is capable of recovering with the root source fault Recovery failure; analysis, get processing strategy;
  • a report generating unit configured to analyze according to the root source fault and the pseudo fault and the fault analysis unit
  • the processing strategy generated generates the network service fault diagnosis report.
  • the fault determining unit is specifically configured to: The abnormal data in the running performance data and the fault information in the network service fault association response, find a data fault information list, and obtain a corresponding root source fault and a pseudo fault, wherein the pseudo fault is capable of recovering with the root source fault And the failure of recovery.
  • the fifth implementation manner of the first aspect of the embodiment of the present invention is characterized in that:
  • the service management system is deployed in the Orchestrator
  • the service management system is deployed in the system management module
  • the service management system is deployed in the EMS.
  • a second aspect of the embodiments of the present invention provides a system management module, including:
  • a request receiving unit configured to receive network service fault association request information sent by the service management system SMS;
  • the fault query unit is configured to query the fault information in the management scope according to the network service fault association request information received by the request receiving unit;
  • An information conversion unit configured to convert the fault information queried by the fault query unit to a network service fault association response
  • a sending unit configured to send a network service fault association response converted by the information conversion unit to the SMS;
  • a report receiving unit configured to receive a network service fault diagnosis report sent by the SMS, where the network service fault diagnosis report is performed by the SMS according to the network service fault association response sent by the sending unit and the running performance data of the virtual network function entity VNF Obtained after diagnosis;
  • a fault recovery unit configured to perform fault recovery according to the network service fault diagnosis report received by the report receiving unit.
  • the fault query unit is specific. And configured to query the fault information of the managed device according to the network service fault association request information received by the request receiving unit.
  • NMS network management system
  • system management module is a virtual network function manager VNFM
  • the fault query unit specifically includes:
  • a first query subunit configured to query, according to the network service fault association request information received by the request receiving unit, fault information of a VNF associated with the network service requested by the network service fault association request information;
  • a second query subunit configured to query, according to the network service fault association request information received by the request receiving unit, the fault information of the network function virtualization infrastructure NFVI associated with the network service by using the virtualization infrastructure manager VIM ;
  • the information conversion unit is specifically configured to: convert the fault information of the VNF and the fault information of the NFVI into a network service fault association response.
  • a third aspect of the embodiments of the present invention provides a service management system, including:
  • Input device output device, processor, memory and bus
  • the processor performs the following operations:
  • the network service fault association request information is sent to the system management module, where the network monthly service fault association request information is used by the system management module to query the fault and feed back the network service fault association response;
  • the processor performs the following operations: periodically receiving the running performance of the VNF sent by the network element management system EMS data.
  • the third embodiment of the present invention In the second implementation of the face,
  • the processor also performs the following operations:
  • the processor specifically performs the following operations:
  • the processor specifically performs the following operations. :
  • a root source fault and a pseudo fault according to the operational performance data of the VNF and the network service fault association response, wherein the pseudo fault is a fault that can be recovered as the root source fault recovers; and the root fault and the pseudo fault
  • the fault is analyzed and the processing strategy is obtained;
  • the network service fault diagnosis report is generated according to the root cause failure and the pseudo fault and the processing policy.
  • the processor specifically performs the following operations:
  • a fourth aspect of the embodiments of the present invention provides a system management module, including:
  • Input device output device, processor, memory and bus
  • the processor performs the following operations:
  • the fault recovery is performed according to the network service fault diagnosis report.
  • the processor specifically performs the following operations:
  • the processor specifically performs the following operations:
  • the fault information of the VNF and the fault information of the NFVI are converted into a network service fault correlation response.
  • a fifth aspect of the embodiments of the present invention provides a network service fault processing method, including:
  • the service management system SMS monitors the running performance data of the virtual network function entity VNF; the SMS determines whether the network service performance is abnormal according to the running performance data of the VNF; if the network service performance is abnormal, the SMS sends a network service fault association request The information is sent to the system management module, where the network service fault association request information is used by the system management module to query a fault and feed back a network service fault association response to the SMS;
  • the SMS performs fault diagnosis according to the network service fault association response and the running performance data of the VNF, and obtains a network service fault diagnosis report;
  • SMS sends the network service fault diagnosis report to the system management module, where the network service fault diagnosis report is used by the system management module to perform fault recovery.
  • the service management system SMS monitors the running performance data of the virtual network function VNF, including: the periodic receiving of the SMS The running performance data of the VNF sent by the network element management system EMS.
  • the running performance data includes:
  • the SMS determines whether the network service performance is abnormal according to the running performance data of the VNF. Before the steps include:
  • the SMS receives a range of configuration parameter values sent by the orchestrator Orchestrator;
  • the SMS determines whether the network service performance is abnormal according to the running performance data of the VNF:
  • the SMS performs fault diagnosis according to the network service fault association response and the running performance data of the VNF,
  • the network service troubleshooting report includes:
  • the SMS Determining, by the SMS, the root failure and the pseudo fault according to the running performance data of the VNF and the network service fault association response, where the pseudo fault is a fault that can be recovered according to the recovery of the root fault;
  • the SMS analyzes the root cause fault and the pseudo fault, and obtains a processing strategy
  • the SMS generates the network service fault diagnosis report according to the root cause failure and the pseudo fault and the processing policy.
  • the SMS combines the running performance data of the VNF with the network service fault association.
  • determining the root cause and the false fault include:
  • the SMS combines the abnormal data in the running performance data of the VNF and the fault information in the network service fault correlation response to find a data fault information list, and obtain a corresponding root fault and a pseudo fault.
  • the network service fault association request information includes:
  • the network service identifier and the network service performance abnormality data are used to uniquely identify a network service in the network, and the network service performance abnormality data is used to represent statistics of one or several performance abnormalities of the network service.
  • the network service fault association response includes:
  • the network service identifier, the fault information and the fault type, the fault information is used to identify the fault occurrence body, and the fault type is used to identify the type of the fault.
  • the network service fault diagnosis report includes:
  • the network service identifier, the root cause failure, and the root cause failure processing decision is used to indicate a root cause of the network service abnormality
  • the root cause failure processing decision is used to indicate a policy for processing the root cause failure.
  • the system management module is: Management System NMS, and/or Virtual Network Function Manager VNFM.
  • a sixth aspect of the embodiments of the present invention provides a network service fault processing method, including: receiving, by a system management module, a network service fault association request message sent by a service management system (SMS), the system management module according to the network service fault association request information, Querying fault information within the management scope of the system management module;
  • SMS service management system
  • the system management module converts the fault information into a network service fault association response
  • the system management module sends the network service fault association response to the SMS; the system management module receives a network service fault diagnosis report sent by the SMS, and the network service fault diagnosis report is an SMS response response according to the network service fault Obtained after the fault diagnosis of the running performance data of the virtual network function entity VNF;
  • the system management module performs fault recovery according to the network service fault diagnosis report.
  • the querying the system management Fault information within the scope of module management includes:
  • the NMS queries the fault information of the device managed by the NMS.
  • the querying the fault information in the management scope of the system management module includes:
  • the VNFM queries the fault information of the VNF associated with the network service requested by the network service failure association request information;
  • the VNFM queries the network function virtualized infrastructure NFVI fault information associated with the network service through the virtualization infrastructure manager VIM;
  • the system management module converting the fault information into a network service fault association response includes: the VNFM converting the fault information of the VNF and the fault information of the NFVI into a network service fault association response.
  • the embodiment of the present invention has the following advantages:
  • the service management system SMS detects that the running performance data of the VNF is abnormal
  • the network service association request information is sent to the system management module, and the system management module queries Fault and feedback network service fault correlation response, SMS then according to the network service fault association response and VNF running performance data, fault diagnosis, get network service fault diagnosis report, send the network service fault diagnosis report to the system management module, system management
  • the module can recover the fault according to the network service fault diagnosis report, so that the network service fault is handled in the NFV environment, and when the network service fails, the fault can be quickly found and solved, and the stability of the NFV system is ensured. Improve the operating efficiency of the NFV system.
  • FIG. 1 is a schematic structural diagram of a service management system according to an embodiment of the present invention.
  • FIG. 2 is another schematic structural diagram of a service management system according to an embodiment of the present invention.
  • FIG. 3 is another schematic structural diagram of a service management system according to an embodiment of the present invention.
  • FIG. 4 is another schematic structural diagram of a service management system according to an embodiment of the present invention.
  • FIG. 5 is a schematic structural diagram of a system management module according to an embodiment of the present invention.
  • FIG. 6 is another schematic structural diagram of a system management module according to an embodiment of the present invention.
  • FIG. 7 is another schematic structural diagram of a system management module according to an embodiment of the present invention
  • FIG. FIG. 8 is a schematic flowchart of a network service fault processing method according to an embodiment of the present invention
  • FIG. 9 is another schematic flowchart of a network service fault processing method according to an embodiment of the present invention
  • FIG. 11 is another schematic flowchart of a network service fault processing method according to an embodiment of the present invention
  • FIG. 12 is another schematic flowchart of a network service fault processing method according to an embodiment of the present invention; Processing method Another flow chart. detailed description
  • an embodiment of a service management system in an embodiment of the present invention includes:
  • the monitoring module 101 is configured to monitor the running performance data of the virtual network function entity VNF.
  • the determining module 102 is configured to determine, according to the running performance data of the VNF monitored by the monitoring module 101, whether the network service performance is abnormal.
  • the information requesting module 103 is configured to: when the determining module 102 determines that the network service performance is abnormal, send the network service fault association request information to the system management module, where the network service fault association request information is used by the system management module to query the fault and Feedback network service failure association response to the fault diagnosis module 104;
  • the fault diagnosis module 104 is configured to perform fault diagnosis according to the network service fault association response and the running performance data of the VNF, and obtain a network service fault diagnosis report;
  • the report sending module 105 is configured to send the network service fault diagnosis report obtained by the fault diagnosis module 104 to the system management module, where the network service fault diagnosis report is used by the system management module to perform fault recovery.
  • the information requesting module 103 sends the network service association request information to the system management module, and the system management module queries the fault and feeds back the network service fault association response.
  • the fault diagnosis module 104 performs fault diagnosis according to the network service fault association response and the running performance data of the VNF.
  • the network service fault diagnosis report, the report sending module 105 sends the network service fault diagnosis report to the system management module, and the system management module can perform fault recovery according to the network service fault diagnosis report, thereby implementing the network service fault in the NFV environment. Processing, when there is a network service failure, can quickly find and solve the fault, ensure the stability of the NFV system, and improve the operating efficiency of the NFV system.
  • the monitoring module 101 monitors the running performance data of the VNF, and the determining module 102 determines whether the network service performance is abnormal according to the running performance data of the VNF.
  • the monitoring module 101 can receive the VNF sent by the network element management system EMS.
  • the running performance data, the SMS may also receive the range of configuration parameter values sent by the orchestrator Orchestrator, and the determining module 102 determines whether the network service performance is abnormal according to the value range of the configuration parameter and the running performance data of the VNF.
  • the following is a service management system in the embodiment of the present invention.
  • another embodiment of the service management system in the embodiment of the present invention includes:
  • the monitoring module 201 is configured to monitor the running performance data of the virtual network function entity VNF.
  • the determining module 202 is configured to determine, according to the running performance data of the VNF monitored by the monitoring module 201, whether the network service performance is abnormal.
  • the information requesting module 203 is configured to: when the determining module 202 determines that the network service performance is abnormal, send the network service fault association request information to the system management module, where the network service fault association request information is used by the system management module to query the fault and Feedback network service failure association response to the fault diagnosis module 204;
  • the fault diagnosis module 204 is configured to perform fault diagnosis according to the network service fault association response and the running performance data of the VNF, and obtain a network service fault diagnosis report;
  • the report sending module 205 is configured to send the network service fault diagnosis report obtained by the fault diagnosis module 204 to the system management module, where the network service fault diagnosis report is used by the system management module to perform fault recovery;
  • the monitoring module 201 is specifically configured to: periodically receive the running performance data of the VNF sent by the network element management system EMS.
  • the month's management system can also include:
  • the configuration receiving module 206 is configured to receive a range of configuration parameter values sent by the orchestrator Orchestrator;
  • the determining module 202 specifically includes:
  • the determining unit 2021 is configured to determine whether the running performance data of the VNF is within a range of configuration parameter values received by the configuration receiving module 206.
  • the normal determining unit 2022 is configured to: when the determining unit 2021 determines that the running performance data of the VNF is within the configuration parameter value range, determining that the network service performance is normal;
  • the abnormality determining unit 2023 is configured to determine that the network service performance is abnormal when the determining unit 2021 determines that the running performance data of the VNF is not within the configuration parameter value range.
  • the configuration receiving module 206 may first receive the configuration parameter value range, and the determining module 202 determines whether the network service performance is abnormal according to the configuration parameter value range and the VNF running performance data, so that the network service performance is determined more. Accurate, and if it is determined that the network service performance is normal, the step of triggering the reception of the operational performance data of the VNF enhances the continuity of the fault monitoring and ensures the timeliness of the fault handling.
  • the fault diagnosis module 204 performs fault diagnosis according to the network service fault association response and the running performance data of the VNF, and obtains a network service fault diagnosis report.
  • the fault diagnosis module 204 may first combine the network service fault association response with The operation performance data of the VNF determines the root fault and the pseudo fault and then performs the subsequent processing.
  • the following is a detailed description of the service management system in the embodiment of the present invention.
  • another embodiment of the service management system in the embodiment of the present invention includes: 301.
  • the operating performance data is used to monitor the virtual network function entity VNF.
  • the determining module 302 is configured to determine, according to the running performance data of the VNF that is monitored by the monitoring module 301, whether the network service performance is abnormal.
  • the information requesting module 303 is configured to: when the determining module 302 determines that the network service performance is abnormal, send the network service fault association request information to the system management module, where the network service fault association request information is used by the system management module to query the fault and Feedback network service failure association response to the fault diagnosis module 304;
  • the fault diagnosis module 304 is configured to perform fault diagnosis according to the network service fault association response and the running performance data of the VNF, and obtain a network service fault diagnosis report;
  • a report sending module 305 configured to send the network service fault diagnosis report obtained by the fault diagnosis module 304 to the system management module, where the network service fault diagnosis report is used by the system management module to perform fault recovery;
  • the monitoring module 301 is specifically configured to: periodically receive the running performance data of the VNF sent by the network element management system EMS.
  • the month's management system can also include:
  • the configuration receiving module 306 is configured to receive a parameter range of configuration parameters sent by the orchestrator Orchestrator;
  • the determining module 302 specifically includes:
  • the determining unit 3021 is configured to determine whether the running performance data of the VNF is within a range of configuration parameter values received by the configuration receiving module 306;
  • the normal determining unit 3022 is configured to: when the determining unit 3021 determines that the running performance data of the VNF is within the configuration parameter value range, determining that the network service performance is normal;
  • the abnormality determining unit 3023 is configured to determine, when the determining unit 3021 determines that the running performance data of the VNF is not within the range of the configuration parameter value, determining that the network service performance is abnormal;
  • the fault diagnosis module 304 specifically includes:
  • the fault determining unit 3041 is configured to determine a root fault and a pseudo fault according to the running performance data of the VNF and the network service fault association response monitored by the monitoring module, where the pseudo fault is capable of recovering with the root fault. And the failure of recovery;
  • the fault analysis unit 3042 is configured to analyze the root fault and the pseudo fault determined by the fault determining unit 3041 to obtain a processing strategy.
  • the report generating unit 3043 is configured to generate the network service fault diagnosis report according to the root source fault and the pseudo fault and the processing strategy analyzed by the fault analyzing unit 3042;
  • the fault determining unit 3041 is specifically configured to: in combination with the abnormal data in the running performance data of the VNF monitored by the monitoring module 301 and the fault information in the network service fault association response, find a data fault information list, and obtain a corresponding A root fault and a pseudo fault are faults that can be recovered as the root fault recovers.
  • the network service fault association request information includes a network service identifier, where the network service identifier is used to uniquely identify a network service in the network, and may further include network service performance abnormal data, where the network service performance abnormal data is used to indicate that the network service is
  • the statistics of the item or the performance abnormality may also include the topology information of the network service or the time when the network service abnormality occurs, and the topology information of the network service is used to represent all network devices or network functions included in the network service, the network Service exception occurred The time is used for the time when the fault occurs, and the remaining network monthly service related data may also be included, which is not limited herein.
  • the network service association response includes the network service identifier and the fault information, and the fault information is used to identify the fault occurrence body, and it can be distinguished which part of the management scope of the system management module is faulty, and the network service association response may further include a fault type for identifying The type of fault, such as overload, service suspension or power failure, etc., network service association response may also include topology information of network service, network service performance abnormal data or network service abnormality occurrence time and other network services or fault related data, here Not limited.
  • the network service fault diagnosis report includes a network service identifier, a root cause fault, and a root cause fault processing decision, and the root source fault is used to indicate a root cause of the network service abnormality, and the root source fault processing decision is used to indicate a policy for processing the root cause fault, such as restarting, migrating, and the like.
  • the network service fault diagnosis report may further include a root source fault type, a pseudo fault, a pseudo fault type or a pseudo fault processing decision, and the network service fault diagnosis report may further include topology information of the network service, network service performance abnormal data, and network service abnormality occurrence time. Other data such as monthly service or faults are not limited here.
  • the fault determining unit 3041 first combines the network service fault association response with
  • the running performance data of the VNF determines the root fault and the pseudo fault, and then the fault analyzing unit 3042 analyzes the root fault and the pseudo fault to obtain a processing strategy, and the report generating unit 3043 generates the network service fault diagnosis report according to the root fault and the pseudo fault and the processing policy.
  • the system management module is made to make the analysis of network service failures more accurate and improve the accuracy of fault handling.
  • the service management system SMS can be deployed in different management systems, for example, in the network element management system EMS, in the orchestrator Orchestrator, or in the NMS in the system management module or In VNFM, it is not limited here.
  • SMS is deployed in a different management system, the path to receive or send various messages may be different.
  • SMS can transmit various messages through Orchestrator.
  • the configuration receiving module 306 receives the range of configuration parameter values sent by the Orchestrator
  • the monitoring module 301 receives the running performance data of the VNF sent by the EMS in a cycle of 10 seconds;
  • the determining unit 3021 determines that the service performance of the VNF-4 in the running performance data of the VNF exceeds the configuration parameter value range, and the abnormality determining unit 3023 determines that the network service performance of the VNF-4 is abnormal;
  • the information requesting module 303 sends the network service fault association request information to a system management module, where the network service fault association request information includes a network identifier of the VNF-4 and network service performance abnormality data, where the network service fault association request information is used by the system management module to query the fault and feed back the network service fault association response to the fault determination.
  • Unit 3041
  • the fault determining unit 3041 combines the running performance data of the VNF with the network service fault association response (the network service fault association response includes: the device GW2 is faulty, the throughput is 0, the device GW1 is normal, the service performance is overloaded, the VNF-4 is normal, and the service performance is overloaded. ), determining that device GW2 is the root cause failure, and that devices GW1 and VNF-4 are false faults;
  • the fault analysis unit 3042 analyzes the root fault and the pseudo fault, and obtains a processing strategy for restarting the device GW2;
  • the report generating unit 3043 generates a network service fault diagnosis report according to the root source fault and the pseudo fault and the processing policy, where the root service fault is GW2, and the root fault processing decision is restart GW2;
  • the report sending module 305 sends the network service fault diagnosis report to the system management module, and the network service fault diagnosis report is used by the system management module for failure recovery.
  • the service management system in the embodiment of the present invention is described above with reference to FIG. 4 from the perspective of a unitized functional entity.
  • Another embodiment of the service management system 400 in the embodiment of the present invention includes:
  • the input device 401, the output device 402, the processor 403, and the memory 404 (wherein the number of processors 403 in the service management system may be one or more, and one processor 403 is taken as an example in Fig. 4).
  • the input device 401, the output device 402, the processor 403, and the memory 404 may be connected by a bus or other means, wherein the bus connection is taken as an example in FIG.
  • the processor 403 is configured to perform the following steps by: calling the operation instruction stored in the memory 404: monitoring the running performance data of the virtual network function entity VNF;
  • the network service failure association request information is sent to the system management module.
  • the network service fault association request information is used by the system management module to query a fault and feed back a network service fault association response;
  • the processor 403 specifically performs the following operations:
  • the processor 403 specifically performs the following operations:
  • the processor 403 specifically performs the following operations:
  • a root source fault and a pseudo fault according to the operational performance data of the VNF and the network service fault association response, wherein the pseudo fault is a fault that can be recovered as the root source fault recovers; and the root fault and the pseudo fault
  • the fault is analyzed and the processing strategy is obtained;
  • the processor 403 specifically performs the following operations:
  • an embodiment of the system management module in the embodiment of the present invention includes:
  • the request receiving unit 501 is configured to receive network service fault association request information sent by the service management system SMS;
  • the fault query unit 502 is configured to perform network service failure according to the request receiving unit 501. Correlate the request information, and query the fault information within the management scope;
  • the information conversion unit 503 is configured to convert the fault information queried by the fault query unit 502 into a network service fault association response;
  • the sending unit 504 is configured to send the network service fault association response converted by the information conversion unit 503 to the SMS;
  • the report receiving unit 505 is configured to receive the network service fault diagnosis report sent by the SMS, and obtain the fault correlation response from the running performance data of the virtual network function entity VNF.
  • the fault recovery unit 506 is configured to receive according to the report.
  • the network service fault diagnosis report received by unit 505 performs fault recovery.
  • the fault query unit 502 queries the fault information according to the network service fault association request information received by the request receiving unit 501, and the information conversion unit 503 converts the fault information into a network service fault association response, and then sends the fault information to the SMS by the sending unit 504. Then, the fault recovery unit 506 performs fault recovery according to the network service fault diagnosis report sent by the SMS, and implements the network service fault processing in the NFV environment.
  • the fault can be quickly found and the fault is processed, and the fault is ensured.
  • the stability of the NFV system improves the operating efficiency of the NFV system.
  • the system management module may be a network management system NMS, or a virtual network function manager VNFM, or a combination of an NMS and a VNFM.
  • NMS network management system
  • VNFM virtual network function manager
  • FIG. 6 another implementation of the system management module in the embodiment of the present invention is shown. Examples include:
  • the request receiving unit 601 is configured to receive network service fault association request information sent by the service management system SMS;
  • the fault query unit 602 is configured to query the fault information in the management scope according to the network service fault association request information received by the request receiving unit 601.
  • the information conversion unit 603 is configured to convert the fault information queried by the fault query unit 602 into a network service fault association response;
  • the sending unit 604 is configured to send the network service fault association response converted by the information conversion unit 603 to the SMS;
  • a report receiving unit 605 configured to receive a network service fault diagnosis report sent by the SMS, The fault correlation response is obtained after the fault performance of the virtual network function entity VNF is diagnosed; the fault recovery unit 606 is configured to perform fault recovery according to the network service fault diagnosis report received by the report receiving unit 605;
  • the fault query unit 602 is specifically configured to query the fault information of the managed device according to the network service fault association request information received by the request receiving unit.
  • the fault query unit 602 includes:
  • the first query subunit 6021 is configured to query the fault information of the VNF associated with the network service requested by the network service fault association request information according to the network service fault association request information received by the request receiving unit 601;
  • the second query sub-unit 6022 is configured to query the network function virtualization infrastructure NFVI associated with the network service by using the virtualization infrastructure manager VIM according to the network service fault association request information received by the request receiving unit 601. accident details;
  • the information conversion unit 603 is specifically configured to convert the fault information of the VNF and the fault information of the NFVI into a network service fault association response.
  • the system management module when the system management module is the VNFM, after the request receiving unit 601 receives the network service fault association response information, the first query subunit 6021 can query the fault information of the VNF, and the second query subunit 6022 can pass the VIM. Querying the fault information of the NFVI, and then the information conversion unit 603 converts the fault information of the VNF and the fault information of the NFVI into a network service fault association response, so that the fault diagnosis of the VNF and the NFVI is performed by the virtual network function manager VNFM, compared to using other By querying the faults of VNF and NFVI, the fault information can be obtained faster, and the efficiency of fault query is improved.
  • the request receiving unit 601 receives the network service fault association request information sent by the SMS, where the network service fault association request information includes the VNF-4 overload;
  • the first query sub-unit 6021 queries the VNF fault information associated with the VNF-4 according to the network service fault association request information, and obtains that the VNF-2 is normal, the service performance is overloaded, and the VNF-4 is normal. Performance overload;
  • the second query sub-unit 6022 queries the fault information of the NFVI associated with the VNF-4 through the VIM to obtain an NFVI fault with a throughput of 0;
  • the information conversion unit 603 converts the VNF fault information (normal VNF-2, service performance overload, VNF-4 normal, service performance overload) and NFVI fault information (NFVI failure, throughput 0) into a network service fault correlation response;
  • the sending unit 604 sends the network service fault management response to the SMS
  • the report receiving unit 605 receives the network service fault diagnosis report sent by the SMS, where the root service fault is NFVI, the root fault handling decision is to restart NFVI, and the pseudo faults are VNF-1 and VNF-4;
  • the failure recovery unit 606 restarts the NFVI based on the network service failure diagnosis report.
  • system management module 700 includes:
  • the input device 701, the output device 702, the processor 703, and the memory 704 (wherein the number of processors 703 in the service management system may be one or more, and one processor 703 is taken as an example in Fig. 7).
  • the input device 701, the output device 702, the processor 703, and the memory 704 may be connected by a bus or other means, wherein the bus connection is taken as an example in FIG.
  • the processor 703 is configured to perform the following steps by: receiving an operation instruction stored by the memory 704: receiving network service fault association request information sent by the service management system SMS;
  • the SMS is obtained after performing fault diagnosis according to the network service fault association response and the running performance data of the virtual network function entity VNF;
  • processing The device 703 specifically performs the following operations: querying fault information of the managed device;
  • the processor 703 when the system management module is a virtual network function manager VNFM, the processor 703 performs the following operations:
  • the fault information of the VNF and the fault information of the NFVI are converted into a network service fault correlation response.
  • an embodiment of the network service fault processing method in the embodiment of the present invention includes:
  • the service management system SMS monitors the operational performance data of the virtual network functional entity VNF.
  • the SMS determines whether the network service performance is abnormal according to the running performance data of the VNF.
  • the SMS monitors the running performance data of the VNF, it determines whether the network service performance is abnormal according to the performance data of the VNF;
  • the SMS sends the network service fault association request information to the system management module.
  • the SMS determines that the network service performance is abnormal, the SMS sends the network service failure association request information to the system management module, the network service failure association request information is used by the system management module to query the failure and feed back the network service association response to the SMS.
  • the SMS performs fault diagnosis according to the network service fault association response and the running performance data of the VNF, and obtains a network service fault diagnosis report;
  • the SMS After receiving the network service association response fed back by the system management module, the SMS performs fault diagnosis according to the network service fault association response and the operational performance data of the VNF, and obtains a network service fault diagnosis report.
  • the SMS sends the network service fault diagnosis report to the system management module.
  • the network service is faulty.
  • the diagnostic report is sent to the system management module, which is used by the system management module for failure recovery.
  • the network service association request information is sent to the system management module, the system management module queries the fault and feeds back the network service fault association response, and the SMS is associated according to the network service fault.
  • the system management module responds to the operational performance data of the VNF, perform fault diagnosis, obtain a network service fault diagnosis report, and send the network service fault diagnosis report to the system management module, and the system management module can perform fault recovery according to the network service fault diagnosis report, so that The NFV environment realizes the processing of network service failures. When a network service fails, it can quickly find and solve the fault, ensure the stability of the NFV system, and improve the operating efficiency of the NFV system.
  • the SMS monitors the running performance data of the VNF, and determines whether the network service performance is abnormal according to the running performance data of the VNF.
  • the SMS can receive the running performance data of the VNF sent by the network element management system EMS, and the SMS can also Receiving the range of configuration parameter values sent by the orchestrator Orchestrator, determining whether the network service performance is abnormal according to the value range of the configuration parameter and the running performance data of the VNF.
  • the following describes the network service fault handling method in the embodiment of the present invention, as shown in FIG.
  • Another embodiment of the network service fault processing method in the embodiment of the present invention includes:
  • the SMS periodically receives the running performance data of the VNF sent by the EMS by the network element management system.
  • the service management system SMS periodically receives the running performance data of the VNF sent by the network element management system EMS.
  • the SMS periodically receives the running performance data of the VNF sent by the EMS, and the EMS periodically detects the running performance data of the VNF periodically, and then sends the data to the SMS, or the SMS periodically sends the data request to the EMS.
  • the EMS monitors the running performance data of the VNF and then feeds back to the SMS, which is not limited herein.
  • the period in which the SMS receives the running performance data of the VNF may be set according to the actual situation.
  • the period may be one second or one minute, which is not limited herein.
  • the running performance data may be a data throughput, a delay or a bandwidth, and may also be a parameter indicating a data throughput, a delay, or a bandwidth, and may also include other parameters describing the network service characteristics, which are not limited herein. 902.
  • the SMS receives a configuration parameter value range sent by the orchestrator Orchestrator.
  • the SMS receives the range of configuration parameter values sent by Orchestrator.
  • the range of values of the configuration parameters indicates the range of values of the running performance data of the VNF when the network service is running normally.
  • step 202 may be performed at any timing before the step 203, which is not limited herein. Further, if the configuration parameter value range is stored in the SMS, step 202 may not be performed. Not limited.
  • the SMS determines whether the running performance data of the VNF is within the range of the configuration parameter value; after the SMS receives the operating performance data of the VNF and the value range of the configuration parameter, the SMS determines whether the running performance data of the VNF is within the value range of the configured parameter;
  • SMS determines that the running performance data of the VNF is not within the range of the configuration parameter value, determining that the network service performance is abnormal, triggering step 204;
  • step 201 may be triggered.
  • the preset time may be paused first, and then the step 201 is triggered, which is not limited herein.
  • the SMS sends a network service fault association request message to the system management module.
  • the SMS determines that the running performance data of the VNF is not within the range of the configuration parameter value, thereby determining that the network service performance is abnormal
  • the SMS sends the network service fault association request information to the system management module, and the network service fault association request information is used for the system management module. Query the fault and feed back the network service association response to the SMS.
  • the SMS performs fault diagnosis according to the network service fault association response and the running performance data of the VNF, and obtains a network service fault diagnosis report;
  • the SMS After receiving the network service association response fed back by the system management module, the SMS performs fault diagnosis according to the network service fault association response and the operational performance data of the VNF, and obtains a network service fault diagnosis report.
  • the SMS sends the network service fault diagnosis report to the system management module.
  • the SMS may first receive the value range of the configuration parameter, determine whether the network service performance is abnormal according to the value range of the configuration parameter and the running performance data of the VNF, so that the determination of the network service performance is more accurate, and if the network is determined
  • the step of receiving the running performance data of the VNF is enhanced, the continuity of the fault monitoring is enhanced, and the timeliness of the fault handling is ensured.
  • the SMS performs fault diagnosis according to the network service fault association response and the running performance data of the VNF, and obtains a network service fault diagnosis report.
  • the SMS may first determine the network service fault correlation response and the VNF running performance data. The root fault and the pseudo fault are further processed.
  • the network fault processing method in the embodiment of the present invention is described in detail below. Referring to FIG. 10, another embodiment of the network fault processing method in the embodiment of the present invention includes:
  • the SMS periodically receives the running performance data of the VNF sent by the EMS by the EMS management system;
  • the service management system SMS When the NFV system is running, the service management system SMS periodically receives the running performance data of the VNF sent by the network element management system EMS.
  • the SMS periodically receives the running performance data of the VNF sent by the EMS, and the EMS periodically detects the running performance data of the VNF periodically, and then sends the data to the SMS, or the SMS periodically sends the data request to the EMS.
  • the EMS monitors the running performance data of the VNF and then feeds back to the SMS, which is not limited herein.
  • the period in which the SMS receives the running performance data of the VNF may be set according to the actual situation.
  • the period may be one second or one minute, which is not limited herein.
  • the running performance data may be a data throughput, a delay or a bandwidth, and may also be a parameter indicating a data throughput, a delay or a bandwidth, and may also include other parameters describing the network service characteristics, which are not limited herein.
  • the SMS receives the range of configuration parameter values sent by Orchestrator.
  • the range of values of the configuration parameters indicates the range of values of the running performance data of the VNF when the network service is running normally.
  • the step 1002 can be performed at any timing before the step 1003, which is not limited herein. Further, if the configuration parameter value range is stored in the SMS, the step 1002 may not be performed. Not limited.
  • the SMS determines whether the running performance data of the VNF is within the value range of the configuration parameter; After the SMS receives the operating performance data of the VNF and the range of configuration parameter values, the SMS determines whether the running performance data of the VNF is within the value range of the configuration parameter;
  • SMS determines that the running performance data of the VNF is not within the value range of the configuration parameter, determining that the network service performance is abnormal, triggering step 1004;
  • step 1001 may be triggered.
  • the SMS determines that the running performance data of the VNF is within the range of the configuration parameter value
  • the running preset time may be suspended first, and then the step 1001 is triggered, which is not limited herein.
  • the SMS sends a network service fault association request information to the system management module.
  • the SMS determines that the running performance data of the VNF is not within the range of the configuration parameter value, thereby determining that the network service performance is abnormal
  • the SMS sends the network service fault association request information to the system management module, and the network service fault association request information is used for the system management module. Query the fault and feed back the network service association response to the SMS.
  • the network service fault association request information includes a network service identifier, where the network service identifier is used to uniquely identify a network service in the network, and may further include network service performance abnormality data, where the network service performance abnormality data is used to indicate a certain network service or
  • the statistics of the performance abnormality may also include the topology information of the network service or the time when the network service abnormality occurs.
  • the topology information of the network service is used to represent all network devices or network functions included in the network service, and the network service is abnormal.
  • the time of occurrence is used for the time when the fault occurs, and the remaining network service related data may also be included, which is not limited herein.
  • the system management module can be either the network management system NMS or the virtual network function manager VNFM. It can also include NMS and VNFM, which are not limited here.
  • SMS combines VNF operational performance data and network service fault correlation response to determine root faults and pseudo faults
  • the SMS After receiving the network service association response fed back by the system management module, the SMS determines the root source fault and the pseudo fault according to the running performance data of the VNF and the network service fault association response, and the root fault is used to indicate the root cause of the network service abnormality, and the pseudo source A fault is a fault that can be recovered as the root cause failure recovers.
  • the network service association response includes a network service identifier and fault information, and the fault information is used to identify
  • the fault occurrence body can distinguish which part of the system management module management area is faulty, and the network service association response can also include the fault type, which is used to identify the type of the fault, such as overload, service suspension or power failure, etc., network service association response It may also include other network service or fault related data such as network service topology information, network service performance abnormal data or network service abnormality occurrence time, which is not limited herein.
  • the SMS can find the data fault information list, and obtain the root fault and the pseudo fault corresponding to the fault data in the running performance data of the VNF and the fault information in the network service fault association response.
  • the running performance data of the VNF is associated with the network service fault and is sent to the analysis tool to analyze the root fault and the pseudo fault, and the remaining methods for determining the root fault and the pseudo fault are not limited herein.
  • the data fault information list can be manually set, or the experience data can be imported, and can also be updated by machine learning, which is not limited herein.
  • the SMS analyzes the root cause failure and the pseudo fault, and obtains a processing strategy
  • the root cause and the pseudo fault are analyzed to obtain a processing strategy.
  • the SMS generates a network service fault diagnosis report according to the root cause failure and the pseudo fault and the processing policy
  • the network service fault diagnosis report is generated according to the root source fault and the pseudo fault and the processing strategy.
  • the network service fault diagnosis report includes a network service identifier, a root cause fault, and a root cause fault processing decision, and the root source fault is used to indicate a root cause of the network service abnormality, and the root source fault processing decision is used to indicate a policy for processing the root cause fault, such as restarting, migrating, and the like.
  • the network service fault diagnosis report may further include a root source fault type, a pseudo fault, a pseudo fault type or a pseudo fault processing decision, and the network service fault diagnosis report may further include topology information of the network service, network service performance abnormal data, and network service abnormality occurrence time. Other data such as monthly service or faults are not limited here.
  • SMS sends a network service fault diagnosis report to the system management module.
  • the SMS After the SMS generates the network service troubleshooting report, the network service troubleshooting report is sent to the system management module, and the network service troubleshooting report is used by the system management module for failure recovery.
  • the SMS may also receive a fault recovery message fed back by the system management module, where the fault message is used to indicate that the system management module has performed
  • the system management module network service fault recovery notification may be fed back after the fault has been recovered.
  • the network service fault recovery notification is used to notify the system management module that the fault has been recovered.
  • the SMS first determines the root source fault and the pseudo fault according to the network service fault association response and the running performance data of the VNF, and then analyzes the root source fault and the pseudo fault to obtain a processing strategy, and then according to the root source fault and the pseudo fault and the processing strategy.
  • the generated network service fault diagnosis report is sent to the system management module, so that the analysis of the network service failure is more accurate, and the accuracy of the fault processing is improved.
  • SMS can be deployed in different management systems, for example, in the network element management system EMS, in the orchestrator Orchestrator, or in the NMS in the system management module or in the VNFM. This is not a limitation.
  • the path to receive or send various messages may be different.
  • SMS can transmit various messages through Orchestrator.
  • SMS receives the range of configuration parameter values sent by Orchestrator
  • the SMS receives the running performance data of the VNF sent by the EMS in a 10-second period
  • SMS judges that the service performance of VNF-4 in the running performance data of VNF exceeds the range of configuration parameters, and SMS determines that the network service performance of VNF-4 is abnormal;
  • the SMS sends the network service fault association request information to the system management module, where the network service fault association request information includes the network identifier of the VNF-4 and the network service performance abnormality data, where the network service fault association request information is used by the system management module to query the fault and Feedback network service failure association response to SMS;
  • SMS combined with VNF operational performance data and network service failure association response includes: device GW2 failure, throughput 0, device GW1 normal, service performance overload, VNF-4 normal, service performance overload), determined
  • the device GW2 is the root fault, and the devices GW1 and VNF-4 are pseudo faults.
  • SMS analyzes the root cause and the pseudo fault, and obtains the processing strategy to restart the device GW2;
  • the SMS generates a network service fault diagnosis report according to the root cause failure and the pseudo fault and the processing policy.
  • the network service fault diagnosis report includes the root cause fault as GW2, and the root source fault processing decision is restart. GW2;
  • SMS sends the network service troubleshooting report to the system management module, which is used by the system management module for failure recovery.
  • Another embodiment of the network service fault processing method in the embodiment of the present invention includes:
  • the system management module receives the network service fault association request information sent by the SMS.
  • the system management module receives the network service fault association request information sent by the SMS.
  • the system management module queries the fault information in the management scope of the system management module according to the network service fault association request information.
  • the system management module After receiving the network service fault association request information, the system management module queries the fault information in the management scope of the system management module according to the network service fault association request information.
  • the system management module converts the fault information into a network service fault association response. After the system management module queries the fault information, the fault information is converted into a network service fault association response.
  • the system management module sends a network service fault association response to the SMS.
  • the system management module After receiving the network service fault management response, the system management module sends the network service fault association response to the SMS, and the network service fault association response is used for SMS to perform fault analysis on the network service.
  • the system management module receives a network service fault diagnosis report sent by the SMS.
  • the system management module receives a network service fault diagnosis report sent by the SMS, where the network service fault diagnosis report is an SMS according to the network service fault association response and the virtual network function entity VNF The operational performance data is obtained after troubleshooting.
  • the system management module performs fault recovery according to the network service fault diagnosis report.
  • the system management module After obtaining the network service fault diagnosis report, the system management module performs fault recovery according to the network service fault diagnosis report.
  • the system management module queries the fault information according to the received network service fault association request information, converts the fault information into a network service fault association response, and sends the fault information to the SMS, and then sends the fault information to the SMS.
  • the network service fault diagnosis report sent by SMS the fault is recovered, and the network service fault is handled in the NFV environment.
  • the network service fails, the fault can be quickly found and the fault is processed to ensure the stability of the NFV system. Improve the operating efficiency of the NFV system.
  • system management module may be a network management system NMS, or a virtual network function manager VNFM, or a combination of NMS and VNFM.
  • NMS network management system
  • VNFM virtual network function manager
  • another embodiment of the network service fault processing method in the embodiment of the present invention includes:
  • the network management system receives the network service fault association request information sent by the SMS.
  • the SMS sends the network service fault association request information to the NMS
  • the NMS receives the network service fault association request information sent by the SMS.
  • the NMS queries the fault information of the device managed by the NMS according to the network service fault association request information.
  • the NMS After receiving the network service fault association request information, the NMS queries the fault information of the device managed by the NMS according to the network service fault association request information.
  • the NMS converts the fault information to a network service fault association response.
  • the fault information is converted into a network service fault association response.
  • the NMS sends a network service fault association response to the SMS.
  • the NMS After the NMS receives the network service fault management response, it sends the network service fault association response to
  • the network service failure association response is used for SMS failure analysis of the network service.
  • the NMS receives a network service fault diagnosis report sent by the SMS.
  • the NMS receives the network service fault diagnosis report sent by the SMS, and the network service fault diagnosis report is the running performance data of the SMS according to the network service fault association response and the virtual network function entity VNF. Paid after troubleshooting.
  • the NMS performs fault recovery according to the network service fault diagnosis report.
  • the fault is recovered according to the network service fault diagnosis report.
  • There are various methods for recovering faults on the NMS For example, you can restart the device or migrate the service. This is not limited here.
  • the NMS after the NMS recovers from the fault, it can feedback the SMS fault recovery notification to notify the SMS that the fault has been recovered.
  • the SMS receives the fault recovery notification, when the fault is detected, the network can send the network monthly service to the NMS.
  • a failure recovery notification notifies the NMS that the failure has been restored.
  • the NMS may query the fault information of the device managed by the NMS after receiving the network service fault association response information, and then convert the fault information into a network service fault association response to the SMS.
  • the network management system NMS is used to perform fault query on the device, which speeds up the fault query progress, makes the fault query result more ready, and improves the efficiency of fault processing.
  • FIG. 13 another embodiment of the network service fault processing method in the embodiment of the present invention includes:
  • the virtual network function manager VNFM receives the network service fault association request information sent by the SMS;
  • the VNFM receives the network monthly service failure association request information sent by the SMS.
  • the VNFM queries, according to the network service fault association request information, the fault information of the VNF associated with the network service requested by the network service fault association request information;
  • the VNFM After receiving the network service failure association request information, the VNFM queries the failure information of the VNF associated with the network service requested by the network service failure association request information.
  • the VNFM queries the network function virtualized infrastructure NFVI fault information associated with the network service through the virtualization infrastructure manager VIM;
  • the VNFM may send the network service fault association request information to the VIM, so that the VIM queries the NFVI fault letter associated with the network service.
  • the VNFM converts the fault information of the VNF and the fault information of the NFVI into a network service fault correlation response
  • the VNFM After the VNFM queries the fault information of the VNF and the fault information of the NFVI, the fault information is transferred. Switch to a network service failure association response.
  • the VNFM sends a network service fault association response to the SMS;
  • the VNFM After obtaining the network service fault management response, the VNFM sends the network service fault association response to the SMS, and the network service fault association response is used by the SMS to perform fault analysis on the network service.
  • the VNFM receives a network service fault diagnosis report sent by the SMS.
  • the VNFM receives the network service fault diagnosis report sent by the SMS, and the network service fault diagnosis report is that the SMS fails according to the network service fault association response and the running performance data of the virtual network function entity VNF. Paid after diagnosis.
  • the VNFM performs fault recovery according to the network service fault diagnosis report.
  • the VNFM After the VNFM obtains the network service troubleshooting report, it performs fault recovery based on the network service troubleshooting report.
  • the VNFM can resolve the fault diagnosis and root cause fault handling decision in the network service fault diagnosis report. If the fault occurs in the VNF, the VNF can be restarted, the VNF migration can be performed, and the VNF can be re-instantiated. NFVI can be restarted or migrated, etc., which is not limited here.
  • the VNFM after the VNFM recovers from the fault, it can feedback the SMS fault recovery notification to notify the SMS that the fault has been recovered.
  • the SMS receives the fault recovery notification, when the fault is detected, the network can send the network monthly service to the VNFM.
  • a failure recovery notification notifies the NMS that the failure has been restored.
  • the VNFM when the system management module is a VNFM, the VNFM can query the VNF fault information after receiving the network service fault association response information, and query the NFVI fault information through the VIM, and then the VNF fault information and the NFVI fault.
  • the information is converted into a network service fault association response, so that the VNF and NFVI fault query is performed by the virtual network function manager VNFM, and the fault information can be obtained faster and the fault query can be improved compared to the fault of using other methods to query the VNF and the NFVI. s efficiency.
  • system management module may further include a combination of the NMS and the VNFM in addition to the NMS or the VNFM, and when the system management module includes the NMS and the VNFM, the SMS may simultaneously send the network service fault association request message to the NMS and the VNFM.
  • the network service fault processing method in the embodiment of the present invention is specifically described in a specific application scenario:
  • the VNFM receives the network service fault association request information sent by the SMS, and the network service fault association request information includes the VNF-4 overload;
  • the VNFM queries the VNF fault information associated with the VNF-4 according to the network service fault association request information, and obtains that the VNF-2 is normal, the service performance is overloaded, the VNF-4 is normal, and the service performance is overloaded;
  • the VNFM queries the fault information of the NFVI associated with the VNF-4 through the VIM to obtain the NFVI fault with a throughput of 0;
  • VNFM converts VNF fault information (normal VNF-2, service performance overload, VNF-4 normal, service performance overload) and NFVI fault information (NFVI fault, throughput 0) into network service fault correlation response;
  • the VNFM sends the network service fault management response to the SMS;
  • the VNFM receives the network service fault diagnosis report sent by the SMS.
  • the network service fault diagnosis report includes the root cause fault as NFVI, the root source fault handling decision is to restart NFVI, and the pseudo faults are VNF-1 and VNF-4;
  • the VNFM restarts the NFVI based on the network service troubleshooting report.
  • the disclosed system, apparatus, and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
  • the components displayed by the unit may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. You can choose some or all of them according to actual needs.
  • the unit is to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • the technical solution of the present invention may contribute to the prior art or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium. , including a plurality of instructions for causing a computer device (which may be a personal computer, a server, a storage medium including: a USB flash drive, a removable hard disk, a read-only memory (ROM), a random access memory (RAM, Random) Access Memory ), a variety of media such as a disk or a disc that can store program code.
  • a computer device which may be a personal computer, a server, a storage medium including: a USB flash drive, a removable hard disk, a read-only memory (ROM), a random access memory (RAM, Random) Access Memory
  • a variety of media such as a disk or a disc that can store program code.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

本发明实施例公开了网络服务故障处理方法,服务管理系统和系统管理模块,用于处理NFV环境下网络服务故障,保持NFV系统的稳定性。本发明实施例方法包括:服务管理系统SMS监测到 VNF的运行性能数据异常后,发送网络服务关联请求信息给系统管理模块,该网络服务关联请求信息用于该系统管理模块查询故障并反馈网络服务关联响应给SMS,SMS根据该网络服务故障关联响应与VNF 的运行性能数据,进行故障诊断,获得网络服务故障诊断报告,将该网络服务故障诊断报告发送给系统管理模块,该网络服务故障诊断报告用于系统管理模块进行故障恢复,这样就在NFV环境下实现了网络服务故障的处理。

Description

网络 J¾艮务故障处理方法, Ji 务管理系统和系统管理模块 技术领域
本发明涉及通讯领域,尤其涉及网络服务故障处理方法,服务管理系统和 系统管理模块。 背景技术
网络功能虚拟化( NFV, Network Function Virtulization )由全球 13个主要 电信运营商发起, 众多设备商、 IT厂商参与组织, 旨在定义运营商网络功能 虚拟化的需求和相关的技术报告, 希望借鉴信息技术 IT的虚拟化技术, 利用 通用的高性能大容量服务器、 交换机和存储来实现部分网络功能的软件化。使 得各种类型的网络设备, 如服务器、 路由器、 存储设备 CDN、 交换机等, 都 可以通过网络功能虚拟化技术实现软硬件分离, 让它们可以部署在数据中心、 网络节点或者用户家中。
NFV 可以将定义网络功能的软件从通用的高性能的硬件服务器、 存储以 及网络交换机中完全分离出来, 实现软件和硬件组件独立的模块化特性,且能 基于通用硬件完全自动化的远程安装和管理软件设备。
但是, 在 NFV环境下当前还没有网络服务故障管理的方法, 基于传统实 体网元的故障管理并不适合 NFV虚拟环境,当 NFV环境下网络服务发生故障 时, 不能及时对故障进行处理以维持 NFV系统的稳定。 发明内容
本发明实施例提供了网络服务故障处理方法,服务管理系统和系统管理模 块, 用于处理 NFV环境下网络服务故障, 保持 NFV系统的稳定性。
本发明实施例第一方面提供了一种服务管理系统, 包括:
监测模块, 用于监测虚拟网络功能实体 VNF的运行性能数据;
判断模块, 用于根据所述监测模块监测到的 VNF的运行性能数据, 判断 网络服务性能是否异常;
信息请求模块, 用于当所述判断模块判定网络服务性能异常时,发送网络 服务故障关联请求信息给系统管理模块,所述网络服务故障关联请求信息用于 所述系统管理模块查询故障并反馈网络服务故障关联响应给故障诊断模块; 故障诊断模块,用于根据网络服务故障关联响应与所述监测模块监测到的 VNF的运行性能数据进行故障诊断, 获得网络服务故障诊断报告;
报告发送模块,用于将所述故障诊断模块得到的网络服务故障诊断报告发 送给所述系统管理模块,所述网络服务故障诊断报告用于所述系统管理模块进 行故障恢复。
结合本发明实施例的第一方面,本发明实施例的第一方面的第一种实现方 式中, 所述监测模块具体用于: 周期性的接收网元管理系统 EMS发送的 VNF 的运行性能数据。
结合本发明实施例的第一方面的第一种实现方式,本发明实施例的第一方 面的第二种实现方式中, 所述系统还包括:
配置接收模块, 用于接收编排器 Orchestrator发送的配置参数数值范围; 所述判断模块具体包括:
判断单元, 用于判断所述 VNF的运行性能数据是否在所述配置接收模块 接收的配置参数数值范围内;
正常确定单元, 用于当所述判断单元判定所述 VNF的运行性能数据在所 述配置参数数值范围内时, 确定所述网络服务性能正常;
异常确定单元, 用于当所述判断单元判定所述 VNF的运行性能数据不在 所述配置参数数值范围内时, 确定所述网络服务性能异常。
结合本发明实施例的第一方面至第一方面的第二种实现方式中任一种实 现方式, 本发明实施例的第一方面的第三种实现方式中, 所述故障诊断模块具 体包括:
故障确定单元, 用于结合所述监测模块监测到的 VNF的运行性能数据及 所述网络服务故障关联响应,确定根源故障与伪故障,所述伪故障为能随着所 述根源故障的恢复而恢复的故障; 析, 得到处理策略;
报告生成单元,用于按照所述根源故障与伪故障和所述故障分析单元分析 出的处理策略生成所述网络服务故障诊断报告。
结合本发明实施例的第一方面的第三种实现方式,本发明实施例的第一方 面的第四种实现方式中, 所述故障确定单元具体用于, 结合所述监测模块监测 到的 VNF的运行性能数据中的异常数据及所述网络服务故障关联响应中的故 障信息, 查找数据故障信息列表, 得到对应的根源故障与伪故障, 所述伪故障 为能随着所述根源故障的恢复而恢复的故障。
结合本发明实施例的第一方面的第四种实现方式,本发明实施例的第一方 面的第五种实现方式中, 其特征在于,
所述服务管理系统部署在所述 Orchestrator中;
或,
所述服务管理系统部署在所述系统管理模块中;
或,
所述服务管理系统部署在所述 EMS中。
本发明实施例第二方面提供了一种系统管理模块, 包括:
请求接收单元, 用于接收服务管理系统 SMS发送的网络服务故障关联请 求信息;
故障查询单元,用于按照所述请求接收单元接收的网络服务故障关联请求 信息, 查询管理范围内的故障信息;
信息转换单元,用于转换所述故障查询单元查询到的故障信息为网络服务 故障关联响应;
发送单元,用于发送所述信息转换单元转换出的网络服务故障关联响应给 所述 SMS;
报告接收单元, 用于接收所述 SMS发送的网络服务故障诊断报告, 所述 网络服务故障诊断报告是 SMS根据所述发送单元发送的网络服务故障关联响 应与虚拟网络功能实体 VNF的运行性能数据进行故障诊断后获得;
故障恢复单元,用于根据所述报告接收单元接收的网络服务故障诊断报告 进行故障恢复。
结合本发明实施例的第二方面,本发明实施例的第二方面的第一种实现方 式中, 当所述系统管理模块为网络管理系统 NMS时, 所述故障查询单元具体 用于,按照所述请求接收单元接收的网络服务故障关联请求信息, 查询管理的 设备的故障信息。
结合本发明实施例的第二方面,本发明实施例的第二方面的第二种实现方 式中, 当所述系统管理模块为虚拟网络功能管理器 VNFM时,
所述故障查询单元具体包括:
第一查询子单元,用于按照所述请求接收单元接收的网络服务故障关联请 求信息, 查询与所述网络服务故障关联请求信息请求的网络服务相关联的 VNF的故障信息;
第二查询子单元,用于按照所述请求接收单元接收的网络服务故障关联请 求信息, 通过虚拟化基础设施管理器 VIM查询与所述网络服务相关联的网络 功能虚拟化基础设施 NFVI的故障信息;
所述信息转换单元具体用于: 将所述 VNF的故障信息和所述 NFVI的故 障信息转换为网络服务故障关联响应。
本发明实施例第三方面提供了一种服务管理系统, 包括:
输入装置、 输出装置、 处理器、 存储器和总线;
所述处理器执行如下操作:
监测虚拟网络功能实体 VNF的运行性能数据;
根据所述 VNF的运行性能数据, 判断网络服务性能是否异常;
当网络服务性能异常时, 发送网络服务故障关联请求信息给系统管理模 块,所述网络月良务故障关联请求信息用于所述系统管理模块查询故障并反馈网 络服务故障关联响应;
根据所述网络服务故障关联响应与所述 VNF的运行性能数据进行故障诊 断, 获得网络服务故障诊断报告;
将所述网络服务故障诊断报告发送给所述系统管理模块,所述网络服务故 障诊断报告用于所述系统管理模块进行故障恢复。
结合本发明实施例的第三方面,本发明实施例的第三方面的第一种实现方 式中, 所述处理器具体执行如下操作: 周期性的接收网元管理系统 EMS发送 的 VNF的运行性能数据。
结合本发明实施例的第三方面的第一种实现方式,本发明实施例的第三方 面的第二种实现方式中,
所述处理器还执行如下操作:
接收编排器 Orchestrator发送的配置参数数值范围;
所述处理器具体执行如下操作:
判断所述 VNF的运行性能数据是否在所述配置参数数值范围内; 当所述 VNF的运行性能数据在所述配置参数数值范围内时, 确定所述网 络服务性能正常;
当所述 VNF的运行性能数据不在所述配置参数数值范围内时, 确定所述 网络服务性能异常。
结合本发明实施例的第三方面至第三方面的第二种实现方式中任一种实 现方式, 本发明实施例的第三方面的第三种实现方式中, 所述处理器具体执行 如下操作:
结合所述 VNF的运行性能数据及所述网络服务故障关联响应, 确定根源 故障与伪故障, 所述伪故障为能随着所述根源故障的恢复而恢复的故障; 对所述根源故障与伪故障进行分析, 得到处理策略;
按照所述根源故障与伪故障和所述处理策略生成所述网络服务故障诊断 报告。
结合本发明实施例的第三方面的第三种实现方式,本发明实施例的第三方 面的第四种实现方式中, 所述处理器具体执行如下操作:
结合所述 VNF的运行性能数据中的异常数据及所述网络服务故障关联响 应中的故障信息, 查找数据故障信息列表, 得到对应的根源故障与伪故障。
本发明实施例第四方面提供了一种系统管理模块, 包括:
输入装置、 输出装置、 处理器、 存储器和总线;
所述处理器执行如下操作:
接收服务管理系统 SMS发送的网络服务故障关联请求信息;
按照所述网络服务故障关联请求信息, 查询管理范围内的故障信息; 转换所述故障信息为网络服务故障关联响应;
发送所述网络服务故障关联响应给所述 SMS;
接收 SMS发送的网络服务故障诊断报告, 所述网络服务故障诊断报告是 SMS根据所述网络服务故障关联响应与虚拟网络功能实体 VNF的运行性能数 据进行故障诊断后获得;
根据所述网络服务故障诊断报告进行故障恢复。
结合本发明实施例的第四方面,本发明实施例的第四方面的第一种实现方 式中, 当所述系统管理模块为网络管理系统 NMS时, 所述处理器具体执行如 下操作:
查询管理的设备的故障信息。
结合本发明实施例的第四方面,本发明实施例的第四方面的第二种实现方 式中, 当所述系统管理模块为虚拟网络功能管理器 VNFM时,
所述处理器具体执行如下操作:
查询与所述网络服务故障关联请求信息请求的网络服务相关联的 VNF的 故障信息;
通过虚拟化基础设施管理器 VIM查询与所述网络服务相关联的网络功能 虚拟化基础设施 NFVI的故障信息;
将所述 VNF的故障信息和所述 NFVI的故障信息转换为网络服务故障关 联响应。
本发明实施例第五方面提供了一种网络服务故障处理方法包括:
服务管理系统 SMS监测虚拟网络功能实体 VNF的运行性能数据; 所述 SMS根据所述 VNF的运行性能数据, 判断网络服务性能是否异常; 若网络服务性能异常, 则所述 SMS发送网络服务故障关联请求信息给系 统管理模块,所述网络服务故障关联请求信息用于所述系统管理模块查询故障 并反馈网络服务故障关联响应给所述 SMS;
所述 SMS根据所述网络服务故障关联响应与所述 VNF的运行性能数据进 行故障诊断, 获得网络服务故障诊断报告;
所述 SMS将所述网络服务故障诊断报告发送给所述系统管理模块, 所述 网络服务故障诊断报告用于所述系统管理模块进行故障恢复。
结合本发明实施例的第五方面,本发明实施例的第五方面的第一种实现方 式中, 所述服务管理系统 SMS监测虚拟网络功能 VNF的运行性能数据包括: 所述 SMS周期性的接收网元管理系统 EMS发送的 VNF的运行性能数据。 结合本发明实施例的第五方面的第一种实现方式,本发明实施例的第五方 面的第二种实现方式中, 所述运行性能数据包括:
吞吐量、 时延或带宽。
结合本发明实施例的第五方面的第一种实现方式,本发明实施例的第五方 面的第三种实现方式中, 所述 SMS根据所述 VNF的运行性能数据, 判断网络 服务性能是否异常的步骤之前包括:
所述 SMS接收编排器 Orchestrator发送的配置参数数值范围;
所述 SMS根据所述 VNF的运行性能数据,判断网络服务性能是否异常包 括:
所述 SMS判断所述 VNF的运行性能数据是否在所述配置参数数值范围 内;
若是, 则确定所述网络服务性能正常;
若否, 则确定所述网络服务性能异常。
结合本发明实施例的第五方面,本发明实施例的第五方面的第四种实现方 式中,所述 SMS根据所述网络服务故障关联响应与所述 VNF的运行性能数据 进行故障诊断, 获得网络服务故障诊断报告包括:
所述 SMS结合所述 VNF的运行性能数据及所述网络服务故障关联响应, 确定根源故障与伪故障,所述伪故障为能随着所述根源故障的恢复而恢复的故 障;
所述 SMS对所述根源故障与伪故障进行分析, 得到处理策略;
所述 SMS按照所述根源故障与伪故障和所述处理策略生成所述网络服务 故障诊断报告。
结合本发明实施例的第五方面的第四种实现方式,本发明实施例的第五方 面的第五种实现方式中,所述 SMS结合所述 VNF的运行性能数据及所述网络 服务故障关联响应, 确定根源故障与伪故障包括:
所述 SMS结合所述 VNF的运行性能数据中的异常数据及所述网络服务故 障关联响应中的故障信息, 查找数据故障信息列表,得到对应的根源故障与伪 故障。
结合本发明实施例的第五方面至第五方面的第五种实现方式中任一种实 现方式, 本发明实施例的第五方面的第六种实现方式中, 所述网络服务故障关 联请求信息包括:
网络服务标识和网络服务性能异常数据,所述网络服务标识用于在网络中 唯一标识一个网络服务,所述网络服务性能异常数据用于表示网络服务某一项 或几项性能异常的统计数据。
结合本发明实施例的第五方面的第六种实现方式,本发明实施例的第五方 面的第七种实现方式中, 所述网络服务故障关联响应包括:
网络服务标识,故障信息和故障类型,所述故障信息用于标识故障发生体, 所述故障类型用于标识故障的类型。
结合本发明实施例的第五方面的第七种实现方式,本发明实施例的第五方 面的第八种实现方式中, 所述网络服务故障诊断报告包括:
网络服务标识, 根源故障, 根源故障处理决策, 所述根源故障用于表示网 络服务异常的根源, 所述根源故障处理决策用于表示处理根源故障的策略。
结合本发明实施例的第五方面至第五方面的第五种实现方式中任一种实 现方式,本发明实施例的第五方面的第九种实现方式中,所述系统管理模块为: 网络管理系统 NMS, 和 /或, 虚拟网络功能管理器 VNFM。
本发明实施例第六方面提供了一种网络服务故障处理方法, 包括: 系统管理模块接收服务管理系统 SMS 发送的网络服务故障关联请求信 所述系统管理模块按照所述网络服务故障关联请求信息,查询所述系统管 理模块管理范围内的故障信息;
所述系统管理模块转换所述故障信息为网络服务故障关联响应;
所述系统管理模块发送所述网络服务故障关联响应给所述 SMS; 所述系统管理模块接收 SMS发送的网络服务故障诊断报告, 所述网络服 务故障诊断报告是 SMS根据所述网络服务故障关联响应与虚拟网络功能实体 VNF的运行性能数据进行故障诊断后获得;
所述系统管理模块根据所述网络服务故障诊断报告进行故障恢复。
结合本发明实施例的第六方面,本发明实施例的第六方面的第一种实现方 式中, 当所述系统管理模块为网络管理系统 NMS时, 所述查询所述系统管理 模块管理范围内的故障信息包括:
所述 NMS查询所述 NMS管理的设备的故障信息。
结合本发明实施例的第六方面,本发明实施例的第六方面的第二种实现方 式中, 当所述系统管理模块为虚拟网络功能管理器 VNFM时,
所述查询所述系统管理模块管理范围内的故障信息包括:
所述 VNFM查询与所述网络服务故障关联请求信息请求的网络服务相关 联的 VNF的故障信息;
所述 VNFM通过虚拟化基础设施管理器 VIM查询与所述网络良务相关联 的网络功能虚拟化基础设施 NFVI的故障信息;
所述系统管理模块转换所述故障信息为网络服务故障关联响应包括: 所述 VNFM将所述 VNF的故障信息和所述 NFVI的故障信息转换为网络 服务故障关联响应。
从以上技术方案可以看出, 本发明实施例具有以下优点: 本发明实施例中 服务管理系统 SMS监测到 VNF的运行性能数据异常后,发送网络服务关联请 求信息给系统管理模块, 系统管理模块查询故障并反馈网络服务故障关联响 应, SMS再根据网络服务故障关联响应和 VNF的运行性能数据, 进行故障诊 断,得到网络服务故障诊断报告,将该网络服务故障诊断报告发送给系统管理 模块, 系统管理模块能根据该网络服务故障诊断报告进行故障恢复, 这样就在 NFV环境下实现了网络服务故障的处理, 当有网络服务发生故障时, 能快速 的发现并解决故障,保证了 NFV系统的稳定性,提高了 NFV系统的运行效率。 附图说明
图 1为本发明实施例中服务管理系统一个结构示意图;
图 2为本发明实施例中服务管理系统另一个结构示意图;
图 3为本发明实施例中服务管理系统另一个结构示意图;
图 4为本发明实施例中服务管理系统另一个结构示意图;
图 5为本发明实施例中系统管理模块一个结构示意图;
图 6为本发明实施例中系统管理模块另一个结构示意图;
图 7为本发明实施例中系统管理模块另一个结构示意图; 图 8为本发明实施例中网络服务故障处理方法一个流程示意图; 图 9为本发明实施例中网络服务故障处理方法另一个流程示意图; 图 10为本发明实施例中网络服务故障处理方法另一个流程示意图; 图 11为本发明实施例中网络服务故障处理方法另一个流程示意图; 图 12为本发明实施例中网络服务故障处理方法另一个流程示意图; 图 13为本发明实施例中网络服务故障处理方法另一个流程示意图。 具体实施方式
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清 楚、 完整地描述, 显然, 所描述的实施例仅仅是本发明一部分实施例, 而不是 全部的实施例。基于本发明中的实施例, 本领域技术人员在没有做出创造性劳 动前提下所获得的所有其他实施例, 都属于本发明保护的范围。
请参阅图 1 , 本发明实施例中服务管理系统一个实施例包括:
监测模块 101 , 用于监测虚拟网络功能实体 VNF的运行性能数据; 判断模块 102, 用于根据所述监测模块 101监测到的 VNF的运行性能数 据, 判断网络服务性能是否异常;
信息请求模块 103, 用于当所述判断模块 102判定网络服务性能异常时, 发送网络服务故障关联请求信息给系统管理模块,所述网络服务故障关联请求 信息用于所述系统管理模块查询故障并反馈网络服务故障关联响应给故障诊 断模块 104;
故障诊断模块 104, 用于根据网络服务故障关联响应与所述 VNF的运行 性能数据进行故障诊断, 获得网络服务故障诊断报告;
报告发送模块 105 , 用于将所述故障诊断模块 104得到的网络服务故障诊 断报告发送给所述系统管理模块,所述网络服务故障诊断报告用于所述系统管 理模块进行故障恢复。
本发明实施例中判断模块 102判定监测模块 101监测到的 VNF的运行性 能数据异常后, 信息请求模块 103 发送网络服务关联请求信息给系统管理模 块, 系统管理模块查询故障并反馈网络服务故障关联响应, 故障诊断模块 104 再根据网络服务故障关联响应和 VNF的运行性能数据, 进行故障诊断, 得到 网络服务故障诊断报告,报告发送模块 105将该网络服务故障诊断报告发送给 系统管理模块, 系统管理模块能根据该网络服务故障诊断报告进行故障恢复, 这样就在 NFV环境下实现了网络服务故障的处理,当有网络服务发生故障时, 能快速的发现并解决故障,保证了 NFV系统的稳定性,提高了 NFV系统的运 行效率。
上面实施例中, 监测模块 101监测 VNF的运行性能数据, 判断模块 102 根据 VNF的运行性能数据判断网络服务性能是否异常, 在实际应用中, 监测 模块 101可以接收网元管理系统 EMS发送的 VNF的运行性能数据, SMS还 可以接收编排器 Orchestrator发送的配置参数数值范围, 判断模块 102根据该 配置参数数值范围与 VNF的运行性能数据来判断网络服务性能是否异常, 下 面对本发明实施例中服务管理系统进行具体描述, 请参阅图 2, 本发明实施例 中服务管理系统另一个实施例包括:
监测模块 201 , 用于监测虚拟网络功能实体 VNF的运行性能数据; 判断模块 202, 用于根据所述监测模块 201监测到的 VNF的运行性能数 据, 判断网络服务性能是否异常;
信息请求模块 203, 用于当所述判断模块 202判定网络服务性能异常时, 发送网络服务故障关联请求信息给系统管理模块,所述网络服务故障关联请求 信息用于所述系统管理模块查询故障并反馈网络服务故障关联响应给故障诊 断模块 204;
故障诊断模块 204, 用于根据网络服务故障关联响应与所述 VNF的运行 性能数据进行故障诊断, 获得网络服务故障诊断报告;
报告发送模块 205 , 用于将所述故障诊断模块 204得到的网络服务故障诊 断报告发送给所述系统管理模块,所述网络服务故障诊断报告用于所述系统管 理模块进行故障恢复;
本实施例中, 监测模块 201具体用于: 周期性的接收网元管理系统 EMS 发送的 VNF的运行性能数据。
该月良务管理系统还可以包括:
配置接收模块 206, 用于接收编排器 Orchestrator发送的配置参数数值范 围; 判断模块 202具体包括:
判断单元 2021 , 用于判断所述 VNF的运行性能数据是否在所述配置接收 模块 206接收的配置参数数值范围内;
正常确定单元 2022, 用于当所述判断单元 2021判定所述 VNF的运行性 能数据在所述配置参数数值范围内时, 确定所述网络服务性能正常;
异常确定单元 2023, 用于当所述判断单元 2021判定所述 VNF的运行性 能数据不在所述配置参数数值范围内时, 确定所述网络服务性能异常。
本发明实施例中, 配置接收模块 206可以先接收配置参数数值范围, 判断 模块 202根据该配置参数数值范围与 VNF的运行性能数据来判断网络服务性 能是否异常, 这样使得对网络服务性能的判定更加准确,且若判定网络服务性 能正常时,触发接收 VNF的运行性能数据的步骤,增强了故障监测的连续性, 保证了故障处理的时效性。
上面实施例中, 故障诊断模块 204根据网络服务故障关联响应与 VNF的 运行性能数据进行故障诊断, 获得网络服务故障诊断报告, 在实际应用中, 故 障诊断模块 204可以先结合网络服务故障关联响应与 VNF的运行性能数据确 定根源故障与伪故障再进行后续处理,下面对本发明实施例中服务管理系统进 行详细描述, 请参阅图 3, 本发明实施例中服务管理系统另一个实施例包括: 监测模块 301 , 用于监测虚拟网络功能实体 VNF的运行性能数据; 判断模块 302, 用于根据监测模块 301监测到的所述 VNF的运行性能数 据, 判断网络服务性能是否异常;
信息请求模块 303, 用于当所述判断模块 302判定网络服务性能异常时, 发送网络服务故障关联请求信息给系统管理模块,所述网络服务故障关联请求 信息用于所述系统管理模块查询故障并反馈网络服务故障关联响应给故障诊 断模块 304;
故障诊断模块 304, 用于根据网络服务故障关联响应与所述 VNF的运行 性能数据进行故障诊断, 获得网络服务故障诊断报告;
报告发送模块 305 , 用于将所述故障诊断模块 304得到的网络服务故障诊 断报告发送给所述系统管理模块,所述网络服务故障诊断报告用于所述系统管 理模块进行故障恢复; 监测模块 301具体用于: 周期性的接收网元管理系统 EMS发送的 VNF 的运行性能数据。
该月良务管理系统还可以包括:
配置接收模块 306, 用于接收编排器 Orchestrator发送的配置参数数值范 围;
判断模块 302具体包括:
判断单元 3021 , 用于判断所述 VNF的运行性能数据是否在所述配置接收 模块 306接收的配置参数数值范围内;
正常确定单元 3022, 用于当所述判断单元 3021判定所述 VNF的运行性 能数据在所述配置参数数值范围内时, 确定所述网络服务性能正常;
异常确定单元 3023, 用于当所述判断单元 3021判定所述 VNF的运行性 能数据不在所述配置参数数值范围内时, 确定所述网络服务性能异常;
本实施例中, 该故障诊断模块 304具体包括:
故障确定单元 3041 , 用于结合所述监测模块监测到的 VNF的运行性能数 据及所述网络服务故障关联响应,确定根源故障与伪故障, 所述伪故障为能随 着所述根源故障的恢复而恢复的故障;
故障分析单元 3042, 用于对所述故障确定单元 3041确定的根源故障与伪 故障进行分析, 得到处理策略;
报告生成单元 3043, 用于按照所述根源故障与伪故障和所述故障分析单 元 3042分析出的处理策略生成所述网络服务故障诊断报告;
该故障确定单元 3041 具体可以用于, 结合所述监测模块 301 监测到的 VNF 的运行性能数据中的异常数据及所述网络服务故障关联响应中的故障信 息, 查找数据故障信息列表, 得到对应的根源故障与伪故障, 所述伪故障为能 随着所述根源故障的恢复而恢复的故障。
其中, 网络服务故障关联请求信息包括网络服务标识, 该网络服务标识用 于在网络中唯一标识一个网络服务,还可以包括网络服务性能异常数据, 该网 络服务性能异常数据用于表示网络服务某一项或几项性能异常的统计数据,还 可以包括网络服务的拓朴信息或网络服务异常发生时间,该网络服务的拓朴信 息用于表示网络服务中包含的所有网络设备或网络功能,该网络服务异常发生 时间用于发生故障的时间,还可以包括其余的网络月良务相关数据, 此处不作限 定。
网络服务关联响应包括网络服务标识和故障信息,该故障信息用于标识故 障发生体, 可以区分出是系统管理模块管理范围内哪一部分发生故障, 网络服 务关联响应还可以包括故障类型, 用于标识故障的类型, 例如过载, 服务中止 或断电等, 网络服务关联响应还可以包括网络服务的拓朴信息, 网络服务性能 异常数据或网络服务异常发生时间等其他网络服务或故障相关数据,此处不作 限定。
网络服务故障诊断报告包括网络服务标识, 根源故障和根源故障处理决 策, 该根源故障用于表示网络服务异常的根源, 该根源故障处理决策用于表示 处理根源故障的策略, 如重启, 迁移等, 网络服务故障诊断报告还可以包括根 源故障类型, 伪故障, 伪故障类型或伪故障处理决策, 网络服务故障诊断报告 还可以包括网络服务的拓朴信息, 网络服务性能异常数据, 网络服务异常发生 时间等其他网络月良务或故障相关数据, 此处不作限定。
本发明实施例中, 故障确定单元 3041 先结合网络服务故障关联响应与
VNF的运行性能数据确定根源故障与伪故障, 然后故障分析单元 3042对根源 故障与伪故障进行分析得到处理策略, 报告生成单元 3043再按照根源故障与 伪故障和处理策略生成网络服务故障诊断报告发送给系统管理模块,使得对网 络服务故障的分析更加准确, 提高了故障处理的准确性。
可以理解的是, 服务管理系统 SMS可以部署在不同的管理系统中, 例如 可以部署在网元管理系统 EMS中,也可以部署在编排器 Orchestrator中,还可 以部署在系统管理模块中的 NMS中或 VNFM中, 此处不作限定。 当 SMS部 署在不同的管理系统中时,接收或发送各种消息的路径可能不同,例如将 SMS 部署在系统管理模块中的 NMS中时, SMS可以通过 Orchestrator来传输各种 消息。
为了便于理解上述实施例,下面以上述服务管理系统各个单元在一个具体 应用场景中的交互过程进行说明:
配置接收模块 306接收 Orchestrator发送的配置参数数值范围;
监测模块 301以 10秒钟为周期接收 EMS发送的 VNF的运行性能数据; 判断单元 3021判断出 VNF的运行性能数据中 VNF-4的服务性能超出了 配置参数数值范围, 异常确定单元 3023判定 VNF-4的网络服务性能异常; 信息请求模块 303发送网络服务故障关联请求信息给系统管理模块,该网 络服务故障关联请求信息中包括 VNF-4的网络标识和网络服务性能异常数据, 该网络服务故障关联请求信息用于系统管理模块查询故障并反馈网络服务故 障关联响应给故障确定单元 3041;
故障确定单元 3041结合 VNF的运行性能数据及网络服务故障关联响应 (网络服务故障关联响应中包括: 设备 GW2故障, 吞吐量为 0, 设备 GW1 正常, 服务性能过载, VNF-4正常, 服务性能过载), 确定设备 GW2为根源 故障, 设备 GW1与 VNF-4为伪故障;
故障分析单元 3042 分析根源故障与伪故障, 得到处理策略为重启设备 GW2;
报告生成单元 3043按照根源故障与伪故障和该处理策略生成网络服务故 障诊断报告, 该网络服务故障诊断报告中包括根源故障为 GW2, 根源故障处 理决策为重启 GW2;
报告发送模块 305将该网络服务故障诊断报告发送给系统管理模块,该网 络服务故障诊断报告用于系统管理模块进行故障恢复。
上面从单元化功能实体的角度对本发明实施例中的服务管理系统进行了 参阅图 4, 本发明实施例中的服务管理系统 400另一实施例包括:
输入装置 401、 输出装置 402、 处理器 403和存储器 404 (其中服务管理 系统中的处理器 403的数量可以一个或多个,图 4中以一个处理器 403为例)。 在本发明的一些实施例中, 输入装置 401、 输出装置 402、 处理器 403和存储 器 404可通过总线或其它方式连接, 其中, 图 4中以通过总线连接为例。
其中,
通过调用存储器 404存储的操作指令, 处理器 403, 用于执行如下步骤: 监测虚拟网络功能实体 VNF的运行性能数据;
根据所述 VNF的运行性能数据, 判断网络服务性能是否异常;
当网络服务性能异常时, 发送网络服务故障关联请求信息给系统管理模 块,所述网络服务故障关联请求信息用于所述系统管理模块查询故障并反馈网 络服务故障关联响应;
根据所述网络服务故障关联响应与所述 VNF的运行性能数据进行故障诊 断, 获得网络服务故障诊断报告;
将所述网络服务故障诊断报告发送给所述系统管理模块,所述网络服务故 障诊断报告用于所述系统管理模块进行故障恢复;
本发明的一些实施例中, 该处理器 403具体执行如下操作:
周期性的接收网元管理系统 EMS发送的 VNF的运行性能数据; 本发明的一些实施例中, 该处理器 403具体执行如下操作:
接收编排器 Orchestrator发送的配置参数数值范围;
判断所述 VNF的运行性能数据是否在所述配置参数数值范围内; 当所述 VNF的运行性能数据在所述配置参数数值范围内时, 确定所述网 络服务性能正常;
当所述 VNF的运行性能数据不在所述配置参数数值范围内时, 确定所述 网络服务性能异常;
本发明的一些实施例中, 该处理器 403具体执行如下操作:
结合所述 VNF的运行性能数据及所述网络服务故障关联响应, 确定根源 故障与伪故障, 所述伪故障为能随着所述根源故障的恢复而恢复的故障; 对所述根源故障与伪故障进行分析, 得到处理策略;
按照所述根源故障与伪故障和所述处理策略生成所述网络服务故障诊断 报告;
本发明的一些实施例中, 该处理器 403具体执行如下操作:
结合所述 VNF的运行性能数据中的异常数据及所述网络服务故障关联响 应中的故障信息, 查找数据故障信息列表, 得到对应的根源故障与伪故障。
下面对本发明实施例中的系统管理模块进行描述, 请参阅图 5, 本发明实 施例中系统管理模块一个实施例包括:
请求接收单元 501 , 用于接收服务管理系统 SMS发送的网络服务故障关 联请求信息;
故障查询单元 502, 用于按照所述请求接收单元 501接收的网络服务故障 关联请求信息, 查询管理范围内的故障信息;
信息转换单元 503, 用于转换所述故障查询单元 502查询到的故障信息为 网络服务故障关联响应;
发送单元 504, 用于发送所述信息转换单元 503转换出的网络服务故障关 联响应给所述 SMS;
报告接收单元 505, 用于接收所述 SMS发送的网络服务故障诊断报告, 障关联响应与虚拟网络功能实体 VNF的运行性能数据进行故障诊断后获得; 故障恢复单元 506, 用于根据所述报告接收单元 505接收的网络服务故障 诊断报告进行故障恢复。
本发明实施例中,故障查询单元 502按照请求接收单元 501接收到的网络 服务故障关联请求信息查询故障信息,信息转换单元 503将故障信息转换为网 络服务故障关联响应后由发送单元 504发送给 SMS, 然后故障恢复单元 506 根据 SMS发送的网络服务故障诊断报告进行故障恢复,在 NFV环境下实现了 网络服务故障的处理, 当网络服务发生故障时, 能迅速查找故障并对故障进行 处理, 保证了 NFV系统的稳定性, 提高了 NFV系统的运行效率。
在实际应用中, 系统管理模块可以是网络管理系统 NMS, 也可以是虚拟 网络功能管理器 VNFM, 还可以是 NMS与 VNFM的组合, 请参阅图 6, 本发 明实施例中系统管理模块另一个实施例包括:
请求接收单元 601 , 用于接收服务管理系统 SMS发送的网络服务故障关 联请求信息;
故障查询单元 602, 用于按照所述请求接收单元 601接收的网络服务故障 关联请求信息, 查询管理范围内的故障信息;
信息转换单元 603, 用于转换所述故障查询单元 602查询到的故障信息为 网络服务故障关联响应;
发送单元 604, 用于发送所述信息转换单元 603转换出的网络服务故障关 联响应给所述 SMS;
报告接收单元 605, 用于接收所述 SMS发送的网络服务故障诊断报告, 障关联响应与虚拟网络功能实体 VNF的运行性能数据进行故障诊断后获得; 故障恢复单元 606, 用于根据所述报告接收单元 605接收的网络服务故障 诊断报告进行故障恢复;
本实施例中,当系统管理模块为网络管理系统 NMS时,故障查询单元 602 具体用于按照所述请求接收单元接收的网络服务故障关联请求信息,查询管理 的设备的故障信息。
当系统管理模块为虚拟网络功能管理器 VNFM时, 故障查询单元 602具 体包括:
第一查询子单元 6021 , 用于按照所述请求接收单元 601接收的网络服务 故障关联请求信息,查询与所述网络服务故障关联请求信息请求的网络服务相 关联的 VNF的故障信息;
第二查询子单元 6022, 用于按照所述请求接收单元 601接收的网络服务 故障关联请求信息, 通过虚拟化基础设施管理器 VIM查询与所述网络服务相 关联的网络功能虚拟化基础设施 NFVI的故障信息;
该信息转换单元 603具体用于, 将所述 VNF的故障信息和所述 NFVI的 故障信息转换为网络服务故障关联响应。
本发明实施例中, 当系统管理模块为 VNFM时, 请求接收单元 601接收 到网络服务故障关联响应信息后,第一查询子单元 6021可以查询 VNF的故障 信息, 第二查询子单元 6022可以通过 VIM查询 NFVI的故障信息, 然后信息 转换单元 603将 VNF的故障信息与 NFVI的故障信息转换为网络服务故障关 联响应,这样通过虚拟网络功能管理器 VNFM对 VNF和 NFVI进行故障查询, 相比于使用其他方式查询 VNF和 NFVI的故障, 能更快的获得故障信息, 提 高了故障查询的效率。
为了便于理解上述实施例,下面以上述系统管理模块各个单元在一个具体 应用场景中的交互过程进行说明:
请求接收单元 601接收 SMS发送的网络服务故障关联请求信息, 该网络 服务故障关联请求信息中包括 VNF-4过载;
第一查询子单元 6021按照该网络服务故障关联请求信息查询与 VNF-4相 关联的 VNF的故障信息, 得到 VNF-2正常, 服务性能过载, VNF-4正常, 服 务性能过载;
第二查询子单元 6022通过 VIM查询与 VNF-4相关联的 NFVI的故障信息, 得到 NFVI故障, 吞吐量为 0;
信息转换单元 603将 VNF的故障信息( VNF-2正常,服务性能过载, VNF-4 正常, 服务性能过载)和 NFVI的故障信息(NFVI故障, 吞吐量为 0 )转换 为网络服务故障关联响应;
发送单元 604发送该网络服务故障管理响应给 SMS;
报告接收单元 605接收 SMS发送的网络服务故障诊断报告, 该网络服务 故障诊断报告中包括根源故障为 NFVI, 根源故障处理决策为重启 NFVI, 伪 故障为 VNF-1与 VNF-4;
故障恢复单元 606根据该网路服务故障诊断报告重启 NFVI。
上面从单元化功能实体的角度对本发明实施例中的系统管理模块进行了 描述, 下面从硬件处理的角度对本发明实施例中的系统管理模块进行描述,请 参阅图 7, 本发明实施例中的系统管理模块 700另一实施例包括:
输入装置 701、 输出装置 702、 处理器 703和存储器 704 (其中服务管理 系统中的处理器 703的数量可以一个或多个,图 7中以一个处理器 703为例)。 在本发明的一些实施例中, 输入装置 701、 输出装置 702、 处理器 703和存储 器 704可通过总线或其它方式连接, 其中, 图 7中以通过总线连接为例。
其中,
通过调用存储器 704存储的操作指令, 处理器 703, 用于执行如下步骤: 接收服务管理系统 SMS发送的网络服务故障关联请求信息;
按照所述网络服务故障关联请求信息, 查询管理范围内的故障信息; 转换所述故障信息为网络服务故障关联响应;
发送所述网络服务故障关联响应给所述 SMS;
接收 SMS发送的网络服务故障诊断报告, 所述网络服务故障诊断报告是
SMS根据所述网络服务故障关联响应与虚拟网络功能实体 VNF的运行性能数 据进行故障诊断后获得;
根据所述网络服务故障诊断报告进行故障恢复;
本发明的一些实施例中, 当系统管理模块为网络管理系统 NMS时, 处理 器 703具体执行如下操作: 查询管理的设备的故障信息;
本发明的一些实施例中, 当系统管理模块为虚拟网络功能管理器 VNFM 时, 处理器 703具体执行如下操作:
查询与所述网络服务故障关联请求信息请求的网络服务相关联的 VNF的 故障信息;
通过虚拟化基础设施管理器 VIM查询与所述网络服务相关联的网络功能 虚拟化基础设施 NFVI的故障信息;
将所述 VNF的故障信息和所述 NFVI的故障信息转换为网络服务故障关 联响应。
下面对本发明实施例中网络服务故障处理方法进行描述, 请参阅图 8, 本 发明实施例中网络服务故障处理方法一个实施例包括:
801、 SMS监测 VNF的运行性能数据;
当 NFV系统运行时, 服务管理系统 SMS监测虚拟网络功能实体 VNF的 运行性能数据。
802、 SMS根据 VNF的运行性能数据, 判断网络服务性能是否异常;
SMS监测到 VNF的运行性能数据后, 根据该 VNF的性能数据, 判断网 络服务性能是否异常;
803、若网络服务性能异常, 则 SMS发送网络服务故障关联请求信息给系 统管理模块;
当 SMS判定网络服务性能异常时, SMS发送网络服务故障关联请求信息 给系统管理模块,该网络服务故障关联请求信息用于该系统管理模块查询故障 并反馈网络服务关联响应给 SMS。
804、 SMS根据网络服务故障关联响应与 VNF的运行性能数据进行故障 诊断, 获得网络服务故障诊断报告;
SMS接收到系统管理模块反馈的网络服务关联响应后, 根据该网络服务 故障关联响应与 VNF的运行性能数据进行故障诊断, 获得网络服务故障诊断 报告。
805、 SMS将网络服务故障诊断报告发送给系统管理模块。
SMS 进行故障诊断, 获得网络服务故障诊断报告后, 将该网络服务故障 诊断报告发送给系统管理模块,该网络服务故障诊断报告用于系统管理模块进 行故障恢复。
本发明实施例中服务管理系统 SMS监测到 VNF的运行性能数据异常后, 发送网络服务关联请求信息给系统管理模块,系统管理模块查询故障并反馈网 络服务故障关联响应, SMS再根据网络服务故障关联响应和 VNF的运行性能 数据, 进行故障诊断, 得到网络服务故障诊断报告, 将该网络服务故障诊断报 告发送给系统管理模块,系统管理模块能根据该网络服务故障诊断报告进行故 障恢复, 这样就在 NFV环境下实现了网络服务故障的处理, 当有网络服务发 生故障时,能快速的发现并解决故障,保证了 NFV系统的稳定性,提高了 NFV 系统的运行效率。
上面实施例中, SMS监测 VNF的运行性能数据, 根据 VNF的运行性能 数据判断网络服务性能是否异常, 在实际应用中, SMS 可以接收网元管理系 统 EMS发送的 VNF的运行性能数据, SMS还可以接收编排器 Orchestrator发 送的配置参数数值范围, 根据该配置参数数值范围与 VNF的运行性能数据来 判断网络服务性能是否异常,下面对本发明实施例中网络服务故障处理方法进 行具体描述, 请参阅图 9, 本发明实施例中网络服务故障处理方法另一个实施 例包括:
901、SMS周期性的接收网元管理系统 EMS发送的 VNF的运行性能数据; 当 NFV系统运行时,服务管理系统 SMS周期性的接收网元管理系统 EMS 发送的 VNF的运行性能数据。
可以理解的是, SMS周期性的接收 EMS发送的 VNF的运行性能数据, 可以是 EMS周期性的主动检测 VNF的运行性能数据, 然后发送给 SMS, 也 可以是 SMS周期性的给 EMS发送数据请求,使得 EMS监测 VNF的运行性能 数据再反馈给 SMS, 此处不作限定。
SMS接收 VNF的运行性能数据的周期可以根据实际情况设定, 例如可以 以一秒钟为周期, 也可以一分钟为周期, 此处不作限定。
该运行性能数据可以为数据吞吐量, 时延或带宽,也可以为表示数据吞吐 量, 时延或带宽的参数, 还可以包括其余的描述网络服务特性的参数, 此处不 作限定。 902、 SMS接收编排器 Orchestrator发送的配置参数数值范围;
SMS接收 Orchestrator发送的配置参数数值范围, 该配置参数数值范围表 示网络服务正常运行时 VNF的运行性能数据的取值范围。
可以理解的是,在实际应用中, 步骤 202可以在步骤 203之前的任一个时 机执行, 此处不作限定, 进一步的, 若 SMS中存储有配置参数数值范围, 也 可以不执行步骤 202, 此处不作限定。
903、 SMS判断 VNF的运行性能数据是否在该配置参数数值范围内; SMS接收到 VNF的运行性能数据和配置参数数值范围后, SMS判断 VNF 的运行性能数据是否在该配置参数数值范围内;
若 SMS判定 VNF的运行性能数据不在该配置参数数值范围内,则确定网 络服务性能异常, 触发步骤 204;
若 SMS判定 VNF的运行性能数据在该配置参数数值范围内,则确定网络 服务性能正常, 可以触发步骤 201。
在实际应用中,当 SMS判定 VNF的运行性能数据在该配置参数数值范围 内时, 也可以先暂停运行预置时间, 再触发步骤 201 , 此处不作限定。
904、 SMS发送网络服务故障关联请求信息给系统管理模块;
当 SMS判定 VNF的运行性能数据不在该配置参数数值范围内,从而确定 网络服务性能异常时, SMS发送网络服务故障关联请求信息给系统管理模块, 该网络服务故障关联请求信息用于该系统管理模块查询故障并反馈网络服务 关联响应给 SMS。
905、 SMS根据网络服务故障关联响应与 VNF的运行性能数据进行故障 诊断, 获得网络服务故障诊断报告;
SMS接收到系统管理模块反馈的网络服务关联响应后, 根据该网络服务 故障关联响应与 VNF的运行性能数据进行故障诊断, 获得网络服务故障诊断 报告。
906、 SMS将网络服务故障诊断报告发送给系统管理模块。
SMS 进行故障诊断, 获得网络服务故障诊断报告后, 将该网络服务故障 诊断报告发送给系统管理模块,该网络服务故障诊断报告用于系统管理模块进 行故障恢复。 本发明实施例中, SMS 可以先接收配置参数数值范围, 根据该配置参数 数值范围与 VNF的运行性能数据来判断网络服务性能是否异常, 这样使得对 网络服务性能的判定更加准确,且若判定网络服务性能正常时,触发接收 VNF 的运行性能数据的步骤,增强了故障监测的连续性,保证了故障处理的时效性。
上面实施例中, SMS根据网络服务故障关联响应与 VNF的运行性能数据 进行故障诊断, 获得网络服务故障诊断报告, 在实际应用中, SMS 可以先结 合网络服务故障关联响应与 VNF的运行性能数据确定根源故障与伪故障再进 行后续处理, 下面对本发明实施例中网络故障处理方法进行详细描述,请参阅 图 10, 本发明实施例中网络故障处理方法另一个实施例包括:
1001、 SMS周期性的接收网元管理系统 EMS发送的 VNF的运行性能数 据;
当 NFV系统运行时,服务管理系统 SMS周期性的接收网元管理系统 EMS 发送的 VNF的运行性能数据。
可以理解的是, SMS周期性的接收 EMS发送的 VNF的运行性能数据, 可以是 EMS周期性的主动检测 VNF的运行性能数据, 然后发送给 SMS, 也 可以是 SMS周期性的给 EMS发送数据请求,使得 EMS监测 VNF的运行性能 数据再反馈给 SMS, 此处不作限定。
SMS接收 VNF的运行性能数据的周期可以根据实际情况设定, 例如可以 以一秒钟为周期, 也可以一分钟为周期, 此处不作限定。
该运行性能数据可以为数据吞吐量, 时延或带宽,也可以为表示数据吞吐 量, 时延或带宽的参数, 还可以包括其余的描述网络服务特性的参数, 此处不 作限定。
1002、 SMS接收编排器 Orchestrator发送的配置参数数值范围;
SMS接收 Orchestrator发送的配置参数数值范围, 该配置参数数值范围表 示网络服务正常运行时 VNF的运行性能数据的取值范围。
可以理解的是,在实际应用中, 步骤 1002可以在步骤 1003之前的任一个 时机执行, 此处不作限定, 进一步的, 若 SMS中存储有配置参数数值范围, 也可以不执行步骤 1002, 此处不作限定。
1003、 SMS判断 VNF的运行性能数据是否在该配置参数数值范围内; SMS接收到 VNF的运行性能数据和配置参数数值范围后, SMS判断 VNF 的运行性能数据是否在该配置参数数值范围内;
若 SMS判定 VNF的运行性能数据不在该配置参数数值范围内,则确定网 络服务性能异常, 触发步骤 1004;
若 SMS判定 VNF的运行性能数据在该配置参数数值范围内,则确定网络 服务性能正常, 可以触发步骤 1001。
在实际应用中,当 SMS判定 VNF的运行性能数据在该配置参数数值范围 内时, 也可以先暂停运行预置时间, 再触发步骤 1001 , 此处不作限定。
1004、 SMS发送网络服务故障关联请求信息给系统管理模块;
当 SMS判定 VNF的运行性能数据不在该配置参数数值范围内,从而确定 网络服务性能异常时, SMS发送网络服务故障关联请求信息给系统管理模块, 该网络服务故障关联请求信息用于该系统管理模块查询故障并反馈网络服务 关联响应给 SMS。
网络服务故障关联请求信息包括网络服务标识,该网络服务标识用于在网 络中唯一标识一个网络服务,还可以包括网络服务性能异常数据, 该网络服务 性能异常数据用于表示网络服务某一项或几项性能异常的统计数据,还可以包 括网络服务的拓朴信息或网络服务异常发生时间,该网络服务的拓朴信息用于 表示网络服务中包含的所有网络设备或网络功能,该网络服务异常发生时间用 于发生故障的时间, 还可以包括其余的网络服务相关数据, 此处不作限定。
按实际应用情况的不同, 该系统管理模块可以为网络管理系统 NMS, 也 可以为虚拟网络功能管理器 VNFM , 还可以包括 NMS和 VNFM , 此处不作限 定。
1005、 SMS结合 VNF的运行性能数据及网络服务故障关联响应, 确定根 源故障与伪故障;
SMS接收到系统管理模块反馈的网络服务关联响应后, 结合所述 VNF的 运行性能数据及所述网络服务故障关联响应,确定根源故障与伪故障,根源故 障用于表示网络服务异常的根源,伪故障为能随着所述根源故障的恢复而恢复 的故障。
网络服务关联响应包括网络服务标识和故障信息,该故障信息用于标识故 障发生体, 可以区分出是系统管理模块管理范围内哪一部分发生故障, 网络服 务关联响应还可以包括故障类型, 用于标识故障的类型, 例如过载, 服务中止 或断电等, 网络服务关联响应还可以包括网络服务的拓朴信息, 网络服务性能 异常数据或网络服务异常发生时间等其他网络服务或故障相关数据,此处不作 限定。
SMS确定根源故障与伪故障的方法有多种, 可以查找数据故障信息列表, 得到 VNF的运行性能数据中的异常数据和网络服务故障关联响应中的故障信 息对应的根源故障与伪故障, 也可以提交 VNF的运行性能数据与网络服务故 障关联响应给分析工具, 分析出根源故障与伪故障,还可以有其余的确定根源 故障与伪故障的方式,此处不作限定。其中数据故障信息列表可以由人工设定, 也可以导入经验数据, 还可以通过机器学习进行更新, 此处不作限定。
1006、 SMS对该根源故障与伪故障进行分析, 得到处理策略;
SMS 确定根源故障与伪故障后, 对该根源故障与伪故障进行分析, 得到 处理策略。
1007、 SMS按照根源故障与伪故障和该处理策略生成网络服务故障诊断 报告;
SMS 分析得到处理策略后, 按照根源故障与伪故障和该处理策略, 生成 网络服务故障诊断报告。
网络服务故障诊断报告包括网络服务标识, 根源故障和根源故障处理决 策, 该根源故障用于表示网络服务异常的根源, 该根源故障处理决策用于表示 处理根源故障的策略, 如重启, 迁移等, 网络服务故障诊断报告还可以包括根 源故障类型, 伪故障, 伪故障类型或伪故障处理决策, 网络服务故障诊断报告 还可以包括网络服务的拓朴信息, 网络服务性能异常数据, 网络服务异常发生 时间等其他网络月良务或故障相关数据, 此处不作限定。
1008、 SMS将网络服务故障诊断报告发送给系统管理模块。
SMS 生成网络服务故障诊断报告后, 将该网络服务故障诊断报告发送给 系统管理模块, 该网络服务故障诊断报告用于系统管理模块进行故障恢复。
SMS 将网络服务故障诊断报告发送给系统管理模块后, 还可以接收系统 管理模块反馈的故障恢复消息,该故障消息用于表示系统管理模块已经进行故 障恢复, SMS接收到故障恢复消息后, 当监测到故障已经恢复后, 还可以反 馈系统管理模块网络服务故障恢复通知,该网络服务故障恢复通知用于通知系 统管理模块故障已经恢复。
本发明实施例中, SMS先结合网络服务故障关联响应与 VNF的运行性能 数据确定根源故障与伪故障, 然后对根源故障与伪故障进行分析得到处理策 略,再按照根源故障与伪故障和处理策略生成网络服务故障诊断报告发送给系 统管理模块,使得对网络服务故障的分析更加准确,提高了故障处理的准确性。
可以理解的是, SMS 可以部署在不同的管理系统中, 例如可以部署在网 元管理系统 EMS中,也可以部署在编排器 Orchestrator中,还可以部署在系统 管理模块中的 NMS中或 VNFM中, 此处不作限定。 当 SMS部署在不同的管 理系统中时, 接收或发送各种消息的路径可能不同, 例如将 SMS部署在系统 管理模块中的 NMS中时, SMS可以通过 Orchestrator来传输各种消息。
为便于理解,下面以一具体应用场景对本发明实施例中网络服务故障处理 方法进行具体描述:
SMS接收 Orchestrator发送的配置参数数值范围;
SMS以 10秒钟为周期接收 EMS发送的 VNF的运行性能数据;
SMS判断出 VNF的运行性能数据中 VNF-4的服务性能超出了配置参数数 值范围, SMS判定 VNF-4的网络服务性能异常;
SMS 发送网络服务故障关联请求信息给系统管理模块, 该网络服务故障 关联请求信息中包括 VNF-4的网络标识和网络服务性能异常数据, 该网络服 务故障关联请求信息用于系统管理模块查询故障并反馈网络服务故障关联响 应给 SMS;
SMS结合 VNF的运行性能数据及网络服务故障关联响应(网络服务故障 关联响应中包括: 设备 GW2故障, 吞吐量为 0, 设备 GW1正常, 服务性能过 载, VNF-4正常, 服务性能过载), 确定设备 GW2为根源故障, 设备 GW1与 VNF-4为伪故障;
SMS分析根源故障与伪故障, 得到处理策略为重启设备 GW2;
SMS 按照根源故障与伪故障和该处理策略生成网络服务故障诊断报告, 该网络服务故障诊断报告中包括根源故障为 GW2, 根源故障处理决策为重启 GW2;
SMS 将该网络服务故障诊断报告发送给系统管理模块, 该网络服务故障 诊断报告用于系统管理模块进行故障恢复。
下面从系统管理模块的角度对本发明实施例中的网络服务故障处理方法 进行描述, 请参阅图 11 , 本发明实施例中网络服务故障处理方法另一个实施 例包括:
1101、 系统管理模块接收 SMS发送的网络服务故障关联请求信息; 当 SMS发送网络服务故障关联请求信息给系统管理模块时, 系统管理模 块接收 SMS发送的网络服务故障关联请求信息。
1102、 系统管理模块按照网络服务故障关联请求信息, 查询系统管理模块 管理范围内的故障信息;
系统管理模块接收到网络服务故障关联请求信息后,按照该网络服务故障 关联请求信息查询系统管理模块管理范围内的故障信息。
1103、 系统管理模块转换该故障信息为网络服务故障关联响应; 系统管理模块查询到故障信息后,将该故障信息转换为网络服务故障关联 响应。
1104、 系统管理模块发送网络服务故障关联响应给 SMS;
系统管理模块得到网络服务故障管理响应后,发送该网络服务故障关联响 应给 SMS, 该网络服务故障关联响应用于 SMS对网络服务进行故障分析。
1105、 系统管理模块接收 SMS发送的网络服务故障诊断报告;
当 SMS发送网络服务故障诊断报告给系统管理模块时, 系统管理模块接 收 SMS发送的网络服务故障诊断报告,所述网络服务故障诊断报告是 SMS根 据所述网络服务故障关联响应与虚拟网络功能实体 VNF的运行性能数据进行 故障诊断后获得。
1106、 系统管理模块根据该网络服务故障诊断报告进行故障恢复。
系统管理模块获得网络服务故障诊断报告后,根据该网络服务故障诊断报 告进行故障恢复。
本发明实施例中,系统管理模块按照接收到的网络服务故障关联请求信息 查询故障信息, 将故障信息转换为网络服务故障关联响应发送给 SMS, 然后 根据 SMS发送的网络服务故障诊断报告进行故障恢复,在 NFV环境下实现了 网络服务故障的处理, 当网络服务发生故障时, 能迅速查找故障并对故障进行 处理, 保证了 NFV系统的稳定性, 提高了 NFV系统的运行效率。
在实际应用中, 系统管理模块可以是网络管理系统 NMS, 也可以是虚拟 网络功能管理器 VNFM , 还可以是 NMS与 VNFM的组合, 下面以系统管理 模块分别为 NMS或 VNFM时进行描述。
当系统管理模块为 NMS时, 请参阅图 12, 本发明实施例中网络服务故障 处理方法另一个实施例包括:
1201、 网络管理系统 NMS接收 SMS发送的网络服务故障关联请求信息; 当 SMS发送网络服务故障关联请求信息给 NMS时, NMS接收 SMS发送 的网络服务故障关联请求信息。
1202、 NMS按照网络服务故障关联请求信息,查询 NMS管理的设备的故 障信息;
NMS接收到网络服务故障关联请求信息后, 按照该网络服务故障关联请 求信息查询 NMS管理的设备的故障信息。
1203、 NMS转换该故障信息为网络服务故障关联响应;
NMS查询到设备的故障信息后, 将该故障信息转换为网络服务故障关联 响应。
1204、 NMS发送网络服务故障关联响应给 SMS;
NMS 得到网络服务故障管理响应后, 发送该网络服务故障关联响应给
SMS, 该网络服务故障关联响应用于 SMS对网络服务进行故障分析。
1205、 NMS接收 SMS发送的网络服务故障诊断报告;
当 SMS发送网络服务故障诊断报告给 NMS时, NMS接收 SMS发送的网 络服务故障诊断报告, 所述网络服务故障诊断报告是 SMS根据所述网络服务 故障关联响应与虚拟网络功能实体 VNF 的运行性能数据进行故障诊断后获 付。
1206、 NMS根据该网络服务故障诊断报告进行故障恢复。
NMS获得网络服务故障诊断报告后, 根据该网络服务故障诊断报告进行 故障恢复。 NMS进行故障恢复的方法有多种, 例如可以重启设备, 也可以迁徙业务 等, 此处不作限定。
在实际应用中, 当 NMS进行恢复故障后, 可以反馈 SMS故障恢复通知, 通知 SMS已经进行故障恢复, SMS接收到故障恢复通知后, 当监测到故障已 经恢复时, 可以向 NMS发送网络月良务故障恢复通知, 通知 NMS故障已经恢 复。
本发明实施例中, 当系统管理模块为 NMS时, NMS接收到网络服务故障 关联响应信息后可以查询 NMS管理的设备的故障信息, 然后将该故障信息转 换为网络服务故障关联响应反馈给 SMS,这样利用网络管理系统 NMS对设备 进行故障查询, 加快了故障查询进度, 使得故障查询结果更加准备, 提高了故 障处理的效率。
当系统管理模块为虚拟网络功能管理器 VNFM时, 请参阅图 13, 本发明 实施例中网络服务故障处理方法另一个实施例包括:
1301、虚拟网络功能管理器 VNFM接收 SMS发送的网络服务故障关联请 求信息;
当 SMS发送网络服务故障关联请求信息给 VNFM时, VNFM接收 SMS 发送的网络月良务故障关联请求信息。
1302、 VNFM按照网络服务故障关联请求信息, 查询与该网络服务故障 关联请求信息请求的网络服务相关联的 VNF的故障信息;
VNFM接收到网络服务故障关联请求信息后, 查询与该网络服务故障关 联请求信息请求的网络服务相关联的 VNF的故障信息。
1303、 VNFM通过虚拟化基础设施管理器 VIM查询与该网络服务相关联 的网络功能虚拟化基础设施 NFVI的故障信息;
VNFM接收到网络服务故障关联请求信息后, 可以发送该网络服务故障 关联请求信息给 VIM, 使得 VIM查询与该网络服务相关联的 NFVI的故障信
1304、 VNFM将 VNF的故障信息和 NFVI的故障信息转换为网络服务故 障关联响应;
VNFM查询到 VNF的故障信息和 NFVI的故障信息后, 将该故障信息转 换为网络服务故障关联响应。
1305、 VNFM发送网络服务故障关联响应给 SMS;
VNFM 获得网络服务故障管理响应后, 发送该网络服务故障关联响应给 SMS, 该网络服务故障关联响应用于 SMS对网络服务进行故障分析。
1306、 VNFM接收 SMS发送的网络服务故障诊断报告;
当 SMS发送网络服务故障诊断报告给 VNFM时, VNFM接收 SMS发送 的网络服务故障诊断报告, 所述网络服务故障诊断报告是 SMS根据网络服务 故障关联响应与虚拟网络功能实体 VNF 的运行性能数据进行故障诊断后获 付。
1307、 VNFM根据该网络服务故障诊断报告进行故障恢复。
VNFM 获得网络服务故障诊断报告后, 根据该网络服务故障诊断报告进 行故障恢复。
VNFM 可以解析出网络服务故障诊断报告中的根据故障和根源故障处理 决策, 若发生故障的是 VNF, 则可以重启 VNF, 也可以进行 VNF迁移, 还可 以重新实例化 VNF等, 若发生故障的是 NFVI, 则可以进行重启或迁移等, 此 处不作限定。
在实际应用中,当 VNFM进行恢复故障后,可以反馈 SMS故障恢复通知, 通知 SMS已经进行故障恢复, SMS接收到故障恢复通知后, 当监测到故障已 经恢复时,可以向 VNFM发送网络月良务故障恢复通知,通知 NMS故障已经恢 复。
本发明实施例中, 当系统管理模块为 VNFM时, VNFM接收到网络服务 故障关联响应信息后可以查询 VNF的故障信息, 并通过 VIM查询 NFVI的故 障信息, 然后将 VNF的故障信息与 NFVI的故障信息转换为网络服务故障关 联响应,这样通过虚拟网络功能管理器 VNFM对 VNF和 NFVI进行故障查询, 相比于使用其他方式查询 VNF和 NFVI的故障, 能更快的获得故障信息, 提 高了故障查询的效率。
可以理解的是, 系统管理模块除了分别为 NMS 或 VNFM, 还可以包括 NMS与 VNFM的组合, 当系统管理模块包括 NMS和 VNFM时, SMS可以 同时发送网络服务故障关联请求消息给 NMS与 VNFM, 此处不作限定。 为便于理解,下面以一具体应用场景对本发明实施例中网络服务故障处理 方法进行具体描述:
VNFM接收 SMS发送的网络服务故障关联请求信息, 该网络服务故障关 联请求信息中包括 VNF-4过载;
VNFM按照该网络服务故障关联请求信息查询与 VNF-4相关联的 VNF的 故障信息, 得到 VNF-2正常, 服务性能过载, VNF-4正常, 服务性能过载;
VNFM通过 VIM查询与 VNF-4相关联的 NFVI的故障信息, 得到 NFVI 故障, 吞吐量为 0;
VNFM将 VNF的故障信息( VNF-2正常, 服务性能过载, VNF-4正常, 服务性能过载)和 NFVI的故障信息 (NFVI故障, 吞吐量为 0 )转换为网络 服务故障关联响应;
VNFM发送该网络服务故障管理响应给 SMS;
VNFM接收 SMS发送的网络服务故障诊断报告, 该网络服务故障诊断报 告中包括根源故障为 NFVI,根源故障处理决策为重启 NFVI,伪故障为 VNF-1 与 VNF-4;
VNFM根据该网路服务故障诊断报告重启 NFVI。
所属领域的技术人员可以清楚地了解到, 为描述的方便和筒洁, 上述描述 的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程, 在此不再赘述。
在本申请所提供的几个实施例中, 应该理解到, 所揭露的系统, 装置和方 法, 可以通过其它的方式实现。 例如, 以上所描述的装置实施例仅仅是示意性 的, 例如, 所述单元的划分, 仅仅为一种逻辑功能划分, 实际实现时可以有另 外的划分方式, 例如多个单元或组件可以结合或者可以集成到另一个系统, 或 一些特征可以忽略, 或不执行。 另一点, 所显示或讨论的相互之间的耦合或直 接耦合或通信连接可以是通过一些接口, 装置或单元的间接耦合或通信连接, 可以是电性, 机械或其它的形式。 单元显示的部件可以是或者也可以不是物理单元, 即可以位于一个地方, 或者 也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部 单元来实现本实施例方案的目的。
另外, 在本发明各个实施例中的各功能单元可以集成在一个处理单元中, 也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元 中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的 形式实现。
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售 或使用时, 可以存储在一个计算机可读取存储介质中。基于这样的理解, 本发 明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全 部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储 介质中, 包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器, 的存储介质包括: U盘、 移动硬盘、 只读存储器(ROM, Read-Only Memory ), 随机存取存储器(RAM, Random Access Memory ), 磁碟或者光盘等各种可以 存储程序代码的介质。
以上所述, 以上实施例仅用以说明本发明的技术方案, 而非对其限制; 尽 管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理 解: 其依然可以对前述各实施例所记载的技术方案进行修改, 或者对其中部分 技术特征进行等同替换; 而这些修改或者替换, 并不使相应技术方案的本质脱 离本发明各实施例技术方案的精神和范围。
+

Claims

权 利 要 求
1、 一种服务管理系统, 其特征在于, 包括:
监测模块, 用于监测虚拟网络功能实体 VNF的运行性能数据;
判断模块, 用于根据所述监测模块监测到的 VNF的运行性能数据, 判断 网络服务性能是否异常;
信息请求模块, 用于当所述判断模块判定网络服务性能异常时,发送网络 服务故障关联请求信息给系统管理模块,所述网络服务故障关联请求信息用于 所述系统管理模块查询故障并反馈网络服务故障关联响应给故障诊断模块; 故障诊断模块,用于根据网络服务故障关联响应与所述监测模块监测到的 VNF的运行性能数据进行故障诊断, 获得网络服务故障诊断报告;
报告发送模块,用于将所述故障诊断模块得到的网络服务故障诊断报告发 送给所述系统管理模块,所述网络服务故障诊断报告用于所述系统管理模块进 行故障恢复。
2、 根据权利要求 1所述的系统, 其特征在于, 所述监测模块具体用于: 周期性的接收网元管理系统 EMS发送的 VNF的运行性能数据。
3、 根据权利要求 2所述的系统, 其特征在于, 所述系统还包括: 配置接收模块, 用于接收编排器 Orchestrator发送的配置参数数值范围; 所述判断模块具体包括:
判断单元, 用于判断所述 VNF的运行性能数据是否在所述配置接收模块 接收的配置参数数值范围内;
正常确定单元, 用于当所述判断单元判定所述 VNF的运行性能数据在所 述配置参数数值范围内时, 确定所述网络服务性能正常;
异常确定单元, 用于当所述判断单元判定所述 VNF的运行性能数据不在 所述配置参数数值范围内时, 确定所述网络服务性能异常。
4、 根据权利要求 1至 3中任一项所述的系统, 其特征在于, 所述故障诊 断模块具体包括:
故障确定单元, 用于结合所述监测模块监测到的 VNF的运行性能数据及 所述网络服务故障关联响应,确定根源故障与伪故障,所述伪故障为能随着所 述根源故障的恢复而恢复的故障; 析, 得到处理策略;
报告生成单元,用于按照所述根源故障与伪故障和所述故障分析单元分析 出的处理策略生成所述网络服务故障诊断报告。
5、 根据权利要求 4所述的系统, 其特征在于, 所述故障确定单元具体用 于, 结合所述监测模块监测到的 VNF的运行性能数据中的异常数据及所述网 络服务故障关联响应中的故障信息, 查找数据故障信息列表,得到对应的根源 故障与伪故障, 所述伪故障为能随着所述根源故障的恢复而恢复的故障。
6、 根据权利要求 5所述的系统, 其特征在于,
所述服务管理系统部署在所述 Orchestrator中;
或,
所述服务管理系统部署在所述系统管理模块中;
或,
所述服务管理系统部署在所述 EMS中。
7、 一种系统管理模块, 其特征在于, 包括:
请求接收单元, 用于接收服务管理系统 SMS发送的网络服务故障关联请 求信息;
故障查询单元,用于按照所述请求接收单元接收的网络服务故障关联请求 信息, 查询管理范围内的故障信息;
信息转换单元,用于转换所述故障查询单元查询到的故障信息为网络服务 故障关联响应;
发送单元,用于发送所述信息转换单元转换出的网络服务故障关联响应给 所述 SMS;
报告接收单元, 用于接收所述 SMS发送的网络服务故障诊断报告, 所述 网络服务故障诊断报告是 SMS根据所述发送单元发送的网络服务故障关联响 应与虚拟网络功能实体 VNF的运行性能数据进行故障诊断后获得;
故障恢复单元,用于根据所述报告接收单元接收的网络服务故障诊断报告 进行故障恢复。
8、 根据权利要求 7所述的系统管理模块, 其特征在于, 当所述系统管理 模块为网络管理系统 NMS时, 所述故障查询单元具体用于, 按照所述请求接 收单元接收的网络服务故障关联请求信息, 查询管理的设备的故障信息。
9、 根据权利要求 7所述的系统管理模块, 其特征在于, 当所述系统管理 模块为虚拟网络功能管理器 VNFM时,
所述故障查询单元具体包括:
第一查询子单元,用于按照所述请求接收单元接收的网络服务故障关联请 求信息, 查询与所述网络服务故障关联请求信息请求的网络服务相关联的 VNF的故障信息;
第二查询子单元,用于按照所述请求接收单元接收的网络服务故障关联请 求信息, 通过虚拟化基础设施管理器 VIM查询与所述网络服务相关联的网络 功能虚拟化基础设施 NFVI的故障信息;
所述信息转换单元具体用于: 将所述 VNF的故障信息和所述 NFVI的故 障信息转换为网络服务故障关联响应。
10、 一种服务管理系统, 其特征在于, 包括:
输入装置、 输出装置、 处理器、 存储器和总线;
所述处理器执行如下操作:
监测虚拟网络功能实体 VNF的运行性能数据;
根据所述 VNF的运行性能数据, 判断网络服务性能是否异常;
当网络服务性能异常时, 发送网络服务故障关联请求信息给系统管理模 块,所述网络月良务故障关联请求信息用于所述系统管理模块查询故障并反馈网 络服务故障关联响应;
根据所述网络服务故障关联响应与所述 VNF的运行性能数据进行故障诊 断, 获得网络服务故障诊断报告;
将所述网络服务故障诊断报告发送给所述系统管理模块,所述网络服务故 障诊断报告用于所述系统管理模块进行故障恢复。
11、 根据权利要求 10所述的系统, 其特征在于, 所述处理器具体执行如 下操作: 周期性的接收网元管理系统 EMS发送的 VNF的运行性能数据。
12、 根据权利要求 11所述的系统, 其特征在于,
所述处理器还执行如下操作: 接收编排器 Orchestrator发送的配置参数数值范围;
所述处理器具体执行如下操作:
判断所述 VNF的运行性能数据是否在所述配置参数数值范围内; 当所述 VNF的运行性能数据在所述配置参数数值范围内时, 确定所述网 络服务性能正常;
当所述 VNF的运行性能数据不在所述配置参数数值范围内时, 确定所述 网络服务性能异常。
13、 根据权利要求 10至 12中任一项所述的系统, 其特征在于, 所述处理 器具体执行如下操作:
结合所述 VNF的运行性能数据及所述网络服务故障关联响应, 确定根源 故障与伪故障, 所述伪故障为能随着所述根源故障的恢复而恢复的故障; 对所述根源故障与伪故障进行分析, 得到处理策略;
按照所述根源故障与伪故障和所述处理策略生成所述网络服务故障诊断 报告。
14、 根据权利要求 13所述的系统, 其特征在于, 所述处理器具体执行如 下操作:
结合所述 VNF的运行性能数据中的异常数据及所述网络服务故障关联响 应中的故障信息, 查找数据故障信息列表, 得到对应的根源故障与伪故障。
15、 一种系统管理模块, 其特征在于, 包括:
输入装置、 输出装置、 处理器、 存储器和总线;
所述处理器执行如下操作:
接收服务管理系统 SMS发送的网络服务故障关联请求信息;
按照所述网络服务故障关联请求信息, 查询管理范围内的故障信息; 转换所述故障信息为网络服务故障关联响应;
发送所述网络服务故障关联响应给所述 SMS;
接收 SMS发送的网络服务故障诊断报告, 所述网络服务故障诊断报告是 SMS根据所述网络服务故障关联响应与虚拟网络功能实体 VNF的运行性能数 据进行故障诊断后获得;
根据所述网络服务故障诊断报告进行故障恢复。
16、 根据权利要求 15所述的模块, 其特征在于, 当所述系统管理模块为 网络管理系统 NMS时, 所述处理器具体执行如下操作:
查询管理的设备的故障信息。
17、 根据权利要求 15所述的模块, 其特征在于, 当所述系统管理模块为 虚拟网络功能管理器 VNFM时,
所述处理器具体执行如下操作:
查询与所述网络服务故障关联请求信息请求的网络服务相关联的 VNF的 故障信息;
通过虚拟化基础设施管理器 VIM查询与所述网络服务相关联的网络功能 虚拟化基础设施 NFVI的故障信息;
将所述 VNF的故障信息和所述 NFVI的故障信息转换为网络服务故障关 联响应。
18、 一种网络服务故障处理方法, 其特征在于, 包括:
服务管理系统 SMS监测虚拟网络功能实体 VNF的运行性能数据; 所述 SMS根据所述 VNF的运行性能数据, 判断网络服务性能是否异常; 若网络服务性能异常, 则所述 SMS发送网络服务故障关联请求信息给系 统管理模块,所述网络月良务故障关联请求信息用于所述系统管理模块查询故障 并反馈网络服务故障关联响应给所述 SMS;
所述 SMS根据所述网络服务故障关联响应与所述 VNF的运行性能数据进 行故障诊断, 获得网络服务故障诊断报告;
所述 SMS将所述网络服务故障诊断报告发送给所述系统管理模块, 所述 网络服务故障诊断报告用于所述系统管理模块进行故障恢复。
19、 根据权利要求 18所述的方法, 其特征在于, 所述服务管理系统 SMS 监测虚拟网络功能 VNF的运行性能数据包括:
所述 SMS周期性的接收网元管理系统 EMS发送的 VNF的运行性能数据。
20、 根据权利要求 19所述的方法, 其特征在于, 所述运行性能数据包括: 吞吐量、 时延或带宽。
21、 根据权利要求 19所述的方法, 其特征在于,
所述 SMS根据所述 VNF的运行性能数据,判断网络服务性能是否异常的 步骤之前包括:
所述 SMS接收编排器 Orchestrator发送的配置参数数值范围;
所述 SMS根据所述 VNF的运行性能数据,判断网络服务性能是否异常包 括:
所述 SMS判断所述 VNF的运行性能数据是否在所述配置参数数值范围 内;
若是, 则确定所述网络服务性能正常;
若否, 则确定所述网络服务性能异常。
22、 根据权利要求 18所述的方法, 其特征在于, 所述 SMS根据所述网络 服务故障关联响应与所述 VNF的运行性能数据进行故障诊断, 获得网络服务 故障诊断报告包括:
所述 SMS结合所述 VNF的运行性能数据及所述网络服务故障关联响应, 确定根源故障与伪故障,所述伪故障为能随着所述根源故障的恢复而恢复的故 障;
所述 SMS对所述根源故障与伪故障进行分析, 得到处理策略;
所述 SMS按照所述根源故障与伪故障和所述处理策略生成所述网络服务 故障诊断报告。
23、根据权利要求 22所述的方法,其特征在于,所述 SMS结合所述 VNF 的运行性能数据及所述网络服务故障关联响应, 确定根源故障与伪故障包括: 所述 SMS结合所述 VNF的运行性能数据中的异常数据及所述网络服务故 障关联响应中的故障信息, 查找数据故障信息列表,得到对应的根源故障与伪 故障。
24、 根据权利要求 18至 23中任一项所述的方法, 其特征在于, 所述网络 服务故障关联请求信息包括:
网络服务标识和网络服务性能异常数据,所述网络服务标识用于在网络中 唯一标识一个网络服务,所述网络服务性能异常数据用于表示网络服务某一项 或几项性能异常的统计数据。
25、 根据权利要求 24所述的方法, 其特征在于, 所述网络服务故障关联 响应包括: 网络服务标识,故障信息和故障类型,所述故障信息用于标识故障发生体, 所述故障类型用于标识故障的类型。
26、 根据权利要求 25所述的方法, 其特征在于, 所述网络服务故障诊断 报告包括:
网络服务标识, 根源故障, 根源故障处理决策, 所述根源故障用于表示网 络服务异常的根源, 所述根源故障处理决策用于表示处理根源故障的策略。
27、 根据权利要求 18至 23中任一项所述的方法, 其特征在于, 所述系统 管理模块为:
网络管理系统 NMS, 和 /或, 虚拟网络功能管理器 VNFM。
28、 一种网络服务故障处理方法, 其特征在于, 包括:
系统管理模块接收服务管理系统 SMS 发送的网络服务故障关联请求信 所述系统管理模块按照所述网络服务故障关联请求信息,查询所述系统管 理模块管理范围内的故障信息;
所述系统管理模块转换所述故障信息为网络服务故障关联响应;
所述系统管理模块发送所述网络服务故障关联响应给所述 SMS; 所述系统管理模块接收 SMS发送的网络服务故障诊断报告, 所述网络服 务故障诊断报告是 SMS根据所述网络服务故障关联响应与虚拟网络功能实体 VNF的运行性能数据进行故障诊断后获得;
所述系统管理模块根据所述网络服务故障诊断报告进行故障恢复。
29、 根据权利要求 28所述的方法, 其特征在于, 当所述系统管理模块为 网络管理系统 NMS时, 所述查询所述系统管理模块管理范围内的故障信息包 括:
所述 NMS查询所述 NMS管理的设备的故障信息。
30、 根据权利要求 28所述的方法, 其特征在于, 当所述系统管理模块为 虚拟网络功能管理器 VNFM时,
所述查询所述系统管理模块管理范围内的故障信息包括:
所述 VNFM查询与所述网络服务故障关联请求信息请求的网络服务相关 联的 VNF的故障信息; 所述 VNFM通过虚拟化基础设施管理器 VIM查询与所述网络良务相关联 的网络功能虚拟化基础设施 NFVI的故障信息;
所述系统管理模块转换所述故障信息为网络服务故障关联响应包括: 所述 VNFM将所述 VNF的故障信息和所述 NFVI的故障信息转换为网络 服务故障关联响应。
PCT/CN2014/071007 2014-01-21 2014-01-21 网络服务故障处理方法,服务管理系统和系统管理模块 WO2015109443A1 (zh)

Priority Applications (8)

Application Number Priority Date Filing Date Title
CN201480000951.7A CN105165054B (zh) 2014-01-21 2014-01-21 网络服务故障处理方法,服务管理系统和系统管理模块
BR112016016656A BR112016016656A2 (pt) 2014-01-21 2014-01-21 Método de manuseio de falha de serviço de rede, sistema de gerenciamento de serviço e módulo de gerenciamento de sistema
MX2016009433A MX2016009433A (es) 2014-01-21 2014-01-21 Metodo de manejo de falla del servicio de red, sistema de gestion de servicio, y modulo de gestion de sistema.
PCT/CN2014/071007 WO2015109443A1 (zh) 2014-01-21 2014-01-21 网络服务故障处理方法,服务管理系统和系统管理模块
EP14880365.3A EP3089505B1 (en) 2014-01-21 2014-01-21 Method for processing network service faults, service management system and system management module
RU2016134016A RU2641706C1 (ru) 2014-01-21 2014-01-21 Способ обработки отказа сетевой службы, система управления службами и модуль управления системой
US15/215,294 US10680874B2 (en) 2014-01-21 2016-07-20 Network service fault handling method, service management system, and system management module
ZA2016/05062A ZA201605062B (en) 2014-01-21 2016-07-20 Network service fault handling method,service management system , and system management module.

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2014/071007 WO2015109443A1 (zh) 2014-01-21 2014-01-21 网络服务故障处理方法,服务管理系统和系统管理模块

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/215,294 Continuation US10680874B2 (en) 2014-01-21 2016-07-20 Network service fault handling method, service management system, and system management module

Publications (1)

Publication Number Publication Date
WO2015109443A1 true WO2015109443A1 (zh) 2015-07-30

Family

ID=53680559

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/071007 WO2015109443A1 (zh) 2014-01-21 2014-01-21 网络服务故障处理方法,服务管理系统和系统管理模块

Country Status (8)

Country Link
US (1) US10680874B2 (zh)
EP (1) EP3089505B1 (zh)
CN (1) CN105165054B (zh)
BR (1) BR112016016656A2 (zh)
MX (1) MX2016009433A (zh)
RU (1) RU2641706C1 (zh)
WO (1) WO2015109443A1 (zh)
ZA (1) ZA201605062B (zh)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105681077A (zh) * 2015-12-31 2016-06-15 华为技术有限公司 故障处理方法、装置及系统
WO2017050130A1 (zh) * 2015-09-22 2017-03-30 华为技术有限公司 一种故障恢复方法及装置
CN107534575A (zh) * 2015-10-21 2018-01-02 华为技术有限公司 一种网络虚拟化环境下的监控方法、监控装置和网络节点
US9935818B1 (en) 2017-05-02 2018-04-03 At&T Intellectual Property I, L.P. Diagnostic traffic generation for automatic testing and troubleshooting
CN108353004A (zh) * 2015-11-12 2018-07-31 是德科技新加坡(控股)私人有限公司 用于测试网络功能虚拟化(nfv)的方法、系统和计算机可读介质
US10110462B1 (en) 2016-09-16 2018-10-23 Sprint Communications Company L.P. False positive protection for Network Function Virtualization (NFV) virtual probe deployment
CN109905261A (zh) * 2017-12-08 2019-06-18 华为技术有限公司 故障诊断方法及装置
CN111581062A (zh) * 2020-05-18 2020-08-25 聚好看科技股份有限公司 服务的故障处理方法及服务器
CN116708135A (zh) * 2023-08-08 2023-09-05 中国电信股份有限公司 网络业务故障监测方法、装置、电子设备及存储介质

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108370331A (zh) * 2016-01-08 2018-08-03 英特尔Ip公司 用于虚拟化网络功能的实例化和终止的技术
US10417074B2 (en) * 2016-11-03 2019-09-17 International Business Machines Corporation SMS-based backup notification system for storage systems
US11012883B2 (en) * 2017-01-06 2021-05-18 Apple Inc. Measurement job suspension and resumption in network function virtualization
US20190386878A1 (en) * 2017-03-16 2019-12-19 Intel IP Corporation Supporting son functions on network slice instances
CN107222352B (zh) * 2017-06-30 2020-09-18 苏州浪潮智能科技有限公司 一种管理网恢复方法及装置
GB2553419B (en) 2017-07-11 2018-12-12 Spatialbuzz Ltd Fault monitoring in a utility supply network
EP3974986A1 (en) * 2018-11-14 2022-03-30 Telefonaktiebolaget LM Ericsson (publ) Nf service consumer restart detection using direct signaling between nfs
US11194591B2 (en) 2019-01-23 2021-12-07 Salesforce.Com, Inc. Scalable software resource loader
US10747551B2 (en) 2019-01-23 2020-08-18 Salesforce.Com, Inc. Software application optimization
US10802944B2 (en) 2019-01-23 2020-10-13 Salesforce.Com, Inc. Dynamically maintaining alarm thresholds for software application performance management
US11968240B2 (en) * 2019-03-04 2024-04-23 Cisco Technology, Inc. Network posture based suggestion of applications and services
CN109886599A (zh) * 2019-03-06 2019-06-14 国网江苏省电力有限公司镇江供电分公司 电力故障信息数据分析服务系统及处理反馈方法
US11522888B2 (en) * 2019-04-02 2022-12-06 Nec Corporation Anomaly detection and troubleshooting system for a network using machine learning and/or artificial intelligence
US10922062B2 (en) 2019-04-15 2021-02-16 Salesforce.Com, Inc. Software application optimization
US10922095B2 (en) 2019-04-15 2021-02-16 Salesforce.Com, Inc. Software application performance regression analysis
CN112994917B (zh) * 2019-12-17 2022-12-13 华为技术有限公司 网络管理方法和网络管理装置
CN112783718A (zh) * 2020-12-31 2021-05-11 航天信息股份有限公司 一种用于系统异常的管理系统及方法
CN114915994A (zh) * 2021-02-09 2022-08-16 华为技术有限公司 一种网络问题处理方法、设备及系统
CN116545961B (zh) * 2023-07-03 2023-09-15 明阳时创(北京)科技有限公司 一种网络交换机集群智能检测方法及系统

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101247617A (zh) * 2008-01-16 2008-08-20 中兴通讯股份有限公司 一种移动通讯业务故障自动检测装置和方法
CN101472288A (zh) * 2007-12-27 2009-07-01 北京摩软科技有限公司 一种进行故障诊断的移动终端、方法及服务器
CN103378982A (zh) * 2012-04-17 2013-10-30 深圳市腾讯计算机系统有限公司 互联网业务运行监测方法和系统

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5151807B2 (ja) * 2008-08-26 2013-02-27 富士通株式会社 通信装置及び保守管理メッセージ終端方法
US7992044B2 (en) * 2008-12-05 2011-08-02 Oracle America, Inc. Method and system for platform independent fault management
CN101594192B (zh) 2009-06-19 2012-12-19 中兴通讯股份有限公司 一种信号处理设备和光接口板的在线故障检测方法和装置
US8055933B2 (en) * 2009-07-21 2011-11-08 International Business Machines Corporation Dynamic updating of failover policies for increased application availability
CN101777951B (zh) * 2009-12-30 2014-06-11 中兴通讯股份有限公司 一种数据监测的方法和系统
JP5689333B2 (ja) * 2011-02-15 2015-03-25 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation 異常検知システム、異常検知装置、異常検知方法、プログラムおよび記録媒体
CN103428025A (zh) * 2012-05-25 2013-12-04 中兴通讯股份有限公司 一种管理虚拟网络服务的方法、装置和系统
CN105247826B (zh) * 2013-01-11 2018-07-13 华为技术有限公司 网络设备的网络功能虚拟化
US9973375B2 (en) * 2013-04-22 2018-05-15 Cisco Technology, Inc. App store portal providing point-and-click deployment of third-party virtualized network functions
WO2015031512A1 (en) * 2013-08-27 2015-03-05 Huawei Technologies Co., Ltd. System and method for mobile network function virtualization
US9350632B2 (en) * 2013-09-23 2016-05-24 Intel Corporation Detection and handling of virtual network appliance failures
KR101908465B1 (ko) * 2013-09-30 2018-12-10 후아웨이 테크놀러지 컴퍼니 리미티드 결함 관리 방법, 엔티티 및 시스템
US9645899B1 (en) * 2013-12-19 2017-05-09 Amdocs Software Systems Limited System, method, and computer program for managing fault recovery in network function virtualization (NFV) based networks

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101472288A (zh) * 2007-12-27 2009-07-01 北京摩软科技有限公司 一种进行故障诊断的移动终端、方法及服务器
CN101247617A (zh) * 2008-01-16 2008-08-20 中兴通讯股份有限公司 一种移动通讯业务故障自动检测装置和方法
CN103378982A (zh) * 2012-04-17 2013-10-30 深圳市腾讯计算机系统有限公司 互联网业务运行监测方法和系统

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3089505A4 *

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10601643B2 (en) 2015-09-22 2020-03-24 Huawei Technologies Co., Ltd. Troubleshooting method and apparatus using key performance indicator information
WO2017050130A1 (zh) * 2015-09-22 2017-03-30 华为技术有限公司 一种故障恢复方法及装置
CN107534575A (zh) * 2015-10-21 2018-01-02 华为技术有限公司 一种网络虚拟化环境下的监控方法、监控装置和网络节点
CN107534575B (zh) * 2015-10-21 2020-07-10 北京航天天盾信息有限公司 一种网络虚拟化环境下的监控方法、监控装置和网络节点
CN108353004A (zh) * 2015-11-12 2018-07-31 是德科技新加坡(控股)私人有限公司 用于测试网络功能虚拟化(nfv)的方法、系统和计算机可读介质
US11032130B2 (en) 2015-12-31 2021-06-08 Huawei Technologies Co., Ltd. Troubleshooting method, apparatus, and system
CN105681077B (zh) * 2015-12-31 2019-04-05 华为技术有限公司 故障处理方法、装置及系统
CN105681077A (zh) * 2015-12-31 2016-06-15 华为技术有限公司 故障处理方法、装置及系统
US10110462B1 (en) 2016-09-16 2018-10-23 Sprint Communications Company L.P. False positive protection for Network Function Virtualization (NFV) virtual probe deployment
US10461990B2 (en) 2017-05-02 2019-10-29 At&T Intellectual Property I, L.P. Diagnostic traffic generation for automatic testing and troubleshooting
US11032126B2 (en) 2017-05-02 2021-06-08 At&T Intellectual Property I, L.P. Diagnostic traffic generation for automatic testing and troubleshooting
US9935818B1 (en) 2017-05-02 2018-04-03 At&T Intellectual Property I, L.P. Diagnostic traffic generation for automatic testing and troubleshooting
CN109905261A (zh) * 2017-12-08 2019-06-18 华为技术有限公司 故障诊断方法及装置
CN111581062A (zh) * 2020-05-18 2020-08-25 聚好看科技股份有限公司 服务的故障处理方法及服务器
CN116708135A (zh) * 2023-08-08 2023-09-05 中国电信股份有限公司 网络业务故障监测方法、装置、电子设备及存储介质
CN116708135B (zh) * 2023-08-08 2023-11-07 中国电信股份有限公司 网络业务故障监测方法、装置、电子设备及存储介质

Also Published As

Publication number Publication date
ZA201605062B (en) 2019-09-25
BR112016016656A2 (pt) 2017-08-08
CN105165054A (zh) 2015-12-16
CN105165054B (zh) 2019-05-24
US10680874B2 (en) 2020-06-09
US20160330067A1 (en) 2016-11-10
RU2641706C1 (ru) 2018-01-22
EP3089505A4 (en) 2017-01-25
EP3089505B1 (en) 2018-05-02
EP3089505A1 (en) 2016-11-02
MX2016009433A (es) 2016-12-02

Similar Documents

Publication Publication Date Title
WO2015109443A1 (zh) 网络服务故障处理方法,服务管理系统和系统管理模块
CN109344014B (zh) 一种主备切换方法、装置及通信设备
US11463303B2 (en) Determining the health of other nodes in a same cluster based on physical link information
WO2015154246A1 (zh) 基于网络功能虚拟化的故障处理方法及装置、系统
TW201423398A (zh) 虛擬機至實體機之間相關性能問題的根源分析的方法與系統
US10187181B2 (en) Method and device for handling exception event in telecommunication cloud
JP5530864B2 (ja) ネットワークシステム、管理サーバ、及び、管理方法
CN106982244B (zh) 在云网络环境下实现动态流量的报文镜像的方法和装置
WO2018010176A1 (zh) 获取故障信息的方法及设备
WO2012139461A1 (zh) 一种数据采集方法、装置及系统
CN116089205A (zh) 一种自动化运维管理方法、装置、服务器和存储介质
JP2010198491A (ja) 仮想メシンサーバおよびこれを用いた仮想マシンネットワーク監視システム
WO2020088351A1 (zh) 设备信息发送的方法、计算机设备和分布式计算机设备系统
WO2016145653A1 (zh) 基于网络功能虚拟化的故障处理方法及设备
WO2014113957A1 (zh) 一种链路管理方法、设备和通信系统
JP2019153981A (ja) 通信装置、通信システム、及び通信方法
JP6488600B2 (ja) 情報処理システム、プログラム及び情報処理装置
EP3756310B1 (en) Method and first node for managing transmission of probe messages
WO2024087692A1 (zh) 设备管理方法、设备、系统和存储介质
US20230009270A1 (en) OPC UA-Based Anomaly Detection and Recovery System and Method
WO2023185214A1 (zh) 网络切换方法、节点、电子设备和可读存储介质
EP4057582B1 (en) Device management method and apparatus
US11816095B2 (en) Metrics and events infrastructure
JP7238515B2 (ja) ネットワーク制御装置、システム、方法、及びプログラム
TWI468949B (zh) 網路伺服系統及其管理方法

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201480000951.7

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14880365

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: MX/A/2016/009433

Country of ref document: MX

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2014880365

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2014880365

Country of ref document: EP

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112016016656

Country of ref document: BR

WWE Wipo information: entry into national phase

Ref document number: IDP00201605472

Country of ref document: ID

ENP Entry into the national phase

Ref document number: 2016134016

Country of ref document: RU

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 112016016656

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20160719