TWI690894B - Automatic round-collecting for broadband customer and automatic round-collecting server - Google Patents

Automatic round-collecting for broadband customer and automatic round-collecting server Download PDF

Info

Publication number
TWI690894B
TWI690894B TW107133259A TW107133259A TWI690894B TW I690894 B TWI690894 B TW I690894B TW 107133259 A TW107133259 A TW 107133259A TW 107133259 A TW107133259 A TW 107133259A TW I690894 B TWI690894 B TW I690894B
Authority
TW
Taiwan
Prior art keywords
rate
user
sampling
user groups
samples
Prior art date
Application number
TW107133259A
Other languages
Chinese (zh)
Other versions
TW202013306A (en
Inventor
陳梅苑
卓清波
王長智
洪健哲
Original Assignee
中華電信股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中華電信股份有限公司 filed Critical 中華電信股份有限公司
Priority to TW107133259A priority Critical patent/TWI690894B/en
Publication of TW202013306A publication Critical patent/TW202013306A/en
Application granted granted Critical
Publication of TWI690894B publication Critical patent/TWI690894B/en

Links

Images

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

An automatic round-collecting for broadband customer and an automatic round-collecting server are provided. Multiple users are grouped into multiple user group according to attribute thereof. The attribute of each user includes located area and rented rate. Sample size of each user group is determined according to corresponding rented rate. Then, the round-collecting operation for the user groups are performed according to the determined sample sizes. Accordingly, an easy and smart determination sampling method are provided, so as to achieve a best representation and collecting data without interruption.

Description

寬頻服務的自動輪查方法及自動化輪查伺服器Automatic polling method and broadband polling server for broadband service

本發明是有關於一種電信服務抽查技術,且特別是有關於一種寬頻服務的自動輪查方法及自動化輪查伺服器。The invention relates to a telecommunication service spot check technology, and in particular to an automatic polling method and an automatic polling server for broadband services.

對電信業者而言,用戶的訊務(流量)資料是大數據分析中最基本的一種應用。然而,以大型業者為例,網路內的用戶數量動輒百萬。若業者想要同時收集所有用戶的訊務資料,將往往受限於接取設備及收集設備效能而不易達成。在現有技術中,雖然有借重抽樣方法,但傳統的簡單抽樣通常無法滿足電信用戶多樣化及與隨時間改變的特性。For telecom operators, users' traffic (traffic) data is the most basic application in big data analysis. However, taking the example of a large-scale industry, the number of users in the network is easily one million. If the industry wants to collect all users' communication information at the same time, it will often be limited by the performance of the access equipment and the collection equipment, which is not easy to achieve. In the prior art, although there is a re-sampling method, the traditional simple sampling usually cannot satisfy the characteristics of diversification and time-varying changes of telecommunication users.

有鑑於此,本發明提供一種寬頻服務的自動輪查方法及自動化輪查伺服器,其針對用戶屬性分群,並依據承租速度決定各群組的抽樣比例,再進行批次輪查。In view of this, the present invention provides an automatic polling method and an automated polling server for broadband services, which are grouped according to user attributes, and determine the sampling ratio of each group according to the lease rate, and then perform batch polling.

本發明寬頻服務的自動輪查方法,其包括下列步驟。對用戶依據其用戶屬性分群,以產生用戶群組。而各用戶的用戶屬性包括所在區域及承租速率。依據那些用戶群組對應承租速率,分別決定各用戶群組的抽樣數。依據決定的用戶群組的抽樣數執行輪查作業。The automatic polling method of the broadband service of the present invention includes the following steps. Group users according to their user attributes to generate user groups. The user attributes of each user include the location and lease rate. According to the corresponding lease rates of those user groups, the number of samples of each user group is determined separately. Carry out the polling operation based on the determined sample number of the user group.

本發明的自動化輪查伺服器,其包括通訊模組、儲存器及處理器。通訊模組用以傳送及接收資料。儲存器記錄數個模組。處理器耦接通訊模組及儲存器,並存取且載入儲存器所記錄的那些模組。那些模組包括屬性分析模組、分群樣本決策模組及輪查用戶取樣及設定模組。屬性分析模組對用戶依據其用戶屬性分群,以產生用戶群組。而各用戶的用戶屬性包括所在區域及承租速率。分群樣本決策模組依據那些用戶群組對應承租速率,分別決定各用戶群組的抽樣數。輪查用戶取樣及設定模組依據決定的用戶群組的抽樣數執行輪查作業。The automatic polling server of the present invention includes a communication module, a storage and a processor. The communication module is used to send and receive data. The memory records several modules. The processor is coupled to the communication module and the memory, and accesses and loads those modules recorded in the memory. Those modules include attribute analysis module, grouping sample decision module and polling user sampling and setting module. The attribute analysis module groups users according to their user attributes to generate user groups. The user attributes of each user include the location and lease rate. The grouping sample decision module determines the sampling number of each user group according to the corresponding lease rate of those user groups. The polling user sampling and setting module performs the polling operation according to the determined number of samples of the user group.

基於上述,本發明實施例的寬頻服務的自動輪查方法及自動化輪查伺服器,從取得的全區用戶資料,依據用戶屬性(例如,所在區域及承租速率等)進行分層抽樣。其中,依據各承租速率用戶的數量及重要性,自動將所有用戶分類成數種用戶群組(例如,分成主要速率(申裝用戶數最多的前幾個速率)、重要速率(新服務或欲特別觀察的速率、或由使用者決定)及次要速率(非前述速率)三類)。接著,依據不同類別的速率,分配抽樣數(例如,主要速率用戶採用分層簡單抽樣;重要速率用戶採用全面抽查;非主要速率則採用全區抽樣或依抽樣狀況忽略不抽)。彙整前述各速率類別的總抽樣數,並與設備總查測容許量比較。若超過總查測容許量,則降低精確度或次要項不抽,再決定抽樣數,調整至總抽查數符合總查測容許量為止。依序排定每一批次要進行查測的名單,直到所有的用戶都抽完為止。接著,按照批次進行固定週期的資料收集,進行反覆輪查,以維持資料取得的不間斷。而在查測期間若某一批次內有用戶退租或其他異動,則將新增用戶補上,以維持總抽樣數。Based on the above, the automatic polling method and the automatic polling server of the broadband service according to the embodiments of the present invention perform stratified sampling based on user attributes (for example, location and lease rate, etc.) from the obtained user data of the whole area. Among them, according to the number and importance of users at each lease rate, all users are automatically classified into several user groups (for example, divided into main rates (the first few rates with the largest number of users), important rates (new services or special Observed rate, or determined by the user) and secondary rate (not the aforementioned three types). Then, according to the different types of rate, assign the number of samples (for example, the main rate users use stratified simple sampling; the important rate users use comprehensive spot checks; the non-primary rate use the whole area sampling or ignore sampling according to the sampling situation). Summarize the total number of samples for each of the aforementioned rate categories and compare it with the total inspection allowance of the equipment. If the total inspection allowance is exceeded, the accuracy is reduced or the secondary items are not selected, and then the number of samples is determined and adjusted until the total inspection number meets the total inspection allowance. Arrange the list of each batch to be checked in order until all users are selected. Then, according to the batch, a fixed period of data collection and repeated polling are carried out to maintain the uninterrupted data acquisition. During the inspection period, if there are users withdrawing rent or other changes within a certain batch, new users will be added to maintain the total number of samples.

為讓本發明的上述特徵和優點能更明顯易懂,下文特舉實施例,並配合所附圖式作詳細說明如下。In order to make the above-mentioned features and advantages of the present invention more obvious and understandable, the embodiments are specifically described below in conjunction with the accompanying drawings for detailed description as follows.

圖1是依據本發明一實施例的自動化輪查伺服器1的元件方塊圖。請參照圖1,自動化輪查伺服器1可以是各類型伺服器、桌上型電腦、筆記型電腦、工作站、後台主機等電子裝置。自動化輪查伺服器1至少包括但不僅限於儲存器120、通訊模組130及處理器150。FIG. 1 is a block diagram of components of an automated polling server 1 according to an embodiment of the invention. Referring to FIG. 1, the automatic polling server 1 may be various types of servers, desktop computers, notebook computers, workstations, back-end hosts, and other electronic devices. The automatic polling server 1 includes at least but not limited to the storage 120, the communication module 130 and the processor 150.

儲存器120可以是任何型態的固定或可移動隨機存取記憶體(Radom Access Memory,RAM)、唯讀記憶體(Read Only Memory,ROM)、快閃記憶體(flash memory)、傳統硬碟(Hard Disk Drive,HDD)、固態硬碟(Solid-State Drive,SSD)或類似元件,並用以記錄程式碼、軟體模組(例如,屬性分析模組111、屬性權重管理模組112、用戶局情模組113、分群抽樣數產生器114、分權樣本決策模組115、輪查用戶取樣及設定模組116、電路異動管理模組117、網管(Network Management)模組118等)、用戶清單119、分群樣本數矩陣、及其他資料或檔案,其詳細內容待後續實施例詳述。The storage 120 can be any type of fixed or removable random access memory (RAM), read only memory (Read Only Memory, ROM), flash memory (flash memory), traditional hard disk (Hard Disk Drive, HDD), Solid-State Drive (SSD) or similar components, and used to record code, software modules (for example, attribute analysis module 111, attribute weight management module 112, user office Module 113, group sampling number generator 114, decentralized sample decision module 115, polling user sampling and setting module 116, circuit transaction management module 117, network management module 118, etc.), user list 119. A matrix of grouping sample numbers, and other data or files, the details of which are to be detailed in subsequent embodiments.

通訊模組130可以是支援諸如乙太網路(Ethernet)、光纖等有線通訊技術、或者是第四代(4G)、或更後代行動通訊、Wi-Fi等無線通訊技術的通訊收發器。通訊模組130用以與外界傳送及接收資料。The communication module 130 may be a communication transceiver supporting wired communication technologies such as Ethernet and optical fiber, or fourth-generation (4G) or later generation mobile communication, Wi-Fi and other wireless communication technologies. The communication module 130 is used to transmit and receive data with the outside world.

處理器150耦接儲存器120及通訊模組130,處理器150並可以是中央處理器(Central Processing Unit,CPU)、微控制器、可程式化控制器、特殊應用積體電路(ASIC)、晶片或其他類似元件或上述元件的組合。於本實施例中,處理器150執行布建決策裝置1的所有操作,處理器150並可存取並載入儲存器120所記錄的那些軟體模組。The processor 150 is coupled to the storage 120 and the communication module 130. The processor 150 may also be a central processing unit (CPU), a microcontroller, a programmable controller, an application specific integrated circuit (ASIC), Wafer or other similar components or a combination of the above components. In this embodiment, the processor 150 executes all operations of the deployment decision device 1, and the processor 150 can access and load those software modules recorded in the storage 120.

為了方便理解本發明實施例的操作流程,以下將舉諸多實施例詳細說明本發明實施例中針對用戶輪查的流程。下文中,將搭配自動化輪查伺服器1中的各項裝置、元件及模組說明本發明實施例所述之方法。本方法的各個流程可依照實施情形而隨之調整,且並不僅限於此。In order to facilitate the understanding of the operation process of the embodiment of the present invention, a number of embodiments will be given below to describe in detail the process of polling for users in the embodiment of the present invention. Hereinafter, the methods described in the embodiments of the present invention will be described with various devices, components, and modules in the automatic polling server 1. The various processes of the method can be adjusted according to the implementation situation, and it is not limited to this.

由於用戶端訊務進行大數據分析時需要同時收集所有用戶的訊務資料,但卻受限於用戶接取設備眾多,且設備收集效能有限,致使無法全部完整收集的困難。而本發明實施例即借重抽樣方法來解決前述問題。雖然傳統的分層抽樣簡單且容易實現,但粗略分層或未分層恐造成抽測名單對用戶母體的代表性及準確度偏離,進而無法滿足電信用戶多樣化及隨時間改變的特性。本發明實施例即是提出一種簡單且智慧決策的抽樣查測方法,以達成具代表性及不間斷收集資料的目的。以下將詳細說明。The user-side communication needs to collect all user's communication data at the same time for big data analysis, but it is limited by the large number of user access devices and the limited collection efficiency of the device, which makes it difficult to collect all of them. The embodiment of the present invention solves the aforementioned problem by using the resampling method. Although traditional stratified sampling is simple and easy to implement, rough or unstratified sampling may deviate from the representativeness and accuracy of the sampling list to the user's parent, and thus fail to meet the characteristics of telecommunications users that diversify and change over time. The embodiment of the present invention proposes a simple and intelligent decision-making sampling inspection method to achieve the purpose of representative and uninterrupted data collection. This will be explained in detail below.

圖2是依據本發明一實施例之自動輪查方法的流程圖。請參照圖2,用戶局情模組113先取得受寬頻服務的所有用戶清單119。屬性分析模組111對所有用戶依據用戶清單119所記錄的用戶屬性來進行分群,以產生一個或更多個用戶群組(步驟S210)。而用戶的用戶屬性包括所在區域(例如,分公司/營運處)及承租速率等。具體而言,當自動化輪查伺服器1進行查測時,可藉由對用戶樣本分群,並對各用戶群組挑選查測用戶,以避免查測過分集中於某些區域或某些承租速率的用戶,進而造成抽測名單對用戶母體的代表性及準確度之偏離。此外,對用戶依據屬性分群,亦保有針對各用戶群組個別抽樣樣本進行調整的彈性,以利後續最佳化分群樣本之決策、及自動評估不同用戶群組個別最小抽樣數之依據。2 is a flowchart of an automatic polling method according to an embodiment of the invention. Referring to FIG. 2, the user situation module 113 first obtains a list 119 of all users receiving broadband services. The attribute analysis module 111 groups all users according to the user attributes recorded in the user list 119 to generate one or more user groups (step S210). The user attributes of the user include the location (for example, branch office/operation office) and lease rate. Specifically, when the automated polling server 1 performs the survey, the user samples can be grouped and the survey users can be selected for each user group to avoid over-concentration of the survey in certain areas or certain lease rates Users, which in turn leads to a deviation of the representativeness and accuracy of the random test list from the user’s parent. In addition, the grouping of users according to attributes also maintains the flexibility to adjust the individual sampling samples of each user group, so as to facilitate the subsequent decision-making to optimize the grouping samples and the basis for automatically evaluating the individual minimum sampling numbers of different user groups.

舉例而言,圖3是一範例說明初始的分群樣本數矩陣。每一列依據所在區域(例如,分公司/營運處)、及承租速率等用戶屬性區分。例如,用戶群組1的單位層級二是A1B1而承租速率為SP1;而相同單位層級二為A1B1,但其承租速率為SP2不同於用戶群組1的SP1,故被分類到用戶群組2;其餘依此類推。For example, FIG. 3 is an example illustrating the initial clustering sample number matrix. Each column is distinguished according to user attributes such as location (for example, branch office/operation office) and lease rate. For example, the unit level 2 of user group 1 is A1B1 and the lease rate is SP1; while the same unit level 2 is A1B1, but the lease rate is SP2 different from SP1 of user group 1, so it is classified into user group 2; The rest can be deduced by analogy.

而分群之後,屬性權重管理模組112會針對這些用戶群組對應承租速率來決定不同各用戶群組的重要性(步驟S220)。於一實施例中,屬性權重管理模組112設定主要速率、重要速率及次要速率三種重要等級。屬性權重管理模組112係將那些用戶群組中對應承租速率的用戶數量最多的至少一者(例如,用戶數量最多的前3個、前5個、或前6個等承租速率)作為主要速率。以圖3為例,用戶數量最多的前4個承租速率是SP1~SP4。屬性權重管理模組112並將那些用戶群組中對應承租速率需要關注的至少一者(例如,最近新增加的、或依據實際需求而需要特別觀察的承租速率)作為重要速率。以圖3為例,最近新增或需要關注的承租速率為SP5~SP8。此外,屬性權重管理模組112將那些用戶群組中不為主要速率及重要速率的至少一者作為次要速率。即,未被歸類到主要速率或重要速率的其他承租速率將作為次要速率。After grouping, the attribute weight management module 112 determines the importance of different user groups according to the lease rates of these user groups (step S220). In one embodiment, the attribute weight management module 112 sets three major levels: primary rate, important rate, and secondary rate. The attribute weight management module 112 uses at least one of the users with the largest number of users corresponding to the lease rate in the user group (for example, the top 3, top 5, or top 6 lease rates with the largest number of users) as the main rate . Taking Figure 3 as an example, the first four lease rates with the largest number of users are SP1~SP4. The attribute weight management module 112 regards at least one of those user groups that needs to pay attention to the lease rate (for example, a lease rate that has recently been newly added or needs special observation according to actual needs) as an important rate. Taking Figure 3 as an example, the lease rates recently added or need attention are SP5~SP8. In addition, the attribute weight management module 112 regards at least one of those user groups that are not the primary rate and the important rate as the secondary rate. That is, other lease rates that are not categorized as a primary rate or an important rate will be considered secondary rates.

接著,屬性權重管理模組112依據多個抽樣信心水準來設定主要速率的那些用戶群組的抽樣數(即,採用分層抽樣查測),將重要速率的那些用戶群組的抽樣數設定為百分百(即,採用全部查測),並將次要速率的那些用戶群組的抽樣數設定為零(即,不查測)。此外,屬性權重管理模組112還可以進一步對各重要等級中的承租速率進行優先排序。以圖3的主要速率為例, SP1的重要性最優先,SP2第二,SP3第三,而SP4則最後。Next, the attribute weight management module 112 sets the sampling rate of those user groups whose main rate is based on multiple sampling confidence levels (that is, using stratified sampling survey), and sets the sampling rate of those user groups whose important rate is One hundred percent (ie, all surveys are used), and the sampling rate of those user groups of the secondary rate is set to zero (ie, no surveys). In addition, the attribute weight management module 112 can further prioritize the lease rates in each important level. Taking the main rate in Figure 3 as an example, SP1 has the highest priority, SP2 is second, SP3 is third, and SP4 is last.

需說明的是,在其他實施例中,若不考慮重要性,則處理器150亦可能是對所有用戶群組都採用分層抽樣查測。It should be noted that, in other embodiments, if importance is not considered, the processor 150 may also adopt stratified sampling inspection for all user groups.

分群樣本產生器114針對每一用戶群組設計不同之抽樣信心水準,於誤差範圍(Margin of Error)(百分比%)為3的條件下,規劃三種抽樣信心水準(例如,99%、95%及90%),從而得出符合對各用戶群組抽樣之最小抽樣數,以供主要速率分群分層抽樣數之決策依據(步驟S230)。以圖3之分群抽樣矩陣為例。用戶群組1(A1/A1B1/SP1)之用戶數為12323,分群樣本產生器114接著可計算出於誤差範圍百分比為3的條件下,分別依據99%,95%及90%之抽樣信心水準所需的抽樣數為1608、983及705。接著,如圖3所示的初始的分群樣本數矩陣即能建立得出(步驟S240),其中重要速率因全部測查,故各抽樣信心水準對應的抽樣數與此用戶群組內的用戶數量相同。The grouped sample generator 114 designs different sampling confidence levels for each user group, and plans three sampling confidence levels (for example, 99%, 95% and 95%) under the condition that the margin of error (percentage %) is 3 90%), so as to obtain the minimum sampling number that meets the sampling of each user group for the decision basis of the main rate grouping stratified sampling number (step S230). Take the grouped sampling matrix of Figure 3 as an example. The number of users in user group 1 (A1/A1B1/SP1) is 12323, and the grouped sample generator 114 can then calculate the sampling confidence level based on 99%, 95%, and 90% for the error range percentage of 3, respectively The required number of samples is 1608, 983 and 705. Next, the initial matrix of sample numbers for clustering as shown in FIG. 3 can be established (step S240), where the important rates are all tested, so the number of samples corresponding to each sampling confidence level and the number of users in this user group the same.

分群樣本決策模組115接著依據那些用戶群組對應承租速率,分別決定各用戶群組的抽樣數。具體而言,分群樣本決策模組115依據分群抽樣矩陣,設定初始抽樣信心水準為95%。分群樣本決策模組115並計算主要速率所有用戶群組(SP1~SP4)於抽樣信心水準95%下之最小抽樣數之加總,再加上重要速率所需之全查測數(即,所有用戶數量),即可求得初始總抽樣數(即,所有用戶群組的抽樣數之加總)(步驟S250)。The grouping sample decision module 115 then determines the number of samples of each user group according to the corresponding lease rates of those user groups. Specifically, the group sampling decision module 115 sets the initial sampling confidence level to 95% based on the group sampling matrix. The grouping sample decision module 115 calculates the sum of the minimum sampling number of all user groups (SP1~SP4) at the sampling rate at the sampling confidence level of 95% of the main rate, plus the total number of full inspections required for the important rate (ie, all Number of users), that is, the initial total number of samples (that is, the sum of the number of samples of all user groups) can be obtained (step S250).

分群樣本決策模組115會判斷當次的總抽樣數是否符合總查測容量(步驟S260)。總查測容量是事先決定的,並可依據實際需求而調整。若未超過此總查測容量,則分群樣本決策模組115記錄結果(步驟S265)並依序提升主要速率的那些用戶群組對應的抽樣信心水準,以使總抽樣數趨近總查測容量,從而發揮最大訊務收集效能。The grouped sample decision module 115 determines whether the current total sampling number matches the total search capacity (step S260). The total inspection capacity is determined in advance and can be adjusted according to actual needs. If the total search capacity is not exceeded, the grouped sample decision module 115 records the results (step S265) and sequentially increases the sampling confidence level corresponding to those user groups of the main rate, so that the total sample number approaches the total search capacity , So as to maximize the efficiency of communication collection.

具體而言,分群樣本決策模組115可提高不同用戶群組之抽樣信心水準。例如,分群樣本決策模組115先將主要速率 SP1的抽樣信心水準由95%調高至99%進行計算,而主要速率SP2~SP4仍保持95%之抽樣信心水準。接著,分群樣本決策模組115將主要速率經調整抽樣信心水準後之抽樣數與重要速率(SP5~SP8)所需之全查測數加總,即可取得新的用戶群組所需之抽樣總數。若此抽樣總數仍未超過總查測容量,則分群樣本決策模組115依據屬性權重管理模組112對於各承租速率的重要性設定,將主要速率SP2的抽樣信心水準由95%調高至99%,使主要速率SP1~SP2 將採99%之抽樣數。分群樣本決策模組115再將主要速率SP1~SP2調整後之抽樣數與維持95%之主要速率SP3~SP4之抽樣數、及重要速率所需之全查測數(SP5~SP8)加總。若抽樣總數仍未超過總查測容量,則分群樣本決策模組115依此往下一主要速率SP3(下一重要性)進行持續調增,直到超過總查測容量,則此前一回合之抽樣總數,即為每批輪查各分群抽樣數之最佳解。Specifically, the grouped sample decision module 115 can improve the sampling confidence level of different user groups. For example, the grouped sample decision module 115 first increases the sampling confidence level of the main rate SP1 from 95% to 99% for calculation, while the main rates SP2~SP4 still maintain the sampling confidence level of 95%. Next, the grouped sample decision module 115 adds the sample number after adjusting the sampling confidence level of the main rate and the total inspection number required by the important rate (SP5~SP8) to obtain the sample required by the new user group total. If the total number of samples still does not exceed the total search capacity, the cluster sample decision module 115 sets the importance of the attribute weight management module 112 for each lease rate, and increases the sampling confidence level of the main rate SP2 from 95% to 99 %, so that the main speeds SP1~SP2 will adopt 99% sampling. The grouped sample decision module 115 then adds the adjusted sample numbers of the main rates SP1~SP2, the number of samples maintained at 95% of the main rate SP3~SP4, and the total number of inspections required for the important rate (SP5~SP8). If the total number of samples still does not exceed the total inspection capacity, the grouped sample decision module 115 will continue to increase to the next major rate SP3 (next importance) until the total inspection capacity is exceeded, then the previous round of sampling The total number is the best solution for the number of samples of each group in each batch.

另一方面,若抽樣總數超過總查測容許量,則分群樣本決策模組115需適時降低用戶群組之抽樣總數,直與總查測容量之差異在可接受範圍內,即能保有最大訊務收集效能。分群樣本決策模組115還會適當降低不同用戶群組之抽樣信心水準。例如,分群樣本決策模組115先將主要速率重要性順序最低之SP4的抽樣信心水準由95%調降至90%,而主要速率SP1~SP3仍維持95%之抽樣數。分群樣本決策模組115再將主要速率的總抽樣數與重要速率(SP5~SP8)所需之全查測數加總,則可取得新的用戶群組所需之抽樣總數。若仍超過全查測數加總,則分群樣本決策模組115繼續將主要速率SP3的抽樣信心水準由95%調降至90%,即主要速率SP3~SP4 將採90%之抽樣數。分群樣本決策模組115再將主要速率的總抽樣數與維持95%之主要速率SP1~SP2樣本數合計,再與重要速率所需之全查測數(SP5~SP8)加總。若仍超過,則依此往下一主要速率SP2進行持續調降,直到小於設備總查測容許量,則此回合之抽樣總數,即為每批輪查各分群抽樣數之最佳解。On the other hand, if the total number of samples exceeds the total inspection allowance, the grouped sample decision module 115 needs to reduce the total number of samples of the user group in a timely manner until the difference from the total inspection capacity is within the acceptable range, that is, the maximum information can be retained Collection efficiency. The grouping sample decision module 115 will also appropriately reduce the sampling confidence level of different user groups. For example, the grouping sample decision module 115 first reduces the sampling confidence level of the SP4 with the lowest main rate importance order from 95% to 90%, while the main rates SP1~SP3 still maintain the sampling rate of 95%. The grouped sample decision module 115 adds the total number of samples for the main rate and the total number of tests required for the important rate (SP5~SP8) to obtain the total number of samples required for the new user group. If the total number of total surveys is still exceeded, the cluster sample decision module 115 continues to reduce the sampling confidence level of the main rate SP3 from 95% to 90%, that is, the main rate SP3~SP4 will take 90% of the samples. The grouped sample decision module 115 adds the total number of samples of the main rate and the number of samples SP1~SP2 that maintain 95% of the main rate, and then adds up the total number of inspections (SP5~SP8) required for the important rate. If it is still exceeded, continue to decrease the SP2 to the next main rate until it is less than the total equipment inspection tolerance, then the total number of samples in this round is the best solution for the number of samples of each group in each round of inspection.

以圖3為例,若初始採抽樣信心水準為95%之抽樣數,則假設所有用戶加總為97534(小於120000的總查測容許值)。因總抽樣數未超過總查測容許值,則分群樣本決策模組115可適當提高不同分群之抽樣信心水準。分群樣本決策模組115先將主要速率重要性最高之SP1的抽樣信心水準由95%調高至99%,則主要速率之抽樣數將取主要速率採99%的SP1加上主要速率採95%的SP2~SP4之樣本數合計。而主要速率之抽樣數將與重要速率所需之全查測數加總,即得總抽樣數為109198,但仍小於總查測容許值。Taking Figure 3 as an example, if the initial sampling confidence level is 95% of the samples, it is assumed that the sum of all users is 97534 (less than the total check allowable value of 120000). Since the total number of samples does not exceed the allowable value of the total inspection, the group sample decision module 115 can appropriately increase the sampling confidence level of different groups. The cluster sample decision module 115 first increases the sampling confidence level of the SP1 with the highest main rate importance from 95% to 99%, then the sampling rate of the main rate will be taken from the SP1 with 99% of the main rate plus 95% with the main rate The total number of samples of SP2~SP4. The number of samples for the main rate will be added to the total number of inspections required for the important rate, that is, the total number of samples is 109198, but it is still less than the allowable value of the total inspection.

因仍未超過總查測容許值,則分群樣本決策模組115繼續將主要速率SP2的抽樣信心水準由95%調高至99%,則主要速路的抽樣數即為主要速率採99%的SP1~SP2加上主要速率採95%的SP3~SP4之樣本數合計。主要速率的抽樣數將與重要速率所需之全查測數加總,而得出總抽樣數為120657,即超過總查測容許值。分群樣本決策模組115可將前一回合主要速率採99%的SP1的抽樣數加上主要速率採95%的SP2~SP4之抽樣數,再與重要速率所需之全查測數加總,即得出總抽樣數為109198 (如圖4中所示粗體字之加總)。此總抽樣數與總查測容許值差異小於門檻值(例如,500、1000、或5000等),可視為符合總查測容許值,故亦為最佳解。Since the total inspection tolerance value has not been exceeded, the cluster sample decision module 115 continues to increase the sampling confidence level of the main rate SP2 from 95% to 99%, then the sampling rate of the main speed is 99% of the main speed. The total number of samples of SP1~SP2 plus 95% of SP3~SP4 at the main rate. The number of samples for the main rate will be added to the total number of inspections required for the important rate, and the total number of samples is 120657, which exceeds the allowable value of the total inspection. The grouping sample decision module 115 can add 99% of the SP1 sampling rate of the main rate in the previous round plus 95% of the SP2~SP4 sampling rate of the main rate, and then add up the total number of inspections required for the important rate. That is, the total number of samples is 109198 (as shown in the sum of bold in Figure 4). The difference between the total number of samples and the total allowable value of the search is less than the threshold value (for example, 500, 1000, or 5000, etc.), which can be regarded as conforming to the allowable value of the total search, so it is also the best solution.

分群樣本決策模組115依據此最佳解可得出每一用戶群組之抽樣數。以用戶群組1為例,可知要採99%抽樣信心水準下的數據,而此群取的抽樣數為1608。而用戶群組2~4皆要採95%抽樣信心水準下的數據。以用戶群組4為例,此群取的抽樣數為915。用戶群組5~8則因是重要速率,所以全面查測。以用戶群組6為例,此用戶群組須取的抽樣數為144(等同於總用戶數量)。The grouping sample decision module 115 can obtain the number of samples for each user group based on this optimal solution. Taking user group 1 as an example, it can be seen that the data under the confidence level of 99% sampling is to be taken, and the number of samples taken by this group is 1608. The user groups 2~4 all need to use the data with 95% sampling confidence level. Taking user group 4 as an example, the number of samples taken by this group is 915. User groups 5 to 8 are fully investigated because they are important rates. Taking user group 6 as an example, the number of samples that this user group must take is 144 (equivalent to the total number of users).

接著,輪查用戶取樣及設定模組116取得最佳化後每一用戶群組之抽樣數(即,最佳解中各用戶群組對應的抽樣數),以作為後續每批輪查每一用戶群組所需之抽樣樣本數。接著,網管模組118透過通訊模組130而對主要速率的用戶群組以分層抽查,對重要速率的用戶群組全查,並採用不重複抽樣的方式,預先逐批排定每一批次要進行查測的名單(步驟S270)。輪查用戶取樣及設定模組116接著依批次名單,循序進行逐批輪查(步驟S285),直至所有規劃之主要速率/重要速率中的用戶名單都抽查完成為止。Next, the polling user sampling and setting module 116 obtains the optimized sampling number of each user group (that is, the sampling number corresponding to each user group in the optimal solution), as each subsequent polling batch The number of sampling samples required by the user group. Next, the network management module 118 uses the communication module 130 to conduct a hierarchical random check on the user groups at the main rate, and to check all the user groups at the important rate in a non-repetitive sampling manner. List of secondary inspections (step S270). The round-robin user sampling and setting module 116 then performs batch-by-batch round-by-block round-robin inspection (step S285) until all the user lists in the planned main rate/important rate have been spot-checked.

舉例而言,輪查用戶取樣及設定模組116依據各分群最佳抽樣數,其中對主要速率的用戶群組以分層抽查,對重要速率的用戶群組全查。輪查用戶取樣及設定模組116並採不重複抽樣,排定每一批次查測名單,接著進行逐批輪查,直至所有的用戶都抽完為止。以圖4為例,若用戶群組1採99%信心水準,如依最佳抽樣數1608,針對用戶群組1所有用戶(12323)予以排定,則可分為8批,即可涵蓋用戶群組1所有主要速率/重要速率用戶。For example, the polling user sampling and setting module 116 is based on the optimal number of samples for each sub-group, wherein the user groups of the main rate are sampled in layers, and the user groups of the important rate are fully checked. The polling user sampling and setting module 116 adopts non-repetitive sampling, arranges the check list for each batch, and then performs the batch-by-batch polling until all users are selected. Taking Figure 4 as an example, if the user group 1 adopts a 99% confidence level, if all users (12323) in user group 1 are scheduled according to the best sampling number 1608, they can be divided into 8 batches, which can cover users Group 1 all major rate/important rate users.

值得注意的是,電路異動管理模組117會對每批輪查中相鄰兩批輪查間之新增/離開(電路異動)及服務異動之用戶(可能已不符原先用戶群組之規劃或是新用戶),納入該批查測名單(即,直接作為抽查對象),以反應於每一輪查之即時性及代表性(步驟S290)。It is worth noting that the circuit change management module 117 will add/leave (circuit change) and service change users between two adjacent batches of polls in each batch of polls (may not conform to the original user group plan or Is a new user), included in the batch inspection list (ie, directly as a random inspection object), to reflect the immediacy and representativeness of each round of inspection (step S290).

由於每批次持續輪查,是採最佳分群及屬性重要性來度量進行,並以接取設備之總查測容許量取最大化的抽樣樣本進行查測,因此自動化輪查伺服器1所取得之用戶資訊可即時反應接近的所有用戶之最佳代表。同時,隨著持續輪查進行,藉由用戶母體之涵蓋範圍逐步擴充,可逐步提高接近收集所有用戶訊務資料之準確性及大數據預測/分析之所需基礎資訊之品質。Since each batch of continuous polling is measured by the best clustering and attribute importance, and the maximum sampling tolerance of the total inspection of the receiving equipment is used to maximize the sampling sample for testing, the automatic polling server 1 The obtained user information can instantly reflect the best representative of all users in proximity. At the same time, as the continuous polling progresses, by gradually expanding the coverage of the user matrix, it is possible to gradually improve the accuracy of collecting all user communication data and the quality of basic information required for big data prediction/analysis.

綜上所述,本發明實施例的寬頻服務的自動輪查方法及自動化輪查伺服器,其具有以下特點。In summary, the automatic polling method and the automatic polling server of the broadband service according to the embodiments of the present invention have the following characteristics.

本發明實施例可自動依據接取設備及收集設備效能,並利用簡單決策分析法進行抽樣查測,從而解決無法於短期內進行一次全體用戶訊務查測的問題。The embodiments of the present invention can automatically perform sampling inspection based on the performance of the access equipment and the collection equipment, and use a simple decision analysis method, thereby solving the problem that it is impossible to conduct an inspection of all users' communications in a short time.

本發明實施例可依據用戶屬性分群,並自動評估不同分群之個別最小抽樣數,決定每一批次抽查的樣本數,從而確保每批輪查均有最佳代表性。In the embodiment of the present invention, groups can be grouped according to user attributes, and the individual minimum sampling numbers of different groups can be automatically evaluated to determine the number of samples in each batch of random inspections, thereby ensuring that each batch of rounds has the best representativeness.

本發明實施例可依據用戶的速率重要性,決定每一類用戶群組的精確度及誤差水準,並研判是要提高或減少抽樣數。The embodiments of the present invention can determine the accuracy and error level of each type of user group according to the importance of the user's rate, and determine whether to increase or decrease the number of samples.

本發明實施例是事先規劃好的抽樣查測,若於每一批次查測期間發生用戶異動(新增或退租),則下一批次可自動進行增補調整。The embodiment of the present invention is a pre-planned sampling inspection. If a user change (addition or cancellation of rent) occurs during the inspection of each batch, the next batch may be automatically supplemented and adjusted.

雖然本發明已以實施例揭露如上,然其並非用以限定本發明,任何所屬技術領域中具有通常知識者,在不脫離本發明的精神和範圍內,當可作些許的更動與潤飾,故本發明的保護範圍當視後附的申請專利範圍所界定者為準。Although the present invention has been disclosed as above with examples, it is not intended to limit the present invention. Any person with ordinary knowledge in the technical field can make some changes and modifications without departing from the spirit and scope of the present invention. The scope of protection of the present invention shall be subject to the scope defined in the appended patent application.

1:自動化輪查伺服器 120:儲存器 111:屬性分析模組 112:屬性權重管理模組 113:用戶局情模組 114:分群抽樣數產生器 115:分群樣本決策模組 116:輪查用路取樣及設定模組 117:電路異動管理模組 118:網管模組 119:用戶清單 130:通訊模組 150:處理器 S210~S290:步驟1: Automated polling server 120: storage 111: attribute analysis module 112: attribute weight management module 113: user situation module 114: cluster sample number generator 115: cluster sample decision module 116: for polling Road sampling and setting module 117: circuit change management module 118: network management module 119: user list 130: communication module 150: processor S210~S290: steps

圖1是依據本發明一實施例的自動化輪查伺服器的元件方塊圖。 圖2是依據本發明一實施例的自動輪查方法的流程圖。 圖3是一範例說明初始的分群樣本數矩陣。 圖4是一範例說明最佳化的分群樣本數矩陣。FIG. 1 is a block diagram of components of an automated polling server according to an embodiment of the invention. 2 is a flowchart of an automatic polling method according to an embodiment of the invention. FIG. 3 is an example illustrating the initial clustering sample number matrix. FIG. 4 is an example illustrating the optimized clustering sample number matrix.

S210~S290:步驟 119:用戶清單S210~S290: Step 119: User list

Claims (8)

一種寬頻服務的自動輪查方法,包括:對多個用戶依據其用戶屬性分群,以產生多個用戶群組,其中每一該用戶的用戶屬性包括所在區域及承租速率;依據該些用戶群組對應承租速率,分別決定每一該用戶群組的抽樣數,其中包括:將該些用戶群組中對應承租速率的用戶數量最多的至少一者作為主要速率;將該些用戶群組中對應承租速率需要關注的至少一者作為重要速率;以及將該些用戶群組中不為該主要速率及該重要速率的至少一者作為次要速率;以及依據決定的該些用戶群組的抽樣數執行輪查作業。 An automatic polling method for broadband service includes: grouping multiple users according to their user attributes to generate multiple user groups, wherein each user's user attributes include location and lease rate; based on the user groups Corresponding to the rental rate, determine the sampling number of each user group separately, including: use at least one of the users with the largest number of users corresponding to the rental rate in these user groups as the main rate; At least one of the rates that needs attention is regarded as an important rate; and at least one of the user groups that is not the primary rate and the important rate is regarded as a secondary rate; and the execution is based on the determined number of samples of the user groups Polling homework. 如申請專利範圍第1項所述的自動輪查方法,將該些用戶群組中不為該主要速率及該重要速率的至少一者作為次要速率的步驟之後,更包括:依據多個抽樣信心水準來設定該主要速率的該些用戶群組的抽樣數;將該重要速率的該些用戶群組的抽樣數設定為百分百;以及將該次要速率的該些用戶群組的抽樣數設定為零。 The automatic polling method as described in item 1 of the patent application scope, after the step of using at least one of the primary rate and the important rate in these user groups as the secondary rate, further includes: Confidence level to set the sampling rate of the user groups of the primary rate; the sampling rate of the user groups of the important rate to 100%; and the sampling of the user groups of the secondary rate The number is set to zero. 如申請專利範圍第1項所述的自動輪查方法,其中將該次要速率的該些用戶群組的抽樣數設定為零的步驟之後,更包括: 判斷該些用戶群組的抽樣數之加總是否符合一總查測容量;若未超過該總查測容量,則依序提升該主要速率的該些用戶群組對應的抽樣信心水準;以及若超過該總查測容量,則依序下降該主要速率的該些用戶群組對應的抽樣信心水準。 The automatic polling method as described in item 1 of the patent application scope, wherein after the step of setting the sampling rate of the user groups of the secondary rate to zero, it further includes: Determine whether the sum of the sampling numbers of the user groups conforms to a total search capacity; if the total search capacity is not exceeded, increase the sampling confidence level corresponding to the user groups of the main rate in sequence; and if If the total search capacity is exceeded, the sampling confidence level corresponding to the user groups of the main rate is sequentially decreased. 如申請專利範圍第1項所述的自動輪查方法,其中依據決定的該些用戶群組的抽樣數執行輪查作業的步驟包括:將兩次輪查作業之間電路異動及服務異動的用戶作為抽查對象。 The automatic polling method as described in item 1 of the scope of the patent application, wherein the step of performing the polling operation based on the determined number of samples of these user groups includes: changing the circuit between the two polling operations and the users with service changes As a random inspection object. 一種自動化輪查伺服器,包括:一通訊模組,傳送及接收資料;一儲存器,記錄多個模組;以及一處理器,耦接該通訊模組及該儲存器,並存取且載入該儲存器所記錄的該些模組,而該些模組包括:一屬性分析模組,對多個用戶依據其用戶屬性分群,以產生多個用戶群組,其中每一該用戶的用戶屬性包括所在區域及承租速率;一分群樣本決策模組,依據該些用戶群組對應承租速率,分別決定每一該用戶群組的抽樣數;一輪查用戶取樣及設定模組,依據決定的該些用戶群組的抽樣數而透過該通訊模組執行輪查作業;以及一屬性權重管理模組,將該些用戶群組中對應承租速率 的用戶數量最多的至少一者作為主要速率,將該些用戶群組中對應承租速率需要關注的至少一者作為重要速率,並將該些用戶群組中不為該主要速率及該重要速率的至少一者作為次要速率。 An automatic polling server includes: a communication module to transmit and receive data; a storage to record multiple modules; and a processor to couple the communication module and the storage and access and load The modules recorded in the storage, and the modules include: an attribute analysis module, grouping multiple users according to their user attributes to generate multiple user groups, each of which is a user of the user Attributes include location and lease rate; a sub-group sample decision module, which determines the number of samples for each user group according to the corresponding lease rates of these user groups; a round of user sampling and configuration modules, based on the determined The number of samples of these user groups is used to perform the polling operation through the communication module; and an attribute weight management module to correspond the lease rate in these user groups At least one of the largest number of users is regarded as the main rate, and at least one of the user groups corresponding to the lease rate needs to be regarded as the important rate, and the user group is not the main rate and the important rate. At least one is used as the secondary rate. 如申請專利範圍第5項所述的自動化輪查伺服器,其中該屬性權重管理模組依據多個抽樣信心水準來設定該主要速率的該些用戶群組的抽樣數,將該重要速率的該些用戶群組的抽樣數設定為百分百,並將該次要速率的該些用戶群組的抽樣數設定為零。 The automatic polling server as described in item 5 of the patent application scope, wherein the attribute weight management module sets the sampling rate of the user groups of the main rate based on multiple sampling confidence levels, The number of samples of these user groups is set to 100%, and the number of samples of these user groups of the secondary rate is set to zero. 如申請專利範圍第5項所述的自動化輪查伺服器,其中該分群樣本決策模組判斷該些用戶群組的抽樣數之加總是否符合一總查測容量;而若未超過該總查測容量,則依序提升該主要速率的該些用戶群組對應的抽樣信心水準;以及若超過該總查測容量,則依序下降該主要速率的該些用戶群組對應的抽樣信心水準。 The automatic polling server as described in item 5 of the patent application scope, wherein the grouped sample decision module determines whether the sum of the sampling numbers of the user groups conforms to a total search capacity; and if the total search is not exceeded The measurement capacity increases the sampling confidence level corresponding to the user groups of the main rate in sequence; and if the total search capacity is exceeded, it sequentially decreases the sampling confidence level corresponding to the user groups of the main rate. 如申請專利範圍第5項所述的自動化輪查伺服器,其中該輪查用戶取樣及設定模組將兩次輪查作業之間電路異動及服務異動的用戶作為抽查對象。 The automatic polling server as described in item 5 of the patent application scope, wherein the polling user sampling and setting module selects users for circuit changes and service changes between the two polling operations as spot checks.
TW107133259A 2018-09-20 2018-09-20 Automatic round-collecting for broadband customer and automatic round-collecting server TWI690894B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW107133259A TWI690894B (en) 2018-09-20 2018-09-20 Automatic round-collecting for broadband customer and automatic round-collecting server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW107133259A TWI690894B (en) 2018-09-20 2018-09-20 Automatic round-collecting for broadband customer and automatic round-collecting server

Publications (2)

Publication Number Publication Date
TW202013306A TW202013306A (en) 2020-04-01
TWI690894B true TWI690894B (en) 2020-04-11

Family

ID=71130638

Family Applications (1)

Application Number Title Priority Date Filing Date
TW107133259A TWI690894B (en) 2018-09-20 2018-09-20 Automatic round-collecting for broadband customer and automatic round-collecting server

Country Status (1)

Country Link
TW (1) TWI690894B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101990003A (en) * 2010-10-22 2011-03-23 西安交通大学 User action monitoring system and method based on IP address attribute
US20140334304A1 (en) * 2013-05-13 2014-11-13 Hui Zang Content classification of internet traffic
TW201614981A (en) * 2014-10-07 2016-04-16 Chunghwa Telecom Co Ltd Broadband internet service quality monitoring system
TW201618027A (en) * 2014-11-03 2016-05-16 Chunghwa Telecom Co Ltd Network traffic prediction method and its computer program product

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101990003A (en) * 2010-10-22 2011-03-23 西安交通大学 User action monitoring system and method based on IP address attribute
US20140334304A1 (en) * 2013-05-13 2014-11-13 Hui Zang Content classification of internet traffic
TW201614981A (en) * 2014-10-07 2016-04-16 Chunghwa Telecom Co Ltd Broadband internet service quality monitoring system
TW201618027A (en) * 2014-11-03 2016-05-16 Chunghwa Telecom Co Ltd Network traffic prediction method and its computer program product

Also Published As

Publication number Publication date
TW202013306A (en) 2020-04-01

Similar Documents

Publication Publication Date Title
CN107171848B (en) Flow prediction method and device
KR102125219B1 (en) Data processing methods, devices, systems, programs and computer readable recording media
US7984147B2 (en) Apparatus and method for identifying a requested level of service for a transaction
CN102694868B (en) A kind of group system realizes and task dynamic allocation method
JP2019521427A (en) Network Advisor Based on Artificial Intelligence
EP2563062A1 (en) Long connection management apparatus and link resource management method for long connection communication
CN113467944B (en) Resource deployment device and method for complex software system
WO2021027783A1 (en) Method, apparatus, device, and system for allocating radio frequency resources, and storage medium
CN112020098A (en) Load balancing method and device, computing equipment and computer storage medium
CN111541628A (en) Power communication network service resource allocation method and related device
CN101199162B (en) Method, system and device for controlling communication network
CN110555452A (en) network problem processing method and device based on intelligent clustering
CN110162379A (en) Virtual machine migration method, device and computer equipment
Nunes et al. State of the art on microservices autoscaling: An overview
US20140379100A1 (en) Method for requesting control and information processing apparatus for same
CN107172193A (en) A kind of load-balancing method and its device based on cluster
TWI690894B (en) Automatic round-collecting for broadband customer and automatic round-collecting server
CN110502346A (en) Resource information management system and method under a kind of cluster environment
CN112488491B (en) Petri network-based train control vehicle-mounted equipment access process reliability evaluation method
CN110309223A (en) Select method, system and the relevant device of back end
CN107517474A (en) A kind of network analysis optimization method and device
CN116910602B (en) Line loss analysis method and system based on relevance analysis
JP7413443B2 (en) Radio frequency resource allocation methods, apparatus, devices and systems, and storage media
WO2021052387A1 (en) Method, apparatus, device, and system for detecting source of interference, and storage medium
CN114793200B (en) Important internet of things node identification method based on electric power internet of things network structure