TWI779311B - Occupational category determination system and occupational category determination method - Google Patents

Occupational category determination system and occupational category determination method Download PDF

Info

Publication number
TWI779311B
TWI779311B TW109123015A TW109123015A TWI779311B TW I779311 B TWI779311 B TW I779311B TW 109123015 A TW109123015 A TW 109123015A TW 109123015 A TW109123015 A TW 109123015A TW I779311 B TWI779311 B TW I779311B
Authority
TW
Taiwan
Prior art keywords
identification information
category
occupational
database
occupation
Prior art date
Application number
TW109123015A
Other languages
Chinese (zh)
Other versions
TW202203131A (en
Inventor
張志宏
蔣琴韻
賴意潔
柯士文
李昕儒
陳芃諭
黃軍儒
Original Assignee
合作金庫人壽保險股份有限公司
國立中央大學
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 合作金庫人壽保險股份有限公司, 國立中央大學 filed Critical 合作金庫人壽保險股份有限公司
Priority to TW109123015A priority Critical patent/TWI779311B/en
Publication of TW202203131A publication Critical patent/TW202203131A/en
Application granted granted Critical
Publication of TWI779311B publication Critical patent/TWI779311B/en

Links

Images

Landscapes

  • Image Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A system for occupational category determination and a method for occupational category determination are provided. The system for occupational category determination includes an electronic device and a server. The server is coupled to the electronic device, and includes a database and a processor. The server stores a plurality of category tags and customer information. The electronic device uploads at least one occupational identification information to the server, and the processor in the server determines whether a corresponding category tag exists in the database in the server. When the category tag corresponding to the at least one occupational identification information exists in the database, the server replies an occupational category determination result to the electronic device. When the category tag corresponding to the at least one occupational identification information does not exist in the database, the electronic device connects to an external server by the internet to perform searching, to obtain the category tag corresponding to the at least one occupational identification information.

Description

職業類別判斷系統和職業類別判斷方法Occupational Category Judgment System and Occupational Category Judgment Method

本發明是有關於一種職業類別判斷系統和職業類別判斷方法。The invention relates to a system for judging occupational categories and a method for judging occupational categories.

目前,保險商品的販售方,在判斷客戶的職業類別時,仍然依賴人工作業。然而,人工作業存在著許多缺點,例如,在判斷對應職業識別資訊的類別標籤是否存在資料庫中時,必須依賴判斷者對職業識別資訊和類別標籤的個人經驗,另外,當客戶數量龐大,以人工作業判斷對應職業識別資訊的類別標籤是否存在資料庫將會消耗過多時間。At present, sellers of insurance products still rely on manual work when judging the occupational category of customers. However, there are many shortcomings in manual operation. For example, when judging whether the category label corresponding to the occupation identification information exists in the database, it must rely on the personal experience of the judge on the occupation identification information and category labels. In addition, when the number of customers is large, and the It will consume too much time to manually determine whether the category tag corresponding to the occupation identification information exists in the database.

本發明提供一種職業類別判斷系統和職業類別判斷方法,在判斷職業類別時,可以利用類別標籤的關鍵字以匹配職業識別資訊,提高了職業類別判斷的準確性、一致性和效率。The present invention provides an occupational category judgment system and an occupational category determination method. When judging an occupational category, the keywords of the category label can be used to match the occupational identification information, which improves the accuracy, consistency and efficiency of occupational category determination.

本發明的職業類別判斷系統包括電子裝置以及伺服器。伺服器耦接至電子裝置,包括資料庫及處理器,資料庫儲存多個類別標籤及客戶資訊。電子裝置將至少一職業識別資訊上傳至伺服器,並由伺服器中的處理器根據至少一職業識別資訊判斷在伺服器的資料庫中是否存在對應的類別標籤,當對應至少一職業識別資訊的類別標籤存在於資料庫中時,則伺服器回傳一職業類別判斷結果至電子裝置。當對應至少一職業識別資訊的類別標籤不存在於資料庫中時,則電子裝置經由網際網路連接至一外部伺服器進行搜尋,以取得對應於至少一職業識別資訊的類別標籤。The occupation category judging system of the present invention includes an electronic device and a server. The server is coupled to the electronic device and includes a database and a processor. The database stores a plurality of category labels and customer information. The electronic device uploads at least one piece of occupational identification information to the server, and the processor in the server judges whether there is a corresponding category tag in the database of the server according to the at least one occupational identification information. When the class label exists in the database, the server returns a judgment result of the occupation class to the electronic device. When the category tag corresponding to the at least one occupation identification information does not exist in the database, the electronic device is connected to an external server through the Internet to search to obtain the category tag corresponding to the at least one occupation identification information.

在本發明的一實施例中,上述的職業類別判斷系統的類別標籤分別具有多個關鍵字,用以匹配至少一職業識別資訊。In an embodiment of the present invention, the category tags of the above-mentioned occupation category determination system respectively have a plurality of keywords for matching at least one occupation identification information.

在本發明的一實施例中,上述的職業類別判斷系統的至少一職業識別資訊包括公司名稱、公司所在地、職稱及年齡的至少其中之一。In an embodiment of the present invention, the at least one occupation identification information in the above occupation category judgment system includes at least one of company name, company location, job title and age.

在本發明的一實施例中,上述的職業類別判斷系統的處理器根據至少一職業識別資訊中的公司名稱與客戶資訊進行比對,以判斷在該資料庫中是否存在對應的類別標籤。In an embodiment of the present invention, the processor of the above-mentioned occupational category determination system compares the company name in at least one occupational identification information with customer information to determine whether there is a corresponding category label in the database.

在本發明的一實施例中,上述的職業類別判斷系統的處理器更根據至少一職業識別資訊中的公司所在地與客戶資訊進行比對,以判斷在資料庫中是否存在對應的類別標籤。In an embodiment of the present invention, the processor of the above-mentioned occupation category determination system further compares the company location in at least one occupation identification information with the customer information to determine whether there is a corresponding category label in the database.

在本發明的一實施例中,上述的職業類別判斷系統的處理器根據至少一職業識別資訊中的年齡是否大於預設年齡門檻值,以判斷在資料庫中是否存在對應的類別標籤。In an embodiment of the present invention, the processor of the above-mentioned occupation category determination system determines whether there is a corresponding category label in the database according to whether the age in at least one occupation identification information is greater than a preset age threshold.

在本發明的一實施例中,上述的職業類別判斷系統的處理器根據至少一職業識別資訊中的職稱與該些關鍵字進行比對,以判斷在資料庫中是否存在對應的類別標籤。In an embodiment of the present invention, the processor of the above-mentioned occupational category determination system compares the professional title in at least one occupation identification information with the keywords to determine whether there is a corresponding category label in the database.

在本發明的一實施例中,上述的職業類別判斷系統的處理器對至少一職業識別資訊中的公司名稱執行文字語義處理,以判斷在資料庫中是否存在對應的類別標籤。In an embodiment of the present invention, the processor of the above-mentioned occupation category determination system performs text semantic processing on the company name in at least one occupation identification information to determine whether there is a corresponding category label in the database.

在本發明的一實施例中,上述的職業類別判斷系統的文字語義處理包括根據斷詞技術、詞向量以及機器學習相似詞比對。In an embodiment of the present invention, the text semantic processing of the occupational category judgment system includes comparison of similar words based on word segmentation technology, word vectors and machine learning.

在本發明的一實施例中,上述的職業類別判斷系統的處理器對至少一職業識別資訊中的公司名稱執行錯別字校正。In an embodiment of the present invention, the processor of the above-mentioned occupation category determination system performs typo correction on the company name in at least one occupation identification information.

在本發明的一實施例中,上述的職業類別判斷系統的外部伺服器包括一外部資料庫,該外部資料庫是一上市櫃公司類別資訊和財政部營業稅籍登記資訊的其中之一。In an embodiment of the present invention, the external server of the above-mentioned occupational category judgment system includes an external database, which is one of the category information of listed companies and the business tax registration information of the Ministry of Finance.

本發明的職業類別判斷方法,包括:將至少一職業識別資訊上傳至伺服器,根據至少一職業識資訊判斷在伺服器的資料庫中是否存在對應的類別資訊,當對應至少一職業識別資訊的類別標籤存在於資料庫中時,則回傳類別資訊至電子裝置,當對應至少一職業識別資訊的類別標籤不存在於資料庫中時,則經由網際網路連接至一外部伺服器進行搜尋,以取得對應於至少一職業識別資訊的類別標籤。The occupation category determination method of the present invention includes: uploading at least one occupation identification information to the server, judging whether there is corresponding category information in the database of the server according to the at least one occupation identification information, and when the at least one occupation identification information corresponds When the category tag exists in the database, return the category information to the electronic device; when the category tag corresponding to at least one occupation identification information does not exist in the database, connect to an external server via the Internet for searching, to obtain a category label corresponding to at least one occupation identification information.

基於上述,本發明的職業類別判斷方法和職業類別判斷系統,在判斷職業類別時,可以利用類別標籤的關鍵字以匹配職業識別資訊,也可利用職業識別資訊的公司所在地與客戶資訊進行比對,以判斷在資料庫中是否存在對應的類別標籤。藉由本發明,職業類別的判斷結果的準確性和職業類別的判斷效率可以被提高。Based on the above, the occupation category judgment method and occupation category judgment system of the present invention can use the keywords of the category label to match the occupation identification information when determining the occupation category, and can also use the company location of the occupation identification information to compare with the customer information , to determine whether the corresponding category label exists in the database. With the present invention, the accuracy of the judgment result of the occupation category and the judgment efficiency of the occupation category can be improved.

為讓本發明的上述特徵和優點能更明顯易懂,下文特舉實施例,並配合所附圖式作詳細說明如下。In order to make the above-mentioned features and advantages of the present invention more comprehensible, the following specific embodiments are described in detail together with the accompanying drawings.

圖1是根據本發明的一實施例繪示的一種職業類別判斷系統100的示意圖,請參照圖1。職業類別判斷系統100包括電子裝置110、伺服器120以及外部伺服器130。電子裝置110可以是供職業類別判斷系統100的職業類別判斷者輸入資訊的裝置,以銀行業者或人壽業者為例,可以是行員的桌上型電腦、筆記型電腦或平板電腦等,但不以此為限制。FIG. 1 is a schematic diagram of an occupation category determination system 100 according to an embodiment of the present invention, please refer to FIG. 1 . The occupation category determination system 100 includes an electronic device 110 , a server 120 and an external server 130 . The electronic device 110 can be a device for the occupational category judges of the occupational category determination system 100 to input information. For example, a banker or a life insurance company can be a desktop computer, a notebook computer or a tablet computer of an operator, but not in the form of This is a limitation.

伺服器120耦接至電子裝置110,用以從電子裝置110接收至少一職業識別資訊。伺服器120包括處理器121以及資料庫122,其中處理器121根據至少一職業識別資訊判斷在伺服器120的資料庫122中是否存在對應的類別標籤。The server 120 is coupled to the electronic device 110 for receiving at least one job identification information from the electronic device 110 . The server 120 includes a processor 121 and a database 122 , wherein the processor 121 judges whether there is a corresponding category tag in the database 122 of the server 120 according to at least one occupation identification information.

處理器121例如是中央處理單元(central processing unit,CPU),或是其他可程式化之一般用途或特殊用途的微控制單元(micro control unit,MCU)、微處理器(microprocessor)、數位信號處理器(digital signal processor,DSP)、可程式化控制器、特殊應用積體電路(application specific integrated circuit,ASIC)、圖形處理器(graphics processing unit,GPU)、算數邏輯單元(arithmetic logic unit,ALU)、複雜可程式邏輯裝置(complex programmable logic device,CPLD)、現場可程式化邏輯閘陣列(field programmable gate array,FPGA)或其他類似元件或上述元件的組合。The processor 121 is, for example, a central processing unit (central processing unit, CPU), or other programmable general purpose or special purpose micro control unit (micro control unit, MCU), microprocessor (microprocessor), digital signal processing Digital signal processor (DSP), programmable controller, application specific integrated circuit (ASIC), graphics processing unit (graphics processing unit, GPU), arithmetic logic unit (arithmetic logic unit, ALU) , complex programmable logic device (complex programmable logic device, CPLD), field programmable logic gate array (field programmable gate array, FPGA) or other similar components or a combination of the above components.

資料庫122可儲存於伺服器120的儲存裝置(未繪示)中,其中所述儲存裝置可以是任意型式的固定式或可移動式隨機存取記憶體(RAM)、唯讀記憶體(ROM)、快閃記憶體(flash memory)、硬碟或其他類似裝置、積體電路及其組合。需注意的是,資料庫122用以儲存多個類別標籤及客戶資訊,其中類別標籤可以包括,例如「餐飲食品業G1」、「水泥工業D11」等等,而客戶資訊可以包括公司名稱、職稱及公司所在地等等,以公司名稱為例,例如「頂好烤肉」、「富蘭克林語言中心」、「台積電」等,但不以此為限。The database 122 can be stored in a storage device (not shown) of the server 120, wherein the storage device can be any type of fixed or removable random access memory (RAM), read only memory (ROM) ), flash memory (flash memory), hard disk or other similar devices, integrated circuits and combinations thereof. It should be noted that the database 122 is used to store a plurality of category tags and customer information, wherein the category tags may include, for example, "catering and food industry G1", "cement industry D11", etc., and the customer information may include company name, job title and the location of the company, etc., take the company name as an example, such as "Dinghao Barbecue", "Franklin Language Center", "TSMC", etc., but not limited to this.

值得注意的是,資料庫122所儲存的多個類別標籤可以分別具有多個關鍵字,用以匹配職業識別資訊。具體而言,類別標籤「軍職B1」可以具有多個關鍵字「國軍」、「陸軍」、「海軍」,當電子裝置110將職業識別資訊「陸軍」上傳至伺服器120,類別標籤的多個關鍵字可用以匹配職業識別資訊,也就是說,處理器121可以根據類別標籤的多個關鍵字是否與職業識別資訊匹配,來判斷資料庫122是否存在和職業識別資訊對應的類別標籤。在此例子中,由於類別標籤「軍職B1」的其中一個關鍵字「陸軍」和上傳至伺服器120的職業識別資訊「陸軍」匹配,處理器121可以藉此判斷資料庫122存在和職業識別資訊「陸軍」對應的類別標籤「軍職B1」。由於處理器121判斷對應職業識別資訊「陸軍」的類別標籤「軍職B1」存在於資料庫中122,伺服器120可以回傳職業類別判斷結果至電子裝置110,此例子中的職業類別判斷結果可以為「軍職B1」,也就是說,當對應職業識別資訊的類別標籤存在於資料庫122中時,伺服器120可以將此類別標籤作為職業類別判斷結果,並回傳至電子裝置110。It should be noted that the multiple category tags stored in the database 122 may respectively have multiple keywords for matching the occupation identification information. Specifically, the category label "military occupation B1" may have multiple keywords "National Army", "Army", and "Navy". When the electronic device 110 uploads the occupation identification information "Army" to the server 120, the category label A plurality of keywords can be used to match the occupation identification information, that is, the processor 121 can determine whether there is a category tag corresponding to the occupation identification information in the database 122 according to whether the keywords of the category tag match the occupation identification information. In this example, since one of the keywords "Army" in the category label "Military B1" matches the occupation identification information "Army" uploaded to the server 120, the processor 121 can use this to determine the existence of the database 122 and the occupation identification The category label "Military B1" corresponding to the information "Army". Since the processor 121 judges that the class label "Military Class B1" corresponding to the occupation identification information "Army" exists in the database 122, the server 120 can return the occupation class judgment result to the electronic device 110. In this example, the occupation class judgment result It can be “military occupation B1”, that is, when the category tag corresponding to the occupation identification information exists in the database 122, the server 120 can use the category tag as the occupation category judgment result and send it back to the electronic device 110.

在一實施例中,職業識別資訊包括公司名稱、公司所在地、職稱及年齡的至少其中之一。具體而言,職業識別資訊可以是,例如「台積電」、「台積電:新竹」、「高雄」、「經理」、「台積電:經理」、「30歲」、「台積電:50歲」等資訊,但不以此為限。In one embodiment, the occupation identification information includes at least one of company name, company location, job title and age. Specifically, the occupation identification information may be information such as "TSMC", "TSMC: Hsinchu", "Kaohsiung", "Manager", "TSMC: Manager", "30 years old", "TSMC: 50 years old", etc., but This is not the limit.

在一實施例中,處理器121根據職業識別資訊中的職稱與關鍵字進行比對,以判斷在資料庫122中是否存在對應的類別標籤。具體而言,類別標籤「教職員B3」可以具有多個關鍵字,例如「教授」、「系主任」、「高中老師」、「國小老師」等,而職業識別資訊可以是「中央大學:教授」,由於職業識別資訊「中央大學:教授」中的職稱「教授」與類別標籤「教職員B3」的多個關鍵字「教授」、「系主任」、「高中老師」、「國小老師」的其中一個關鍵字「教授」比對結果為相符,處理器121可以藉此判斷資料庫122存在對應職業識別資訊「中央大學:教授」的類別標籤「教職員B3」。In one embodiment, the processor 121 compares the job title and the keyword in the job identification information to determine whether there is a corresponding category tag in the database 122 . Specifically, the category label "teacher B3" can have multiple keywords, such as "professor", "department head", "high school teacher", "elementary school teacher", etc., and the occupation identification information can be "Central University: Professor ", due to the job title "Professor" in the occupation identification information "Central University: Professor" and the multiple keywords "Professor", "Department Head", "High School Teacher", and "Elementary School Teacher" in the category label "Faculty B3" One of the keywords "professor" is a matching result, and the processor 121 can judge that the database 122 has the category label "staff B3" corresponding to the occupation identification information "Central University: Professor".

在一實施例中,處理器121根據至少一職業識別資訊中的公司名稱與客戶資訊進行比對,以判斷在資料庫122中是否存在對應的類別標籤。具體而言,資料庫122可以儲存類別標籤,例如「餐飲食品業G1」、「水泥工業D11」以及客戶資訊「頂好烤肉」、「富蘭克林語言中心」、「台積電」等。當電子裝置110將職業識別資訊「台積電:新竹」上傳至伺服器120,處理器121可以根據職業識別資訊「台積電:新竹」中的公司名稱「台積電」與客戶資訊「頂好烤肉」、「富蘭克林語言中心」、「台積電」進行比對。由於職業識別資訊「台積電:新竹」中的公司名稱「台積電」與客戶資訊的其中之一「台積電」匹配,處理器121可以藉此判斷資料庫122存在和職業識別資訊對應的類別標籤。由於處理器121判斷對應職業識別資訊「台積電:新竹」的類別標籤存在於資料庫122中,伺服器120可以回傳職業類別判斷結果至電子裝置110。In one embodiment, the processor 121 compares the company name in at least one occupation identification information with the customer information to determine whether there is a corresponding category tag in the database 122 . Specifically, the database 122 can store category labels, such as "catering and food industry G1", "cement industry D11" and customer information "Top Good Barbecue", "Franklin Language Center", "TSMC" and so on. When the electronic device 110 uploads the occupational identification information "TSMC: Hsinchu" to the server 120, the processor 121 can use the company name "TSMC" in the occupational identification information "TSMC: Hsinchu" and the customer information "Dinghao BBQ", "Franklin Language Center" and "TSMC" for comparison. Since the company name “TSMC” in the occupational identification information “TSMC: Hsinchu” matches one of the customer information “TSMC”, the processor 121 can determine that there is a category tag corresponding to the occupational identification information in the database 122 . Since the processor 121 determines that the category tag corresponding to the occupation identification information “TSMC: Hsinchu” exists in the database 122 , the server 120 may return the occupation category determination result to the electronic device 110 .

在一實施例中,處理器121更根據至少一職業識別資訊中的公司所在地與客戶資訊進行比對,以判斷在資料庫122中是否存在對應的類別標籤。圖2是根據本發明一實施例繪示的一種判斷在資料庫122中是否存在對應的類別標籤的示意圖。請參照圖2,資料庫122可以儲存客戶資訊211「廣隆昌機械:新竹」、客戶資訊212「廣隆昌國際貿易:高雄」、客戶資訊213「廣隆昌工程股份有限公司:台北」。當電子裝置110將包括公司所在地的職業識別資訊220「廣隆昌:台北」上傳至伺服器120,處理器121除了根據職業識別資訊220「廣隆昌:台北」中的公司名稱「廣隆昌」與客戶資訊進行比對之外,處理器121更根據職業識別資訊220「廣隆昌:台北」中的公司所在地「台北」與客戶資訊211~213進行比對。在此例子中,職業識別資訊220「廣隆昌:台北」中的公司名稱「廣隆昌」與客戶資訊211「廣隆昌機械:新竹」、客戶資訊212「廣隆昌國際貿易:高雄」、客戶資訊213「廣隆昌工程股份有限公司:台北」均非完全匹配,然而,職業識別資訊220「廣隆昌:台北」中的公司所在地「台北」與客戶資訊中的其中一個客戶資訊213「廣隆昌工程股份有限公司:台北」為匹配,處理器121可以藉此判斷資料庫122存在和職業識別資訊220「廣隆昌:台北」對應的類別標籤。由於處理器121判斷對應職業識別資訊220「廣隆昌:台北」的類別標籤存在於資料庫122中,伺服器120可以回傳職業類別判斷結果至電子裝置110。In one embodiment, the processor 121 further compares the company location in the at least one job identification information with the customer information to determine whether there is a corresponding category tag in the database 122 . FIG. 2 is a schematic diagram of judging whether there is a corresponding category tag in the database 122 according to an embodiment of the present invention. Please refer to FIG. 2 , the database 122 can store customer information 211 "Longchang Machinery: Hsinchu", customer information 212 "Longchang International Trading: Kaohsiung", and customer information 213 "Longchang Engineering Co., Ltd.: Taipei". When the electronic device 110 uploads the occupational identification information 220 "Guanglongchang: Taipei" including the location of the company to the server 120, the processor 121, in addition to the company name "Guanglongchang" in the occupational identification information 220 "Guanglongchang: Taipei" and the customer In addition to information comparison, the processor 121 further compares the company location "Taipei" in the occupation identification information 220 "Guanglongchang: Taipei" with the customer information 211-213. In this example, the company name "Guanglongchang" in the occupation identification information 220 "Guanglongchang: Taipei" and the customer information 211 "Guanglongchang Machinery: Hsinchu", customer information 212 "Guanglongchang International Trade: Kaohsiung", customer information 213 "Guanglongchang Engineering Co., Ltd.: Taipei" is not an exact match. However, the company location "Taipei" in the occupation identification information 220 "Guanglongchang: Taipei" and one of the customer information 213 "Guanglongchang Engineering Co., Ltd. Company: Taipei" is a match, and the processor 121 can judge that there is a category label corresponding to the occupation identification information 220 "Guanglongchang: Taipei" in the database 122 . Since the processor 121 determines that the category tag corresponding to the occupation identification information 220 “Guang Longchang: Taipei” exists in the database 122 , the server 120 can return the occupation category determination result to the electronic device 110 .

在一實施例中,處理器121根據職業識別資訊和客戶資訊的文字重疊率,以判斷在該資料庫中是否存在對應的類別標籤。圖3是根據本發明一實施例繪示的一種判斷在資料庫122中是否存在對應的類別標籤的另一示意圖。請參照圖3,在圖3與圖2資料庫122中,其差別為,圖2中的客戶資訊213為「廣隆昌工程股份有限公司:台北」,而圖3中的客戶資訊313為「廣隆昌工程股份有限公司:基隆」。當電子裝置110將包括公司所在地的職業識別資訊320「廣隆昌:台北」上傳至伺服器120,由於職業識別資訊320「廣隆昌:台北」的公司名稱「廣隆昌」和公司所在地「台北」與客戶資訊311~313均非完全匹配,處理器121可以依據職業識別資訊和客戶資訊的文字重疊率判斷在資料庫122中是否存在對應的類別標籤。具體而言,文字重疊率可以是「職業識別資訊和客戶資訊的相同文字數」與「客戶資訊文字數」的比例。在此例子中,職業識別資訊320「廣隆昌:台北」與客戶資訊311的「廣隆昌機械:新竹」的相同文字數為3,並且客戶資訊311「廣隆昌機械:新竹」的文字數為7,因此職業識別資訊320「廣隆昌:台北」與客戶資訊311的「廣隆昌機械:新竹」的文字重疊率是3/7。相似的,職業識別資訊320「廣隆昌:台北」與客戶資訊312「廣隆昌國際貿易:高雄」的文字重疊率是3/9,職業識別資訊320「廣隆昌:台北」與客戶資訊313「廣隆昌工程股份有限公司:基隆」的文字重疊率是3/13。處理器121還可以根據文字重疊率門檻值來判斷在資料庫122中是否存在對應的類別標籤。舉例來說,文字重疊率門檻值可以設置為1/3,由於職業識別資訊320「廣隆昌:台北」與客戶資訊311的「廣隆昌機械:新竹」的文字重疊率3/7大於文字重疊率門檻值,處理器121可以判斷在資料庫122中存在對應的類別標籤。In one embodiment, the processor 121 determines whether there is a corresponding category tag in the database according to the text overlap rate of the occupation identification information and the customer information. FIG. 3 is another schematic diagram of determining whether a corresponding category tag exists in the database 122 according to an embodiment of the present invention. Please refer to Fig. 3, in Fig. 3 and Fig. 2 database 122, its difference is, the customer information 213 in Fig. Longchang Engineering Co., Ltd.: Keelung". When the electronic device 110 uploads the occupational identification information 320 "Guanglongchang: Taipei" including the location of the company to the server 120, the company name "Guanglongchang" and the company location "Taipei" in the occupational identification information 320 "Guanglongchang: Taipei" are the same as None of the customer information 311 to 313 is a complete match, and the processor 121 can determine whether there is a corresponding category label in the database 122 according to the text overlap rate of the occupation identification information and the customer information. Specifically, the character overlap rate may be the ratio of "the number of identical characters in the occupation identification information and the customer information" to the "number of characters in the customer information". In this example, the occupation identification information 320 "Guanglongchang: Taipei" and the customer information 311 "Guanglongchang Machinery: Hsinchu" have the same number of characters as 3, and the number of characters in the customer information 311 "Guanglongchang Machinery: Hsinchu" is 7 , so the text overlap rate of occupation identification information 320 "Guanglongchang: Taipei" and customer information 311 "Guanglongchang Machinery: Hsinchu" is 3/7. Similarly, the occupational identification information 320 "Guanglongchang: Taipei" and the customer information 312 "Guanglongchang International Trade: Kaohsiung" have a text overlap rate of 3/9, and the occupational identification information 320 "Guanglongchang: Taipei" and the customer information 313 "Guanglongchang Long Cheong Engineering Co., Ltd.: Keelung" has a text overlap rate of 3/13. The processor 121 can also determine whether there is a corresponding category label in the database 122 according to the text overlap ratio threshold. For example, the threshold value of the text overlap rate can be set to 1/3, because the text overlap rate of 3/7 of the occupation identification information 320 "Guang Long Chang: Taipei" and the customer information 311 "Guang Long Chang Machinery: Hsinchu" is greater than the text overlap rate threshold value, the processor 121 can determine that there is a corresponding category label in the database 122 .

在一實施例中,處理器121根據職業識別資訊中的年齡是否大於預設年齡門檻值,以判斷在資料庫122中是否存在對應的類別標籤。具體而言,資料庫122可以儲存客戶資訊「中央大學」,並且預設年齡門檻值可以是30歲。當電子裝置110將職業識別資訊「中央大學:50歲」上傳至伺服器120,由於職業識別資訊中的年齡「50歲」大於預設年齡門檻值,處理器121可以藉此判斷資料庫122存在和職業識別資訊「中央大學:50歲」對應的類別標籤。In one embodiment, the processor 121 determines whether there is a corresponding category tag in the database 122 according to whether the age in the occupation identification information is greater than a preset age threshold. Specifically, the database 122 can store customer information "Central University", and the default age threshold can be 30 years old. When the electronic device 110 uploads the occupation identification information "Central University: 50 years old" to the server 120, since the age "50 years old" in the occupation identification information is greater than the preset age threshold, the processor 121 can use this to determine the existence of the database 122 A category tag corresponding to occupational identification information "Central University: 50 years old".

在一實施例中,處理器121對職業識別資訊中的公司名稱執行文字語義處理,以判斷在資料庫122中是否存在對應的類別標籤。假設資料庫122儲存客戶資訊「頂好烤肉」、「富蘭克林語言中心」,當電子裝置110將包括公司名稱的職業識別資訊「新港生炒鴨肉」上傳至伺服器120,處理器121可以對職業識別資訊的公司名稱「新港生炒鴨肉」執行文字語義處理。處理器121可以利用斷詞技術,由職業識別資訊的公司名稱「新港生炒鴨肉」獲得斷詞結果為「鴨肉」。接著,處理器1210可以利用詞向量(word2vec),對斷詞結果「鴨肉」執行語義擴展,語義擴展的結果可以是「牛肉和豬肉」。處理器121可以將語義擴展的結果「牛肉和豬肉」利用機器學習相似詞比對,和客戶資訊「頂好烤肉」、「富蘭克林語言中心」分別作比對,比對的結果可以是『客戶資訊「頂好烤肉」與 職業識別資訊「新港生炒鴨肉」相似』,藉此,處理器121可以判斷資料庫122存在和職業識別資訊「新港生炒鴨肉」對應的類別標籤。In one embodiment, the processor 121 performs text semantic processing on the company name in the occupation identification information to determine whether there is a corresponding category tag in the database 122 . Assuming that the database 122 stores customer information "Dinghao Barbecue" and "Franklin Language Center", when the electronic device 110 uploads the occupation identification information including the company name "Xinggang Sheng Fried Duck" to the server 120, the processor 121 can The company name "Xingangsheng Fried Duck" of the identification information performs text semantic processing. The processor 121 can use word segmentation technology to obtain the word segmentation result as "duck meat" from the company name "Xingang Sheng Fried Duck Meat" in the occupation identification information. Next, the processor 1210 may use the word vector (word2vec) to perform semantic expansion on the word segmentation result "duck", and the result of the semantic expansion may be "beef and pork". The processor 121 can use machine learning to compare similar words of the semantically expanded result "beef and pork" with the customer information "Dinghao Barbecue" and "Franklin Language Center". The result of the comparison can be "customer information "Ding Hao Roast Meat" is similar to the occupation identification information "Xingang Sheng Fried Duck", so the processor 121 can determine that there is a category label corresponding to the occupation identification information "Xingang Sheng Fried Duck" in the database 122 .

當處理器121判斷對應職業識別資訊的類別標籤存在於資料庫122中時,伺服器120回傳職業類別判斷結果至110電子裝置。在本實施例中,職業類別判斷結果可以是處理器121所判斷已經存在於資料庫122中的類別標籤。When the processor 121 determines that the category tag corresponding to the occupation identification information exists in the database 122 , the server 120 returns the occupation category determination result to the electronic device 110 . In this embodiment, the occupation category determination result may be the category label determined by the processor 121 and already exists in the database 122 .

當處理器121判斷對應職業識別資訊的類別標籤不存在於資料庫中122時,電子裝置110經由網際網路連接至外部伺服器130進行搜尋,以取得對應於職業識別資訊的類別標籤。網際網路可包括區域網路(Local Area Network,LAN)或廣域網路(Wide Area Network,WAN),例如支援3G、4G或5G標準的無線廣域網路或支援Wifi標準的無線區域網路等等,但不以此為限制。在另一實施例中,外部伺服器130包括一外部資料庫,該外部資料庫一是上市櫃公司類別資訊和財政部營業稅籍登記資訊的其中之一。也就是說,本發明的職業類別判斷系統100在判斷對應職業識別資訊的類別標籤不存在於資料庫中122時,可以搜尋外部伺服器130以取得對應職業識別資訊的類別標籤。When the processor 121 determines that the category tag corresponding to the occupation identification information does not exist in the database 122, the electronic device 110 is connected to the external server 130 to search for the category tag corresponding to the occupation identification information. The Internet can include Local Area Network (LAN) or Wide Area Network (Wide Area Network, WAN), such as wireless wide area network supporting 3G, 4G or 5G standard or wireless local area network supporting Wifi standard, etc. But not as a limitation. In another embodiment, the external server 130 includes an external database, the external database is one of the category information of listed companies and the business tax registration information of the Ministry of Finance. That is to say, when the occupation category determination system 100 of the present invention determines that the category label corresponding to the occupation identification information does not exist in the database 122 , it can search the external server 130 to obtain the category label corresponding to the occupation identification information.

圖4是根據本發明的一實施例繪示的一種職業類別判斷方法的流程圖,其中職業類別判斷方法可由如圖1所示的職業類別判斷系統100實施。在步驟S401中,將至少一職業識別資訊上傳至伺服器。在步驟S402中,根據至少一職業識別資訊判斷在伺服器的資料庫中是否存在對應的類別資訊。在執行完步驟S402後,如果步驟S402的判斷結果為「是」,執行步驟S403,當對應至少一職業識別資訊的類別標籤存在於該資料庫中時,則回傳類別資訊至該電子裝置,如果步驟S402的判斷結果為「否」,執行步驟S404,當對應至少一職業識別資訊的類別標籤不存在於資料庫中時,則經由網際網路連接至一外部伺服器進行搜尋,以取得對應於至少一職業識別資訊的類別標籤。FIG. 4 is a flowchart illustrating a method for determining an occupation category according to an embodiment of the present invention, wherein the occupation category determination method can be implemented by the occupation category determination system 100 shown in FIG. 1 . In step S401, at least one piece of occupation identification information is uploaded to the server. In step S402, it is determined whether there is corresponding category information in the database of the server according to at least one occupation identification information. After step S402 is executed, if the judgment result of step S402 is "Yes", step S403 is executed, and when the category tag corresponding to at least one occupation identification information exists in the database, the category information is returned to the electronic device, If the judgment result of step S402 is "No", execute step S404. When the category tag corresponding to at least one occupation identification information does not exist in the database, connect to an external server via the Internet to search to obtain the corresponding A category label for at least one occupational identifier.

綜上所述,本發明的職業類別判斷方法和職業類別判斷系統,在判斷職業類別時,可以利用類別標籤的關鍵字以匹配職業識別資訊,也可利用職業識別資訊的公司所在地與客戶資訊進行比對,除此之外,更可對職業識別資訊中的公司名稱執行文字語義處理,以判斷在資料庫中是否存在對應的類別標籤。也就是說,本發明的職業類別判斷方法和職業類別判斷系統在輸入部份錯誤或不完整的職業識別資訊時,可以更有彈性地判斷職業類別。To sum up, the occupation category judgment method and occupation category judgment system of the present invention can use the keywords of the category label to match the occupation identification information when determining the occupation category, and can also use the company location and customer information of the occupation identification information to determine the occupation category. In addition to the comparison, text semantic processing can be performed on the company name in the occupation identification information to determine whether there is a corresponding category label in the database. That is to say, the occupational category determination method and occupational category determination system of the present invention can more flexibly determine the occupational category when partial wrong or incomplete occupational identification information is input.

雖然本發明已以實施例揭露如上,然其並非用以限定本發明,任何所屬技術領域中具有通常知識者,在不脫離本發明的精神和範圍內,當可作些許的更動與潤飾,故本發明的保護範圍當視後附的申請專利範圍所界定者為準。Although the present invention has been disclosed above with the embodiments, it is not intended to limit the present invention. Anyone with ordinary knowledge in the technical field may make some changes and modifications without departing from the spirit and scope of the present invention. The scope of protection of the present invention should be defined by the scope of the appended patent application.

100:職業類別判斷系統 110:電子裝置 120:伺服器 121:處理器 122:資料庫 130:外部伺服器 211、212、213、311、312、313:客戶資訊 220、320 :職業識別資訊 S401、S402、S403、S404:步驟100: Occupational Category Judgment System 110: Electronic device 120: server 121: Processor 122: database 130:External server 211, 212, 213, 311, 312, 313: customer information 220, 320: Occupational identification information S401, S402, S403, S404: steps

圖1是根據本發明的一實施例繪示的一種職業類別判斷系統的示意圖。 圖2是根據本發明一實施例繪示的一種判斷在資料庫122中是否存在對應的類別標籤的示意圖。 圖3是根據本發明一實施例繪示的一種判斷在資料庫122中是否存在對應的類別標籤的另一示意圖。 圖4是根據本發明的一實施例繪示的一種職業類別判斷方法的流程圖。FIG. 1 is a schematic diagram of an occupational category determination system according to an embodiment of the present invention. FIG. 2 is a schematic diagram of judging whether there is a corresponding category tag in the database 122 according to an embodiment of the present invention. FIG. 3 is another schematic diagram of determining whether a corresponding category tag exists in the database 122 according to an embodiment of the present invention. FIG. 4 is a flowchart of a method for determining an occupation category according to an embodiment of the present invention.

100:職業類別判斷系統100: Occupational Category Judgment System

110:電子裝置110: Electronic device

120:伺服器120: server

121:處理器121: Processor

122:資料庫122: database

130:外部伺服器130:External server

Claims (7)

一種職業類別判斷系統,包括:一電子裝置;以及一伺服器,耦接至該電子裝置,包括一資料庫及一處理器,該資料庫儲存多個類別標籤及客戶資訊,其中該電子裝置將至少一職業識別資訊上傳至該伺服器,並由該伺服器中的該處理器根據所述至少一職業識別資訊判斷在該伺服器的資料庫中是否存在對應的類別標籤,當對應所述至少一職業識別資訊的類別標籤存在於該資料庫中時,則該伺服器回傳一職業類別判斷結果至該電子裝置,當對應所述至少一職業識別資訊的類別標籤不存在於該資料庫中時,則該電子裝置經由網際網路連接至一外部伺服器進行搜尋,以取得對應於所述至少一職業識別資訊的類別標籤,其中所述至少一職業識別資訊包括公司名稱、公司所在地、職稱及年齡,其中該處理器根據所述至少一職業識別資訊中的該公司名稱與該客戶資訊進行比對,並且根據所述至少一職業識別資訊中的該公司所在地與該客戶資訊進行比對,以判斷在該資料庫中是否存在對應的類別標籤,其中若所述至少一職業識別資訊的該公司名稱與該客戶資訊非完全匹配,並且若所述至少一職業識別資訊的該公司所在地與該客戶資訊的其中之一為匹配,則該處理器判斷在該資料庫中存 在對應的類別標籤,其中該處理器根據所述至少一職業識別資訊中的年齡是否大於預設年齡門檻值,以判斷在該資料庫中是否存在對應的類別標籤。 A professional category judgment system, comprising: an electronic device; and a server, coupled to the electronic device, including a database and a processor, the database stores a plurality of category tags and customer information, wherein the electronic device will At least one occupational identification information is uploaded to the server, and the processor in the server judges whether there is a corresponding category tag in the database of the server according to the at least one occupational identification information, and when the at least one occupational identification information corresponds to When a category tag of occupational identification information exists in the database, the server returns an occupational category judgment result to the electronic device; when the category tag corresponding to the at least one occupational identification information does not exist in the database , then the electronic device is connected to an external server via the Internet to search to obtain a category tag corresponding to the at least one occupational identification information, wherein the at least one occupational identification information includes company name, company location, job title and age, wherein the processor compares the company name in the at least one occupation identification information with the customer information, and compares the company location in the at least one occupation identification information with the customer information, to determine whether there is a corresponding category label in the database, wherein if the company name of the at least one occupational identification information does not exactly match the customer information, and if the company location of the at least one occupational identification information does not match the One of the customer information is a match, the processor determines that there is a For the corresponding category tag, wherein the processor determines whether the corresponding category tag exists in the database according to whether the age in the at least one occupation identification information is greater than a preset age threshold. 如請求項1所述的職業類別判斷系統,其中該些類別標籤分別具有多個關鍵字,用以匹配所述至少一職業識別資訊。 The occupation category judgment system as claimed in claim 1, wherein the category tags each have a plurality of keywords for matching the at least one occupation identification information. 如請求項2所述的職業類別判斷系統,其中該處理器根據所述至少一職業識別資訊中的職稱與該些關鍵字進行比對,以判斷在該資料庫中是否存在對應的類別標籤。 The occupation category determination system according to claim 2, wherein the processor compares the job title in the at least one occupation identification information with the keywords to determine whether there is a corresponding category label in the database. 如請求項1所述的職業類別判斷系統,其中該處理器對所述至少一職業識別資訊中的公司名稱執行文字語義處理,以判斷在該資料庫中是否存在對應的類別標籤。 The occupation category determination system as claimed in claim 1, wherein the processor performs text semantic processing on the company name in the at least one occupation identification information to determine whether there is a corresponding category label in the database. 如請求項4所述的職業類別判斷系統,其中該文字語義處理包括根據斷詞技術、詞向量以及機器學習相似詞比對。 The occupational category judging system as described in Claim 4, wherein the text semantic processing includes comparison of similar words based on word segmentation technology, word vectors and machine learning. 如請求項1所述的職業類別判斷系統,其中該外部伺服器包括一外部資料庫,該外部資料庫是一上市櫃公司類別資訊和財政部營業稅籍登記資訊的其中之一。 The occupational category judgment system as described in Claim 1, wherein the external server includes an external database, and the external database is one of information on the category of listed companies and business tax registration information from the Ministry of Finance. 一種職業類別判斷方法,包括:將至少一職業識別資訊上傳至伺服器,其中所述伺服器的資料庫儲存多個類別標籤及客戶資訊,其中所述至少一職業識別資訊包括公司名稱、公司所在地、職稱及年齡, 根據所述至少一職業識別資訊判斷在該伺服器的該資料庫中是否存在對應的類別資訊,當對應所述至少一職業識別資訊的類別標籤存在於該資料庫中時,則回傳所述類別資訊至該電子裝置,當對應所述至少一職業識別資訊的類別標籤不存在於該資料庫中時,則經由網際網路連接至一外部伺服器進行搜尋,以取得對應於所述至少一職業識別資訊的類別標籤,其中根據所述至少一職業識別資訊判斷在該伺服器的該資料庫中是否存在對應的該類別資訊的步驟包括:根據所述至少一職業識別資訊中的該公司名稱與該客戶資訊進行比對,並且根據所述至少一職業識別資訊中的該公司所在地與該客戶資訊進行比對,以判斷在該資料庫中是否存在對應的類別標籤,若所述至少一職業識別資訊的該公司名稱與該客戶資訊非完全匹配,並且若所述至少一職業識別資訊的該公司所在地與該客戶資訊的其中之一為匹配,則判斷在該資料庫中存在對應的類別標籤,其中根據所述至少一職業識別資訊判斷在該伺服器的該資料庫中是否存在對應的該類別資訊的步驟包括:根據所述至少一職業識別資訊中的年齡是否大於預設年齡門檻值,以判斷在該資料庫中是否存在對應的類別標籤。 A method for judging an occupational category, comprising: uploading at least one piece of occupational identification information to a server, wherein a database of the server stores multiple category tags and customer information, wherein the at least one occupational identification information includes company name, company location , job title and age, Judging whether there is corresponding category information in the database of the server according to the at least one occupational identification information, and when the category label corresponding to the at least one occupational identification information exists in the database, return the category information to the electronic device, and when the category tag corresponding to the at least one occupation identification information does not exist in the database, it is connected to an external server through the Internet to search, so as to obtain the information corresponding to the at least one occupation The category label of the occupational identification information, wherein the step of judging whether the corresponding category information exists in the database of the server according to the at least one occupational identification information includes: according to the company name in the at least one occupational identification information comparing with the customer information, and comparing the location of the company in the at least one occupation identification information with the customer information to determine whether there is a corresponding category label in the database, if the at least one occupation The company name of the identification information does not completely match the customer information, and if the company location of the at least one professional identification information matches one of the customer information, it is determined that there is a corresponding category label in the database , wherein the step of judging whether there is corresponding information of the category in the database of the server according to the at least one occupation identification information includes: according to whether the age in the at least one occupation identification information is greater than a preset age threshold, to determine whether there is a corresponding category label in the database.
TW109123015A 2020-07-08 2020-07-08 Occupational category determination system and occupational category determination method TWI779311B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW109123015A TWI779311B (en) 2020-07-08 2020-07-08 Occupational category determination system and occupational category determination method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW109123015A TWI779311B (en) 2020-07-08 2020-07-08 Occupational category determination system and occupational category determination method

Publications (2)

Publication Number Publication Date
TW202203131A TW202203131A (en) 2022-01-16
TWI779311B true TWI779311B (en) 2022-10-01

Family

ID=80787632

Family Applications (1)

Application Number Title Priority Date Filing Date
TW109123015A TWI779311B (en) 2020-07-08 2020-07-08 Occupational category determination system and occupational category determination method

Country Status (1)

Country Link
TW (1) TWI779311B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200828963A (en) * 2006-12-19 2008-07-01 Kinpo Elect Inc Method for location based guide and system therefor
TW201500941A (en) * 2013-06-18 2015-01-01 Inst Information Industry Social data filtering system, method and non-transitory computer readable storage medium of the same
TWM605548U (en) * 2020-07-08 2020-12-21 合作金庫人壽保險股份有限公司 Occupational category determination system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200828963A (en) * 2006-12-19 2008-07-01 Kinpo Elect Inc Method for location based guide and system therefor
TW201500941A (en) * 2013-06-18 2015-01-01 Inst Information Industry Social data filtering system, method and non-transitory computer readable storage medium of the same
TWM605548U (en) * 2020-07-08 2020-12-21 合作金庫人壽保險股份有限公司 Occupational category determination system

Also Published As

Publication number Publication date
TW202203131A (en) 2022-01-16

Similar Documents

Publication Publication Date Title
US11544578B2 (en) Method, device and equipment for fusing different instances describing same entity
US10095780B2 (en) Automatically mining patterns for rule based data standardization systems
JP7313069B2 (en) Search material information storage device
JP2009026195A (en) Article classification apparatus, article classification method and program
WO2020168839A1 (en) Item recall method and system, electronic device and readable storage medium
JP6664599B2 (en) Ambiguity evaluation device, ambiguity evaluation method, and ambiguity evaluation program
US20130204835A1 (en) Method of extracting named entity
US20160078121A1 (en) Method and apparatus of matching an object to be displayed
JP2016038658A (en) Supplier search device and search method
WO2017024878A1 (en) Object search method, apparatus and server
US20230222561A1 (en) Systems and methods for executing search queries based on dynamic tagging
WO2023236538A1 (en) Risky code pre-detection method and apparatus, electronic device, computer readable storage medium, and computer program product
WO2018070026A1 (en) Commodity information display system, commodity information display method, and program
CN114329189B (en) Recommendation method and device of content information, electronic equipment and readable medium
CN112926297B (en) Method, apparatus, device and storage medium for processing information
US10216792B2 (en) Automated join detection
TWM605548U (en) Occupational category determination system
TWI779311B (en) Occupational category determination system and occupational category determination method
US20220207507A1 (en) Automatic Creation of Master Catalog and Catalog Map for Reconciliation of Merchant Point-of-Sale Catalog and Third-Party Service Catalog
US20160267107A1 (en) Search system, search criteria setting device, control method for search criteria setting device, program, and information storage medium
US10176148B2 (en) Smart flip operation for grouped objects
JP6696344B2 (en) Information processing device and program
JP2022045416A (en) Data processing program, data processing device, and data processing method
CN112783410B (en) Information processing method, medium, device and computing equipment
WO2015159702A1 (en) Partial-information extraction system

Legal Events

Date Code Title Description
GD4A Issue of patent certificate for granted invention patent