TW202349325A

TW202349325A - A system of semantic analysis-based trademark class recommendation and the method thereof

Info

Publication number: TW202349325A
Application number: TW112120780A
Authority: TW
Inventors: 吳鵬君; 子裕陳; 張育睿
Original assignee: 睿加科技股份有限公司
Priority date: 2022-06-02
Filing date: 2023-06-02
Publication date: 2023-12-16

Abstract

The present invention provides an online application system with a classification recommend module. The system includes an electronic device operated by a user, which connects to a server through internet. The electronic device comprises a processor, a memory, and a network interface controller. The server includes an application program. The user operates the electronic device to execute the application program's trademark online application module, conducting a trademark application online. The user inputs the required data for the trademark application through the electronic device. The system utilizes the classification recommend module and the risk assessment module to generate classification recommendation reports and risk assessment reports, providing solutions to the user's difficulties in selecting classifications.

Description

A semantic analysis trademark category recommendation system and its method

本發明係關於線上商標申請及管理系統，特別是具有商標類別推薦以及風險評估的系統。The present invention relates to an online trademark application and management system, especially a system with trademark category recommendation and risk assessment.

在傳統的商標申請過程中，無論是國內還是國外，都需要印出紙本文件並填寫大量表格，這不符合環保原則，由於商標申請文件的正式性，如果申請人填寫不正確，整個過程可能需要重新進行，這導致了時間的浪費和人力資源的浪費，特別是在其他國家進行商標申請時，除了傳統的文書工作外，如果由人員進行溝通，由於他們具有不同的專業背景、不同的語言用法、文化差異和其他不可預測的因素，可能傳達不準確或被誤解的信息，這導致申請人與外國代理人和政府機構之間存在認知差異。因此，申請人可能無法獲得最初想要的結果。In the traditional trademark application process, whether domestic or foreign, it is necessary to print out paper documents and fill in a large number of forms, which is not in line with the principles of environmental protection. Due to the formality of trademark application documents, if the applicant fills them out incorrectly, the entire process will be disrupted. It may need to be done again, which results in a waste of time and human resources, especially when trademark applications are made in other countries. In addition to traditional paperwork, if communication is carried out by personnel, since they have different professional backgrounds, different Language usage, cultural differences, and other unpredictable factors can convey inaccurate or misunderstood information, which results in cognitive differences between applicants and foreign agents and government agencies. As a result, applicants may not obtain the results they originally desired.

另一方面，過去商標搜尋檢索的目標使用者是專業的從業人士，對於沒有相關專業背景的人來說，理解檢索邏輯非常困難，使用者無法直觀地理解他們想申請的商標到底其申請通過的風險為何，該如何評估是否可安心進行商標申請或需要做其申請文字或圖像上的設計調整。對於商標檢索，由各國政府機構建立的官方搜尋頁面或民間的商標搜尋平台在執行商標檢索過程中，往往只會列出相似的前案商標案例，並沒有提供任何結果分析、參考點或相關評估，這導致對知識產權一無所知的使用者可能難以判斷這些先前案例對他們自己商標申請案例的影響程度，使用者可能也難以估計風險程度。On the other hand, in the past, the target users of trademark search and retrieval were professional practitioners. For people without relevant professional background, it was very difficult to understand the search logic. Users could not intuitively understand whether the trademark they wanted to apply for was approved. What are the risks and how to evaluate whether you can safely apply for a trademark or whether you need to make design adjustments to the application text or images. For trademark searches, official search pages established by government agencies of various countries or private trademark search platforms often only list similar previous trademark cases during the trademark search process, and do not provide any result analysis, reference points or related evaluations. , which makes it difficult for users who know nothing about intellectual property rights to judge the impact of these previous cases on their own trademark applications, and it may also be difficult for users to estimate the degree of risk.

此外，為了因應各國文化與風土民情對於商品及服務有著不同的定義與解釋，然而目前選擇商標分類的方式是在各國分類標準與國際上通用的尼斯分類的，透過精確或模糊的詞語進行搜索，以找出相似的結果，然而，對於不了解各國官方機構公布分類標準或是國際間的尼斯分類描述方式的使用者來說，要找到正確和相應的結果會非常困難，使用者可能會遺漏一些他們真正需要的重要項目，例如，一家服裝店的店主可能並不清楚他真正需要的是「服裝的批發和零售」，而只是自行在分類中找到一些項目，如「女裝、男裝、襯衫」。從而，在這種狀態下，使用者只能透過傳統人工顧問諮詢的型態來獲取其商標申請上的商標分類與商品項目等挑選建議，而無法針對其自身狀況直接進行商標申請。In addition, in order to adapt to the different definitions and interpretations of goods and services in various countries, the culture and customs of each country are different. However, the current way to select trademark classification is to search through precise or vague words based on national classification standards and the internationally accepted Nice Classification. to find similar results. However, for users who do not understand the classification standards published by official agencies of various countries or the international Nice Classification description method, it will be very difficult to find correct and corresponding results, and users may miss some The important items they really need. For example, the owner of a clothing store may not know that what he really needs is "wholesale and retail clothing", but just finds some items in the category by himself, such as "women's clothing, men's clothing, shirts" ”. Therefore, in this state, users can only obtain selection suggestions on trademark classification and product items for their trademark applications through traditional manual consultation, but cannot directly apply for trademarks based on their own circumstances.

這樣一來，使得商家無法輕易在各國國中自行挑選商標分類，並提出商標申請，最終，使用者(商家)只能尋求本地律師事務所的幫助，在單一國家選擇適當的商標分類與商品項目挑選上尋求一些專業建議與申請案委託執行，一般來說從溝通到收到最後的類別建議在一到數周之間，若是多國或是建議錯誤那種多次的往返常常導致錯失先機，傳統的流程，不僅讓商標申請前的流程複雜性增加，大量使用商標專業人士的顧問時間也增加，進而導致商標申請的成本費用也居高不下，申請前的處理的流程與執行週期也較長，進而影響一般人在有新的品牌名稱、Slogan或商業(品)名稱發想時，容易因為商標申請的難度與費用門檻，而產生僥倖心態進而放棄進行商標申請，從而為後續可能發生的商標爭議事件埋下了伏筆與不可逆的風險。As a result, merchants cannot easily select trademark categories and file trademark applications in various countries. In the end, users (merchants) can only seek help from local law firms to select appropriate trademark categories and product items in a single country. Seek some professional advice and entrust the implementation of the application case. Generally speaking, it takes one to several weeks from communication to receiving the final category proposal. If it is multiple countries or the proposal is wrong, multiple back and forth often lead to missed opportunities. The traditional process not only increases the complexity of the pre-application process, but also increases the consulting time of a large number of trademark professionals, which in turn leads to high costs for trademark applications, and the pre-application process and execution cycle are also longer. , which in turn affects the general public, when they come up with ideas for a new brand name, slogan or business (product) name, it is easy for them to get lucky and give up on the trademark application due to the difficulty and cost threshold of trademark application, thereby setting the stage for possible subsequent trademark disputes. The incident laid the foundation and irreversible risks.

疫情後，適逢電商與AI發展革命時代，跨境電商交易與跨境服務更是蓬勃發展，有更多的商家需要跨境品牌的需求，對於商標申請人在商標申請有跨國佈局的需求時，通常也因為各國商標商品項目的內容不同，而無法順利將原始申請國的商標申請內容，直接轉換成另一國家的商標申請內容；此時，又需要再次委託專業人士依據原始案件的商品項目進到目標國家的商品項目資料庫中，進行人工比對與挑選；如此一來，又需要耗費大量的人力工時與來回確認，不僅需要更高的執行週期與費用，同時，透過人工的方式，商標專業人士常常因為不是該行業的專家，偶爾也會產生理解上的誤差或推薦操作上的失誤，進而導致後續的補正風險提高，使得商標申請的速度也會變慢，這導致在外國申請商標時執行成本的增加。另一方面對於商標從業人士也因為重複性的執行商標推薦的工作，導致工作滿意度下降與倦怠感提高，正常一個跨境商標推薦，從溝通、到理解、到判斷與製作出推薦意見書，期間處理動輒數小時到數天以上，往往推薦意見書完成後，提供給商標申請人，申請人最終又沒有並未申請，其中所消耗的心神也僅有商標從業人士才有感。After the epidemic, it coincides with the revolutionary era of e-commerce and AI development. Cross-border e-commerce transactions and cross-border services are booming. More merchants need cross-border brands. For trademark applicants who have a cross-border layout in trademark applications, When needed, it is usually because the content of trademarked goods items in different countries is different, and it is impossible to directly convert the trademark application content of the original application country into the trademark application content of another country; at this time, it is necessary to entrust professionals again based on the original case. Product items are entered into the target country's product item database for manual comparison and selection. This requires a lot of manpower and back-and-forth confirmation, which not only requires higher execution cycles and costs, but also requires manual In this way, trademark professionals often make mistakes in understanding or recommended operations because they are not experts in the industry, which increases the risk of subsequent corrections and slows down the speed of trademark applications. This results in Increased enforcement costs when applying for a trademark abroad. On the other hand, for trademark practitioners, the repetitive work of trademark recommendation leads to a decrease in job satisfaction and an increase in burnout. Normally, a cross-border trademark recommendation involves communication, understanding, judgment, and the creation of a recommendation letter. The processing can take several hours to several days. Often, after the recommendation letter is completed, it is provided to the trademark applicant, who ultimately fails to apply. Only trademark practitioners can appreciate the amount of energy consumed.

以目前來說，部份廠商或官方系統係有針對商標類別推薦與商標檢索等功能進行系統與上的功能開發，然而，其在實務使用上仍具有不等的缺點，導致用戶無法非常直觀地在使用者體驗上取得其商標申請建議與評估內容，以利用戶進行後續的(多國)商標申請流程；下列逐一列舉並說明其尚未解決的問題。At present, some manufacturers or official systems have developed systems and functions for trademark category recommendation and trademark retrieval. However, they still have various shortcomings in practical use, resulting in users being unable to intuitively Obtain trademark application suggestions and evaluation content based on user experience to facilitate users in the subsequent (multi-country) trademark application process; the following lists and explains the unresolved issues one by one.

首先，創作人在使用在美國LegalZoom.com, Inc.公司所提供的LegalZoom平台(https://www.legalzoom.com/business/intellectual-property/trademark-registration-overview.html)來說，LegalZoom的商標申請服務流程，雖已經採用線上化來實現，然而，更具體的說，其屬於一種單純的線上接單流程，即用發出商標申請需求時，需完成平台提供的商標申請表單，如圖1~3，包含提供欲申請的商標名稱、商標名稱使用的場景以及用戶的產業描述等資訊；而在完成上述資訊填寫與付費後，系統會將該資訊傳送至後台而透過LegalZoom指派的專員進行後續的人工線下與顧問服務，進而完成後續的商標申請服務與流程；以上述的服務流程中，LegalZoom平台僅係透過線上接單的方式來實現並取代傳統商標事務所需透過業務人員面對面或是進行線下聯繫與接收客戶申請需求的流程，然而，上述流程中，實質上用戶還是無法自行取得商標類別建議書與執行商標申請流程，還是需要透過冗長的訪談過程。First of all, when creators use the LegalZoom platform (https://www.legalzoom.com/business/intellectual-property/trademark-registration-overview.html) provided by LegalZoom.com, Inc. in the United States, LegalZoom’s Although the trademark application service process has been implemented online, more specifically, it is a simple online order-taking process, that is, when issuing a trademark application request, you need to complete the trademark application form provided by the platform, as shown in Figure 1 ~3, including providing information such as the trademark name to be applied for, the scenario in which the trademark name is used, and the user's industry description; after completing the above information filling and payment, the system will send the information to the backend and the specialist assigned by LegalZoom will follow up. Manual offline and consulting services, and then complete the subsequent trademark application services and processes; in the above service process, the LegalZoom platform only realizes and replaces the traditional trademark affairs through business personnel face-to-face or through online orders. The process of conducting offline contact and receiving customer application requirements. However, in the above process, users are still unable to obtain trademark category proposals and implement the trademark application process by themselves. They still need to go through a lengthy interview process.

如中國專利公開號CN109800340A所揭露，請參閱圖4及圖5，用戶需先在商標名稱欄位中輸入正確的文字或字詞，用戶最終根據商品標籤信息，推薦至少數一個商品類別中的每個商品類別對應的至少一個商品/服務項目，包括：用戶最終根據商品標籤信息，確定數據庫中至少一個商品標籤類別中的每個商標類別的商標與商標信息的第一個相似度；以及用戶最終根據第一個相似度，推薦至最小一個商標類別中每個商品類別別對應的至少一個商品/服務項目。確定至少一個群中的每個群的近似群組的商標與商標信息的第二相似度，其中，根據第一個相似度推薦至少一個商品類別中的每個商品類別對應的至少一個商品/服務項目，包括：根據第一個相似度和第二相對推薦至少一個商品類別中的每個商品類別對應的至少一個商品/服務項目。例如，用戶確定的商標類別為第7類，如果第7類中的第一群與第12類中的第二群為相似群，則去掉了將商標信消息與第7類商品標籤的商品標籤進行對比，確定第一相貌之外，還需要將商標信息與第12類商標別中的相貌群(即第二群)的商標前進對比，並確定第二相似度。進一步地，根據第一相似度和第二相似度，推薦商品/服務項目。例如，第一群無相同商標和相似商標，即第一相似度，註冊成功率高，但第二群有很多個相似商標和相同商標，即第二相似度比較高，註冊成功率比較低，因此，結合第一和第二相似度綜合考慮，第一群的註冊成功率比較低，則不推薦註冊第一群。As disclosed in Chinese Patent Publication No. CN109800340A, please refer to Figure 4 and Figure 5. The user needs to enter the correct text or words in the trademark name field first. The user finally recommends at least one product category based on the product label information. At least one product/service item corresponding to each product category includes: the user finally determines the first similarity between the trademark and trademark information of each trademark category in at least one product label category in the database based on the product label information; and the user Finally, based on the first similarity, at least one product/service item corresponding to each product category in the smallest trademark category is recommended. Determining a second degree of similarity between trademarks and trademark information of similar groups in at least one group, wherein at least one commodity/service corresponding to each commodity category in at least one commodity category is recommended based on the first similarity The items include: recommending at least one product/service item corresponding to each product category in at least one product category based on the first similarity and the second relative degree. For example, the trademark category determined by the user is Category 7. If the first group in Category 7 and the second group in Category 12 are similar groups, the product label that combines the trademark information with the product label in Category 7 is removed. In addition to comparing and determining the first appearance, it is also necessary to further compare the trademark information with the trademarks of the appearance group (i.e. the second group) in the 12th class trademark category and determine the second degree of similarity. Further, goods/services are recommended based on the first degree of similarity and the second degree of similarity. For example, the first group does not have the same trademark or similar trademarks, that is, the first degree of similarity, and the registration success rate is high, but the second group has many similar trademarks and the same trademark, that is, the second degree of similarity is relatively high, and the registration success rate is relatively low. Therefore, considering the first and second similarities, the registration success rate of the first group is relatively low, so it is not recommended to register the first group.

但上述中國專利(公開號CN109800340A)存在幾個待改善問題，例如：商標推薦比對主要是根據標籤訊息，倘若標籤設定不夠精確精準或同一商品項目可能可以同時符合多個標籤，在標籤比對的過程中因此可能失準或比對出大量不符合的項目，造成推薦效益降低。此外，上述中國專利是透過中國商標分類中的類別項目的群組關係作為比對時判斷的依據之一，商標類別中群組關係固然是官方在搜尋及檢索的依據，但在商家或是商標從業人士需要知道的推薦類別時，也可能會產生比對出不符合的項目造成資料數量過於龐大，降低推薦的功效。最終，上述中國專利在接收用戶的訊息資料時，用戶所輸入的必須時精準且精確的文字在對應的欄位，例如商標名稱，才有辦法進行標籤的比對進而產生推薦註冊或不推薦註冊的結果資訊。However, the above-mentioned Chinese patent (publication number CN109800340A) has several problems that need to be improved. For example, the trademark recommendation comparison is mainly based on label information. If the label setting is not precise enough or the same product item may match multiple labels at the same time, in the label comparison Therefore, the process may be inaccurate or a large number of unqualified items may be compared, resulting in reduced recommendation benefits. In addition, the above-mentioned Chinese patent uses the group relationship of category items in the Chinese trademark classification as one of the basis for comparison. Although the group relationship in the trademark category is the basis for official search and retrieval, it is not the case for merchants or trademarks. When practitioners need to know the recommended categories, it may also result in items that do not match the comparison, resulting in an excessive amount of data, reducing the effectiveness of recommendations. Ultimately, when the above-mentioned Chinese patent receives user information, the user must input precise and accurate text in the corresponding field, such as a trademark name, so that the tags can be compared and generated to recommend or not recommend registration. result information.

又如中國專利公告號CN109902196B所揭露，一種商標類別推薦方法，其中，該方法應用於商標類別推薦設備，具體的，該方法包括：即時構建更新資料；通過所述更新資料對歷史商標資料庫中的商標類別進行更新，其中，所述歷史商標資料庫中的商標類別為歷史推薦商標類別；將經更新得到的商標類別確定為推薦商標類別。就此，商標類別推薦設備通過即時構建更新資料，並且，該商標類別推薦設備依據該更新資料將歷史商標資料庫中的商標類別進行更新，其中，所述歷史商標資料庫中的商標類別為歷史推薦商標類別；接著，將經更新得到的商標類別確定為推薦商標類別，其中，該推薦商標類別為到此刻為止註冊頻率最小的商標類別，依此即可幫助用戶確定待註冊的商標類別，而用戶也可從該推薦商標類別中繼續選取註冊頻率最小的商標類別。As disclosed in Chinese Patent Announcement No. CN109902196B, a trademark category recommendation method is applied to a trademark category recommendation device. Specifically, the method includes: constructing updated data in real time; using the updated data to compare historical trademark databases The trademark category is updated, wherein the trademark category in the historical trademark database is a historical recommended trademark category; the updated trademark category is determined as a recommended trademark category. In this regard, the trademark category recommendation device constructs updated data in real time, and the trademark category recommendation device updates the trademark categories in the historical trademark database based on the updated data, wherein the trademark categories in the historical trademark database are historical recommendations. Trademark category; then, the updated trademark category is determined as the recommended trademark category, where the recommended trademark category is the trademark category with the smallest registration frequency so far, which can help the user determine the trademark category to be registered, and the user You can also continue to select the trademark category with the smallest registration frequency from the recommended trademark categories.

公告號CN109902196B的中國專利存在待改善的問題，其推薦原則是根據歷史推薦紀錄以及根據統計各個商標類別註冊頻率紀錄，透過不斷的更新數據資料確保資料庫中的資料為最新，但推薦系統中推薦次數少或官方註冊頻率少的商標類別，這樣的推薦模型非常明顯對於用戶並不一定是最適合的，因為最少人申請代表用戶也不會有興趣。再者，即使公告號CN109902196B的中國專利還會針對申請人行業領域與商標類別進行相似度比對，找出相近領域且註冊率少的商標類別，實際上對於申請人並沒有提供實質推薦類別甚至是商品項目的效益。最終，公告號CN109902196B的中國專利在接收用戶的訊息資料時，申請人所輸入的必須時精準且精確的文字在對應的欄位，例如商標名稱、行業領域，才有辦法進行後續的行業相似度比對以及找出推薦次數少、註冊頻率少的商標類別。The Chinese patent with announcement number CN109902196B has problems that need to be improved. The recommendation principle is based on historical recommendation records and statistical registration frequency records of each trademark category. It ensures that the data in the database is up-to-date by continuously updating the data. However, the recommendation system recommends For trademark categories with low frequency or official registration frequency, such a recommendation model is obviously not necessarily the most suitable for users, because even the smallest number of applications will not be of interest to the user. Furthermore, even though the Chinese patent with announcement number CN109902196B will conduct a similarity comparison between the applicant's industry field and the trademark category to find trademark categories in similar fields with low registration rates, in fact it does not provide the applicant with any substantive recommended categories or even categories. It is the benefit of commodity items. Finally, when the Chinese patent with announcement number CN109902196B receives user information, the applicant must enter accurate and accurate text in the corresponding fields, such as trademark name and industry field, so that subsequent industry similarity can be carried out. Compare and identify trademark categories with fewer recommendations and less frequent registrations.

又如中國專利公開號CN111898022A商標類別推薦方法和裝置、以及存儲介質和電子設備所揭露，接收用戶請求的行業；確定所述使用者請求的行業的標識資訊；基於所述使用者請求的行業的標識資訊，獲取所述使用者請求的行業對應的至少一個商標類別；向所述用戶推薦所述至少一個商標類別。可選地，在實施例中，基於所述使用者請求的行業的標識資訊，獲取所述使用者請求的行業對應的至少一個商標類別，包括：基於所述使用者請求的行業的標識資訊，從資料庫中查找所述使用者請求的行業的商標註冊類別統計資訊；其中，所述行業的商標註冊類別統計資訊用於表示對各行業註冊的每個商標類別數量的統計資訊。可選地，所述行業的商標註冊類別統計資訊，包括：基於所述資料庫中各公司的主體資訊，確定所述各公司所屬行業；基於所述資料庫中各公司的商標資訊，獲取所述各公司的商標註冊類別；基於所述各公司所屬行業和所述各公司的商標註冊類別，對各行業的商標註冊類別進行統計，獲得所述各行業的商標註冊類別統計資訊。向所述用戶推薦所述至少一個商標類別，包括：基於所述各行業的商標註冊類別統計資訊和所述各行業的預設的類別閾值資訊，確定所述各行業的至少一個商標註冊類別；基於所述各行業的至少一個商標註冊類別，向所述用戶推薦所述用戶請求的行業的至少一個商標類別。As disclosed in Chinese Patent Publication No. CN111898022A, trademark category recommendation method and device, as well as storage media and electronic equipment, the industry requested by the user is received; the identification information of the industry requested by the user is determined; and the industry based on the industry requested by the user is determined. Identify information, obtain at least one trademark category corresponding to the industry requested by the user, and recommend the at least one trademark category to the user. Optionally, in an embodiment, obtaining at least one trademark category corresponding to the industry requested by the user based on the identification information of the industry requested by the user includes: based on the identification information of the industry requested by the user, The trademark registration category statistical information of the industry requested by the user is searched from the database; where the trademark registration category statistical information of the industry is used to represent the statistical information on the number of each trademark category registered in each industry. Optionally, the statistical information of trademark registration categories in the industry includes: determining the industry to which each company belongs based on the subject information of each company in the database; obtaining all the trademark information based on the trademark information of each company in the database. Based on the industry to which each company belongs and the trademark registration category of each company, statistics are made on the trademark registration categories of each industry to obtain statistical information on the trademark registration categories of each industry. Recommending the at least one trademark category to the user includes: determining at least one trademark registration category for each industry based on statistical information on trademark registration categories for each industry and preset category threshold information for each industry; Based on at least one trademark registration category of each industry, at least one trademark category of the industry requested by the user is recommended to the user.

如上述中國專利(公開號CN111898022A)仍存在待改善的問題，單純使用統計方式，針對中國各公商局備案的公司服務(產品)項目與該公司的商標申請類別進行統計，並存儲成一數據庫，此一模型並不適用其他國家更不適用於跨境類別的推薦分析；當用戶選定某特定產業時，即跳出該產業對應的商標申請類別統計結果，並形成推薦結果；單純只進行商標類別推薦，並無實際商品項目內容推薦；實際上可用度不高，用戶即使勉強從系統提供的產業分類中找到並選出自己公司的產業，其也只能拿到很籠統的"商標類別"資訊，但實務上很高機率不可申請單類全商品項目申請，因此，推薦了但能協助客戶進行商標申請的實務效果有限。If the above-mentioned Chinese patent (publication number CN111898022A) still has problems that need to be improved, simply use statistical methods to conduct statistics on the company's service (product) items registered in various public and commercial bureaus in China and the company's trademark application categories, and store them into a database. This model is not applicable to other countries, let alone cross-border category recommendation analysis; when the user selects a specific industry, the statistical results of the trademark application category corresponding to the industry will be jumped out, and a recommendation result will be formed; it is simply recommended for trademark categories , there is no actual product item content recommendation; in fact, the usability is not high. Even if the user barely finds and selects his company's industry from the industry classification provided by the system, he can only get very general "trademark category" information, but In practice, there is a high probability that it is not possible to apply for a single category of all-product items. Therefore, it is recommended but has limited practical effect in assisting customers with trademark applications.

又如中國專利公開號CN107330109A所揭露，對樣本商標圖像及內容按預設的商標分卡標準進行商標分卡處理，具體處理過程包括：（1）建立由預設的形狀特徵、讀音特徵和含義特徵最小單元多種組合方案所構成的商標分卡標準，（2）對樣本商標是否由漢語文字、圖形、字母、數位或符號構成要素進行識別，獲取構成要素的內容，（3）樣本商標各構成要素的形狀特徵最小單元、讀音特徵最小單元和含義特徵最小單元；（4）根據已建立的商標分卡標準，提取每一組合方案所生成或轉換得到的各種文字、圖形的切分資訊，將這些切分資訊作為樣本商標分卡資訊，並設定每一預設的商標分卡標準的近似度評價分值。以輸入商標分卡資訊集合作為檢索關鍵字對存儲於商標記憶體的樣本商標分卡資訊進行檢索，獲取相關的結果商標的分卡資訊及分卡匹配資訊；按照預設的商標形近率、商標義近率、商標音近率和檢索關鍵字匹配得分率計算公式進行運算；計算獲取商標近似度綜合量化值，然後利用商標近似度綜合量化值的大小對結果商標進行排序。As disclosed in Chinese Patent Publication No. CN107330109A, sample trademark images and content are processed into trademark cards according to preset trademark carding standards. The specific processing process includes: (1) Establishing preset shape features, pronunciation features and A trademark classification standard composed of various combinations of the smallest unit of meaning characteristics, (2) identify whether the sample trademark is composed of Chinese characters, graphics, letters, digits or symbols, and obtain the content of the constituent elements, (3) each of the sample trademarks The smallest unit of shape characteristics, the smallest unit of pronunciation characteristics and the smallest unit of meaning characteristics of the constituent elements; (4) According to the established trademark classification standard, extract the segmentation information of various texts and graphics generated or converted by each combination scheme, Use this segmentation information as sample trademark classification information, and set the similarity evaluation score for each preset trademark classification standard. Use the input trademark card information set as the search keyword to retrieve the sample trademark card information stored in the trademark memory, and obtain the card card information and card matching information of the relevant result trademarks; according to the preset trademark similarity rate, The calculation formulas of trademark meaning similarity rate, trademark sound similarity rate and search keyword matching score rate are calculated; the comprehensive quantitative value of trademark similarity is calculated and obtained, and then the resultant trademarks are sorted by the comprehensive quantitative value of trademark similarity.

上述中國專利(公開號CN107330109A)仍存在待改善的問題，這案件是主要針對商標的檢索邏輯進行技術發展，透過型、音、義等不同角度針對用戶輸入的商標logo進行分析與比對、計算相似度與排序，即使幫用戶排序了，同時並列舉很多前案出來，但用戶還是不知道該不該申請，此外，商標的前案是建立在同類別，甚至是同"近似群組"的概念下才成立的，即使你的logo與別人類似，但你在賣食物對方在賣手機，這種狀態下，別人的logo應該不會影響到你的商標申請才對，此案僅是進行大量的檢索並排序出相似程度，實際上申請人仍不知道該申請哪一類別較適合，仍無從解決跨國商標推薦等相關的問題。The above-mentioned Chinese patent (publication number CN107330109A) still has problems that need to be improved. This case mainly focuses on the technical development of trademark search logic. It analyzes, compares and calculates the trademark logo input by the user from different angles such as type, sound and meaning. Similarity and ranking, even if the user is sorted and many previous cases are listed, the user still does not know whether to apply. In addition, the previous cases for the trademark are established in the same category or even the same "similar group" It is based on the concept that even if your logo is similar to someone else’s, but you are selling food and the other person is selling mobile phones, in this situation, someone else’s logo should not affect your trademark application. This case is just a large-scale Search and sort out the degree of similarity. In fact, the applicant still does not know which category of the application is more suitable, and there is still no way to solve related problems such as cross-border trademark recommendation.

又如美國專利公開號US20140280104A1所揭露，舉例來說，在本文所描述的解決方案的一個例子中，進行商標搜尋，將搜尋商標與許多潛在相關的參考資料進行比較，例如現有的商標註冊、普通法參考、域名等。搜尋結果被編譯成一個數據集，該數據集可以存儲在任何適當的數據存儲機制中。對該數據集進行分析，以確定與每個參考相關的多個類別的接近度分數。例如，這些類別可以包括：外觀相似性、音響相似性、內涵相似性、商業印象相似性、商品/服務相似性、交易渠道相似性、銷售條件相似性、先前商標的知名度等。相關的接近度分數是潛在商標風險和衝突的客觀衡量。例如，在一個例子中，接近度分數可以用0-5的數字表示，其中較高的分數表示較高的關注程度，在這樣的例子中，外觀相似性的較高接近度分數表示外觀相似性較高，先前商標的較高分數表示該商標較有名。換句話說，接近度分數越高，該因素越有可能指示潛在的侵權或衝突。As disclosed in U.S. Patent Publication No. US20140280104A1, for example, in one example of the solution described herein, a trademark search is performed to compare the search trademark to a number of potentially relevant references, such as existing trademark registrations, common trademarks, etc. Legal references, domain names, etc. The search results are compiled into a data set that can be stored in any suitable data storage mechanism. This dataset is analyzed to determine proximity scores for multiple categories associated with each reference. For example, these categories may include: similarity in appearance, similarity in sound, similarity in connotation, similarity in commercial impression, similarity in goods/services, similarity in transaction channels, similarity in sales conditions, and the popularity of previous trademarks, etc. The associated proximity score is an objective measure of potential trademark risk and conflict. For example, in one example, the proximity score may be represented by a number from 0-5, where a higher score indicates a higher level of concern. In such an example, a higher proximity score for appearance similarity indicates appearance similarity. Higher, a higher score for a previous mark indicates that the mark is more famous. In other words, the higher the proximity score, the more likely the factor is to indicate a potential violation or conflict.

上述美國專利(公開號US20140280104A1)雖然透過比對後進行評分來顯示近似程度，透過介面顯示近似前案落在哪些類別，但仍存在待改善的問題，例如使用者雖然可以知道避免在哪個類別申請，不過卻不知道適合申請哪些類別，若是進行跨國上的商標推薦更是無法協助完成，以及用戶需要輸入精準且精確的文字在對應的欄位中，例如商標名稱、商品領域，才能有效進行後續的比對和評分，在實際使用上效益也相當有限。Although the above-mentioned US patent (Publication No. US20140280104A1) displays the degree of similarity through scoring after comparison and displays through the interface which categories the similar previous patents fall into, there are still problems that need to be improved. For example, users can know which categories to avoid applying for. , but I don’t know which categories are suitable for application. If I want to recommend a cross-border trademark, I can’t help. And the user needs to enter accurate and accurate text in the corresponding field, such as trademark name and product field, in order to effectively proceed with the follow-up. The comparison and scoring are also very limited in actual use.

又如美國專利公開號US20170322983A1所揭露，用其特殊的UI來表示前案與檢索商標的相似程度，"顯示器"為透過前案於不同顯示器上顯示的位置來顯示該前案對於本案的相似(近似威脅)程度，用戶可透過顯示器的不同區塊位置來觀察各種不同程度"近似前案"的內容分類。As disclosed in US Patent Publication No. US20170322983A1, a special UI is used to indicate the degree of similarity between the previous case and the searched trademark. The "display" is to display the similarity of the previous case to the current case through the position of the previous case displayed on different monitors ( (Approximate threat) level, users can observe content classifications of various degrees of "similarity to previous cases" through different block positions of the display.

上述美國專利(公開號US20170322983A1)，其商標檢索邏輯與功能僅限於"文字商標"的近似比對與分數轉換，與本案差異還是在於出發點，本案是以商標申請分類推薦建議作為出發點來進行前案檢索與風險評估，並進行類別推薦，前案的目的是用來做高專業度的商標檢索為導向，並透過"顯示器"的畫面呈現與操作，讓用戶可針對不同程度的近似商標做分類查看。The trademark search logic and functions of the above-mentioned U.S. patent (publication number US20170322983A1) are limited to the approximate comparison and fraction conversion of "word trademarks". The difference from this case lies in the starting point. This case is based on the recommended recommendations for trademark application classification as the starting point for the previous case. Search and risk assessment, and category recommendation, the purpose of the previous case is to conduct highly professional trademark search-oriented, and through the "monitor" screen presentation and operation, users can classify and view similar trademarks to different degrees. .

又如新加坡智慧財產局(IPOS)官方網頁中的商標類別搜尋系統，請參閱圖6至圖7，提供使用者可以輸入商品服務的種類名稱例如car repair，該系統針對輸入的文字在資料庫中比對尋找出所有包含car或repair的商品項目，在搜尋結果中顯示所有包含car或repair的商品項目，並標示其是屬於哪一個類別。Another example is the trademark category search system on the official website of the Intellectual Property Office of Singapore (IPOS). Please refer to Figures 6 to 7. It allows users to enter the type name of goods and services, such as car repair. The system will search for the entered text in the database. Compare and find all product items containing car or repair, display all product items containing car or repair in the search results, and indicate which category they belong to.

上述IPOS的商標類別搜尋系統僅是一般的簡單的全文字比對，若有一兩字不同都無法搜尋完整，過程中會列出所有包含的商品項目，對於使用者來說並沒有達到推薦甚至是風險評估的效果，再者，使用者所輸入的文字也必須是明確的商品單字或是服務單字，否則系統無法執行比對，例如輸入I run a car repair store，其搜尋結果為0，在輸入上也較難讓一般申請人輕易地得知該申請的商標類別項目，因一般申請人並不一定知道商標商品項目的名稱或哪些文字才是符合搜尋的標的。The above-mentioned trademark category search system of IPOS is only a simple full-text comparison. If one or two words are different, the search cannot be complete. All included product items will be listed in the process, which does not reach the level of recommendation or even for users. The effect of risk assessment. Furthermore, the text entered by the user must also be a clear product word or service word, otherwise the system cannot perform the comparison. For example, if I run a car repair store is entered, the search result is 0. After entering It is also difficult for ordinary applicants to easily know the trademark category of the application, because ordinary applicants do not necessarily know the name of the trademarked product item or which words are suitable for the search subject.

由上述說明可以得知，實有必要對習知的技術進行改良或調整，藉以提升其使用上的便利性。有鑑於此，本發明之發明人係極力加以研究創作，而終於研發完成本發明之系統。From the above description, it can be understood that it is necessary to improve or adjust the conventional technology to enhance the convenience of use. In view of this, the inventor of the present invention worked hard on research and creation, and finally developed the system of the present invention.

本發明之目的在於提出具有商標類別推薦之線上申請系統，解決了上述現有技術中存在的問題。The purpose of the present invention is to provide an online application system with trademark category recommendation, which solves the above-mentioned problems existing in the prior art.

因此，為了達成上述本發明之目的，本發明係提供所述一種語意分析商標類別推薦系統，其包括：使用者操作的電子裝置，電子裝置透過網路資訊連接一伺服器，其中，該電子裝置包含一處理器、一記憶體及一網路介面控制器，伺服器包含一應用程式。Therefore, in order to achieve the above-mentioned purpose of the present invention, the present invention provides the semantic analysis trademark category recommendation system, which includes: an electronic device operated by a user, and the electronic device is connected to a server through network information, wherein the electronic device Containing a processor, a memory and a network interface controller, the server includes an application program.

該處理器透過網路介面控制器以連上伺服器並執行應用程式，從而配置啟用商標線上申請模組及類別推薦模組，更進一步該處理器配置啟用申請資料收集單元、類別項目選擇單元、資料傳輸單元、文字解析單元、類別項目比較單元以及報告產生單元。The processor connects to the server through the network interface controller and executes the application, thereby configuring and activating the trademark online application module and the category recommendation module. Furthermore, the processor configures and activates the application data collection unit, category item selection unit, Data transmission unit, text analysis unit, category item comparison unit and report generation unit.

此外，該處理器還配置啟用一資料收發模組，將多個資料庫中的資料載入記憶體中。In addition, the processor is also configured to enable a data sending and receiving module to load data from multiple databases into the memory.

處理器透過網路介面控制器進一步連上多個送件伺服器，使資料傳輸單元得以將使用者之商標申請文件上傳至選定國家之送件伺服器。The processor further connects to multiple delivery servers through the network interface controller, so that the data transmission unit can upload the user's trademark application documents to the delivery server in the selected country.

使用者操作電子裝置的處理器執行應用程式之商標線上申請模組進行商標線上申請，使用者藉由電子裝置輸入商標申請所需的資料，例如：商標名稱、商標圖樣、申請人、聯絡人、欲申請的商品項目或對商品服務的描述等，電子裝置的顯示螢幕會顯示相對應的欄位供使用者填入相對應的資料，處理器進而啟用申請資料收集單元，將使用者輸入在對應欄位之資料分類後存入記憶體，因後續會特別使用到商標申請資料中的商標名稱、商標圖樣、欲申請的商品項目以及對商品服務的描述這些欄位的資訊，遂先經由申請資料收集單元做分類。The user operates the processor of the electronic device to execute the trademark online application module of the application to apply for a trademark online. The user inputs the information required for trademark application through the electronic device, such as: trademark name, trademark pattern, applicant, contact person, The display screen of the electronic device will display the corresponding fields for the product items to be applied for or the description of the goods and services for the user to fill in the corresponding information. The processor then activates the application data collection unit to input the corresponding information to the user. The data in the fields are classified and stored in the memory. Since the information in the fields of the trademark name, trademark pattern, product items to be applied for, and description of the goods and services in the trademark application data will be particularly used later, the information in these fields is first processed through the application data. Collect units for classification.

較佳地，該處理器配置啟用該類別項目選擇單元，將已預先存入該記憶體的商標類別項目透過該電子裝置提供使用者選取。Preferably, the processor is configured to enable the category item selection unit to provide the user with the trademark category items that have been pre-stored in the memory for selection through the electronic device.

較佳地，該處理器配置啟用該資料傳輸單元，透過該網路介面控制器連上至少一國家的送件伺服器，上傳使用者完成之商標申請文件。Preferably, the processor is configured to activate the data transmission unit, connect to a delivery server in at least one country through the network interface controller, and upload the trademark application document completed by the user.

較佳地，該處理器配置啟用該文字解析單元，對使用者輸入的商標申請資料進行語意分析擷取出關鍵字。Preferably, the processor is configured to enable the text analysis unit to perform semantic analysis on the trademark application information input by the user to extract keywords.

較佳地，該處理器配置啟用該類別項目比對單元，將經過該文字解析單元擷取出之關鍵字與預先儲存在該記憶體中的商標類別項目進行比對，且將比對結果存回該記憶體中。Preferably, the processor is configured to activate the category item comparison unit, compare the keywords retrieved by the text analysis unit with the trademark category items pre-stored in the memory, and store the comparison results back in this memory.

較佳地，該處理器配置啟用該報告產生單元，將該記憶體中的比對結果依照字詞相似程度進行排列而產生該類別推薦報告。Preferably, the processor is configured to enable the report generation unit to arrange the comparison results in the memory according to word similarity to generate the category recommendation report.

較佳地，該處理器透過該網路介面控制器以連上該伺服器並執行該應用程式，從而進一步配置啟用一風險評估模組，對使用者輸入之商標申請資料中的商標名稱進行前案比對，且該處理器執行該應用程式中的該風險評估模組進一步配置啟用該文字解析單元、一檢索比對單元及該報告產生單元。Preferably, the processor connects to the server through the network interface controller and executes the application, thereby further configuring and activating a risk assessment module to perform pre-processing of the trademark name in the trademark application data input by the user. case comparison, and the processor executes the risk assessment module in the application to further configure and enable the text parsing unit, a search comparison unit and the report generation unit.

較佳地，該處理器配置啟用該文字解析單元，對使用者輸入的商標申請資料之商標名稱做文字排列組合並存入該記憶體中，該處理器進一步配置啟用該檢索比對單元，將該記憶體中之全部文字排列組合分別與預先儲存在該記憶體中之商標前案進行比對，產生比對結果存回該記憶體中。Preferably, the processor is configured to enable the text parsing unit to perform text arrangement and combination of the trademark names of the trademark application data input by the user and store them in the memory. The processor is further configured to enable the search comparison unit to All text combinations in the memory are compared with the trademark records pre-stored in the memory, and the comparison results are generated and stored back in the memory.

較佳地，該處理器配置啟用該報告產生單元，將該記憶體中的比對結果根據字詞近似程度排列，而產生該風險評估報告。Preferably, the processor is configured to enable the report generation unit to arrange the comparison results in the memory according to word similarity to generate the risk assessment report.

以下僅藉由具體實施例，且佐以圖式作詳細之說明。The following is a detailed description only through specific embodiments and drawings.

現在將參照其中示出本發明概念的示例性實施例的附圖在下文中更充分地闡述本發明概念。以下藉由參照附圖更詳細地闡述的示例性實施例，本發明概念的優點及特徵以及其達成方法將顯而易見。Inventive concepts will now be elucidated more fully hereinafter with reference to the accompanying drawings, in which exemplary embodiments of the inventive concepts are shown. The advantages and features of the inventive concept, as well as the methods for achieving them, will be apparent from the following exemplary embodiments, which are explained in more detail with reference to the accompanying drawings.

本文所用術語僅用於闡述特定實施例，而並非旨在限制本發明。除非上下文中清楚地另外指明，否則本文所用的單數形式的用語「一」及「該」旨在亦包括複數形式。本文所用的用語「及/或」包括相關所列項其中一或多者的任意及所有組合。應理解，當稱元件「連接」或「耦合」至另一元件時，所述元件可直接連接或耦合至所述另一元件或可存在中間元件。The terminology used herein is for describing particular embodiments only and is not intended to be limiting of the invention. As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items. It will be understood that when an element is referred to as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may be present.

本文中參照圖來闡述示例性實施例，其中所述圖式是理想化示例性說明圖。因此，預期存在由例如製造技術及/或容差所造成的相對於圖示形狀的偏離。因此，圖中所示的區為示意性的，且其形狀並非旨在說明裝置的實際形狀、亦並非旨在限制示例性實施例的範圍。Example embodiments are described herein with reference to the drawings, which are idealized illustrations of the examples. Therefore, deviations from the shapes illustrated are expected to occur due, for example, to manufacturing techniques and/or tolerances. Accordingly, the regions shown in the figures are schematic and their shapes are not intended to illustrate the actual shape of the device or to limit the scope of the exemplary embodiments.

請結合參閱圖8、圖9，圖8為顯示本發明系統之架構示意圖；圖9為顯示本發明系統之另一架構示意圖。本發明系統藉由提供一使用者U操作的一電子裝置100來實現，電子裝置100透過網路200資訊連接一伺服器300，其中，該電子裝置100包含一處理器110、一記憶體120及一網路介面控制器130，該伺服器300包含一應用程式310，使得該處理器110透過網路介面控制器130以連上伺服器300並執行應用程式310，從而配置啟用商標線上申請模組311及類別推薦模組312，更進一步該處理器110配置啟用申請資料收集單元3111、類別項目選擇單元3112、資料傳輸單元3113、文字解析單元3121、類別項目比較單元3122以及報告產生單元3123，此外，該處理器110還配置啟用一資料收發模組313，將多個資料庫400中的資料載入記憶體120中。Please refer to Figures 8 and 9 in conjunction. Figure 8 is a schematic diagram showing the architecture of the system of the present invention; Figure 9 is a schematic diagram showing another architecture of the system of the present invention. The system of the present invention is implemented by providing an electronic device 100 operated by a user U. The electronic device 100 is connected to a server 300 through the network 200. The electronic device 100 includes a processor 110, a memory 120 and A network interface controller 130, the server 300 includes an application program 310, so that the processor 110 can connect to the server 300 through the network interface controller 130 and execute the application program 310, thereby configuring and enabling the trademark online application module 311 and category recommendation module 312. Further, the processor 110 is configured to enable an application data collection unit 3111, a category item selection unit 3112, a data transmission unit 3113, a text analysis unit 3121, a category item comparison unit 3122, and a report generation unit 3123. In addition, , the processor 110 is also configured to enable a data transceiver module 313 to load the data in the plurality of databases 400 into the memory 120 .

伺服器可為雲端伺服器或為架設為地端伺服器的架構。The server can be a cloud server or configured as an on-premises server.

於本實施例當中，創作人係採用以下規格的伺服器進行本技術之商標商品項目轉化模型訓練與推算執行；處理器(CPU)係採用高效能多核心處理器，尤其對於處理大量數據以及進行複雜計算時，至少使用一個具有16核或以上的CPU (例如AMD Ryzen Threadripper或Intel Xeon系列)。且記憶體(RAM)的大小可以為但不限於64GB或更高的RAM規格以便能夠處理的語料庫的大小以及詞向量模型的大小。網路介面控制器為一個高速穩定的網路連線硬體，特別是使用在雲端計算資源或者下載/上傳大量數據。其中，最重要的圖形處理單元(GPU)係使用高效能的GPU(如NVIDIA的RTX 30系列或Tesla系列)以降低模型訓練的時間；其中，於本案伺服器的訓練架構中，係採用如上較高規格的訓練用伺服器，而於模型訓練完成並進行推算處理時，係可採用較低規格的伺服器主機，並可同為部屬本系統的雲端或地端伺服器主機。於本案架構中，伺服器主機規格的採用係不影響本案所強調的技術特徵，因而任何伺服器主機規格應仍落入本案技術範圍當中。其中，該處理器100透過網路介面控制器130進一步連上多個送件伺服器500，使資料傳輸單元3113得以將使用者U之商標申請文件上傳至選定國家之送件伺服器500。In this embodiment, the author uses a server with the following specifications to perform the training and calculation execution of the trademark product item conversion model of this technology; the processor (CPU) uses a high-performance multi-core processor, especially for processing large amounts of data and performing For complex calculations, use at least a CPU with 16 cores or more (such as AMD Ryzen Threadripper or Intel Xeon series). And the size of the memory (RAM) can be but is not limited to a RAM specification of 64GB or higher to be able to process the size of the corpus and the size of the word vector model. The network interface controller is a high-speed and stable network connection hardware, especially used in cloud computing resources or downloading/uploading large amounts of data. Among them, the most important graphics processing unit (GPU) uses high-performance GPUs (such as NVIDIA's RTX 30 series or Tesla series) to reduce model training time; among them, in the training architecture of the server in this case, the above-mentioned A high-standard training server is used, but when the model training is completed and inference processing is performed, a lower-standard server host can be used, and it can be the same cloud or local server host deployed in the system. In the framework of this case, the adoption of server host specifications does not affect the technical features emphasized in this case, so any server host specifications should still fall within the technical scope of this case. Among them, the processor 100 is further connected to multiple delivery servers 500 through the network interface controller 130, so that the data transmission unit 3113 can upload the trademark application documents of the user U to the delivery servers 500 in the selected country.

具體地，使用者U操作電子裝置100的處理器110執行應用程式310之商標線上申請模組311進行商標線上申請，使用者U藉由電子裝置100輸入商標申請所需的資料，例如：商標名稱、商標圖樣、申請人、聯絡人、欲申請的商品項目或對商品服務的描述等，電子裝置100的顯示螢幕會顯示相對應的欄位供使用者U填入相對應的資料，處理器110進而啟用申請資料收集單元3111，將使用者U輸入在對應欄位之資料分類後存入記憶體120，因後續會特別使用到商標申請資料中的商標名稱、商標圖樣、欲申請的商品項目以及對商品服務的描述這些欄位的資訊，遂先經由申請資料收集單元3111做分類。Specifically, the user U operates the processor 110 of the electronic device 100 to execute the trademark online application module 311 of the application program 310 to apply for a trademark online. The user U inputs the information required for trademark application through the electronic device 100, such as the trademark name. , trademark pattern, applicant, contact person, product items to be applied for, or description of goods and services, etc., the display screen of the electronic device 100 will display corresponding fields for the user to fill in the corresponding information. The processor 110 Then the application data collection unit 3111 is activated to classify the data input by the user U in the corresponding fields and store it in the memory 120, because the trademark name, trademark pattern, product items to be applied for and the product items to be applied for in the trademark application data will be particularly used later. The information in these fields of product and service description is first classified through the application data collection unit 3111.

具體地，處理器110執行應用程式310啟用商標線上申請模組311的類別項目選擇單元3112，使用者U可以直接透過電子裝置100選擇欲申請商標的商品項目，處理器110存取記憶體120中預先存入的商標類別和項目列表，顯示於電子裝置100的顯示螢幕，提供有經驗或專業使用者直接尋找對應商品或服務的類別項目，並於選擇後接續完成商標線上申請。Specifically, the processor 110 executes the application program 310 to activate the category item selection unit 3112 of the trademark online application module 311. The user U can directly select the product item for which a trademark is to be applied for through the electronic device 100. The processor 110 accesses the memory 120. The pre-stored list of trademark categories and items is displayed on the display screen of the electronic device 100, allowing experienced or professional users to directly search for category items corresponding to goods or services, and then complete the online trademark application after selection.

倘若使用者U非有經驗或專業人士，可以透過電子裝置100的顯示螢幕選擇類別推薦選項，處理器100遂啟用類別推薦模組312進行商標商品項目推薦，讓使用者U可以參考推薦項目快速選擇適合的類別項目。If the user U is not an experienced or professional person, he can select the category recommendation option through the display screen of the electronic device 100, and the processor 100 activates the category recommendation module 312 to recommend trademark product items, so that the user U can quickly select by referring to the recommended items. Appropriate category items.

處理器110進一步啟用文字解析單元3121，讀取記憶體120中經過申請資料收集單元3111分類後之商標申請資料，特別是其中的商品項目及對商品服務的描述，對使用者U輸入的商品服務描述之文字做語意分析擷取出關鍵字，其中商品服務描述可以是例如：我要開一間賣手沖咖啡的咖啡廳、販賣飲料蛋糕手做餅乾與飾品等。The processor 110 further activates the text parsing unit 3121 to read the trademark application data classified by the application data collection unit 3111 in the memory 120, especially the product items and descriptions of the product services, and the product services input by the user U The text of the description is semantically analyzed to extract keywords. The description of the product and service can be, for example: I want to open a cafe selling hand-brewed coffee, sell drinks, cakes, hand-made biscuits and accessories, etc.

具體地，關鍵字擷取是一種自然語言處理技術，旨在從文本中自動提取出重要的關鍵字或詞組，方法可以為但是不限於統計方法，頻率法(Frequency-based methods)、文本統計法(Statistical methods)、文本向量化方法(Text vectorization methods)或機器學習方法(Machine learning methods)。Specifically, keyword extraction is a natural language processing technology that aims to automatically extract important keywords or phrases from text. The method can be but is not limited to statistical methods, frequency-based methods, and text statistics methods. (Statistical methods), text vectorization methods (Text vectorization methods) or machine learning methods (Machine learning methods).

頻率法基於詞語在文本中的頻率來判斷其重要性。常見的方法有TF-IDF(詞頻-逆文件頻率)和詞頻(Term Frequency)等。TF-IDF考慮了詞語在文本中的出現頻率以及在整個文集中的重要程度，詞頻則僅考慮了詞語在文本中的出現頻率。The frequency method determines the importance of words based on their frequency in the text. Common methods include TF-IDF (term frequency-inverse document frequency) and term frequency (Term Frequency). TF-IDF considers the frequency of words in the text and their importance in the entire corpus, while word frequency only considers the frequency of words in the text.

文本統計法基於統計模型來分析詞語在文本中的分布和關聯性。常見的方法有互信息(Mutual Information)、點互信息(Pointwise Mutual Information)和卡方檢驗(Chi-squared Test)等。這些方法通常需要建立一個詞語和文本之間的統計模型，並根據該模型計算詞語的重要性。Text statistics method is based on statistical models to analyze the distribution and correlation of words in text. Common methods include mutual information (Mutual Information), pointwise mutual information (Pointwise Mutual Information), and Chi-squared Test (Chi-squared Test). These methods usually require building a statistical model between words and text and calculating the importance of words based on this model.

文本向量化方法將文本轉換為向量表示，然後使用向量空間模型(Vector Space Model)來計算詞語的重要性。常見的方法有詞袋模型(Bag-of-Words Model)、詞向量(Word Embeddings)和文本向量化方法(如TF-IDF向量化)等。The text vectorization method converts text into vector representation, and then uses a vector space model to calculate the importance of words. Common methods include Bag-of-Words Model, Word Embeddings, and text vectorization methods (such as TF-IDF vectorization).

機器學習方法使用機器學習算法來訓練模型，從文本中學習詞語的重要性。常見的方法有文本分類、文本聚類和關鍵詞提取模型等。這些方法需要使用標註好的文本數據進行模型的訓練。Machine learning methods use machine learning algorithms to train models to learn the importance of words from text. Common methods include text classification, text clustering, and keyword extraction models. These methods require the use of annotated text data for model training.

上述也能透過使用腳本語言(例如Python)編寫一個程式來執行。The above can also be executed by writing a program using a scripting language (such as Python).

在文字解析單元3121擷取出關鍵字後，處理器110接著啟用類別項目比對單元3122，將記憶體120中使用者U輸入的商品項目、擷取出之關鍵字與商標類別項目進行比對，該商標類別項目已經預先透過資料收發模組313連接多個資料庫400將不同國家的商標類別項目存入記憶體120中，依據使用者U欲申請的國家選擇比對該國的商標類別項目。After the text analysis unit 3121 extracts the keywords, the processor 110 then activates the category item comparison unit 3122 to compare the product items input by the user U in the memory 120, the extracted keywords and the trademark category items. The trademark category items have been connected to multiple databases 400 through the data transceiver module 313 in advance and the trademark category items of different countries are stored in the memory 120. According to the country where the user U wants to apply, the trademark category items of the country are selected and compared.

具體而言，比對方法可以為但不限於使用腳本語言(例如Python)編寫一個程式，將每個類別名稱逐一讀入，再透過程式讀取記憶體120中的商標類別項目文字，以搜尋是否有符合或近似的項目，或是使用SQL語言進行比對，使用SELECT語句來從中選取符合特定條件的類別項目，以此找出是否存在於商標類別項目中。Specifically, the comparison method can be, but is not limited to, writing a program using a script language (such as Python), reading in each category name one by one, and then reading the trademark category item text in the memory 120 through the program to search whether There are matching or similar items, or you can use SQL language for comparison, and use the SELECT statement to select category items that meet specific conditions to find out whether they exist in the trademark category items.

若使用腳本語言(例如Python)編寫程式可以逐一讀入每個商標類別項目，接著，透過Python中的檔案處理功能，讀商標類別項目中的文字，以搜尋是否有符合或近似的文字，為了進行文字比對，可以使用Python中的字串處理方法，如字串比對、正規表達式等，透過這些方法，程式可以判斷商標類別項目中的文字是否包含與商品項目或關鍵字相符的字詞，以確定是否有符合的文字存在。If you use a scripting language (such as Python) to write a program, you can read each trademark category item one by one, and then use the file processing function in Python to read the text in the trademark category item to search for matching or similar words. In order to carry out For text comparison, you can use string processing methods in Python, such as string comparison, regular expressions, etc. Through these methods, the program can determine whether the text in the trademark category item contains words that match the product item or keyword to determine whether matching text exists.

另外，比對的方法可以是但不限於透過字串匹配算法(例如 Levenshtein Distance、Jaro-Winkler Distance 等)，Levenshtein Distance(也稱為 Edit Distance)是一種衡量兩個字串相似度的算法，其原理是計算將一個字串轉換為另一個字串所需的最少操作數，這些操作可以是插入一個字符、刪除一個字符或替換一個字符。In addition, the comparison method can be but is not limited to string matching algorithms (such as Levenshtein Distance, Jaro-Winkler Distance, etc.). Levenshtein Distance (also known as Edit Distance) is an algorithm that measures the similarity of two strings. The principle is to calculate the minimum number of operations required to convert one string to another. These operations can be to insert a character, delete a character, or replace a character.

舉例而言，假設有兩個字串"kitten"和"sitting"，要計算它們之間的 Levenshtein Distance，我們將 "kitten" 轉換為 "sitting"，需要經過以下幾個步驟：把 "k" 替換為 "s"，成為 "sitten"；把 "e" 替換為 "i"，成為 "sittin"；插入一個 "g"，成為 "sitting"，因此，這兩個字串之間的 Levenshtein Distance 為3，即需要3次操作才能把一個字串轉換為另一個字串。透過這種算法，我們可以計算出任意兩個字串之間的距離，並用它來比較它們之間的相似度。For example, suppose there are two strings "kitten" and "sitting". To calculate the Levenshtein Distance between them, we convert "kitten" to "sitting" and need to go through the following steps: Replace "k" is "s", it becomes "sitten"; replace "e" with "i", it becomes "sittin"; insert a "g", it becomes "sitting", therefore, the Levenshtein Distance between the two strings is 3 , that is, it takes 3 operations to convert one string to another. Through this algorithm, we can calculate the distance between any two strings and use it to compare the similarity between them.

此外，Jaro-Winkler Distance是一種衡量兩個字串相似度的算法，它是基於Jaro Distance的改進版本，通常用於比較較短的字串，比如人名、地址等，Jaro-Winkler Distance會根據兩個字串的相同字元和字元位置之間的距離，計算出一個0到1之間的相似度值，其中0表示完全不相似，1表示完全相同。Jaro-Winkler Distance算法的主要步驟如下：計算Jaro距離，計算兩個字串之間的相似度，它主要考慮了字串中字符的順序，以及兩個字串中共有的字符數量；計算Winkler修正因素，主要用於處理相似字串的情況，它通過計算共同前綴的長度以及一個常數p的值，來調整Jaro距離的值；計算Jaro-Winkler Distance，Jaro-Winkler Distance = Jaro距離 + Winkler修正因素。In addition, Jaro-Winkler Distance is an algorithm that measures the similarity of two strings. It is an improved version of Jaro Distance and is usually used to compare shorter strings, such as names, addresses, etc. Jaro-Winkler Distance will measure the similarity between two strings based on The distance between the same character and the character position of each string is calculated as a similarity value between 0 and 1, where 0 means completely dissimilar and 1 means exactly the same. The main steps of the Jaro-Winkler Distance algorithm are as follows: Calculate the Jaro distance and calculate the similarity between two strings. It mainly considers the order of characters in the strings and the number of characters in the two strings; calculate the Winkler correction Factor, mainly used to deal with similar strings. It adjusts the value of Jaro distance by calculating the length of the common prefix and the value of a constant p; calculates Jaro-Winkler Distance, Jaro-Winkler Distance = Jaro distance + Winkler correction factor .

在類別項目比對單元3122產生比對結果之後存入記憶體120中，同時處理器110啟用報告產生單元3123，將比對結果依照字詞相似程度進行排列，完全相同或近似程度高列為優先推薦的類別項目，具體而言，在比對的過程中可以依不同的方式計算近似程度，例如編輯距離(Edit Distance)、餘弦相似度(Cosine Similarity)或Jaccard相似度(Jaccard Similarity)，經過近似程度排列後之類產生一推薦報告，使用者U透過推薦報告可以清楚的知道如何選擇商標類別項目。After the comparison results are generated by the category item comparison unit 3122, they are stored in the memory 120. At the same time, the processor 110 activates the report generation unit 3123 to arrange the comparison results according to the degree of word similarity, with identical or high similarity being prioritized. Recommended category items, specifically, the degree of approximation can be calculated in different ways during the comparison process, such as Edit Distance, Cosine Similarity or Jaccard Similarity. After approximation After sorting by degree, a recommendation report is generated. Through the recommendation report, the user U can clearly know how to select trademark category items.

進一步地，處理器110透過該網路介面控制器130以連上該伺服器300並執行該應用程式310，從而配置啟用一風險評估模組314，對使用者U輸入之商標申請資料中的商標名稱進行前案比對，且處理器110執行該應用程式310中的風險評估模組314進一步配置啟用文字解析單元3121、檢索比對單元3142及報告產生單元3123。Further, the processor 110 connects to the server 300 through the network interface controller 130 and executes the application program 310, thereby configuring and enabling a risk assessment module 314 to evaluate the trademark in the trademark application data input by the user U. The name is compared with previous records, and the processor 110 executes the risk assessment module 314 in the application 310 to further configure and enable a text parsing unit 3121, a search comparison unit 3142, and a report generation unit 3123.

文字解析單元3121對存在記憶體120中的商標申請資料中的商標名稱進行文字排列組合，具體而言，若要將一個字詞進行單一文字的排列組合，可以使用以下步驟進行：首先，將字詞的第一個字元取出，作為起始字元；接著，將剩餘的字元進行排列組合，這可以透過遞迴或迭代的方式進行，遞迴是一種自我調用的過程，將問題分解成更小的子問題並遞迴處理，直到達到終止條件，迭代則是使用迴圈來重複執行相同的操作，直到達到終止條件；將第一個字元插入到每個排列組合的不同位置，形成新的排列組合，插入的位置可以從頭到尾逐一嘗試，這樣可以保證生成所有可能的排列組合；重複上述步驟，直到處理完所有的字元。The text analysis unit 3121 performs text arrangement and combination on the trademark names in the trademark application materials stored in the memory 120. Specifically, if you want to arrange and combine a word into a single text, you can use the following steps: First, put the words into a single text combination. The first character of the word is taken out as the starting character; then, the remaining characters are arranged and combined. This can be done through recursion or iteration. Recursion is a self-invoking process that decomposes the problem into Smaller sub-problems are processed recursively until the termination condition is reached. Iteration uses loops to repeatedly perform the same operation until the termination condition is reached; inserting the first character into different positions of each permutation and combination, forming New permutations and insertion positions can be tried one by one from beginning to end, so as to ensure that all possible permutations and combinations are generated; repeat the above steps until all characters have been processed.

要注意的是，上述排列組合方法僅為示範，並無加以限定，其主要作用在有利於後續比對過程中，盡可能找出相似的名稱。It should be noted that the above permutation and combination methods are only examples and are not limited. Their main function is to facilitate the subsequent comparison process to find similar names as much as possible.

在文字解析單元3121完成將使用者U輸入的商標名稱之所有排列組合後存入記憶體120中，且處理器110同時啟用檢索比對單元3142，在此之前已預先透過資料收發模組313連接資料庫400而儲存所有已申請的商標資訊在記憶體120中，這些已申請的商標資訊即為商標前案，做為後續比對的基礎。After the text analysis unit 3121 completes storing all the permutations and combinations of the brand name input by the user U into the memory 120, and the processor 110 simultaneously activates the search comparison unit 3142, which has been connected through the data sending and receiving module 313 before The database 400 stores all applied trademark information in the memory 120. These applied trademark information are trademark precedents and serve as the basis for subsequent comparisons.

具體地，檢索比對單元3142將記憶體120中所有排列組合與記憶體120中的商標前案進行比對，比對的方法如上述的類別項目比對單元3122近似，因此不再贅述。Specifically, the retrieval comparison unit 3142 compares all the permutations and combinations in the memory 120 with the trademark records in the memory 120. The comparison method is similar to the above-mentioned category item comparison unit 3122, and therefore will not be described again.

檢索比對單元3142比對後產生的比對結果同樣會存回記憶體120中，同時處理器110啟用報告產生單元3123，將比對結果進行近似程度排列，其近似程度計算方式如同上述類別推薦報告，近似程度越高排列順序越前或越上，而產生風險評估報告，該報告呈現的方式可為線上圖像、可下載之文書檔案、可轉發至通訊軟體之檔案格式、可分享至社群軟體之檔案格式、電子布告欄或其組合。The comparison results generated by the retrieval and comparison unit 3142 will also be stored back in the memory 120. At the same time, the processor 110 activates the report generation unit 3123 to arrange the comparison results by degree of similarity. The calculation method of the degree of similarity is the same as the above category recommendation. In the report, the higher the degree of similarity, the higher the ranking order, and a risk assessment report is generated. The report can be presented as an online image, a downloadable document file, a file format that can be forwarded to communication software, or can be shared with the community. Software file format, electronic bulletin board or combination thereof.

在另一實施例中，處理器110可以同時啟用類別推薦模組312與風險評估模組314，在報告產生單元3123所產生的則會是結合商標類別推薦與風險評估的綜合報告。In another embodiment, the processor 110 can enable the category recommendation module 312 and the risk assessment module 314 at the same time, and the report generation unit 3123 generates a comprehensive report that combines trademark category recommendation and risk assessment.

具體地，文字解析單元3121同時存取記憶體120中使用者U輸入之商標名稱、商品項目及商品服務描述，並對商標名稱進行文字排列組合、對商品服務描述進行關鍵字擷取，且存回記憶體120中，接著處理器110啟用類別項目比對單元3122與檢索比對單元3142，分別對關鍵字與所有文字排列組合進行對應的比對，產生的比對結果存回記憶體120。Specifically, the text analysis unit 3121 simultaneously accesses the brand name, product items and product service descriptions input by the user U in the memory 120, arranges and combines text on the brand name, extracts keywords from the product and service descriptions, and stores The processor 110 then activates the category item comparison unit 3122 and the retrieval comparison unit 3142 to perform corresponding comparisons between the keywords and all text permutations and combinations, and the generated comparison results are stored back in the memory 120 .

接著，處理器110啟用報告產生單元3123讀取記憶體120中的比對結果，綜合商標類別項目與商標名稱近似程度排列，具體而言，先排列出商標名稱近似程度，再分析計算該商標屬於的類別項目與推薦的商標類別項目近似程度，若商標名稱高度近似且該商標屬於的類別項目符合高度推薦的商標類別項目則代表此商標前案為高度近似前案，在綜合報告中會以明顯顏色來呈現。Next, the processor 110 activates the report generation unit 3123 to read the comparison results in the memory 120, and arranges the similarity between the trademark category items and the trademark names. Specifically, the similarity of the trademark names is first arranged, and then the trademark is analyzed and calculated. The degree of similarity between the category items of the trademark and the recommended trademark category items. If the trademark name is highly similar and the category item of the trademark belongs to the highly recommended trademark category item, it means that the previous trademark case is a highly similar previous case, and it will be clearly marked in the comprehensive report. Color is presented.

綜合報告中所顯示的顏色用以明顯區分使用者U盡量避開的商標類別項目，該商標類別項目具有較多高度近似的商標前案，提供使用者U清楚的選擇適合且風險較低的類別項目。The colors displayed in the comprehensive report are used to clearly distinguish the trademark category items that the user U tries to avoid. This trademark category item has many highly similar trademark cases, providing the user U with a clear choice of a suitable and low-risk category. project.

本發明的另一實施例請參閱圖10，本系統用以接收一使用者端藉由操作一電子裝置所提供之字串與欲查詢的至少一個目標國家，經該電子裝置的一處理器透過一網路介面控制器連上一伺服器並執行一應用程式，運算產出商標類別推薦報告，該系統至少包括：輸入模組600、多語翻譯模型700、類別推薦模組312、風險評估模組314及資料庫400，其中資料庫400可以為尼斯商標類別資料庫及多個國家的商標類別資料庫、多國商標資料庫。Please refer to Figure 10 for another embodiment of the present invention. The system is used to receive a string provided by a user by operating an electronic device and at least one target country to be queried, through a processor of the electronic device. A network interface controller is connected to a server and executes an application program to calculate and generate a trademark category recommendation report. The system at least includes: an input module 600, a multilingual translation model 700, a category recommendation module 312, and a risk assessment module. Group 314 and database 400, where the database 400 can be a Nice trademark category database, a trademark category database of multiple countries, or a multinational trademark database.

其中，尼斯商標類別資料庫及多個國家的商標類別資料庫主要是分別儲存有尼斯國際商標類別的分類與商品項目資料，以及各個對應國家的商標類別分類與商品項目資料，而多國商標資料庫則是儲存有各個對應國家的商標申請資料。Among them, the Nice trademark category database and the trademark category databases of multiple countries mainly store the classification and product item data of the Nice international trademark category, as well as the trademark category classification and product item data of each corresponding country. The multi-country trademark data The database stores trademark application data for each corresponding country.

使用者端可以為品牌業主、商標申請人、商標從業人士、法律相關從業人士的組合中任意選擇。Users can choose from any combination of brand owners, trademark applicants, trademark practitioners, and legal practitioners.

輸入模組600用以接收使用者端所輸入的字串，並將該字串進行標籤化處理，並發送一字串資訊，具體地，字串標籤化（Tokenization）是將一個句子或文件拆分成個別的詞彙（tokens）的過程，以下舉例一個簡單的字串標籤化處理方法：The input module 600 is used to receive a string input by the user, perform tokenization processing on the string, and send a string of information. Specifically, string tokenization (Tokenization) is to decompose a sentence or document. The process of dividing into individual words (tokens), here is a simple string labeling processing method:

去除標點符號：使用正則表達式或預先定義的標點符號列表，將字串中的標點符號去除，例如句點、逗號、問號等。拆分字詞：將字串按照空格或其他特定的分隔符號進行拆分，每個拆分的部分即為一個詞彙。處理特殊情況：處理特殊情況，例如縮寫詞、連字符、數字等。可以使用正則表達式或特定的規則來處理這些情況，將它們拆分成合適的詞彙。轉換為小寫(若為具有大小寫區分的語言)：將所有詞彙轉換為小寫形式，以統一詞彙的表示方式。Remove punctuation marks: Use regular expressions or a predefined list of punctuation marks to remove punctuation marks from the string, such as periods, commas, question marks, etc. Split words: Split the string according to spaces or other specific delimiters, and each split part is a word. Handle special cases: Handle special cases such as abbreviations, hyphens, numbers, etc. Regular expressions or specific rules can be used to handle these situations, splitting them into appropriate words. Convert to lowercase (if it is a case-sensitive language): Convert all words to lowercase to unify the representation of words.

多語翻譯模型700係透過大型語言翻譯訓練出之一運算模型，在接收到該字串資訊時，判斷輸入的字串語言，與使用者選擇的目標國家之官方語言是否一致，若非一致則對使用者輸入的字串進行翻譯，遂將字串翻譯為符合該目標國家之官方語言，並將翻譯後的字串資訊進行發送。The multilingual translation model 700 is a computing model trained through large-scale language translation. When receiving the string information, it determines whether the input string language is consistent with the official language of the target country selected by the user. If it is not consistent, the The string entered by the user is translated, the string is translated into the official language of the target country, and the translated string information is sent.

類別推薦模組312係透過自然語言模型與商標分類表及細目所訓練出之運算模型，用以將模糊語意(或非精準敘述)的字串資訊解析為商標類別推薦資訊，其中更包含：文字解析單元3121、類別項目比對單元3122以及報告產生單元3123。The category recommendation module 312 is a computational model trained through natural language models and trademark classification tables and details to parse ambiguous semantic (or imprecise description) string information into trademark category recommendation information, which also includes: text Analysis unit 3121, category item comparison unit 3122, and report generation unit 3123.

自然語言模型是可以根據已知的文本資料來預測下一個詞彙或生成符合文法和語意的句子，可以為但不限於：Natural language models can predict the next word or generate sentences that conform to grammar and semantics based on known text data. They can be but are not limited to:

N-gram模型：N-gram模型是一種基於機率的語言模型，它假設詞彙出現的機率只與前面N-1個詞彙有關。例如，在二元（bigram）模型中，對於給定的前一個詞彙，預測下一個詞彙的機率。N-gram model: The N-gram model is a language model based on probability. It assumes that the probability of a word appearing is only related to the previous N-1 words. For example, in a bigram model, predict the probability of the next word given the previous word.

遞歸神經網路（RNN）模型：RNN是一種適合處理序列數據的神經網路，它可以捕捉詞彙間的時間相依性。在自然語言處理中，RNN常用於語言模型的建構，其中每個詞彙被視為一個時間步。Recurrent Neural Network (RNN) model: RNN is a neural network suitable for processing sequence data, which can capture the temporal dependence between words. In natural language processing, RNN is often used in the construction of language models, where each word is regarded as a time step.

預訓練語言模型（例如BERT）：預訓練語言模型是通過大規模無監督訓練而得到的模型，可以理解和生成自然語言。BERT模型利用Transformer網路架構，在大量的文本資料上進行預訓練，然後進行微調以適應特定的任務，如文本分類、命名實體識別等。Pre-trained language model (such as BERT): A pre-trained language model is a model obtained through large-scale unsupervised training that can understand and generate natural language. The BERT model uses the Transformer network architecture to be pre-trained on a large amount of text data, and then fine-tuned to adapt to specific tasks, such as text classification, named entity recognition, etc.

商標分類表及細目為從各國的商標別資料庫中擷取出的分類，以及分類中的所有商品項目，將其擷取出來並形成列表，提供自然語言模型進行訓練與運算。The trademark classification table and details are the classifications extracted from the trademark classification databases of various countries, as well as all the product items in the classifications. They are extracted and formed into a list, and a natural language model is provided for training and calculation.

文字解析單元3121用以將該翻譯後字串資訊進行解析，擷取出關鍵字以及與商業行為、產品、服務等統稱為產業資訊之描述文字。The text parsing unit 3121 is used to parse the translated string information and extract keywords and descriptive text collectively referred to as industrial information such as business activities, products, services, etc.

類別項目比對單元3122，係用以接收該等關鍵字及產業資訊的相關描述文字，並將其與該目標國家之商標類別資料庫進行比對，並運算產生最接近至少一個類別資訊與該類別之一產品項目資訊與一服務項目資訊，即為推薦類別與推薦商品項目，一個推薦類別搭配至少一個推薦商品項目。The category item comparison unit 3122 is used to receive the relevant description text of the keywords and industry information, compare it with the trademark category database of the target country, and calculate to generate at least one category information that is closest to the target country. One category of product item information and one category of service item information are recommended categories and recommended product items, and one recommended category is matched with at least one recommended product item.

報告產生單元3123，係用以接收該推薦類別、該推薦商品項目與該目標國家，用該輸入語言透過一模板化格式產出一商標類別推薦報告，具體地，若最初使用者所輸入的字串之語言與目標國家的官方語言不相同，則產生之商標類別推薦報告會再透過多語翻譯模型700將推薦類別與推薦商品項目等文字翻譯回輸入字串時所使用的語言。The report generation unit 3123 is used to receive the recommended category, the recommended product item and the target country, and use the input language to generate a trademark category recommendation report through a templated format. Specifically, if the word input by the initial user If the language of the string is different from the official language of the target country, the generated trademark category recommendation report will then use the multilingual translation model 700 to translate the recommended category and recommended product items back to the language used when the string was input.

具體地，輸入的字串可以包含商標文字、商標描述、商業行為、公司名稱、股票代碼、產品名稱、服務名稱或其組合資訊。Specifically, the input string may include trademark text, trademark description, business behavior, company name, stock code, product name, service name or combination information thereof.

使用者端輸入字串的方式可以是文字輸入、語音輸入或視頻輸入。The method of inputting strings on the user side can be text input, voice input or video input.

該輸入模組600更包含一資訊擷取單元601，係用以將標籤化後的字串資訊主動至網路空間擷取相關資訊，並可將該資訊傳送至多語翻譯模型700進行翻譯。The input module 600 further includes an information retrieval unit 601, which is used to actively retrieve the tagged string information to the network space to retrieve relevant information, and can transmit the information to the multilingual translation model 700 for translation.

具體地，資訊擷取單元601是依據使用者所輸入的字串，在經過文字解析單元3121解析後所產生的產業相關資訊，在網路空間進行爬取資訊，在此所爬取的資訊可以為使用者的官方網站、包含擷取出的關鍵字的新聞或其他網路文章。Specifically, the information retrieval unit 601 crawls information in the cyberspace based on the industry-related information generated after the text string input by the user is parsed by the text parsing unit 3121. The information crawled here can be For the user's official website, news or other online articles containing the extracted keywords.

風險評估模組314係用以將自輸入的字串中所擷取出之關鍵字與該目標國家商之商標資料庫進行比對，並產出一風險資訊，風險資訊更可透過該報告產生單元3123整併至該商標類別推薦報告進而產生風險評估報告。The risk assessment module 314 is used to compare the keywords extracted from the input string with the trademark database of the target country, and generate a risk information. The risk information can also be generated through the report generation unit. 3123 is integrated into the trademark category recommendation report to generate a risk assessment report.

具體地，是先經由類別推薦模組312運算後產生推薦商標類別與推薦商品項目，再經由風險評估模組在該些推薦類別中進行檢索比對，首先是比對關鍵字與目標國家商標資料庫中是否存在近似的名稱，若在推薦類別中存在近似前案，則再比對商品項目的近似程度，最終交叉分析之後產生風險評估報告。Specifically, the recommended trademark categories and recommended product items are first generated through calculation by the category recommendation module 312, and then the risk assessment module is used to search and compare these recommended categories. First, the keywords are compared with the trademark information of the target country. Whether there is a similar name in the library, and if there is a similar previous name in the recommended category, then compare the similarity of the product items, and finally generate a risk assessment report after cross analysis.

請接續參閱圖11，本系統之類別推薦模組312更包含知識圖譜比對單元3124，具體地，該資訊擷取單元601除了可以爬取與使用者本身官網或相關的網路資訊以外，還可以依據解析後的產業資訊去爬取相似產業或相似背景的公司資訊，並再將這些資訊藉由知識圖譜比對單元3124進行比對，將相似產業或相似背景公司申請過的商標前案比對搜尋出來，並且擷取出該些案件所申請的類別與其中的商品項目併入由類別項目比對單元3122比對出的推薦類別及推薦商品項目中。Please continue to refer to Figure 11. The category recommendation module 312 of this system further includes a knowledge graph comparison unit 3124. Specifically, the information retrieval unit 601 can not only crawl the user's own official website or related network information, but also Based on the parsed industrial information, company information of similar industries or similar backgrounds can be crawled, and then the information can be compared through the knowledge graph comparison unit 3124 to compare previous trademark cases applied by companies in similar industries or similar backgrounds. The categories applied for in these cases and the product items are retrieved and merged into the recommended categories and recommended product items compared by the category item comparison unit 3122.

知識圖譜（Knowledge Graph），其中的每個節點代表不同公司的商標申請案件紀錄資料，邊則代表各公司之間的產業相關性或相似性。這種方法可以直觀地顯示出不同公司間所申請的商標中類別/商品項目與產業之間關係，而此知識圖普的建構係透過時間進行蒐集，當中，係包括但不限定於如下的資訊：已完成註冊之商標公告資訊(商標家族)、透過字義擴充與翻譯比對所建構的前案對照表、以及部分國家智財局公開之商標申請案件。Knowledge Graph (Knowledge Graph), in which each node represents the trademark application case record data of different companies, and the edges represent the industrial correlation or similarity between the companies. This method can visually display the relationship between categories/product items and industries in trademarks applied for by different companies, and the construction of this knowledge map is collected over time, which includes but is not limited to the following information : Completed registration trademark announcement information (trademark family), a comparison table of previous cases constructed through word meaning expansion and translation comparison, and some trademark application cases disclosed by the National Intellectual Property Administration.

並且，其詳細的建構流程如下：Moreover, its detailed construction process is as follows:

數據蒐集：收集已有的商標資料，包括公司申請過的商標、商標名稱、類別等相關資訊。這些資料可以從商標資料庫、專利局等來源獲取。Data collection: Collect existing trademark information, including trademarks, trademark names, categories and other related information that the company has applied for. This information can be obtained from sources such as trademark databases and patent offices.

知識圖譜建構：將商標資料與其他相關資訊，例如公司信息、產業類別等，結合在一起，建構一個知識圖譜。知識圖譜可以使用圖形數據庫或圖形表示法來表示不同實體（例如公司、商標、產業）之間的關係。Knowledge graph construction: Combine trademark information with other related information, such as company information, industry categories, etc., to construct a knowledge graph. Knowledge graphs can use graph databases or graphical representations to represent relationships between different entities (e.g. companies, trademarks, industries).

相似度計算：根據知識圖譜中的關係和屬性，計算不同商標之間的相似度。這可以基於不同的特徵，例如商標名稱的相似度、所屬類別的相似度等。Similarity calculation: Calculate the similarity between different trademarks based on the relationships and attributes in the knowledge graph. This can be based on different characteristics, such as similarity of the brand name, similarity of the category it belongs to, etc.

本發明的另一實施例請參閱圖12，顯示本發明之方法流程圖，其包含步驟S101~S108。For another embodiment of the present invention, please refer to FIG. 12 , which shows a flow chart of the method of the present invention, which includes steps S101 to S108.

在步驟S101中用戶輸入介紹文字，主要是透過操作電子裝置100輸入任意的文字，並沒有限定一定要是商標名稱或制式的字詞；在步驟S102中文字解析單元3121對用戶輸入的介紹文字進行語意分析，且判斷輸入的文字中是否包含有用戶的品牌名稱、公司名稱、公司介紹、商品介紹等資訊；若是則進行步驟S103，文字解析單元3121將用戶輸入的介紹文字中的品牌名稱與公司名稱擷取出來；並接著進行步驟S104，對擷取出的文字做識別性的判別，主要是因為商標的核准條件之一為需具有識別性，在步驟S105中透過文字解析單元3121先排除不具識別性文字並擷取出識別性文字，例如ABC股份有限公司中的”股份有限公司”即為不具識別性文字；前述的判斷若為否與經過步驟S105後進行步驟S106，將用戶輸入的介紹文字或具識別性文字再次透過文字解析單元3121進行產業分析，同時也進行產品及服務內容的解析，在步驟S107中，還會啟用類別項目比對單元3122，將步驟S106的識別性文字、經解析分析後的介紹文字與資料庫400中的商標類別項目進行比對，且在步驟S108中將比對結果透過報告產生單元3123形成類別推薦報告，進一步附上推薦理由。In step S101, the user inputs the introduction text, mainly by operating the electronic device 100 to input any text, which is not limited to trademark names or standard words; in step S102, the text analysis unit 3121 performs semantic analysis on the introduction text input by the user. Analyze and determine whether the input text contains the user's brand name, company name, company introduction, product introduction and other information; if so, proceed to step S103, the text analysis unit 3121 compares the brand name and company name in the introduction text input by the user Extract it; and then proceed to step S104 to determine the identification of the extracted text. This is mainly because one of the approval conditions for a trademark is that it must be identifiable. In step S105, the text analysis unit 3121 is used to first exclude non-identifiable characters. text and extract the identifiable text. For example, the "Incorporated Company" in ABC Co., Ltd. is non-identifiable text; if the aforementioned judgment is no, step S106 is performed after step S105, and the introduction text or specific information input by the user is The identifying text is again analyzed through the text analysis unit 3121 for industry analysis and product and service content analysis. In step S107, the category item comparison unit 3122 is also activated to parse and analyze the identifying text in step S106. The introduction text is compared with the trademark category items in the database 400, and in step S108, the comparison result is used to form a category recommendation report through the report generation unit 3123, and the recommendation reasons are further attached.

本發明之另一實施例，與圖12之實施例的差異在於步驟S106之後，在步驟S106中將用戶輸入的介紹文字或具識別性文字再次透過文字解析單元3121進行產業分析，同時也進行產品及服務內容的解析，進一步，類別項目比對單元3122先對經解析後的產業敘述進行比對，進而產生推薦的商標類別，接著，類別項目比對單元3122對經解析後的商品及服務內容描述在已經產生的推薦的商標類別中進行比對，進而產生該商標類別的推薦商品項目，最後再進到步驟S108，將比對結果透過報告產生單元3123形成類別推薦報告，進一步附上推薦理由。Another embodiment of the present invention differs from the embodiment of FIG. 12 in that after step S106, in step S106, the introductory text or identifying text input by the user is again analyzed through the text analysis unit 3121, and the product is also analyzed. and service content analysis. Furthermore, the category item comparison unit 3122 first compares the analyzed industry descriptions, and then generates recommended trademark categories. Then, the category item comparison unit 3122 compares the analyzed product and service content. Describes comparing the already generated recommended trademark categories, and then generating recommended product items of the trademark category, and finally proceeding to step S108, where the comparison results are formed into a category recommendation report through the report generation unit 3123, and the reasons for the recommendation are further attached. .

請接續參閱圖13，顯示本發明之另一實施例方法流程圖，其中包含步驟S201~S208。Please continue to refer to FIG. 13 , which shows a method flow chart of another embodiment of the present invention, which includes steps S201 to S208.

在此實施例中，步驟S201用戶透過操作電子裝置100複製公司介紹資訊的文字描述後貼入，特別的是，用戶於步驟S201輸入的是一段文字敘述並非單純字詞、名詞；在步驟S202中，文字解析單元3121對用戶輸入的文字進行語意分析、擷取關鍵字；並判斷輸入的文字描述中是否具有用戶的品牌名稱、公司名稱等資訊，而後續的步驟S203~S208與前述的步驟S103~S108相同，因此不再贅述。In this embodiment, in step S201, the user copies and pastes the text description of the company introduction information by operating the electronic device 100. In particular, what the user inputs in step S201 is a text description and not just words or nouns; in step S202 , the text analysis unit 3121 performs semantic analysis on the text input by the user, extracts keywords; and determines whether the input text description contains the user's brand name, company name and other information, and the subsequent steps S203 to S208 are the same as the aforementioned step S103 ~S108 is the same, so it will not be described again.

請接續參閱圖14，顯示本發明之另一實施例方法流程圖，其中包含步驟S301~S309。Please continue to refer to FIG. 14 , which shows a method flow chart of another embodiment of the present invention, which includes steps S301 to S309.

在步驟S301中用戶透過操作電子裝置100輸入介紹文字，並經由文字解析單元3121判斷輸入的語言是否為預設接收的語言是否需進行翻譯；若是需進行翻譯則先進行步驟S302進行語言翻譯，若不需翻譯或完成步驟S302後進行步驟S303，文字解析單元3121對用戶輸入的介紹文字進行語意分析、擷取關鍵字；而後續的步驟S304~S309與前述的步驟S103~S108相同，因此不再贅述。In step S301, the user inputs introduction text by operating the electronic device 100, and determines through the text analysis unit 3121 whether the input language is the default received language and whether translation needs to be performed; if translation is required, step S302 is first performed for language translation. If There is no need to translate or complete step S302 before proceeding to step S303. The text analysis unit 3121 performs semantic analysis on the introduction text input by the user and extracts keywords; and subsequent steps S304 to S309 are the same as the aforementioned steps S103 to S108, so no further steps are required. Repeat.

請接續參閱圖15，顯示本發明之另一實施例方法流程圖，其中包含步驟S401~S408。Please continue to refer to FIG. 15 , which shows a method flow chart of another embodiment of the present invention, which includes steps S401 to S408.

在步驟S401中用戶透過操作電子裝置100利用語音方式敘述輸入說明文字，且透過文字解析單元3121將語音轉換為文字儲存於記憶體120中，其中的語音識別技術可以基於模型和算法，將音訊數據與語音模型進行比對，識別出對應的文字內容，並進行文字的後處理，例如語法校正、拼寫修正和語義分析；接著對文字進行判斷，判斷是否具有用戶的品牌名稱、公司名稱等資訊；而後續的步驟S403~S408與前述的步驟S103~S108相同，因此不再贅述。In step S401, the user narrates and inputs descriptive text by operating the electronic device 100, and converts the speech into text through the text analysis unit 3121 and stores it in the memory 120. The speech recognition technology can be based on models and algorithms to convert the audio data into text. Compare with the speech model to identify the corresponding text content, and perform post-processing of the text, such as grammar correction, spelling correction and semantic analysis; then judge the text to determine whether it contains the user's brand name, company name and other information; The subsequent steps S403 to S408 are the same as the aforementioned steps S103 to S108, and therefore will not be described again.

請接續參閱圖16，顯示本發明之另一實施例方法流程圖，其中包含步驟S501~S509。Please continue to refer to FIG. 16 , which shows a method flow chart of another embodiment of the present invention, which includes steps S501 to S509.

在此實施例中之步驟S501~S507與前述的步驟S101~S107相同，因此不再贅述。在步驟S508中啟用檢索比對單元3142，將擷取出之具有識別性文字與資料庫400中的已申請的商標前案進行比對，且將比對結果存入記憶體120；在步驟509中，透過報告產生單元3123將比對結果經過排列後產生類別推薦報告以及近似前案列表，且同時附上類別推薦的理由。Steps S501 to S507 in this embodiment are the same as the aforementioned steps S101 to S107, and therefore will not be described again. In step S508, the search comparison unit 3142 is enabled to compare the retrieved identifying words with the applied trademark records in the database 400, and store the comparison results in the memory 120; in step 509 , the report generation unit 3123 sorts the comparison results to generate a category recommendation report and a list of similar cases, and at the same time attaches the reasons for the category recommendation.

請接續參閱圖17，顯示本發明之另一實施例方法流程圖，其中包含步驟S601~S611。Please continue to refer to FIG. 17 , which shows a method flow chart of another embodiment of the present invention, which includes steps S601 to S611.

在此實施例中主要是提供用戶不同國家(跨國)的商標類別推薦，在步驟S601~S608與前述的步驟S101~S108相同，因此不再贅述。在步驟S608後，電子裝置100的顯示螢幕顯示是否進行跨國商品項目轉換提供用戶選擇，若否則結束，若是將進行步驟S609，提供資料庫400中存在的多個國家給用戶選擇，有別於原始申請的國家，再選擇至少一第二申請國；在步驟S610中用戶選定國家後，處理器110讀取記憶體120中該國的商標類別項目並啟用類別項目比對單元3122進行比對；在步驟S611中將比對結果顯示於電子裝置100的顯示螢幕，藉以達到跨國的商標類別推薦功效。In this embodiment, the main purpose is to provide users with trademark category recommendations from different countries (cross-border countries). Steps S601 to S608 are the same as the aforementioned steps S101 to S108, and therefore will not be described again. After step S608, the display screen of the electronic device 100 displays whether to perform cross-border commodity item conversion to provide the user with a choice. If not, the end is completed. If so, step S609 is performed to provide multiple countries in the database 400 for the user to choose. Different from the original The country of application, and then select at least one second application country; after the user selects the country in step S610, the processor 110 reads the trademark category items of the country in the memory 120 and activates the category item comparison unit 3122 for comparison; In step S611, the comparison result is displayed on the display screen of the electronic device 100, thereby achieving a cross-border trademark category recommendation function.

請接續參閱圖18，顯示本發明之另一實施例方法流程圖，其中包含步驟S701~S709。Please continue to refer to FIG. 18 , which shows a method flow chart of another embodiment of the present invention, which includes steps S701 to S709.

在此實施例中步驟S701用戶透過操作電子裝置100輸入介紹文字後，遂先進行申請國家的選擇，在確定商標申請國家後進行步驟S702，文字解析單元3121對用戶輸入的介紹文字做語意分析，並判斷介紹文字中是否具有品牌名稱、公司名稱等資訊，而後續的步驟S703~S707與前述的步驟S103~S107相同，因此不再贅述。在步驟S708中，將步驟S707所產生的比對結果依據用戶在步驟S701所選擇的國家進行跨國轉換，將原始的推薦類別項目轉換為該選擇國家的商品項目；在步S709中報告產生單元3123將轉換結果輸出產生跨國的類別推薦報告。In this embodiment, in step S701, after the user inputs the introduction text by operating the electronic device 100, he first selects the country for application. After determining the country for trademark application, step S702 is performed. The text analysis unit 3121 performs semantic analysis on the introduction text input by the user. And determine whether the introduction text contains information such as brand name, company name, etc., and subsequent steps S703 to S707 are the same as the aforementioned steps S103 to S107, so they will not be described again. In step S708, the comparison results generated in step S707 are converted across countries according to the country selected by the user in step S701, and the original recommended category items are converted into product items of the selected country; in step S709, the report generation unit 3123 The conversion results are output to generate a cross-border category recommendation report.

請接續參閱圖19，顯示本發明之另一實施例方法流程圖，其中包含步驟S801~S813。Please continue to refer to FIG. 19 , which shows a method flow chart of another embodiment of the present invention, which includes steps S801 to S813.

在此實施例中也是提供可以跨國轉換的類別推薦及風險評估，且步驟S801~S806與前述的步驟S101~S106相同，因此不再贅述。在步驟S807中將已經擷取出的具有識別性文字及解析後的文字透過文字解析單元3121進行語言轉換，在此實施例中預設將所有文字統一轉換為英文；在步驟S808中，類別項目比對單元3122將已轉換為英文的文字與在資料庫400中已經預先轉換為英文的資訊資料進行比對；在步驟S809、S810中，分別是比對商品項目以及比對已申請的案件名稱，比對已申請的案件名稱是透過檢索比對單元3142；在步驟S811中，將步驟S809、S810的比對結果藉由報告產生單元3123進行綜合分析；在步驟S812中報告產生單元3123在比對結果中以顏色區分類別推薦的優先順位；在步驟S813中報告產生單元3123綜合分析類別推薦與檢索結果後整合產生跨國的風險評估報告。In this embodiment, category recommendations and risk assessments that can be converted across countries are also provided, and steps S801 to S806 are the same as the aforementioned steps S101 to S106, so they will not be described again. In step S807, the extracted identifiable text and the parsed text are language converted through the text parsing unit 3121. In this embodiment, all text is preset to be uniformly converted into English; in step S808, the category item ratio is The comparison unit 3122 compares the text that has been converted into English with the information data that has been converted into English in advance in the database 400; in steps S809 and S810, it compares the product items and the applied case names, respectively. The applied case names are compared through the search comparison unit 3142; in step S811, the comparison results of steps S809 and S810 are comprehensively analyzed by the report generation unit 3123; in step S812, the report generation unit 3123 performs the comparison The priority of category recommendations is distinguished by color in the results; in step S813, the report generation unit 3123 comprehensively analyzes the category recommendations and search results and then integrates and generates a transnational risk assessment report.

請接續參閱圖20，顯示本發明之另一實施例方法流程圖，其中包含步驟S901~S914。Please continue to refer to FIG. 20 , which shows a method flow chart of another embodiment of the present invention, which includes steps S901 to S914.

此實施例也是提供跨國的類別推薦與風險評估方法，且步驟S901~S906與前述的步驟S101~S106相同、步驟S907與前述步驟S807相同，因此不再贅述。在步驟S908中啟用文字解析單元3121將擷取出的識別性文字與解析後的文字進行同義詞或近似詞生成，產出近似詞或同義詞的方法可以為但不限於文字解析單元3121利用已有的詞彙表，將詞彙的同義詞或相似詞納入詞彙表中，透過匹配詞彙表中的詞彙，生成近似詞或同義詞。例如WordNet就是一個基於詞彙表擴充法的同義詞生成系統，或是利用大量的語料庫來學習單詞之間的關係，包括詞彙共現和上下文相似度等，生成詞彙之間的相似性分數，進而生成近似詞或同義詞。例如LSI(Latent Semantic Indexing)和LDA(Latent Dirichlet Allocation)。This embodiment also provides a cross-border category recommendation and risk assessment method, and steps S901 to S906 are the same as the aforementioned steps S101 to S106, and step S907 is the same as the aforementioned step S807, so they will not be described again. In step S908, the text parsing unit 3121 is enabled to generate synonyms or similar words between the extracted identifying text and the parsed text. The method of generating similar words or synonyms may be, but is not limited to, the text parsing unit 3121 using existing vocabulary. Table, include synonyms or similar words of words into the vocabulary list, and generate similar words or synonyms by matching the words in the vocabulary list. For example, WordNet is a synonym generation system based on the vocabulary expansion method, or uses a large number of corpora to learn the relationship between words, including word co-occurrence and contextual similarity, etc., to generate similarity scores between words, and then generate approximations words or synonyms. For example, LSI (Latent Semantic Indexing) and LDA (Latent Dirichlet Allocation).

WordNet是一個英文詞彙的電腦化詞彙庫，包含大量的英文單詞，並以詞彙的語意和語法關係作為基礎組織和管理單詞，WordNet的目的是提供一個可靠的語言資源，用於自然語言處理和語義分析。WordNet is a computerized vocabulary database of English vocabulary, which contains a large number of English words and uses the semantic and grammatical relationships of words as a basis to organize and manage words. The purpose of WordNet is to provide a reliable language resource for natural language processing and semantics. analyze.

LSI(Latent Semantic Indexing)是一種自然語言處理技術，用於識別同義詞和相似詞，它的基本原理是通過將文本轉換為向量空間模型，然後進行奇異值分解(Singular Value Decomposition，SVD)來識別詞語之間的語義關係，具體而言，LSI運用了文本中的詞頻統計學方法，將文本中的詞彙進行處理，將它們轉換為向量空間模型。在進行SVD之前，LSI通常使用TF-IDF(Term Frequency-Inverse Document Frequency)權重來加權詞彙，以消除一些常用詞語的影響，從而更好地捕捉詞彙之間的關聯性，透過這樣的處理，LSI可以生成一個詞彙-文本矩陣，其中每個詞彙對應於一個向量。通過奇異值分解，LSI可以分解出這個詞彙-文本矩陣的奇異值和奇異向量，從而捕捉文本中隱含的語義信息。基於這些語義信息，LSI可以計算兩個詞彙之間的相似度，從而找出同義詞或相似詞。LSI (Latent Semantic Indexing) is a natural language processing technology used to identify synonyms and similar words. Its basic principle is to identify words by converting text into a vector space model and then performing Singular Value Decomposition (SVD) Specifically, LSI uses word frequency statistics in the text to process the words in the text and convert them into a vector space model. Before performing SVD, LSI usually uses TF-IDF (Term Frequency-Inverse Document Frequency) weights to weight words to eliminate the influence of some commonly used words, thereby better capturing the correlation between words. Through this processing, LSI A word-text matrix can be generated in which each word corresponds to a vector. Through singular value decomposition, LSI can decompose the singular values and singular vectors of this vocabulary-text matrix, thereby capturing the semantic information implicit in the text. Based on this semantic information, LSI can calculate the similarity between two words to find synonyms or similar words.

此外，LDA(Latent Dirichlet Allocation)是一種常用的主題模型，可以將一個文集中的文本分配到多個主題中，同時可以找到每個主題所代表的詞彙分布，這個詞彙分布可以用來尋找同義詞或近似詞，首先需要準備一個文本集，這個文本集可以是任何包含目標詞彙的文本集合。接著，使用LDA將這個文本集分配到多個主題中，同時可以找到每個主題所代表的詞彙分布，對於一個目標詞彙，可以使用其在LDA模型中對應的詞彙分布來尋找同義詞或近似詞。具體來說，可以計算目標詞彙的詞彙分布和其他詞彙的詞彙分布之間的相似度，找到與目標詞彙相似的詞彙作為同義詞或近似詞。In addition, LDA (Latent Dirichlet Allocation) is a commonly used topic model that can allocate texts in a collection to multiple topics, and at the same time find the vocabulary distribution represented by each topic. This vocabulary distribution can be used to find synonyms or To approximate words, you first need to prepare a text set. This text set can be any text set that contains the target vocabulary. Then, use LDA to distribute this text set into multiple topics. At the same time, you can find the vocabulary distribution represented by each topic. For a target vocabulary, you can use its corresponding vocabulary distribution in the LDA model to find synonyms or similar words. Specifically, the similarity between the vocabulary distribution of the target vocabulary and the vocabulary distribution of other vocabulary can be calculated, and vocabulary similar to the target vocabulary can be found as synonyms or approximate words.

文字解析單元3121同樣可使用腳本語言(例如Python)編寫一個程式來執行WordNet、LSI(Latent Semantic Indexing)、LDA(Latent Dirichlet Allocation)進行文字解析並產出同義詞或近似詞。The text parsing unit 3121 can also use a script language (such as Python) to write a program to execute WordNet, LSI (Latent Semantic Indexing), and LDA (Latent Dirichlet Allocation) to perform text parsing and generate synonyms or similar words.

其中，值得提出說明的是，於本發明技術中的機器翻譯係採用直接的方法並基於深度學習的機器翻譯模型，例如Google的Transformer模型或Facebook的FAIR SEQ模型，將一國的商標商品項目語句直接翻譯成另一國的語言。從而進入另一國的商品項目資料庫中進行直接的文字比對。Among them, it is worth mentioning that the machine translation in the technology of the present invention adopts a direct method and a machine translation model based on deep learning, such as Google's Transformer model or Facebook's FAIR SEQ model, to translate a country's trademark product item sentences into Direct translation into another country's language. Thereby entering another country's commodity item database for direct text comparison.

此外，於本發明技術中建立一個跨國商標商品項目的知識圖譜（Knowledge Graph），知識圖譜用以建立節點矩陣關聯，其中的每個節點代表不同國家的商品項目，邊則代表商品項目之間的相關性或相似性。並依各國之間商品項目與各國商標項目之間的等價關係根據時間演進的動態學習模式，該知識圖譜單元轉換至少包含以下步驟：將多國商標商品項目與商標專案關係資料收集與動態模型學習，透過圖譜查詢與自然與研磨行推理，推算出最佳匹配、最短路徑、最大流量，並根據最終決策結果進行優化商品項目之等價關係。這種方法可以直觀地顯示出不同國家間的商品項目轉換關係，而此知識圖普的建構係透過時間進行蒐集，當中，係包括但不限定於如下的資訊：已完成註冊之跨國商標公告資訊(商標家族)、先前用戶完成轉換的結果、透過字義擴充與翻譯比對所建構的商品項目對照表、以及部分國家智財局公開之商品項目翻譯對照表。In addition, a knowledge graph (Knowledge Graph) of multinational trademark product items is established in the technology of the present invention. The knowledge graph is used to establish a node matrix association, in which each node represents a product item in a different country, and the edges represent the connections between the product items. Relevance or similarity. Based on the dynamic learning model that evolves over time, the equivalence relationship between product items in various countries and trademark items in each country, the knowledge graph unit conversion at least includes the following steps: Collecting data on the relationship between multi-country trademark product items and trademark projects and using a dynamic model Learning, through graph query and natural and grinding line reasoning, calculate the best match, shortest path, maximum flow, and optimize the equivalence relationship of product items based on the final decision results. This method can visually display the conversion relationship between commodity items in different countries, and the construction of this knowledge map is collected over time, which includes but is not limited to the following information: Transnational trademark announcement information that has completed registration (trademark family), the conversion results completed by previous users, the product item comparison table constructed through word meaning expansion and translation comparison, and some product item translation comparison tables disclosed by the National Intellectual Property Administration.

其詳細的建構流程如下：在商標商品項目(goods or services)的跨國申請轉換技術中，知識圖譜可能包括以下元素：The detailed construction process is as follows: In the cross-border application conversion technology of trademark goods items (goods or services), the knowledge graph may include the following elements:

實體：各國的商標商品項目。Entity: Trademark product items in various countries.

關係：各商品項目之間的相似性或等價關係，例如，美國的某一商標商品項目可能與德國的某一商品項目具有相同或相似的含義使用知識圖譜進行跨國商標申請轉換，係遵循以下步驟：Relationship: similarity or equivalence between various product items. For example, a certain trademark product item in the United States may have the same or similar meaning as a certain product item in Germany. Using the knowledge graph to convert cross-border trademark applications follows the following Steps:

資料收集：首先，收集各國商標商品專案的資料，以及這些商品專案之間的關係。這部份需要大量的人工努力，部份也需要自動化的資料抓取和處理技術。並使用收集到的資料構建知識圖譜，當中每個商品項目都是一個節點，每個節點之間的關係是一條邊。Data collection: First, collect data on trademark product projects in various countries and the relationships between these product projects. This part requires a lot of manual effort, and part also requires automated data capture and processing technology. And use the collected data to build a knowledge graph, in which each product item is a node, and the relationship between each node is an edge.

圖譜查詢和推理：使用查詢和推理技術，係基於知識圖譜找出原始國家的商標商品專案與目標國家的商品專案之間的最佳匹配，當中涉及到圖論中的最短路徑、最大流等問題。最後係執行結果評估和回饋：對查詢和推理的結果進行評估，如果結果不準確或不滿意，可以根據回饋更新知識圖譜。Graph query and reasoning: Using query and reasoning technology, it is based on the knowledge graph to find the best match between the trademark product project of the original country and the product project of the target country, which involves the shortest path, maximum flow and other issues in graph theory. . The last step is execution result evaluation and feedback: the results of query and reasoning are evaluated. If the results are inaccurate or unsatisfactory, the knowledge graph can be updated based on the feedback.

另一方面，於本發明技術的詞向量分析在商標商品項目的跨國申請轉換技術中，係被運用來捕捉和理解不同國家商標分類系統中商品項目的語意相關性，並實現轉換的過程。On the other hand, word vector analysis based on the technology of the present invention is used in the transnational application conversion technology of trademark product items to capture and understand the semantic correlation of product items in trademark classification systems of different countries, and realize the conversion process.

以下提供本發明技術的其中一種應用方法和建立資料庫模型與訓練方式：The following provides one of the application methods of the technology of the present invention and the establishment of database models and training methods:

建立多語言詞向量模型：首先，我們將不同國家的商標商品項目敘述轉換成詞向量。這邊係通過訓練一個多語言詞向量模型實現，例如，我們可使用公開的大規模多語言文本數據集來訓練這個模型，或者使用已經訓練好的多語言詞向量模型，如 Facebook 的 FastText但不限定上述語言模型。Establish a multilingual word vector model: First, we convert the descriptions of trademarked product items in different countries into word vectors. This is achieved by training a multilingual word vector model. For example, we can use a public large-scale multilingual text data set to train this model, or use an already trained multilingual word vector model, such as Facebook's FastText but not Limit the above language model.

轉換商品項目敘述：將原始國家的商標商品項目敘述轉換成詞向量，然後再將這些詞向量轉換成目標國家的語言。這一步係可通過詞向量之間的相似性實現，例如，我們可以找出原始語言中的詞向量與目標語言中詞向量的最近鄰居，並用這些最近鄰居的詞語來組成新的商品項目敘述。Convert product item descriptions: Convert the original country's trademark product item descriptions into word vectors, and then convert these word vectors into the language of the target country. This step can be achieved through the similarity between word vectors. For example, we can find the nearest neighbors of the word vectors in the original language and the word vectors in the target language, and use the words of these nearest neighbors to form a new product item description.

人工審核與調整修正：由於詞向量模型雖然強大，但仍然無法完全理解語言的細微差別和文化差異，因此在轉換完成後，可透過人工審核和修正來強化此訓練效果。例如，透過專家團隊來審核和修正轉換結果，並將這些修正的數據再次用於訓練模型，以此來不斷提高模型的轉換精度。並且持續優化模型：隨著時間的推移，可以根據實際的需求和效果，不斷優化模型，例如，增加更多語言的支持，或者改善處理特定種類商品項目的能力。Manual review and adjustment: Although the word vector model is powerful, it still cannot fully understand the nuances of language and cultural differences. Therefore, after the conversion is completed, this training effect can be enhanced through manual review and correction. For example, a team of experts review and correct the conversion results, and these corrected data are reused to train the model to continuously improve the conversion accuracy of the model. And continue to optimize the model: Over time, the model can be continuously optimized based on actual needs and effects, for example, adding support for more languages, or improving the ability to handle specific types of commodity items.

以上的方法只是一種可能的方式，實際的實施可能會根據具體的需求和條件進行調整。以本案的實際轉化案例來說，係透過以下的”詞向量”轉化模型而完成，當中係包含：The above method is only a possible way, and the actual implementation may be adjusted according to specific needs and conditions. Taking the actual conversion case of this case as an example, it is completed through the following "word vector" conversion model, which includes:

詞向量訓練：利用大規模語料庫訓練詞向量模型，並基於語境將每個詞映射到一個高維空間，其中近似詞會在這個空間內彼此靠近。其轉化公式為以下目標函數： Word vector training: Use a large-scale corpus to train a word vector model, and map each word to a high-dimensional space based on context, where similar words will be close to each other in this space. Its transformation formula is the following objective function:

其中 T 是語料庫的詞數，c 是一個選擇的窗口大小，θ 是模型參數，是給定中心詞 w_t 的情況下，上下文詞的條件概率。 where T is the number of words in the corpus, c is a selected window size, θ is the model parameter, is the context word given the center word w_t conditional probability.

轉換函數：利用已訓練的詞向量模型，我們可以為A國和B國的商品項目產生對應的詞向量，並且，我們建立一個轉換函數來將A國的商品項目詞向量映射到B國的商品項目詞向量當中。並最小化A國的詞向量與其對應的B國詞向量之間的距離。假設我們有一組A國與B國的商品項目配對 (x_i, y_i)，其中i = 1, ..., N，N是配對數量，那最小化目標的公式為： Conversion function: Using the trained word vector model, we can generate corresponding word vectors for product items in country A and country B, and we establish a conversion function to map the word vectors of product items in country A to products in country B. item word vector. And minimize the distance between the word vector of country A and its corresponding word vector of country B. Suppose we have a set of product item pairings (x_i, y_i) from country A and country B, where i = 1, ..., N, N is the number of pairs, then the formula for minimizing the objective is:

其中 W 是我們希望學習的轉換矩陣。假設我們要轉換的美國商標商品項目是 "computer software"，我們首先將這個片語轉換為詞向量，並使用已經訓練好的詞向量模型，我們可以得到 "computer software" 的詞向量表示 v_US。然後，我們應用轉換矩陣W到 v_US，即 W * v_US，得到的結果就是轉換後的詞向量 v_TW。接下來，我們需要找到最接近 v_TW 的臺灣商標商品項目。這可以通過計算 v_TW 與臺灣商標商品項目詞向量的相似度來實現，相似度一般使用余弦相似度來衡量。計算公式如下：其中 a 和 b 是兩個詞向量，a·b 表示向量點積，||a|| 和 ||b|| 分別表示向量 a 和 b 的範數。 where W is the transformation matrix we wish to learn. Assume that the US trademark product item we want to convert is "computer software". We first convert this phrase into a word vector, and using the already trained word vector model, we can get the word vector representation v_US of "computer software". Then, we apply the transformation matrix W to v_US, that is, W * v_US, and the result is the converted word vector v_TW. Next, we need to find the Taiwanese trademark product item closest to v_TW. This can be achieved by calculating the similarity between v_TW and the word vector of the Taiwan trademark product item. The similarity is generally measured using cosine similarity. The calculation formula is as follows: where a and b are two word vectors, a·b represents the vector dot product, ||a|| and ||b|| represent the norms of vectors a and b respectively.

最後系統查詢所有的臺灣商標商品項目詞向量，找到與 v_TW 余弦相似度最高的項目，這個項目就是轉換後的結果。當中，系統可判斷出找到的最相似的臺灣商標商品項目是 "電腦軟體"，並將其設定為轉換的結果。Finally, the system queries the word vectors of all Taiwan trademark product items and finds the item with the highest cosine similarity to v_TW. This item is the converted result. Among them, the system can determine that the most similar Taiwanese trademark product item found is "computer software" and set it as the result of the conversion.

其中，於本系統與方法架構中，確保各國商品項目資料庫的更新係為核心重點，由於每個國家的商品項目資料庫於每年可能進行一至兩次不定時的更新，當中，有可能淘汰掉舊的商品項目名稱，也會新增新的商品項目；其中，新增的商品項目如官方有告知其對應到原始舊版的某一商品項目時，則需一同將此類訊息更新於系統的知識圖譜當中；反之，若官方未告知對應關係，則可透過詞像量比對或人工等方式來進行對照關係的維護。Among them, in the framework of this system and method, ensuring the update of each country's commodity item database is the core focus. Since each country's commodity item database may be updated from time to time once or twice a year, some of the items may be eliminated. Old product item names will also be added with new product items; among them, if the newly added product item is officially notified that it corresponds to a product item of the original old version, such information must be updated in the system at the same time. In the knowledge graph; on the contrary, if the official does not inform the corresponding relationship, the comparison relationship can be maintained through word-image comparison or manual methods.

在步驟S909中將步驟S907及S908的文字與資料庫400中已經語言轉換過的商標類別項目透過類別項目比對單元3122進行比對；在步驟S910及S911中，依據文字進行解析產生類別推薦，同時依識別性文字透過檢索比對單元3142進行前案檢索；在步驟S912中報告產生單元3123將檢索結果與比對結果進行綜合分析，並於步驟S913中以顏色方式區分推薦類別，再於步驟S914中透過報告產生單元3123進行推薦類別中的檢索前案比對，產生風險評估報告。In step S909, the text in steps S907 and S908 is compared with the language-converted trademark category items in the database 400 through the category item comparison unit 3122; in steps S910 and S911, category recommendations are generated based on analysis of the text. At the same time, the search and comparison unit 3142 performs a previous case search based on the identifying text; in step S912, the report generation unit 3123 conducts a comprehensive analysis of the search results and comparison results, and distinguishes recommended categories by color in step S913, and then in step S913 In S914, the report generating unit 3123 performs a comparison of the search results in the recommended category to generate a risk assessment report.

接著請參閱圖21A及圖21B，係顯示本發明的應用實例示意圖。如圖所示，使用者在本系統的網頁中(https://inta.aiplux.com/)任意輸入描述文字或介紹文字，其文字內容可以包含但不限於品牌名稱、公司名稱、商品描述、服務描述、公司理念、公司介紹等，在輸入完描述文字後即可按下分析按鈕。Next, please refer to FIG. 21A and FIG. 21B, which are schematic diagrams showing application examples of the present invention. As shown in the figure, the user can enter any descriptive text or introductory text into the system's web page (https://inta.aiplux.com/). The text content can include but is not limited to brand name, company name, product description, Service description, company philosophy, company introduction, etc. After entering the description text, you can press the analysis button.

經由本系統的類別推薦模組運算之後即立即產生結果，其結果包含從描述文字中擷取出的品牌名稱、公司名稱、商標名稱，以及從描述文字中解析出的產業資訊，最重要的是，依據解析出的產業資訊經過比對後產生的推薦商標類別，以及該商標類別中的推薦商品項目。The results are generated immediately after the operation of the category recommendation module of this system. The results include the brand name, company name, and trademark name extracted from the description text, as well as the industry information parsed from the description text. The most important thing is, Recommended trademark categories generated after comparison based on the parsed industrial information, as well as recommended product items in the trademark category.

接下來，本案將透過具體流程說明與展示，進而說明當用戶輸入任一段敘述進入本發明系統時，其系統的後台運作流程具體分為以下五個主要動作並使用其對應技術據以實現：實體識別（NER）、主題建模或文本分類、識別性文字判斷、商標類別推薦以及商標商品項目(goods and services)推薦。其中，舉例當用戶輸入系統訊息為『AIPLUX科技股份有限公司發表台灣第一個大語言模型，該模型透過超級電腦建立出的高達1,760 億參數，結合語意理解與文本生成能力，推出企業級生成式AI解決方案。』為例，則於實體識別（NER）階段：系統使用NER模型進而識別出文本中的組織名稱（例如"AIPLUX科技股份有限公司"）。Next, this case will be explained and displayed through the specific process, and then explain that when the user enters any narrative to enter the system of the present invention, the background operation process of the system is specifically divided into the following five main actions and is implemented using corresponding technologies: Entity Recognition (NER), topic modeling or text classification, recognition text judgment, trademark category recommendation, and trademark product item (goods and services) recommendation. Among them, for example, when the user inputs system information, "AIPLUX Technology Co., Ltd. released Taiwan's first large-scale language model. This model uses up to 176 billion parameters created by supercomputers, combines semantic understanding and text generation capabilities, and launches enterprise-level generative formulas." AI solutions. 』For example, in the entity recognition (NER) stage: the system uses the NER model to identify the organization name in the text (such as "AIPLUX Technology Co., Ltd.").

並且，於主題建模或文本分類階段，則協助系統理解文本中的主要話題並將它們分類到相應的產業，進而使用主題建模來識別出“人工智能”、“自然語言處理”和“大數據”等關鍵詞，並將其歸類到相應的產業類別。此外，於識別性文字判斷階段中，系統需要找出文本中的關鍵字，這些詞彙能夠代表該公司的主要名稱，在這個例子中，系統即將"AIPLUX"視為識別性的詞彙。Moreover, in the topic modeling or text classification stage, it assists the system to understand the main topics in the text and classify them into corresponding industries, and then uses topic modeling to identify "artificial intelligence", "natural language processing" and "big words". "data" and other keywords, and classify them into the corresponding industry categories. In addition, in the recognition text judgment stage, the system needs to find keywords in the text. These words can represent the main name of the company. In this example, the system regards "AIPLUX" as a recognition word.

於商標類別推薦階段中，系統係依據以上的信息來推薦商標的國際類別。當中，若以一家公司主要從事人工智能、自然語言處理和大數據相關的業務，系統將推薦9類（科學儀器）、42類（科學和技術服務）和41類（教育和娛樂服務）等相關的商標類別。最後，於商標商品項目(goods and services)推薦階段中，系統根據公司的產業類別和主要業務來推薦相關的商品和服務項目。當中，若以一家公司主要從事人工智能的研發，我們可以推薦與人工智能相關的商品和服務項目，如“人工智能顧問服務”、“人工智能技術研究”等。In the trademark category recommendation stage, the system recommends international categories of trademarks based on the above information. Among them, if a company is mainly engaged in artificial intelligence, natural language processing and big data related businesses, the system will recommend categories 9 (scientific instruments), 42 categories (scientific and technical services) and 41 categories (educational and entertainment services) and other related categories trademark category. Finally, in the trademark product item (goods and services) recommendation stage, the system recommends related goods and services based on the company's industry category and main business. Among them, if a company is mainly engaged in the research and development of artificial intelligence, we can recommend products and services related to artificial intelligence, such as "artificial intelligence consulting services", "artificial intelligence technology research", etc.

更詳細地說明，其中，實體識別 (NER) 屬於一種自然語言處理 (NLP)技術，並用於識別文本中的具有特定意義的詞語，如人名、組織名、地名等。其不限定但可使用以下模型來完成: 隱馬爾科夫模型 (HMM)、最大熵馬爾科夫模型 (MEMM)、條件隨機域 (CRF)、或基於神經網路的模型如BiLSTM-CRF。其中，於本案展示中，係透過BiLSTM-CRF模型來執行，其具體步驟係對每個單詞進行詞嵌入並將單詞轉換為密集的向量表示，詞嵌入可以利用如word2vec、GloVe(Global Vectors for Word Representation)、CKIP(Chinese Knowledge and Information Processing)或者BERT(Bidirectional Encoder Representations from Transformers)等預訓練模型獲得。並使用雙向長短期記憶 (BiLSTM) 網絡對詞嵌入進行處理，其中，於這一階段係可獲取每個詞在其語境中的表示。接著，將BiLSTM的輸出送入CRF層：CRF層能考慮標籤之間的依賴性，使得序列標記結果更加準確。In more detail, entity recognition (NER) is a natural language processing (NLP) technology and is used to identify words with specific meanings in text, such as names of people, organizations, places, etc. It is not limited but can be accomplished using the following models: Hidden Markov Model (HMM), Maximum Entropy Markov Model (MEMM), Conditional Random Field (CRF), or neural network-based models such as BiLSTM-CRF. Among them, in this case, it is executed through the BiLSTM-CRF model. The specific steps are to perform word embedding on each word and convert the word into a dense vector representation. Word embedding can use words such as word2vec, GloVe (Global Vectors for Word Representation), CKIP (Chinese Knowledge and Information Processing) or BERT (Bidirectional Encoder Representations from Transformers) and other pre-trained models are obtained. The word embedding is processed using a Bidirectional Long Short-Term Memory (BiLSTM) network, where the representation of each word in its context is obtained at this stage. Then, the output of BiLSTM is sent to the CRF layer: the CRF layer can consider the dependencies between tags, making the sequence labeling results more accurate.

以本展示內容中，使用者輸入『AIPLUX科技股份有限公司發表台灣第一個企業用大語言模型，該模型透過超級電腦建立出的高達1,760 億參數，結合語意理解與文本生成能力，推出企業級生成式AI解決方案。』為例，將句子切分為單詞序列：["AIPLUX科技股份有限公司", "發表", "台灣", "第一個", "企業用", "大", "語言模型", "該模型", "透過", "超級", "電腦", "建立", "出", "的", "高達", "1,760", "億", "參數", "結合", "繁中", "的", "語意理解", "與", "文本生成能力", "推出", "企業級", "生成式", "AI", "解決方案"]，進一步地使用預訓練的使用預訓練的詞嵌入模型 (例如word2vec、GloVe或BERT) 將每個詞轉換為一個向量。In this display, the user inputs "AIPLUX Technology Co., Ltd. released Taiwan's first large language model for enterprise use. The model uses up to 176 billion parameters created by supercomputers, combines semantic understanding and text generation capabilities, and launches enterprise-level Generative AI solutions. 』For example, the sentence is divided into word sequences: ["AIPLUX Technology Co., Ltd.", "published", "Taiwan", "first", "enterprise use", "big", "language model", " The model", "through", "super", "computer", "build", "out", "of", "Gundam", "1,760", "billion", "parameters", "combination", "traditional "in", "of", "semantic understanding", "and", "text generation capability", "launch", "enterprise level", "generative", "AI", "solution"], further use pre- Trained uses a pretrained word embedding model (such as word2vec, GloVe, or BERT) to convert each word into a vector.

例如，我們假設"AIPLUX科技股份有限公司"的詞向量為[0.1, 0.2, ..., 0.5] (實際上，一個詞向量通常有幾百到幾千個維度)，"發表"的詞向量為[0.2, 0.3, ..., 0.6]，等等。將詞向量送入BiLSTM網路。BiLSTM可以獲取每個詞的上下文資訊。例如，給定"AIPLUX科技股份有限公司"的詞向量，BiLSTM可以根據其前後的詞("發表"和"台灣")來生成一個新的向量，這個向量包含了"AIPLUX科技股份有限公司"在句子中的上下文資訊。將BiLSTM的輸出送入CRF層。CRF層會給出每個詞的標籤概率。例如，CRF可能會判斷"AIPLUX科技股份有限公司"的標籤為"ORG" (代表組織名)，"發表"的標籤為"O" (代表非實體詞)，等等。For example, we assume that the word vector of "AIPLUX Technology Co., Ltd." is [0.1, 0.2, ..., 0.5] (in fact, a word vector usually has hundreds to thousands of dimensions), and the word vector of "published" is [0.2, 0.3, ..., 0.6], etc. Feed the word vector into the BiLSTM network. BiLSTM can obtain contextual information for each word. For example, given the word vector of "AIPLUX Technology Co., Ltd.", BiLSTM can generate a new vector based on the words before and after it ("published" and "Taiwan"). This vector contains the words "AIPLUX Technology Co., Ltd." Contextual information in sentences. Feed the output of BiLSTM into the CRF layer. The CRF layer will give the label probability of each word. For example, CRF may determine that the tag of "AIPLUX Technology Co., Ltd." is "ORG" (representing the organization name), the tag of "published" is "O" (representing non-entity words), and so on.

最後，我們選擇概率最大的標籤序列作為NER的結果。例如，對於上述句子，NER的結果可能為：["ORG", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O"]Finally, we select the label sequence with the highest probability as the result of NER. For example, for the above sentence, the result of NER might be: ["ORG", "O", "O", "O", "O", "O", "O", "O", "O", " O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O", "O" , "O", "O", "O", "O", "O", "O", "O"]

從而，以這個結果表示，"AIPLUX科技股份有限公司"被識別為一個組織名，其餘的詞都被識別為非實體詞。Therefore, this result indicates that "AIPLUX Technology Co., Ltd." is recognized as an organization name, and the rest of the words are recognized as non-entity words.

接著，以主題建模或文本分類階段中，主題建模通常使用無監督學習演算法，如Latent Dirichlet Allocation (LDA)，通過分析文檔的詞頻，識別出文檔的主題。而文本分類則使用有監督學習演算法，需要預先標注好的資料，於本案中使用的演算法技術包含但不限定為以下演算法：樸素貝葉斯(Naive Bayes)、支持向量機(SVM)、或者深度學習方法如卷積神經網路(CNN)、長短期記憶網路(LSTM)等。而於本次演示過程中，所採用的主題建模LDA演算法主要技術係將每個文檔可以看作是一個主題的混合體，而每個主題則可以看作是詞語的概率分佈。具體實現上，LDA使用Dirichlet分佈來建模文檔對主題的分佈和主題對詞語的分佈，然後通過反覆運算優化，得到每個文檔的主題分佈和每個主題的詞語分佈。在經過分詞後，我們得到文檔的詞項表示，並在LDA的計算過程中，系統可得到每個主題的詞項分佈和每篇文檔的主題分佈後，透過檢查每個主題的詞項分佈，找出每個主題的關鍵字。當中，即將"人工智慧"、"自然語言處理"和"大資料"等判定為同一主題的關鍵字，進而進行摘錄。Next, in the topic modeling or text classification stage, topic modeling usually uses unsupervised learning algorithms, such as Latent Dirichlet Allocation (LDA), to identify the topic of the document by analyzing the word frequency of the document. Text classification uses supervised learning algorithms, which require pre-labeled data. The algorithm technologies used in this case include but are not limited to the following algorithms: Naive Bayes (Naive Bayes), Support Vector Machine (SVM) , or deep learning methods such as convolutional neural network (CNN), long short-term memory network (LSTM), etc. In this demonstration, the main technology of the topic modeling LDA algorithm used is that each document can be regarded as a mixture of topics, and each topic can be regarded as a probability distribution of words. In terms of specific implementation, LDA uses Dirichlet distribution to model the distribution of documents to topics and the distribution of topics to words, and then optimizes it through repeated operations to obtain the topic distribution of each document and the word distribution of each topic. After word segmentation, we obtain the term representation of the document. During the LDA calculation process, the system can obtain the term distribution of each topic and the topic distribution of each document. By checking the term distribution of each topic, Find the keywords for each topic. Among them, "artificial intelligence", "natural language processing" and "big data" are determined to be keywords of the same theme, and then excerpted.

以識別性文字判斷階段中，關鍵字提取係可透過TF-IDF或TextRank等方法完成，其中，TF-IDF考慮了詞頻（TF）和逆文檔頻率（IDF）來評估一個詞的重要性。識別性文字判斷的主要任務是找出那些對商標有獨特標識性的部分，也就是那些能夠説明消費者識別並區分商品來源的文字。在你提供的例子中，“AIPLUX”就可能被判斷為識別性文字，因為它可能是一個特定的品牌名稱或公司名稱，而“科技股份有限公司”則可能被判斷為不具識別性的文字，因為它是一個通用的詞彙，無法幫助消費者區分商品來源。這個任務可以通過以下步驟實現：首先，構建一個包含所有已注冊商標的詞典，這個詞典可以從各智財局的商標資料庫中獲取；其次，構建一個包含所有常見詞彙的詞典，這個詞典可以從大規模的文本資料中獲取，例如Wiki百科或其他大型語料庫。最後，透過上述兩個詞典來識別出文本中的識別性文字和非識別性文字。In the recognition text judgment stage, keyword extraction can be completed through methods such as TF-IDF or TextRank. Among them, TF-IDF considers word frequency (TF) and inverse document frequency (IDF) to evaluate the importance of a word. The main task of identifying text judgment is to find those parts that are uniquely identifiable to the trademark, that is, those words that can help consumers identify and distinguish the source of the product. In the example you provided, "AIPLUX" may be judged as an identifying word because it may be a specific brand name or company name, while "Technology Co., Ltd." may be judged as a non-identifying word. Because it is a general term, it cannot help consumers distinguish the source of goods. This task can be achieved through the following steps: first, build a dictionary containing all registered trademarks, which can be obtained from the trademark databases of various intellectual property bureaus; second, build a dictionary containing all common words, which can be obtained from Obtained from large-scale text materials, such as Wikipedia or other large corpora. Finally, the above two dictionaries are used to identify the identifying words and non-identifying words in the text.

其中，單純透過上述兩個詞典的評估方式，仍存有不足與判斷失真風險，識別性文字通常具有一定的獨特性，因此不太可能出現在常見詞彙詞典中。然而，這個假設並不總是成立，有些識別性文字可能也是常見詞彙。因此，在實際應用中，我們可能需要結合其他資訊來提高識別性文字判斷的準確性，例如使用機器學習模型來預測一個詞是否具有識別性，或者使用知識圖譜來理解一個詞在特定上下文中的含義。Among them, there are still shortcomings and risks of judgment distortion simply through the evaluation methods of the above two dictionaries. Recognizable words are usually unique to a certain extent, so they are unlikely to appear in common vocabulary dictionaries. However, this assumption does not always hold, and some identifying words may also be common words. Therefore, in practical applications, we may need to combine other information to improve the accuracy of identifying text judgments, such as using machine learning models to predict whether a word is identifiable, or using knowledge graphs to understand the meaning of a word in a specific context. meaning.

承上述，本發明實施例中，係進一步地，系統使用TF-IDF的方法來提取關鍵字。進而用以評估一個詞在一個文檔集合中的重要性。TF-IDF的計算公式為： Following the above, in the embodiment of the present invention, further, the system uses the TF-IDF method to extract keywords. It is then used to evaluate the importance of a word in a collection of documents. The calculation formula of TF-IDF is:

其中：t 是一個詞，d 是一個文檔，D 是所有文檔的集合，TF(t, d) 是詞 t 在文檔 d 中的詞頻（term frequency），通常可以通過計數詞 t 在文檔 d 中出現的次數並除以文檔 d 中的總詞數來計算；IDF(t, D) 是詞 t 的逆文檔頻率（inverse document frequency），可以通過計算文檔總數除以包含詞 t 的文檔數目的對數來計算。Among them: t is a word, d is a document, D is the set of all documents, TF(t, d) is the term frequency of word t in document d, which can usually be calculated by counting the occurrence of word t in document d. times and divided by the total number of words in document d; IDF(t, D) is the inverse document frequency of word t, which can be calculated by dividing the total number of documents by the logarithm of the number of documents containing word t calculate.

以“AIPLUX科技股份有限公司”當中，系統判定其為包含兩個詞的文檔，其中“AIPLUX”出現了一次，“科技股份有限公司”也出現了一次，所以他們的 TF 值都是 1/2 = 0.5。接著我們需要計算 IDF 值，假設我們的文件庫包含了10000個文檔，“AIPLUX”只在10個文檔中出現過，而“科技股份有限公司”在1000個文檔中出現過，所以他們的 IDF 值分別是 log(10000/10) = 3 和 log(10000/1000) = 1。最後，我們將 TF 值和 IDF 值相乘，得到“AIPLUX”的 TF-IDF 值是 0.5 * 3 = 1.5，而“科技股份有限公司”的 TF-IDF 值是 0.5 * 1 = 0.5。由此我們可以看出，“AIPLUX”在這個文檔中的重要性大於“科技股份有限公司”。For "AIPLUX Technology Co., Ltd.", the system determines that it is a document containing two words. "AIPLUX" appears once and "AIPLUX Technology Co., Ltd." also appears once, so their TF values are both 1/2 = 0.5. Next we need to calculate the IDF value. Assume that our file library contains 10,000 documents. "AIPLUX" only appears in 10 documents, and "Technology Co., Ltd." appears in 1,000 documents, so their IDF values They are log(10000/10) = 3 and log(10000/1000) = 1 respectively. Finally, we multiply the TF value and the IDF value, and the TF-IDF value of "AIPLUX" is 0.5 * 3 = 1.5, while the TF-IDF value of "Technology Co., Ltd." is 0.5 * 1 = 0.5. From this we can see that "AIPLUX" is more important than "Technology Co., Ltd." in this document.

更進一步地，本發明係採用知識圖譜（Knowledge Graph）技術以更加優化本技術中的識別性文字判斷效果，其中，知識圖譜能夠存儲大量的實體及其屬性和關係的資訊。在這個問題中，我們可以將“AIPLUX”和“科技股份有限公司”作為實體，他們的屬性可以包括他們在文檔中出現的次數，他們的 TF-IDF 值等，他們的關係可以是他們在同一文檔中共同出現等。此外，本案知識圖譜整合和並存儲智財局審查商標歷程中的資料。舉例在審查的歷程中，“科技股份有限公司”或”股份有限公司”被多次提到並被視為不具有識別性的文字，那麼我們可以將這個資訊添加到知識圖譜中，“科技股份有限公司”的屬性中就可以添加一個“識別性：低”的標籤；如此，在後續的關鍵字提取過程中，我們就可以利用這個資訊，為“科技股份有限公司”這個詞分配一個較低的權重，使得模型更偏向於將“AIPLUX”識別為關鍵字。進而更好地理解文本中的實體及其屬性和關係，從而更準確地提取關鍵字與進行識別性文字的判斷。Furthermore, the present invention uses knowledge graph (Knowledge Graph) technology to further optimize the recognition text judgment effect in this technology. The knowledge graph can store a large amount of information about entities and their attributes and relationships. In this problem, we can use "AIPLUX" and "Technology Co., Ltd." as entities. Their attributes can include the number of times they appear in the document, their TF-IDF values, etc., and their relationship can be that they are in the same co-occur in documents, etc. In addition, the knowledge graph in this case integrates and stores data from the intellectual property bureau’s trademark review process. For example, during the review process, "Technology Co., Ltd." or "Technology Co., Ltd." were mentioned many times and were regarded as non-identifying words. Then we can add this information to the knowledge graph, "Technology Co., Ltd." Co., Ltd." can add a "identification: low" tag to the attribute; in this way, in the subsequent keyword extraction process, we can use this information to assign a lower value to the word "Technology Co., Ltd." The weight makes the model more inclined to recognize "AIPLUX" as a keyword. Then we can better understand the entities in the text and their attributes and relationships, so as to more accurately extract keywords and judge identifying text.

而在商標類別推薦的階段中，在這個階段中，系統利用之前從文本中提取出的關鍵字與主題，以及他們對應的產業分類，來為用戶推薦相應的商標類別。例如，對於一家主要從事人工智慧、自然語言處理和大資料相關業務的公司，系統將推薦9類（科學儀器）、42類（科學和技術服務）和41類（教育和娛樂服務）等相關的商標類別。其過程中係透過查閱資料表與推薦系統演算法來實現。其中，系統後台創建一個查閱資料表，將關鍵字或主題映射到他們對應的商標類別，如下表所示：關鍵字/主題 | 對應的商標類別 ----------------|------------------ 人工智慧 | 9, 42 自然語言處理 | 9, 42, 41 大數據 | 9, 42 In the trademark category recommendation stage, in this stage, the system uses the keywords and topics previously extracted from the text, as well as their corresponding industry classifications, to recommend corresponding trademark categories to users. For example, for a company mainly engaged in artificial intelligence, natural language processing and big data related businesses, the system will recommend related categories 9 (scientific instruments), 42 categories (scientific and technical services) and 41 categories (educational and entertainment services). Trademark category. The process is achieved by consulting data tables and recommendation system algorithms. Among them, the system creates a lookup table in the background to map keywords or topics to their corresponding trademark categories, as shown in the following table: Keyword/topic | Corresponding trademark category ----------------|----------------- Artificial Intelligence | 9, 42 Natural Language Processing | 9, 42, 41 Big Data | 9, 42

接著，系統後台可以根據使用者輸入的文本中出現的關鍵字或主題，來查找他們對應的商標類別，並將這些類別推薦給用戶。同時, 系統後台係合併透過該用戶的產業資訊結果, 進入商業信息查詢平台中, 找到類似同產業的其他公司, 再從該些其他公司所以經申請的商標前案, 去進行統整與歸納後,最終以詞向量比對分析推薦結果, 結合產業資訊比對分析推薦結果, 來統整推薦商標類別與商品項目(goods and services)的挑選建議。Then, the system background can find their corresponding trademark categories based on the keywords or topics that appear in the text entered by the user, and recommend these categories to the user. At the same time, the system's backend system merges the user's industrial information results into the business information query platform to find other companies in the same industry, and then consolidates and summarizes the trademark applications filed by these other companies. , Finally, the recommendation results are compared and analyzed with word vectors, and the recommendation results are compared and analyzed with industry information to integrate the selection suggestions for recommended trademark categories and product items (goods and services).

其中，系統後台係不僅可透過通用Nice分類內容進行商品項目(goods and services)挑選，其更可依據用戶提供的產品訊息，進行各國的商品項目(goods and services)內容中挑選，具體上系統後台係提供各國加商品項目的詞向量分析, 之後再依據用戶的介紹訊息, 進行詞向量分析與比對, 進而最終挑選出對應的推薦"商品項目"內容。Among them, the system backend can not only select goods and services through the general Nice classification content, but also select goods and services from each country based on the product information provided by the user. Specifically, the system backend The system provides word vector analysis of product items in various countries, and then conducts word vector analysis and comparison based on the user's introduction information, and finally selects the corresponding recommended "product items" content.

其中，主要包含了兩個階段：一是用詞向量(Word Vector)比對來推薦商品和服務項目，二是用產業資訊比對來進一步優化推薦結果。當系統後台已經具備了各國的商品和服務項目的詞向量，那麼系統可以先將用戶介紹信息中的產品名稱轉換為詞向量。例如，假設用戶介紹的產品是"AI智能機器人"，於本演示過程中，係採用Word2Vec的模型來轉換這個詞語為一個詞向量v。Among them, it mainly includes two stages: one is to use word vector (Word Vector) comparison to recommend goods and services, and the other is to use industry information comparison to further optimize the recommendation results. When the system background already has word vectors for goods and services in various countries, the system can first convert the product names in the user introduction information into word vectors. For example, assume that the product introduced by the user is "AI intelligent robot". During this demonstration, the Word2Vec model is used to convert this word into a word vector v.

然後，系統後台可以遍歷各國的商品和服務項目的詞向量，並計算它們與v的相似性，並使用餘弦相似度來計算詞向量之間的相似性，其公式如下： Then, the system background can traverse the word vectors of goods and services in various countries, calculate their similarity with v, and use cosine similarity to calculate the similarity between word vectors. The formula is as follows:

其中v和w分別是兩個詞向量，"||"表示向量的長度。最後，系統可以選擇相似度最高的那些商品和服務項目作為推薦結果。where v and w are two word vectors respectively, and "||" represents the length of the vector. Finally, the system can select those goods and services with the highest similarity as recommended results.

另一方面，透過產業資訊比對推薦商品和服務項目過程中，系統後台係利用用戶的產業資訊，找到同產業的其他公司，然後從這些公司的商標申請中找出最常見的商品和服務項目。具體來說，首先在商業信息查詢平台上搜索同產業的公司，然後收集這些公司的商標申請。我們可以將每個申請中的商品和服務項目看作一個詞，並計算每個詞的頻率。接著，系統後台可以選擇頻率最高的詞語作為推薦的商品和服務項目。On the other hand, in the process of recommending goods and services through industry information comparison, the system background uses the user's industry information to find other companies in the same industry, and then finds the most common goods and services from the trademark applications of these companies. . Specifically, we first search for companies in the same industry on the business information query platform, and then collect the trademark applications of these companies. We can treat the goods and services in each application as a word and calculate the frequency of each word. Then, the system background can select the words with the highest frequency as recommended products and services.

如此一來，系統後台目前已經得到了兩個推薦結果：一個是基於詞向量比對的結果，另一個是基於產業資訊比對的結果。接著，系統後台可將這兩個結果結合起來，形成最終的推薦結果。具體來說，為每個商品和服務項目分配一個分數，這個分數是該項目在詞向量比對結果中的排名和在產業資訊比對結果中的排名的加權平均。然後，我們可以選擇分數最高的項目作為最終的推薦結果。As a result, the system background has currently obtained two recommendation results: one is based on the result of word vector comparison, and the other is based on the result of industry information comparison. Then, the system background can combine these two results to form the final recommendation result. Specifically, each product and service item is assigned a score, which is the weighted average of the item's ranking in the word vector comparison results and its ranking in the industry information comparison results. Then, we can select the item with the highest score as the final recommendation result.

以下是該步驟的公式： Here is the formula for this step:

其中，rank_vector(item)是項目在詞向量比對結果中的排名，rank_industry(item)是項目在產業資訊比對結果中的排名，α和β是兩種方法的權重，它們的和為1。在實際應用中，α和β的值可以通過交叉驗證來確定，以優化推薦結果的質量，因此α和β屬於可調式參數，於本案介紹中，即不再加以贅述其權重調整方式與過程。可調式參數係根據該使用者端進行客製化設定，該可調式參數可由使用者端喜好、使用者端習慣、使用者端可承受風險的能力的選項中任意選擇。Among them, rank_vector(item) is the ranking of the item in the word vector comparison results, rank_industry(item) is the ranking of the item in the industry information comparison results, α and β are the weights of the two methods, and their sum is 1. In practical applications, the values of α and β can be determined through cross-validation to optimize the quality of recommendation results. Therefore, α and β are adjustable parameters. In the introduction of this case, the weight adjustment method and process will not be described in detail. The adjustable parameters are customized according to the user. The adjustable parameters can be selected from the options of user preferences, user habits, and the user's ability to tolerate risks.

倘若使用者輸入的描述文字中經本系統解析後找不到公司名稱、品牌名稱或商標名稱，也不影響後續的類別推薦，本系統仍可以對描述文字進行解析產生產業資訊，並且再依據產業資訊比對後產生推薦的商標類別，以及該商標類別中的推薦商品項目。If the company name, brand name or trademark name cannot be found in the description text input by the user after being parsed by the system, it will not affect subsequent category recommendations. The system can still parse the description text to generate industry information, and then based on the industry information After comparison, a recommended trademark category is generated, as well as recommended product items in the trademark category.

請再接續參閱圖23及圖24，係為本發明之實施例流程圖。Please continue to refer to Figure 23 and Figure 24, which are flow charts of embodiments of the present invention.

使用者透過電子裝置的輸入模組任意輸入一段描述文字，並進一步選擇欲產生商標推薦的目標國家，本系統先判斷描述文字語言與目標國家的官方語言是否相同，若不同則會先對描述文字進行翻譯，若相同則直接進行後續流程，再對描述文字進行解析與擷取，產出關鍵字和產業相關文字，產業相關文字即為產業資訊。The user inputs a piece of description text arbitrarily through the input module of the electronic device, and further selects the target country for which the trademark recommendation is to be generated. The system first determines whether the language of the description text is the same as the official language of the target country. If it is different, it will first modify the description text. Translate, and if they are the same, proceed directly to the follow-up process, and then analyze and extract the description text to produce keywords and industry-related text. The industry-related text is industry information.

對關鍵字部分進行識別性文字判斷與擷取，主要是透過與非識別性文字資料庫進行比對，確認關鍵字中是否有包含非識別性文字，若有則將非識別性文字剔除，僅保留識別性文字，具體地，非識別性文字資料庫建立的方式可以為但不限於利用大型語言模型資料庫進行訓練，將常用字或語助詞、介係詞等判別為不具識別性。The identification and extraction of the keyword part is mainly done by comparing it with the non-recognizable text database to confirm whether the keyword contains non-recognizable text. If so, the non-recognizable text will be removed and only Retaining identifiable text, specifically, the non-recognizing text database can be established by, but is not limited to, using a large language model database for training, and identifying commonly used words, particles, prepositions, etc. as non-recognizable.

另一方面對經過解析後產生的產業資訊(產業相關文字)透過經訓練的語言生成模型產生對應於使用者輸入的描述文字的產業類別。On the other hand, the industry information (industry-related text) generated after analysis is used to generate an industry category corresponding to the description text input by the user through a trained language generation model.

藉由類別推薦模組依據上述產生的產業類別進行商標的類別與商品項目比對，具體而言，比對的資料庫會依據使用者選擇的目標國家不同而不同，例如使用者目標國家選擇美國，則會與美國官方的商標類別資料庫進行比對，若選擇台灣，會與台灣官方的商標類別資料庫進行比對，目的在於因為不同國家的商標類別中的商品項目會有所不同，因為有時可能發生在台灣有的商品項目在美國同一類別中卻找不到，因此依據目標國家的商標類別資料庫比對很重要，才不會產生的推薦商品項目實際上在該國卻不存在，本系統會根據可調式參數利用且例如詞向量的近似度比對產生包含至少一推薦類別與至少一推薦商品項目的至少一商標類別推薦資訊之比對結果，尋找接近的商標類別與商品項目，達到精準推薦的功效。可調式參數係根據該使用者端進行客製化設定，該可調式參數可由使用者端喜好、使用者端習慣、使用者端可承受風險的能力的選項中任意選擇。 The category recommendation module is used to compare trademark categories and product items based on the above-generated industry categories. Specifically, the comparison database will be different depending on the target country selected by the user. For example, the user selects the United States as the target country. , it will be compared with the official trademark category database of the United States. If Taiwan is selected, it will be compared with the official trademark category database of Taiwan. The purpose is because the product items in the trademark categories of different countries will be different. Sometimes it may happen that some product items in Taiwan cannot be found in the same category in the United States. Therefore, it is important to compare the trademark category database of the target country so that the recommended product items that are generated do not actually exist in that country. , this system will generate a comparison result of at least one trademark category recommendation information including at least one recommended category and at least one recommended product item based on the use of adjustable parameters and, for example, the similarity comparison of word vectors, to find similar trademark categories and product items. , to achieve the effect of accurate recommendation. The adjustable parameters are customized according to the user. The adjustable parameters can be selected from the options of user preferences, user habits, and the user's ability to tolerate risks.

比對的結果再經由報告產生單元進行排列並填入模組化的格式中，進而產生商標類別推薦報告。其中風險評估模組，係用以將自字串中擷取出之關鍵字與該目標國家之商標資料庫進行比對，並產出一風險資訊，該類別推薦模組更包含一知識圖譜單元，係用以建立節點矩陣關聯，並依各國之間商品項目與各國商標項目之間的等價關係根據時間演進的動態學習模式，該知識圖譜單元轉換至少包含以下步驟：將多國商標商品項目與商標專案關係資料收集與動態模型學習，透過圖譜查詢與自然與研磨行推理，推算出最佳匹配、最短路徑、最大流量，並根據最終決策結果進行優化商品項目之等價關係。The comparison results are then sorted by the report generation unit and filled in a modular format, thereby generating a trademark category recommendation report. Among them, the risk assessment module is used to compare the keywords extracted from the string with the trademark database of the target country and produce a risk information. This category recommendation module also includes a knowledge map unit. It is a dynamic learning model used to establish node matrix associations and evolve over time based on the equivalence relationship between product items in various countries and trademark items in various countries. The knowledge graph unit conversion at least includes the following steps: Convert multi-country trademark product items to Trademark project relationship data collection and dynamic model learning, through graph query and natural and grinding line reasoning, calculate the best match, shortest path, maximum flow, and optimize the equivalence relationship of product items based on the final decision results.

報告呈現的方式可為線上圖像、可下載之文書檔案、可轉發至通訊軟體之檔案格式、可分享至社群軟體之檔案格式、電子布告欄或其組合。The report can be presented in the form of an online image, a downloadable document file, a file format that can be forwarded to communication software, a file format that can be shared to social software, an electronic bulletin board, or a combination thereof.

在圖24中，與圖23的差異在於產生目標國家的商標類別推薦資訊之後，風險評估模組進一步在該商標類別推薦資訊中透過檢索比對單元進行前案比對，該目標國家之商標資料庫中搜尋比對出近似具識別性文字的前案商標，檢索比對單元是在該推薦類別中尋找是否有與上述從關鍵字中擷取出的識別性文字相似或近似的申請在前的商標案件，並將檢索結果與推薦類別與推薦商品項目進行整合分析，例如在推薦類別為第9類，檢索比對發現第9類有近似前案，這時檢索比對單元會再詳細比對分析推薦商品項目與前案商品項目，若商品項目完全無重疊則判定的近似度會比部分商品項目重疊的近似度低。將檢索結果與該識別性文字進行近似度運算及排列，同時運算檢索結果中的前案申請類別與該至少一商標類別推薦資訊的近似度，進而產生評估結果。In Figure 24, the difference from Figure 23 is that after generating the trademark category recommendation information of the target country, the risk assessment module further performs a comparison of previous cases in the trademark category recommendation information through the search comparison unit. The trademark data of the target country Search and compare previous trademarks with similar identifying words in the database. The search and comparison unit is to find whether there are previously filed trademarks in the recommended category that are similar or similar to the identifying words extracted from the keywords. cases, and integrate and analyze the search results with recommended categories and recommended product items. For example, when the recommended category is Category 9, and the search comparison finds that Category 9 has similar previous cases, then the search comparison unit will compare and analyze the recommendations in detail. If there is no overlap between the product items and the product items in the previous case, the degree of similarity will be lower than if there is some overlap between the product items. The similarity calculation and arrangement of the search results and the identifying words are performed, and the similarity between the previous application category in the search results and the recommended information of at least one trademark category is calculated to generate an evaluation result.

最後將整合分析結果透過報告產生單元同樣套入模組化格式產生風險評估報告。Finally, the integrated analysis results are also put into a modular format through the report generation unit to generate a risk assessment report.

進一步地，本系統還建立一個馳名商標資料庫，在上述的比對中，即使使用者與檢索比對出的前案在不同商標類別，但若前案是屬於馳名商標資料庫中的其中之一，則其檢索比對的結果仍會顯示此前案，且會優先排列，並顯示為高風險。Furthermore, this system also establishes a well-known trademark database. In the above comparison, even if the previous case compared by the user and the search is in a different trademark category, if the previous case belongs to one of the well-known trademark databases, 1, the search and comparison results will still show the previous case, and will be prioritized and displayed as high risk.

進一步地，類別推薦模組對描述文字進行解析並擷取出關鍵字和產業相關文字形成產業資訊之後，還可以透過資訊擷取單元依據擷取出的產業相關文字，主動至外部網路資訊中搜尋比對具有相似產業相關文字描述的公司，並將搜尋比對出的公司資訊傳送至該類別推薦模組的一知識圖譜比對單元，再透過該知識圖譜比對單元進行商標類別比對，並產生至少一商標類別推薦資訊，商標類別推薦資訊至少包含：一推薦類別與至少一推薦商品項目。Furthermore, after the category recommendation module parses the description text and extracts keywords and industry-related text to form industry information, it can also use the information acquisition unit to actively search external network information based on the extracted industry-related text. For companies with similar industry-related text descriptions, the company information resulting from the search and comparison is sent to a knowledge graph comparison unit of the category recommendation module, and then the trademark category comparison is performed through the knowledge graph comparison unit, and a generated At least one trademark category recommendation information, and the trademark category recommendation information at least includes: one recommended category and at least one recommended product item.

請接續參閱圖25至圖27，係顯示本發明系統之使用情境示意圖。Please continue to refer to Figures 25 to 27, which are schematic diagrams showing usage scenarios of the system of the present invention.

如圖25所顯示，在此使用情境中，使用者的角色為一般商家、品牌方或商標申請人，透過載具例如手機、電腦等電子裝置，連上網路後啟用伺服器並藉由網頁的欄位任意輸入描述文字的字串，該些字串透過雲端部分的伺服器進行語意分析，具體地，先判斷該字串的語言以及目標國家的官方語言是否相同，若不同藉由多語翻譯模型進行翻譯，並接著進行字串解析，擷取關鍵字和產生產業資訊，接著經由類別項目比對單元與目標國家的商標分類資料庫進行比對，而產生商標類別推薦報告反饋回使用者的載具。As shown in Figure 25, in this usage scenario, the user plays the role of a general merchant, brand owner or trademark applicant. Through electronic devices such as mobile phones and computers, he connects to the Internet and activates the server through the web page. Enter any string describing the text in the field. These strings are semantically analyzed through the server in the cloud part. Specifically, it is first determined whether the language of the string and the official language of the target country are the same. If they are different, multilingual translation is used. The model is translated, and then performs string parsing to extract keywords and generate industry information, and then compares it with the trademark classification database of the target country through the category item comparison unit, and generates a trademark category recommendation report to feed back to the user vehicle.

進一步地，再由推薦的商標類別中檢索商標前案，尋找在推類別中是否存在商標前案，以及其商標與商品項目近似程度，檢索結果之後結合推薦類別與推薦商品項目產生風險評估資訊，最終產生風險評估報告反饋回使用者的載具。Furthermore, the previous trademark cases are searched from the recommended trademark categories to find out whether there are previous trademark cases in the recommended categories and the similarity between the trademark and the product items. The search results are then combined with the recommended categories and recommended product items to generate risk assessment information. Finally, a risk assessment report is generated and fed back to the user's vehicle.

進一步地，該商標類別推薦報告的方式可為線上圖像、可下載之文書檔案、可轉發至通訊軟體之檔案格式、可分享至社群軟體之檔案格式、電子布告欄或其組合。Furthermore, the trademark category recommendation report may be in the form of an online image, a downloadable document file, a file format that can be forwarded to communication software, a file format that can be shared to social software, an electronic bulletin board, or a combination thereof.

且還可以依據輸入的字串，在地端伺服器例如各國家的官方商標申請資料庫或任意網路資訊中進行爬取資訊，搜尋其他相似產業公司申請之商標紀錄，從中獲取商標類別及商品項目也一同作為推薦的選項。You can also crawl information from local servers such as official trademark application databases of various countries or any online information based on the input string, search for trademark records applied by other companies in similar industries, and obtain trademark categories and products. Items are also included as recommended options.

最終使用者可以依據此報告提供給專業的事務所或直接整理後自行將申請文件遞交給官方機構，進行商標申請。End users can provide this report to a professional firm or directly compile and submit the application documents to the official agency for trademark application.

如圖26所顯示，在此使用情境中，使用者的角色為專業商標從業人士(Trademark Practitioners)或事務所(Trademark or law firm)，使用者透過載具通常是例如電腦等電子裝置，連上網路後啟用伺服器並藉由網頁的欄位任意輸入描述文字的字串，透過本發明之系統取得商標類別推薦報告甚至是風險評估報告，使用者可以直接或再次經過模組化固定格式整理報告後產生商標建議書，經過自動套用商標從業人士或是法律事務所自己的格式後，將提供給使用者的客戶，這裡的客戶即為品牌業者、商家或商標申請人，經客戶確認之後即可將申請文件遞交至官方機構完成案件的處理。As shown in Figure 26, in this usage scenario, the user's role is a professional trademark practitioner (Trademark Practitioners) or a firm (Trademark or law firm). The user connects to the Internet through a vehicle, usually an electronic device such as a computer. After that, activate the server and enter any descriptive text string through the field of the web page, and obtain the trademark category recommendation report or even the risk assessment report through the system of the present invention. The user can organize the report directly or again through the modular fixed format. A trademark proposal is then generated, and after automatically applying the format of the trademark practitioner or the law firm, it will be provided to the user's client. The client here is the brand owner, merchant or trademark applicant, and it can be confirmed by the client. Submit the application documents to the official agency to complete the case processing.

風險評估報告的示意圖可以參閱圖27，要注意的是，圖27僅為使用情境中的示範例，並沒有加以限定其報告的顯示方式與格式。如圖27所顯示，在風險評估報告中，會列出在推薦類別中具有的商標近似前案，以及其基本資訊例如商標名稱/圖片、國家、申請日、申請人、分類與商品項目、官方號碼、風險等級等等，透過顯示介面的方式讓使用者可以清楚知道欲申請的商標在對應的商標類別中申請的風險高低，更進一步還可以透過切換目標國家進而顯示針對不同目標國家的風險評估報告，不同目標國家其推薦類別高機率會相同，但商品項目則會有些許不同，且具有的近似商標前案也會有所不同，藉由本發明之系統的語意分析、自動翻譯跟判斷功能可以快速產生多份不同目標國家的報告，大幅減少時間成本以及過往的類別分析、檢索的門檻。The schematic diagram of the risk assessment report can be found in Figure 27. It should be noted that Figure 27 is only an example of a usage scenario and does not limit the display method and format of the report. As shown in Figure 27, in the risk assessment report, similar trademark cases in the recommended categories will be listed, as well as their basic information such as trademark name/picture, country, filing date, applicant, classification and product items, official number, risk level, etc., through the display interface, the user can clearly know the risk level of the trademark to be applied for in the corresponding trademark category. Furthermore, the risk assessment for different target countries can be displayed by switching the target country. According to the report, the recommended categories in different target countries will most likely be the same, but the product items will be slightly different, and the similar trademark cases will also be different. Through the semantic analysis, automatic translation and judgment functions of the system of the present invention, it can Quickly generate multiple reports on different target countries, significantly reducing time costs and past category analysis and search thresholds.

由圖25及圖26可以看出，無論使用者為終端客戶或是中間的從業人士，皆可以大幅縮短搜尋與商標推薦分類的時間成本，並且透過本發明之系統的特徵：輸入任意的描述文字即可以透過多語翻譯模型、文字解析單元，進行語意分析以及文字的生成，在短時間內理解描述文字並且進行類別的比對和近似排列而生成推薦類別和商品項目，在跨國申請時更是發揮到更大的功效。It can be seen from Figure 25 and Figure 26 that whether the user is an end customer or an intermediate practitioner, the time cost of searching and trademark recommendation classification can be greatly shortened, and through the features of the system of the present invention: input any description text That is, you can use the multilingual translation model and text parsing unit to perform semantic analysis and text generation, understand the description text in a short time, and compare and approximate categories to generate recommended categories and product items, especially when applying for a cross-border application. play a greater role.

最後，再將本發明的技術特徵及其可達成之技術功效彙整如下：Finally, the technical features of the present invention and its achievable technical effects are summarized as follows:

其一，藉由本發明之一種語意分析商標類別推薦系統，解決一般使用者在線上申請商標的時候，不知道販售的商品或提供的服務是屬於哪一個商標類別，而不知道該如何為自己的商標選擇適合的類別項目，透過本發明之商標類別推薦報告提供使用者快速的選擇。First, through the semantic analysis trademark category recommendation system of the present invention, it can solve the problem that when ordinary users apply for a trademark online, they do not know which trademark category the goods sold or services provided belong to, and they do not know how to apply for a trademark for themselves. Select appropriate category items for the trademark, and provide users with quick choices through the trademark category recommendation report of the present invention.

其二，藉由本發明之一種語意分析商標類別推薦系統，解決使用者在申請商標前不知道自己的商標是否已存在類似的前案，透過本發明之風險評估報告提供使用者立即得且淺顯易懂的排列方式了解檢索結果。Secondly, the semantic analysis trademark category recommendation system of the present invention solves the problem that users do not know whether there is a similar previous case for their trademark before applying for a trademark. The risk assessment report of the present invention provides users with immediate and easy-to-understand information. Understand the search results in an understandable way.

其三，藉由本發明之一種語意分析商標類別推薦系統所產生的綜合報告，結合商標類別推薦與前案檢索，明確標示出在哪些類別項目具有較多或較少的商標前案，可以用顏色明顯區分或是其他任何可標註的方式呈現，提供使用者更明確的商標類別項目選擇建議。Third, through the comprehensive report generated by the semantic analysis trademark category recommendation system of the present invention, combined with trademark category recommendation and prior case retrieval, it is clearly marked in which category items have more or less trademark prior cases, and the colors can be used. Clearly distinguish or present it in any other markable way to provide users with clearer suggestions for selecting trademark category items.

必須加以強調的是，上述之詳細說明係針對本發明可行實施例之具體說明，惟該實施例並非用以限制本發明之專利範圍，凡未脫離本發明技藝精神所為之等效實施或變更，均應包含於本案之專利範圍中。It must be emphasized that the above detailed description is a specific description of possible embodiments of the present invention. However, the embodiments are not intended to limit the patent scope of the present invention. Any equivalent implementation or modification that does not deviate from the technical spirit of the present invention will All should be included in the patent scope of this case.

100:電子裝置 110:處理器 120:記憶體 130:網路介面控制器 200:網路 300:伺服器 310:應用程式 311:商標線上申請模組 3111:申請資料收集單元 3112:類別項目選擇單元 3113:資料傳輸單元 312:類別推薦模組 3121:文字解析單元 3122:類別項目比對單元 3123:報告產生單元 3124:知識圖譜比對單元 313:資料收發模組 314:風險評估模組 3142:檢索比對單元 400:資料庫 500:送件伺服器 600:輸入模組 601:資訊擷取單元 700:多語翻譯模型 U:使用者 S101、S102、S103、S104、S105、S106、S107、S108:步驟 S201、S202、S203、S204、S205、S206、S207、S208:步驟 S301、S302、S303、S304、S305、S306、S307、S308、S309:步驟 S401、S402、S403、S404、S405、S406、S407、S408:步驟 S501、S502、S503、S504、S505、S506、S507、S508、S509:步驟 S601、S602、S603、S604、S605、S606、S607、S608、S609、S610、S611:步驟 S701、S702、S703、S704、S705、S706、S707、S708、S709:步驟 S801、S802、S803、S804、S805、S806、S807、S808、S809、S810、S811、S812、S813:步驟 S901、S902、S903、S904、S905、S906、S907、S908、S909、S910、S911、S912、S913；S914:步驟 100: Electronic devices 110: Processor 120:Memory 130:Network interface controller 200:Internet 300:Server 310:Application 311:Trademark online application module 3111: Application data collection unit 3112:Category item selection unit 3113: Data transmission unit 312:Category recommendation module 3121: Text analysis unit 3122: Category item comparison unit 3123: Report generation unit 3124: Knowledge graph comparison unit 313: Data sending and receiving module 314:Risk Assessment Module 3142: Search comparison unit 400:Database 500:Sending server 600:Input module 601: Information retrieval unit 700: Multilingual Translation Model U:User S101, S102, S103, S104, S105, S106, S107, S108: steps S201, S202, S203, S204, S205, S206, S207, S208: steps S301, S302, S303, S304, S305, S306, S307, S308, S309: steps S401, S402, S403, S404, S405, S406, S407, S408: steps S501, S502, S503, S504, S505, S506, S507, S508, S509: steps S601, S602, S603, S604, S605, S606, S607, S608, S609, S610, S611: Steps S701, S702, S703, S704, S705, S706, S707, S708, S709: Steps S801, S802, S803, S804, S805, S806, S807, S808, S809, S810, S811, S812, S813: Steps S901, S902, S903, S904, S905, S906, S907, S908, S909, S910, S911, S912, S913; S914: Step

圖1係先前技術的示意圖；圖2係先前技術的另一示意圖；圖3係先前技術的另一示意圖；圖4係先前技術的另一示意圖；圖5係先前技術的另一示意圖；圖6係先前技術的另一示意圖；圖7係先前技術的另一示意圖；圖8係顯示本發明之系統的示意圖；圖9係顯示本發明之系統的電子裝置的示意圖；圖10係顯示本發明之系統的伺服器的示意圖；圖11係顯示本發明之系統的應用程式的示意圖；圖12係顯示本發明之系統的商標線上申請模組的示意圖；圖13係顯示本發明之系統的類別推薦模組的示意圖；圖14係顯示本發明之系統的應用程式的另一示意圖；圖15係顯示本發明之系統的風險評估模組的示意圖；圖16係顯示本發明之系統的另一示意圖；圖17係顯示本發明之系統的另一示意圖；圖18係顯示本發明之方法的流程圖；圖19係顯示本發明之方法的另一實施例流程圖；圖20係顯示本發明之方法的另一實施例流程圖；圖21係顯示本發明之方法的另一實施例流程圖；圖22係顯示本發明之方法的另一實施例流程圖；圖23係顯示本發明之方法的另一實施例流程圖；圖24係顯示本發明之方法的另一實施例流程圖；圖25係顯示本發明之方法的另一實施例流程圖；圖26係顯示本發明之方法的另一實施例流程圖；圖27係顯示本發明之方法的另一實施例流程圖；圖28A係顯示本發明之方法的應用實例示意圖；圖28B係顯示本發明之方法的應用實例示意圖；圖29係顯示本發明之方法的另一實施例流程圖；圖30係顯示本發明之方法的另一實施例流程圖；圖31係顯示本發明之方法的使用情境示意圖；圖32係顯示本發明之方法的使用情境示意圖；以及圖33係顯示本發明之方法的使用情境示意圖。 Figure 1 is a schematic diagram of the prior art; Figure 2 is another schematic diagram of the prior art; Figure 3 is another schematic diagram of the prior art; Figure 4 is another schematic diagram of the prior art; Figure 5 is another schematic diagram of the prior art; Figure 6 is another schematic diagram of the prior art; Figure 7 is another schematic diagram of the prior art; Figure 8 is a schematic diagram showing the system of the present invention; Figure 9 is a schematic diagram showing the electronic device of the system of the present invention; Figure 10 is a schematic diagram showing the server of the system of the present invention; Figure 11 is a schematic diagram showing the application program of the system of the present invention; Figure 12 is a schematic diagram showing the trademark online application module of the system of the present invention; Figure 13 is a schematic diagram showing the category recommendation module of the system of the present invention; Figure 14 is another schematic diagram showing the application program of the system of the present invention; Figure 15 is a schematic diagram showing the risk assessment module of the system of the present invention; Figure 16 is another schematic diagram showing the system of the present invention; Figure 17 is another schematic diagram showing the system of the present invention; Figure 18 is a flow chart showing the method of the present invention; Figure 19 is a flow chart showing another embodiment of the method of the present invention; Figure 20 is a flow chart showing another embodiment of the method of the present invention; Figure 21 is a flow chart showing another embodiment of the method of the present invention; Figure 22 is a flow chart showing another embodiment of the method of the present invention; Figure 23 is a flow chart showing another embodiment of the method of the present invention; Figure 24 is a flow chart showing another embodiment of the method of the present invention; Figure 25 is a flow chart showing another embodiment of the method of the present invention; Figure 26 is a flow chart showing another embodiment of the method of the present invention; Figure 27 is a flow chart showing another embodiment of the method of the present invention; Figure 28A is a schematic diagram showing an application example of the method of the present invention; Figure 28B is a schematic diagram showing an application example of the method of the present invention; Figure 29 is a flow chart showing another embodiment of the method of the present invention; Figure 30 is a flow chart showing another embodiment of the method of the present invention; Figure 31 is a schematic diagram showing the usage scenario of the method of the present invention; Figure 32 is a schematic diagram showing the usage scenario of the method of the present invention; and Figure 33 is a schematic diagram showing a usage scenario of the method of the present invention.

100:電子裝置 100: Electronic devices

200:網路 200:Internet

300:伺服器 300:Server

400:資料庫 400:Database

500:送件伺服器 500:Sending server

U:使用者 U:User

Claims

A semantic analysis trademark category recommendation system for receiving a string provided by a user by operating an electronic device and at least one target country to be queried, through a processor of the electronic device through a network interface controller Connect to a server and execute an application to calculate and generate a trademark category recommendation report. The system at least includes: An input module is used to receive the string input by the user, label the string, and send a string of information; A category recommendation module is a computational model trained through natural language models and trademark classification tables and details to parse string information into trademark category recommendation information, which also includes: A text parsing unit used to parse the translated string information, extract keywords and industry information, and send them; A category item comparison unit is used to receive the keywords and industry information, compare it with the trademark category database of the target country, and calculate to generate at least one recommended category and at least one recommended product item; and A report generating unit is used to receive the recommended category, the recommended product item and the target country, and use the input language to generate a trademark category recommendation report through a templated format.

A semantic analysis trademark category recommendation system as described in claim 1, wherein the system also includes a multilingual translation model, which is a computing model trained through large-scale language translation to receive the string information and determine the user input Whether the string is the same as the official language of the target country. If it is different, the string will be translated into the official language of the target country and all the translated string information will be sent.

For example, in the semantic analysis trademark category recommendation system described in request item 1, the string may further include trademark text, trademark description, business behavior, company name, stock code, product name, service name, or a combination thereof.

The semantic analysis trademark category recommendation system as described in claim 1, wherein the way the user inputs the string may be text input, voice input or video input.

The semantic analysis trademark category recommendation system described in claim 1, wherein the input module further includes an information retrieval unit for actively retrieving relevant information from the tagged string information to the cyberspace, and This information can be sent to the multilingual translation model for translation.

In the semantic analysis trademark category recommendation system described in claim 1, the trademark category database described in the category item comparison unit further includes the trademark category database of the target country and the international Nice trademark category database.

The semantic analysis trademark category recommendation system as described in claim 1, wherein the system further includes a risk assessment module for comparing keywords extracted from the string with the trademark database of the target country, And produce a risk information.

For the semantic analysis trademark category recommendation system described in claim 6, the risk information can be further integrated into the trademark category recommendation report through the report generation unit.

The semantic analysis trademark category recommendation system as described in request item 1, wherein the report can be presented in an online image, a downloadable document file, a file format that can be forwarded to communication software, or a file format that can be shared to social software. , electronic bulletin board or combination thereof.

For the semantic analysis trademark category recommendation system described in claim 1, the user can be selected from any combination of brand owners, trademark applicants, trademark practitioners, and legal practitioners.

The semantic analysis trademark category recommendation system as described in claim 1, wherein the category recommendation module further includes a knowledge graph unit, which is used to establish a node matrix association, and according to the equality between product items and trademark items in each country. A dynamic learning model in which price relationships evolve over time. The knowledge graph unit conversion at least includes the following steps: collecting and learning dynamic model data on the relationship between trademark product items and trademark projects in multiple countries, and deducing the final result through graph query and natural and grinding reasoning. The best match, shortest path, maximum flow, and optimize the equivalence relationship of product items based on the final decision results.

A semantic analysis trademark category recommendation method. The user operates an electronic device to query at least one target country, connects to a server through a network interface controller through a processor of the electronic device, and executes an application program to calculate The output method includes at least the following steps: (1) The user inputs a piece of description text arbitrarily through an input module of the electronic device; (2) A category recommendation module analyzes the description text and extracts keywords and industry-related text to form industry information; (3) Compare the parsed industry-related text with the target country’s trademark category database through a category recommendation module, output the comparison result based on an adjustable parameter, and generate At least one trademark category recommendation information of at least one recommended category and at least one recommended product item; and (4) using a report generation unit to generate a trademark category recommendation report from the trademark category recommendation information of step (3) in a templated format.

The semantic analysis trademark category recommendation method described in claim 11, wherein step (2) may further include identifying text extraction of the extracted keywords through a text analysis unit , which can be divided into: steps (21) The text parsing unit generates and extracts identifiable text by comparing it with the non-identifying text database; and step (22) the text parsing unit connects the language model of the category item comparison unit to compare industry-related text Convert to generate industry categories.

The semantic analysis trademark category recommendation method as described in claim 11, wherein step (1) further includes: Step (11) The user selects at least one target country for the trademark category recommendation to be generated; And further includes in step (3): Step (31) The category recommendation module compares the categories in the trademark category database of the target country based on the country selected by the user.

The semantic analysis trademark category recommendation method as described in request item 11, which further includes after step (3): Step (301) uses a risk assessment module to perform a risk assessment on the identifying words, and uses a search and comparison unit to compare the identifying words with the target country in at least one trademark category recommendation information generated in step (3). The trademark database is compared to generate search results; Step (302) The report generation unit integrates and analyzes the recommended information of at least one trademark category and the search results, performs similarity calculation and arrangement on the search results and the identifying words, and simultaneously calculates the categories of previous applications in the search results and The similarity of the recommended information for at least one trademark category, thereby generating an evaluation result; and Step (303) The report generating unit combines the at least one recommended category and the assessment result to generate a risk assessment report through the templated format.

The semantic analysis trademark category recommendation method as described in request item 12, which further includes after step (2): Step (21): Based on the industry-related text extracted in step (2), the information retrieval unit actively searches for and compares companies with similar industry-related text descriptions in external network information, and searches for the compared company information. A knowledge graph comparison unit sent to the recommendation module of the category; and Step (22) performs trademark category comparison through the knowledge graph comparison unit and generates at least one trademark category recommendation information.

The semantic analysis trademark category recommendation method as described in request item 14, which after step (11) also includes: Step (12) The input module determines whether the input string language is the same as the official language of the target country. If not, it will first be translated through the multilingual translation model to translate the string language into the official language of the target country. Language

The semantic analysis trademark category recommendation method described in claim 14, wherein the adjustable parameters are customized according to the user terminal, and the adjustable parameters can be determined by the user terminal preferences, user terminal habits, user terminal capabilities Choose from any of the options based on your ability to tolerate risk.

The semantic analysis trademark category recommendation method described in claim 14, wherein the trademark category recommendation information at least includes: one recommended category and at least one recommended product item.