TW201322018A - Method of analyzing personalized input automatically - Google Patents

Method of analyzing personalized input automatically Download PDF

Info

Publication number
TW201322018A
TW201322018A TW100143642A TW100143642A TW201322018A TW 201322018 A TW201322018 A TW 201322018A TW 100143642 A TW100143642 A TW 100143642A TW 100143642 A TW100143642 A TW 100143642A TW 201322018 A TW201322018 A TW 201322018A
Authority
TW
Taiwan
Prior art keywords
word
input
content
automatically analyzing
words
Prior art date
Application number
TW100143642A
Other languages
Chinese (zh)
Other versions
TWI477996B (en
Inventor
Ray Chao
Original Assignee
Iq Technology Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Iq Technology Inc filed Critical Iq Technology Inc
Priority to TW100143642A priority Critical patent/TWI477996B/en
Priority to CN2012102519841A priority patent/CN103136319A/en
Publication of TW201322018A publication Critical patent/TW201322018A/en
Application granted granted Critical
Publication of TWI477996B publication Critical patent/TWI477996B/en

Links

Abstract

A computer-implemented method for analyzing personalized input automatically. The method uses one or more processors to capture an input content in the background. The input content is analyzed by said one or more processors to generate an output content, which is a personalized key word, phrase, or a combination of them. The output content is sent to a receiving terminal to retrieve personalized information.

Description

自動分析個人化輸入之方法Automatic analysis of personalized input methods

本發明係關於一種自動分析個人化輸入之方法,特別指一種在背景下自動分析個人化輸入,以產生輸出內容,進而取得一接收端之個人化訊息的方法。The present invention relates to a method for automatically analyzing personalized input, and more particularly to a method for automatically analyzing personalized input in the background to generate output content, thereby obtaining a personalized message at the receiving end.

二十一世紀為資訊科技的世紀,隨著電子裝置及資訊系統的成長及網路資訊的多元化,透過這三者將虛擬與現實社會緊密結合。在網際網路已成為資訊傳播的主要途徑之一後,網際網路使用者可以不受地理位置及時間的限制,在24小時全年無休的網際網路世界接收各式各樣的訊息。In the 21st century, the century of information technology, with the growth of electronic devices and information systems and the diversification of online information, the virtual and real society are closely integrated through these three. After the Internet has become one of the main ways of information dissemination, Internet users can receive a variety of messages in the 24-hour Internet world, regardless of location and time.

隨著資訊成長速度以等比級數增加,舉凡網際網路上的新聞、社群、部落格、微網誌等等的資訊正以極快的速度更新成長。由於這些資訊快速且大量的充斥於網際網路上,從這些天文數字般的資訊中去過濾並取得所需要的訊息變成使用者的負擔,因為過濾花費的時間比取得大量訊息所花費時間更多。As the growth rate of information increases in equal proportions, information on news, communities, blogs, microblogs, etc. on the Internet is growing at an extremely fast rate. Since this information is quickly and abundantly flooded on the Internet, filtering and obtaining the required information from these astronomical information becomes a burden on the user, because filtering takes more time than obtaining a large amount of information.

遂有網路服務者提供搜索或訂定關鍵字詞的服務供使用者訂閱資訊,從天文數字般的資訊中發掘符合使用者需要且有用的資訊,惟這類的服務在其實際執行上有不便與缺陷存在如下:Internet service providers provide services for searching or setting keyword terms for users to subscribe to information, and to find useful and useful information from astronomical information. However, such services have their actual implementation. The inconvenience and defects exist as follows:

1. 需使用者按照其需求自行提供關鍵字或詞,若使用者未提供關鍵字就無法獲得符合需求的資訊。1. Users are required to provide keywords or words according to their needs. If users do not provide keywords, they will not be able to obtain information that meets their needs.

2. 這些關鍵字或詞對使用者而言只是在某個時期(time period)符合其需求或意義,過了這個時期或滿足需求後這些關鍵字或詞所提供的資訊就變成毫無意義,使用者需要自行去更新關鍵字或詞,否者將持續接收到已變成不符合需求的訊息,造成困擾。2. These keywords or words are only relevant to the user's needs or meanings during a time period. After this period or after meeting the requirements, the information provided by these keywords or words becomes meaningless. Users need to update their keywords or words on their own, or they will continue to receive messages that have become non-compliant, causing problems.

緣是,為解決上述的不便的問題,相關領域莫不費盡心思來謀求解決之道,但長久以來未見適用的方式被發展完成。因此如何更有效率的在網際網路上取得符合個人需求的訊息,實屬當前重要研發課題之一,亦成為當前相關領域亟需改進的目標。The reason is that in order to solve the above-mentioned inconvenience, the relevant fields have not tried their best to find a solution, but the method that has not been applied for a long time has been developed. Therefore, how to obtain information that meets individual needs on the Internet more effectively is one of the current important research and development topics, and it has become an urgent target for improvement in related fields.

爰此,有鑑於上述各項問題,本發明之主要目的在提供一種方法係在背景下執行擷取輸入內容,並判讀分析該輸入內容,以產生一輸出內容,透過該輸出內容從一接收端取得符合該輸出內容之資訊。Accordingly, in view of the above various problems, the main object of the present invention is to provide a method for performing capture of input content in the background, and interpreting and analyzing the input content to generate an output content through which the output content is received from a receiving end. Get information that matches the output.

本發明之另一目的在提供一種判讀個人平日輸入之內容,找出個人常用的字或詞,作為關鍵字或詞以取得符合個人特定需求的服務資訊之方法。Another object of the present invention is to provide a method for interpreting content input by a person on a weekday basis, and finding a word or word commonly used by a person as a keyword or word to obtain service information that meets a specific needs of the individual.

本發明之另一目的在提供一種無須自行提供關鍵字,即可從一接收端自動且動態地取得個人化資訊。Another object of the present invention is to provide an automatic and dynamic acquisition of personalized information from a receiving end without providing a keyword.

為達上述目的,本發明提出一較佳實施係為一種自動分析個人化輸入之方法,用以取得一接收端的個人化訊息,該方法係利用一或多個處理器執行下列步驟:利用該處理器在一背景下自動擷取一輸入內容;藉由該處理器判讀該輸入內容;產生一輸出內容;以及將該輸出內容傳送至該接收端。In order to achieve the above object, the present invention provides a preferred embodiment for automatically analyzing a personalized input for obtaining a personalized message at a receiving end, the method of using the one or more processors to perform the following steps: Automatically capturing an input content in a background; interpreting the input content by the processor; generating an output content; and transmitting the output content to the receiving end.

上述判讀該輸入內容步驟包括:載入一原始字詞庫;比對該輸入內容與該原始字詞庫,以判斷是否可找出對應的原始字詞;若是,將該原始字詞之輸入頻率與一第一設定值比對,以判斷該輸入頻率是否高於該第一設定值;若否,則結束該判讀步驟。上述產生一輸出內容步驟係包括將輸入頻率高於該第一設定值之原始字詞從該原始字詞庫中提取,所提取之原始字詞係包括單一字或詞或複數個字或詞之集合。The step of interpreting the input content includes: loading an original word library; comparing the input content with the original word library to determine whether the corresponding original word can be found; and if so, inputting the original word frequency Comparing with a first set value to determine whether the input frequency is higher than the first set value; if not, ending the interpreting step. The step of generating an output content includes extracting an original word whose input frequency is higher than the first set value from the original word dictionary, and extracting the original word system includes a single word or a word or a plurality of words or words. set.

上述判讀該輸入內容之步驟更包括:載入一學習字詞庫;將不包含在原始字詞庫內的輸入內容與該學習字詞庫比對,以判斷是否可找出對應的學習字詞;若是,將該學習字詞之輸入頻率與一第二設定值比對,以判斷該輸入頻率是否高於該第二設定值;若否,則將該輸入內容中不包含在該原始字詞庫之字或詞,儲存於該學習字詞庫,並將該儲存之學習字詞之輸入頻率與該第二設定值比對,以判斷該輸入頻率是否高於該第二設定值。The step of interpreting the input content further includes: loading a learning word library; comparing the input content not included in the original word library with the learning word database to determine whether the corresponding learning word can be found. If yes, comparing the input frequency of the learning word with a second set value to determine whether the input frequency is higher than the second set value; if not, the input content is not included in the original word The word or word of the library is stored in the learning word library, and the input frequency of the stored learning word is compared with the second set value to determine whether the input frequency is higher than the second set value.

上述產生一輸出內容步驟更包括將輸入頻率高於該第二設定值之學習字詞從該學習字詞庫中提取,所提取之學習字詞包括單一字或詞或複數個字或詞之集合,該輸出內容係為單一關鍵字或詞或複數個關鍵字或詞之集合。The step of generating an output content further includes extracting a learning word whose input frequency is higher than the second set value from the learning word dictionary, and the extracted learning word includes a single word or a word or a plurality of words or a collection of words. The output is a single keyword or a collection of words or a plurality of keywords or words.

上述判讀該輸入內容之步驟更包括:將該學習字詞庫中之學習字詞的輸入次數與一第三設定值比對,以判斷該輸入次數是否高於該第三設定值;以及將輸入次數高於該第三設定值之學習字詞存入該原始字詞庫,所述的輸入內容係包括單一字或詞或複數個字或詞之集合。The step of interpreting the input content further includes: comparing the input number of the learning words in the learning word database with a third set value to determine whether the input number is higher than the third set value; and inputting The learning words whose number is higher than the third set value are stored in the original word dictionary, and the input content includes a single word or a word or a plurality of words or a collection of words.

上述方法更包括一過濾步驟,係對該輸入內容進行分析,以過濾贅字或詞,且該贅字或詞係包括感官動詞、程度副詞、介系詞、連接詞、代名詞或虛詞。或另外包括一關聯性分析步驟,係對該輸出內容進行關聯性分析。上述自動擷取係進一步指定一或多個時間範圍,並在該時間範圍內擷取該輸入內容。上述方法更包括:從該接收端取得一反饋資訊,並進而呈現該反饋資訊。The above method further comprises a filtering step of analyzing the input content to filter the 赘 word or the word, and the 赘 word or word system includes a sensory verb, a degree adverb, a mediation word, a conjunction word, a pronoun or a function word. Or additionally including an association analysis step, the association analysis is performed on the output content. The automatic capture system further specifies one or more time ranges and captures the input content within the time range. The method further includes: obtaining a feedback information from the receiving end, and further presenting the feedback information.

為使更進一步瞭解本發明之特徵及技術內容。以下舉出較佳實施例以詳細說明本發明之內容,並以圖示作為輔助說明。說明中提及之符號係參照圖式符號。In order to further understand the features and technical contents of the present invention. The preferred embodiments are described below in order to explain the details of the present invention, and are illustrated by the accompanying drawings. The symbols mentioned in the description refer to the schema symbols.

本發明係為一種自動分析個人化輸入之方法,係應用於取得一接收端的個人化訊息。The invention is a method for automatically analyzing personalized input, which is applied to obtain a personalized message of a receiving end.

請參閱第1及2圖係為本發明之第一較佳實施例。如第1圖所示係為本發明之自動分析個人化輸入之方法流程圖,該方法係利用一或多個處理器執行下列步驟:Please refer to Figures 1 and 2 for a first preferred embodiment of the present invention. FIG. 1 is a flow chart of a method for automatically analyzing personalized input according to the present invention, which uses one or more processors to perform the following steps:

步驟110:利用該處理器在一背景下自動擷取一輸入內容;Step 110: automatically extract an input content in a background by using the processor;

步驟120:藉由該處理器判讀該輸入內容;Step 120: The input content is interpreted by the processor;

步驟130:產生一輸出內容;Step 130: Generate an output content.

步驟140:將該輸出內容傳送至該接收端;Step 140: transmitting the output content to the receiving end;

步驟150:從該接收端取得一反饋資訊Step 150: Obtain a feedback information from the receiving end

步驟160:呈現該反饋資訊。Step 160: Present the feedback information.

前述步驟110中的輸入內容為一個字或詞或複數個字或詞之集合,且該輸入內容在一具體的實施方式中係可利用各種輸入方法達成,例如以硬鍵盤、虛擬鍵盤輸入字元、用觸控螢幕以手寫輸入文字或透過麥克風以語音輸入,惟本發明不限於此處所列舉。The input content in the foregoing step 110 is a word or a word or a plurality of words or a collection of words, and the input content can be achieved by using various input methods in a specific implementation manner, for example, inputting characters by using a hard keyboard or a virtual keyboard. The touch screen is used to input text by hand or voice input through a microphone, but the invention is not limited to the ones listed herein.

在本步驟110中,背景下自動擷取係指在使用者不知情的狀況下利用本方法紀錄使用者的輸入內容。在一具體可行實施中係藉由一硬鍵盤執行經常性的輸入動作,例如在文書作業系統上輸入內容時,可經由一處理器在該文書作業系統處理該硬鍵盤輸入序列的過程中,同時主動記錄該硬鍵盤輸入序列。In this step 110, the automatic capture in the background uses the method to record the user's input content without the user's knowledge. In a specific implementation, a regular input action is performed by a hard keyboard, for example, when inputting content on a paperwork system, the processor can process the hard keyboard input sequence through the paperwork system while simultaneously Actively record the hard keyboard input sequence.

前述的實施其包含下列三種方式:第一種係啟動一管理程序(Hypervisor),並利用中斷與設陷機制,取得鍵盤輸入序列;第二種係在作業系統核心階層,實作一新的設備驅動程式,取代原有設備驅動程式功能,以達到擷取鍵盤輸入序列之目的;第三種係利用作業系統(OS)或外掛輸入法程式所提供之程式介面,取得鍵盤輸入序列。The foregoing implementation includes the following three methods: the first system starts a hypervisor, and uses the interrupt and trap mechanism to obtain a keyboard input sequence; the second system implements a new device at the core level of the operating system. The driver replaces the original device driver function to achieve the purpose of capturing the keyboard input sequence; the third system uses the program interface provided by the operating system (OS) or the plug-in input method to obtain the keyboard input sequence.

另外利用觸控螢幕以手寫輸入文字、虛擬鍵盤輸入字元或透過麥克風以語音輸入,亦可藉由上述三種實施方式中的至少一種方式達成。以上所描述之方法,可應用於各種作業系統(OS),如Microsoft Windows系列、Linux系列、Unix系列、Android系列或Mac OS X系列等,惟本發明不限於此處所列舉。In addition, using the touch screen to input text by hand, virtual keyboard input characters or voice input through a microphone may also be achieved by at least one of the above three embodiments. The method described above can be applied to various operating systems (OS) such as the Microsoft Windows series, the Linux series, the Unix series, the Android series, or the Mac OS X series, but the present invention is not limited to the ones listed herein.

再者步驟110係可限定在一或多個特定時段內進行之。例如使用者僅希望將下班時間之個人化輸入內容作為自動分析之對象,則可將該特定時段設定為晚上七點至早上九點。其具體實現方式藉由執行一程式碼,由使用者指定一或多個特定時段,並經由擷取系統時間與該特定時段比對,判斷是否呼叫自動擷取輸入內容之程式碼。Further, step 110 can be performed within one or more specific time periods. For example, if the user only wants to personalize the input of the off-duty time as the object of automatic analysis, the specific time period can be set to 7:00 to 9:00 in the morning. The specific implementation manner determines whether the call automatically captures the code of the input content by executing a code, the user specifies one or more specific time periods, and compares the system time with the specific time period.

請續參閱第2圖所示係更詳細說明上述步驟120及步驟130之流程圖。如第2圖所示前述步驟120的具體執行包括下列步驟:Please refer to FIG. 2 for a more detailed description of the above steps 120 and 130. The specific execution of the foregoing step 120 as shown in FIG. 2 includes the following steps:

步驟210:載入一原始字詞庫。此原始字詞庫係儲存一個或多個原始字詞,該原始字詞係為一般人們習知或於字辭典中明確定義之字或詞或複數個字或詞之集合;Step 210: Load a raw word library. The original word library stores one or more original words, which are words or words or a plurality of words or words that are generally known or commonly defined in a word dictionary;

步驟220:比對該輸入內容與該原始字詞庫,以判斷是否可找出對應的原始字詞,若是則執行步驟230,若否則執行步驟240;Step 220: Comparing the input content with the original word library to determine whether the corresponding original word can be found, and if so, executing step 230, if otherwise, performing step 240;

步驟230:則將該原始字詞之輸入頻率與一第一設定值比對,以判斷該輸入頻率是否高於該第一設定值,若否則執行步驟240,若是則執行步驟250;Step 230: The input frequency of the original word is compared with a first set value to determine whether the input frequency is higher than the first set value, if otherwise step 240 is performed, if yes, step 250 is performed;

步驟240:結束該判讀步驟;Step 240: End the interpretation step;

步驟250:將輸入頻率高於該第一設定值之原始字詞從該原始字詞庫中提取,以產生該輸出內容(步驟130),該輸出內容係對該輸入內容(步驟120)進行分析而產生單一關鍵字或詞或複數關鍵字或詞之集合。Step 250: Extract the original word whose input frequency is higher than the first set value from the original word library to generate the output content (step 130), and the output content analyzes the input content (step 120) Produce a single keyword or a collection of words or plural keywords or words.

復參閱第1及2圖所示,前述步驟140將該輸出內容傳送至該接收端。此接收端可為一資料庫、一新聞訂閱服務伺服器或是一多媒體訂閱服務伺服器,惟本發明不限於此處所列舉。Referring to Figures 1 and 2, the foregoing step 140 transmits the output to the receiving end. The receiving end can be a database, a news subscription service server or a multimedia subscription service server, but the invention is not limited to the ones listed herein.

在本步驟中,該輸出內容係透過串列介面、藍芽介面、網際網路或無線行動網路等的電信連結,以傳送至該接收端所提供之服務介面,如資料庫關鍵字詞查詢介面、新聞訂閱服務之關鍵字詞輸入介面或多媒體訂閱服務之關鍵字詞輸入介面,惟本發明不限於此處所列舉。In this step, the output content is transmitted to the service interface provided by the receiving end through a telecommunication link such as a serial interface, a Bluetooth interface, an internet network or a wireless mobile network, such as a database keyword query. The keyword input interface of the interface, the news subscription service, or the keyword input interface of the multimedia subscription service, but the invention is not limited to the ones listed herein.

另外前述之步驟120,步驟130及步驟140這三個步驟中至少有一個步驟係可於於背景下進行,亦即當該步驟進行時不主動告知使用者。In addition, at least one of the three steps of step 120, step 130 and step 140 may be performed in the background, that is, the user is not actively informed when the step is performed.

前述步驟150從該接收端取得一反饋資訊。在本步驟中該反饋資訊係由該接收端提供,且其具體實施係為該接收端根據該輸出內容(即關鍵字或詞或其結合)到前述的服務介面搜尋符合該輸出內容的訊息,所符合的訊息即為個人化訊息,其包括文字訊息、網頁連結或多媒體資料等,但不限於此處所列舉。再透過例如電子郵件、短訊息、多媒體訊息、簡易資訊聚合文件、網頁或上述之組合的方式等(惟本發明不限於此處所列舉)取得個人化訊息。The foregoing step 150 obtains a feedback information from the receiving end. In this step, the feedback information is provided by the receiving end, and the specific implementation is that the receiving end searches for the message conforming to the output content according to the output content (ie, the keyword or the word or a combination thereof) to the foregoing service interface. The message that is met is a personalized message, including text messages, web links or multimedia materials, but is not limited to the ones listed here. The personalized message is obtained through, for example, an e-mail, a short message, a multimedia message, a simple information aggregation file, a web page, or a combination thereof, but the invention is not limited to the ones listed herein.

前述步驟160呈現該反饋資訊。在本步驟中具體的實現方式係以具有顯示功能的電子設備例如電腦、手持式裝置、媒體播放裝置等(惟本發明不限於此處所列舉),去顯現從接收端的傳回的個人化訊息。The aforementioned step 160 presents the feedback information. The specific implementation in this step is an electronic device having a display function such as a computer, a handheld device, a media playback device, etc. (but the invention is not limited to the ones listed herein) to visualize the personalized message transmitted from the receiving end.

請參閱第3A及3B圖所示,係為本發明第二較佳實施例,如第3A圖所示,本較佳實施例的步驟大部分係與前述之第一較佳實施例相同,在此即不再對相同的步驟及符號贅述,惟本較佳實施例與前述較佳實施例不同之處係為本較佳實施例步驟包括:Referring to Figures 3A and 3B, which are the second preferred embodiment of the present invention, as shown in Figure 3A, the steps of the preferred embodiment are mostly the same as the first preferred embodiment described above. Therefore, the same steps and symbols are not described again, but the preferred embodiment differs from the foregoing preferred embodiment in that the steps of the preferred embodiment include:

步驟220:比對該輸入內容與該原始字詞庫,以判斷是否可找出對應的原始字詞,若是則執行步驟230,若否則執行步驟310;Step 220: Comparing the input content with the original word library to determine whether the corresponding original word can be found, if yes, executing step 230, if otherwise, performing step 310;

步驟310:載入一學習字詞庫。此學習字詞庫係儲存一個或多個學習字詞,該學習字詞基本上不存在於該原始字詞庫,通常為人名、公司名稱、產品型號或是自創之字詞,但不限於此處所列舉;Step 310: Load a learning term library. The learning word library stores one or more learning words, the learning words are basically not present in the original word library, usually a person name, a company name, a product model or a self-made word, but are not limited thereto. Listed here;

步驟320:將不包含在原始字詞庫的輸入內容與該學習字詞庫比對,以判斷是否可找出對應的學習字詞,若否則執行步驟330,若是則執行步驟340;Step 320: Comparing the input content not included in the original word dictionary with the learning word library to determine whether the corresponding learning word can be found, if otherwise, performing step 330, if yes, executing step 340;

步驟330:將該輸入內容中不包含在該原始字詞庫之字或詞,儲存於該學習字詞庫,成為新增的學習字詞;Step 330: Store the words or words in the input content that are not included in the original word dictionary in the learning word database, and become a new learning word;

步驟340:將該學習字詞之輸入頻率與一第二設定值比對,以判斷該輸入頻率是否高於該第二設定值,若否則執行前述步驟240結束該判讀步驟,若是則執行步驟350;Step 340: The input frequency of the learning word is compared with a second set value to determine whether the input frequency is higher than the second set value. Otherwise, the step 240 is performed to end the determining step, and if yes, step 350 is performed. ;

步驟350:將輸入頻率高於該第二設定值之學習字詞從該學習字詞庫中提取以產生該輸出內容(步驟130)。Step 350: Extract a learning word whose input frequency is higher than the second set value from the learning word library to generate the output content (step 130).

續如第3B圖所示,在本較佳實施例中,係可根據該學習字詞被取用的次數,將該學習字詞轉成慣知或慣用的原始字詞,其具體實現方式係在前述步驟310後執行下列步驟:As shown in FIG. 3B, in the preferred embodiment, the learning word can be converted into a conventional or customary original word according to the number of times the learning word is taken, and the specific implementation manner is After the foregoing step 310, the following steps are performed:

步驟360:將該學習字詞庫中之學習字詞的輸入次數與一第三設定值比對,以判斷該輸入次數是否高於該第三設定值;Step 360: Align the input number of the learning words in the learning word database with a third set value to determine whether the input number is higher than the third set value;

步驟370:將輸入次數高於該第三設定值之學習字詞存入該原始字詞庫。Step 370: Deposit the learning words whose input times are higher than the third set value into the original word dictionary.

請參閱第4A圖,係為本發明第三較佳實施例,如圖所示,本較佳實施例的步驟大部分係與前述之第一及第二較佳實施例相同,在此即不再對相同的步驟及符號贅述,惟本較佳實施例與第一及第二較佳實施例不同之處係為本較佳實施例的步驟更包括:Please refer to FIG. 4A, which is a third preferred embodiment of the present invention. As shown in the figure, most of the steps of the preferred embodiment are the same as the first and second preferred embodiments described above, and The same steps and symbols are repeated, but the preferred embodiment differs from the first and second preferred embodiments in that the steps of the preferred embodiment further include:

步驟410a:對該輸入內容進行分析,以過濾贅字或詞;再經由前述步驟120,判讀過濾後之輸入內容。Step 410a: analyzing the input content to filter the 赘 word or word; and then, through the foregoing step 120, interpreting the filtered input content.

所述之步驟410a其具體實現方式係可建立一贅字詞資料庫,再藉由比對該輸入內容與贅字詞資料庫以進行分析,進而過濾掉贅字或詞。所述之贅字詞包括感官動詞、程度副詞、介系詞、連接詞、代名詞或虛詞等等,惟本發明不限於此處所列舉。The specific implementation of the step 410a can establish a database of word words, and then analyze the input content and the vocabulary database to filter out the 赘 words or words. The above-mentioned words include sensory verbs, degree adverbs, mediators, conjunctions, pronouns or function words, etc., but the invention is not limited to the ones listed herein.

請參閱第4B圖,係為本發明第三較佳實施例另一實施之示意圖,如第4B圖所示,其與第4A圖的差異為:另包括一步驟410b對該輸出內容進行分析,以過濾贅字或詞;再經由前述之步驟140,將過濾後之輸出內容傳送至該接收端。Please refer to FIG. 4B, which is a schematic diagram of another embodiment of the third preferred embodiment of the present invention. As shown in FIG. 4B, the difference from FIG. 4A is: another step 410b is performed to analyze the output content. Filtering the word or word; and transmitting the filtered output to the receiving end via the foregoing step 140.

所述之步驟410b的具體實現方式係與前述步驟410a相同,茲不再贅述。The specific implementation of the step 410b is the same as the foregoing step 410a, and details are not described herein again.

請參閱第5圖,係為本發明第四較佳實施例,如圖所示,本較佳實施例的步驟大部分係與前述之第一及第二較佳實施例相同,在此即不再對相同的步驟及符號贅述,惟本較佳實施例與前述第一及第二較佳實施例不同之處係為本較佳實施例的步驟更包括:Referring to FIG. 5, which is a fourth preferred embodiment of the present invention, as shown in the figure, most of the steps of the preferred embodiment are the same as the first and second preferred embodiments described above, and The same steps and symbols are described, but the preferred embodiment is different from the foregoing first and second preferred embodiments in that the steps of the preferred embodiment further include:

步驟510:對該輸出內容進行關聯性分析。Step 510: Perform correlation analysis on the output content.

本步驟510的具體實現,例如其中一個輸出之關鍵字詞為某車輛品牌,另一關鍵字詞為「耗油」,藉由該關聯性分析,可得知此二關鍵字詞具有相當之關聯性,因此可將其視為同一組輸出內容,傳送至該接收端,以取得個人化訊息。該關聯性分析實施方式可利用資料探勘之關聯式規則學習(Association Rule Learning),預先建立字詞內容之相互關聯性,以作為提取關鍵字詞後進行關聯性分析之依據。The specific implementation of the step 510, for example, one of the output keyword words is a vehicle brand, and the other keyword word is “fuel consumption”. According to the correlation analysis, the two keyword words have a considerable correlation. Sex, so it can be treated as the same set of output and sent to the receiver to get personalized messages. The association analysis implementation method can use the Association Rule Learning of data exploration to pre-establish the correlation of the word content as the basis for the association analysis after extracting the keyword words.

請參閱第6圖,係為本發明第五較佳實施例,如圖所示,本較佳實施例的步驟大部分係與前述之第一較佳實施例相同,在此即不再對相同的步驟及符號贅述,惟本較佳實施例與前述第一較佳實施例不同之處係為本較佳實施例的步驟更包括:Referring to FIG. 6, which is a fifth preferred embodiment of the present invention, as shown in the figure, most of the steps of the preferred embodiment are the same as the first preferred embodiment described above, and the same is no longer the same. The steps and symbolic descriptions of the preferred embodiment are different from the foregoing first preferred embodiment. The steps of the preferred embodiment include:

步驟610:根據該輸出內容之每一個字或詞之輸入頻率給予該字或詞之一分數;Step 610: Give a score of the word or word according to the input frequency of each word or word of the output content;

步驟620:將該輸出內容之每一個字或詞關聯於多個類別中之至少一個類別;Step 620: Associate each word or word of the output content with at least one of a plurality of categories;

步驟630:將該輸出內容、其對應之分數、以及其關聯之類別傳送至該接收端。Step 630: Transfer the output content, its corresponding score, and its associated category to the receiving end.

其中步驟610中,通常輸入頻率愈高則分數亦愈高。在一實施例中,更可根據不同輸入時段計算輸入頻率,並對各輸入時段指定對應之權重,進而計算出該輸出內容之每一個字或詞之加權分數,並將該加權分數傳送至該接收端。再者,該輸入時段之定義,除了可依絕對時間劃分之,亦可針對個人相對時間區段進行定義,例如將早上九點至晚上六點定義為工作時段,或將晚上七點至十一點定義為休閒時段,惟其定義方式不限於此處所列舉。而步驟620,則可藉由與一關聯字分類資料庫進行比對,以決定所屬類別(如科學、娛樂、美食等)。In step 610, the higher the input frequency is, the higher the score is. In an embodiment, the input frequency is further calculated according to different input time periods, and corresponding weights are assigned to each input time period, thereby calculating a weighted score of each word or word of the output content, and transmitting the weighted score to the Receiving end. Furthermore, the definition of the input time period, in addition to being divided according to absolute time, can also be defined for the individual relative time zone, for example, 9:00 to 6:00 am is defined as the working time, or 7:00 to 11:00. Points are defined as leisure time, but the way they are defined is not limited to the ones listed here. Step 620, by comparing with a related word classification database, to determine the category (such as science, entertainment, food, etc.).

在本實施例中,待接收端取得輸出內容相關資料後,即可對其進行統計與分類,進而產生對應的使用者資料庫,該資料庫除可應用於前述各項服務外,亦可應用於社群服務。舉例來說,一社群網站可藉由上述資料庫,提取各個使用者高分之輸入內容,並進行分析比對,進而判定使用者間之關聯性大小,以提供交友撮合建議之反饋資訊。其中使用者間關聯性之判定,可進一步限定於一個或多個特定類別。In this embodiment, after the receiving end obtains the relevant content of the output content, it can be statistically and classified, and then generate a corresponding user database, which can be applied to the foregoing services, and can also be applied. For community services. For example, a social networking website can extract the input content of each user's high scores through the above-mentioned database, and perform analysis and comparison, thereby determining the correlation between the users, so as to provide feedback information of the friend's suggestion. The determination of the inter-user relevance may be further limited to one or more specific categories.

此外,在另一實施例中,該輸出內容,可藉由其他不同之使用者輸入方法與分析方法,而不限於由前述實施例所產生。In addition, in another embodiment, the output content may be generated by other different user input methods and analysis methods, and is not limited to the foregoing embodiment.

藉由本發明上述的實施,使用者毋需針對相關資訊服務自行輸入或更新關鍵字詞,而是利用一自動分析個人化輸入之方法,蒐集個人化的關鍵字詞,以動態地取得對使用者有意義之個人化訊息。With the above implementation of the present invention, the user does not need to input or update the keyword words for the relevant information service, but uses an automatic analysis method of personalization to collect personalized keyword words to dynamically obtain the user. Meaningful personalized messages.

雖然本發明以前述之較佳實施例揭露如上,然其並非用以限定本發明,任何熟習相像技藝者,在不脫離本發明之精神和範圍內,當可作些許之更動與潤飾,因此本發明之專利保護範圍須視本說明書所附之申請專利範圍所界定者為準。While the present invention has been described above in terms of the preferred embodiments thereof, it is not intended to limit the invention, and the invention may be modified and modified without departing from the spirit and scope of the invention. The patent protection scope of the invention is subject to the definition of the scope of the patent application attached to the specification.

110~160...步驟110~160. . . step

210~250...步驟210~250. . . step

310~370...步驟310~370. . . step

410a及410b...步驟410a and 410b. . . step

510...步驟510. . . step

610~630...步驟610~630. . . step

第1圖係為本發明第一較佳實施例主要步驟之流程圖;Figure 1 is a flow chart showing the main steps of the first preferred embodiment of the present invention;

第2圖係為本發明第一較佳實施例詳細步驟之流程圖;Figure 2 is a flow chart showing the detailed steps of the first preferred embodiment of the present invention;

第3A及3B係為本發明第二較佳實施例增加學習字詞庫操作步驟之流程圖;3A and 3B are flowcharts showing steps of adding a learning word dictionary to the second preferred embodiment of the present invention;

第4A圖係為本發明第三較佳實施例增加贅字詞過濾步驟之流程圖;Figure 4A is a flow chart showing the steps of filtering the word filtering according to the third preferred embodiment of the present invention;

第4B圖係為本發明第三較佳實施例增加另一贅字詞過濾步驟之流程圖;Figure 4B is a flow chart showing the steps of adding another 赘 word filtering step according to the third preferred embodiment of the present invention;

第5圖係為本發明第四較佳實施例增加關聯性分析步驟之流程圖;Figure 5 is a flow chart showing the steps of adding correlation analysis according to the fourth preferred embodiment of the present invention;

第6圖係為本發明第五較佳實施例增加輸出內容屬性紀錄步驟之流程圖。Figure 6 is a flow chart showing the steps of adding the output content attribute recording step in the fifth preferred embodiment of the present invention.

110~160...步驟110~160. . . step

Claims (22)

一種自動分析個人化輸入之方法,係應用於取得一接收端的個人化訊息,該方法係利用一或多個處理器執行下列步驟:利用該處理器在一背景下自動擷取一輸入內容;藉由該處理器判讀該輸入內容;產生一輸出內容;以及將該輸出內容傳送至該接收端。A method for automatically analyzing personalized input is applied to obtain a personalized message of a receiving end, wherein the method uses one or more processors to perform the following steps: automatically extracting an input content in a background by using the processor; Determining the input content by the processor; generating an output content; and transmitting the output content to the receiving end. 如申請專利範圍第1項所述之自動分析個人化輸入之方法,其中該輸入內容係包括單一字或詞或複數個字或詞之集合。A method of automatically analyzing personalized input as described in claim 1 wherein the input includes a single word or a plurality of words or a collection of words. 如申請專利範圍第2項所述之自動分析個人化輸入之方法,更包括:根據該輸出內容之每一個字或詞之輸入頻率給予該字或詞一分數,並將該分數傳送至該接收端。The method for automatically analyzing personalized input as described in claim 2, further comprising: giving the word or word a score according to an input frequency of each word or word of the output content, and transmitting the score to the receiving end. 如申請專利範圍第3項所述之自動分析個人化輸入之方法,其中根據該輸出內容之每一個字或詞之輸入頻率給予該字或詞一分數,並將該分數傳送至該接收端之步驟,該輸入頻率係根據不同輸入時段分別計算,並對各輸入時段指定對應之權重,進而計算出該輸出內容之加權分數,並將該加權分數傳送至該接收端。A method for automatically analyzing personalized input as described in claim 3, wherein the word or word score is given according to an input frequency of each word or word of the output content, and the score is transmitted to the receiving end. In step, the input frequency is separately calculated according to different input periods, and corresponding weights are assigned to each input period, and then the weighted score of the output content is calculated, and the weighted score is transmitted to the receiving end. 如申請專利範圍第2項所述之自動分析個人化輸入之方法,更包括:將該輸出內容中之每一個字或詞關聯於多個類別中之至少一個類別,並將該關聯之類別傳送至該接收端。The method for automatically analyzing personalized input as described in claim 2, further comprising: associating each word or word in the output content with at least one of the plurality of categories, and transmitting the category of the association To the receiving end. 如申請專利範圍第1項所述之自動分析個人化輸入之方法,其中判讀該輸入內容步驟包括:載入一原始字詞庫;比對該輸入內容與該原始字詞庫,以判斷是否可找出對應之原始字詞;若是,將該對應之原始字詞從該原始字詞庫中提取。The method for automatically analyzing personalized input as described in claim 1, wherein the step of interpreting the input includes: loading an original word library; comparing the input content with the original word library to determine whether Find the corresponding original word; if so, extract the corresponding original word from the original word library. 如申請專利範圍第6項所述之自動分析個人化輸入之方法,其中比對該輸入內容與該原始字詞庫,以判斷是否可找出對應之原始字詞之步驟更包括將該對應之原始字詞之輸入頻率與一第一設定值比對,以判斷該輸入頻率是否高於該第一設定值。The method for automatically analyzing personalized input as described in claim 6, wherein the step of comparing the input content with the original word library to determine whether the corresponding original word can be found includes the corresponding The input frequency of the original word is compared with a first set value to determine whether the input frequency is higher than the first set value. 如申請專利範圍第6項所述之自動分析個人化輸入之方法,其中該提取之原始字詞係包括單一字或詞或複數個字或詞之集合。A method for automatically analyzing personalized input as described in claim 6 wherein the extracted original word comprises a single word or a plurality of words or a collection of words. 如申請專利範圍第6或7項所述之自動分析個人化輸入之方法,其中判讀該輸入內容之步驟更包括:載入一學習字詞庫;將該輸入內容中不包含在該原始字詞庫之字或詞,儲存於該學習字詞庫;將該學習字詞之輸入頻率與一第二設定值比對,以判斷該輸入頻率是否高於該第二設定值;若是,將該對應之學習字詞從該學習字詞庫中提取。The method for automatically analyzing personalized input as described in claim 6 or 7, wherein the step of interpreting the input further comprises: loading a learning word library; the input content is not included in the original word. The word or word of the library is stored in the learning word library; the input frequency of the learning word is compared with a second set value to determine whether the input frequency is higher than the second set value; if so, the corresponding The learning words are extracted from the learning word library. 如申請專利範圍第9項所述之自動分析個人化輸入之方法,其中該提取之學習字詞包括單一字或詞或複數個字或詞之集合。A method of automatically analyzing personalized input as described in claim 9 wherein the extracted learning term comprises a single word or a plurality of words or a collection of words. 如申請專利範圍第9項所述之自動分析個人化輸入之方法,其中判讀該輸入內容之步驟更包括:將該學習字詞庫中之學習字詞的輸入次數與一第三設定值比對,以判斷該輸入次數是否高於該第三設定值;以及將該輸入次數高於該第三設定值之學習字詞存入該原始字詞庫。The method for automatically analyzing personalized input according to claim 9 of the patent application, wherein the step of reading the input content further comprises: comparing the input number of the learning words in the learning word database with a third set value And determining whether the input number is higher than the third set value; and storing the learning words whose input times are higher than the third set value into the original word dictionary. 如申請專利範圍第1項所述之自動分析個人化輸入之方法,更包括一過濾步驟,係對該輸入內容進行分析,以過濾贅字或詞。The method for automatically analyzing personalized input as described in claim 1 of the patent application further includes a filtering step of analyzing the input content to filter the 赘 word or word. 如申請專利範圍第1項所述之自動分析個人化輸入之方法,更包括一過濾步驟,係對該輸出內容進行分析,以過濾贅字或詞。The method for automatically analyzing personalized input as described in claim 1 of the patent scope further includes a filtering step of analyzing the output content to filter the word or word. 如申請專利範圍第12或13項所述之自動分析個人化輸入之方法,其中該贅字或詞係包括感官動詞、程度副詞、介系詞、連接詞、代名詞或虛詞。The method for automatically analyzing personalized input as described in claim 12 or 13, wherein the 赘 word or word system includes a sensory verb, a degree adverb, a preposition, a conjunction, a pronoun or a vocabulary. 如申請專利範圍第1項所述之自動分析個人化輸入之方法,更包括一關聯性分析步驟,係對該輸出內容進行關聯性分析。The method for automatically analyzing personalized input as described in claim 1 of the patent application further includes an association analysis step for performing correlation analysis on the output content. 如申請專利範圍第1項所述之自動分析個人化輸入之方法,其中該自動擷取係進一步指定一或多個時間範圍,並在該時間範圍內擷取該輸入內容。The method of automatically analyzing personalized input as described in claim 1, wherein the automatic capture further specifies one or more time ranges and captures the input content within the time range. 如申請專利範圍第1項所述之自動分析個人化輸入之方法,更包括:從該接收端取得一反饋資訊;以及呈現該反饋資訊。The method for automatically analyzing personalized input as described in claim 1 further includes: obtaining a feedback information from the receiving end; and presenting the feedback information. 如申請專利範圍第1項所述之自動分析個人化輸入之方法,其中判讀該輸入內容;產生一輸出內容,以及將該輸出內容傳送至該接收端這三步驟中至少一步驟係於一背景進行操作。The method for automatically analyzing personalized input as described in claim 1, wherein the input content is interpreted; an output content is generated, and at least one of the three steps of transmitting the output content to the receiving end is tied to a background. Take action. 如申請專利範圍第1項所述之自動分析個人化輸入之方法,其中該輸出內容係為單一關鍵字或詞或複數個關鍵字或詞之集合。The method of automatically analyzing personalized input as described in claim 1, wherein the output content is a single keyword or a plurality of keywords or a collection of words. 一種自動分析個人化輸入之方法,係應用於分析一個以上之使用者間輸入內容之關聯性,該方法係利用一或多個處理器執行下列步驟:提供一個以上之使用者之輸入內容;對每一個使用者,根據其輸入內容中之每一個字或詞之輸入頻率,給予該字或詞一對應分數;對每一個使用者之輸入內容,提取分數高於一預設值之字或詞;以及根據該提取之字或詞,分析不同使用者間輸入內容之關聯性。A method for automatically analyzing personalized input is used to analyze the relevance of input content between more than one user. The method uses one or more processors to perform the following steps: providing input content of more than one user; Each user gives a corresponding score to the word or word according to the input frequency of each word or word in the input content; for each user input content, extracts a word or word whose score is higher than a preset value And analyzing the relevance of input between different users based on the extracted words or words. 如申請專利範圍第20項所述之自動分析個人化輸入之方法,其中對每一個使用者,根據其輸入內容中之每一個字或詞之輸入頻率,給予該字或詞一對應分數之步驟,該輸入頻率係根據不同輸入時段分別計算,並對各輸入時段指定對應之權重,進而計算出該字或詞之加權分數,並根據該加權分數進行提取。The method for automatically analyzing personalized input as described in claim 20, wherein each user is given a corresponding score for the word or word according to the input frequency of each word or word in the input content. The input frequency is separately calculated according to different input periods, and corresponding weights are assigned to each input period, and then the weighted score of the word or word is calculated, and extracted according to the weighted score. 如申請專利範圍第20項所述之自動分析個人化輸入之方法,更包括:對每一個使用者,將該輸入內容中之每一個字或詞,分別關聯於多個類別中之至少一個類別;且該提取步驟,係針對該多個類別中之至少一個類別進行之。The method for automatically analyzing personalized input as described in claim 20, further comprising: for each user, each word or word in the input content is associated with at least one of the plurality of categories. And the extracting step is performed for at least one of the plurality of categories.
TW100143642A 2011-11-29 2011-11-29 Method of analyzing personalized input automatically TWI477996B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
TW100143642A TWI477996B (en) 2011-11-29 2011-11-29 Method of analyzing personalized input automatically
CN2012102519841A CN103136319A (en) 2011-11-29 2012-07-19 Method for automatically analyzing personalized input

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW100143642A TWI477996B (en) 2011-11-29 2011-11-29 Method of analyzing personalized input automatically

Publications (2)

Publication Number Publication Date
TW201322018A true TW201322018A (en) 2013-06-01
TWI477996B TWI477996B (en) 2015-03-21

Family

ID=48496150

Family Applications (1)

Application Number Title Priority Date Filing Date
TW100143642A TWI477996B (en) 2011-11-29 2011-11-29 Method of analyzing personalized input automatically

Country Status (2)

Country Link
CN (1) CN103136319A (en)
TW (1) TWI477996B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101787248B1 (en) * 2016-04-14 2017-10-18 라인 가부시키가이샤 Method and system for keyword search using messaging service
TWI699674B (en) * 2019-07-26 2020-07-21 致伸科技股份有限公司 Independent inputting device with self-learning function

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPWO2006048998A1 (en) * 2004-11-05 2008-05-22 株式会社アイ・ピー・ビー Keyword extractor
US7849079B2 (en) * 2006-07-31 2010-12-07 Microsoft Corporation Temporal ranking of search results
CN101420313B (en) * 2007-10-22 2011-01-12 北京搜狗科技发展有限公司 Method and system for clustering customer terminal user group
TW201020817A (en) * 2008-11-19 2010-06-01 Inventec Corp Processing method for word(s) and electric device using the same
CN101739847A (en) * 2008-11-24 2010-06-16 英业达股份有限公司 Word processing method and electronic device adopting same
TW201118619A (en) * 2009-11-30 2011-06-01 Inst Information Industry An opinion term mining method and apparatus thereof
TWI443529B (en) * 2010-04-01 2014-07-01 Inst Information Industry Methods and systems for automatically constructing domain phrases, and computer program products thereof

Also Published As

Publication number Publication date
CN103136319A (en) 2013-06-05
TWI477996B (en) 2015-03-21

Similar Documents

Publication Publication Date Title
US10339161B2 (en) Expanding network relationships
US8972498B2 (en) Mobile-based realtime location-sensitive social event engine
US20160314393A1 (en) Clarification of Submitted Questions in a Question and Answer System
CN108898362B (en) Populating user contact entries
US11354340B2 (en) Time-based optimization of answer generation in a question and answer system
JP6404106B2 (en) Computing device and method for connecting people based on content and relationship distance
JP6538277B2 (en) Identify query patterns and related aggregate statistics among search queries
US20110125759A1 (en) Method and system to contextualize information being displayed to a user
US20150066968A1 (en) Authorship Enhanced Corpus Ingestion for Natural Language Processing
US10108698B2 (en) Common data repository for improving transactional efficiencies of user interactions with a computing device
US9069880B2 (en) Prediction and isolation of patterns across datasets
WO2021139343A1 (en) Data analysis method and apparatus based on natural language processing, and computer device
US10002187B2 (en) Method and system for performing topic creation for social data
US10073828B2 (en) Updating language databases using crowd-sourced input
CN110232126B (en) Hot spot mining method, server and computer readable storage medium
CN107330079B (en) Method and device for presenting rumor splitting information based on artificial intelligence
WO2016137690A1 (en) Efficient retrieval of fresh internet content
CN110019763B (en) Text filtering method, system, equipment and computer readable storage medium
TWI477996B (en) Method of analyzing personalized input automatically
US20160246794A1 (en) Method for entity-driven alerts based on disambiguated features
US20220292127A1 (en) Information management system
Jee et al. Potential of patent image data as technology intelligence source
RU2698916C1 (en) Method and system of searching for relevant news
CN110347818B (en) Word segmentation statistical method and device, electronic equipment and computer readable storage medium