TWI434024B - The use of voice recognition input and multi-mode output technology to query the geographic information system latitude and longitude coordinates of the system - Google Patents

The use of voice recognition input and multi-mode output technology to query the geographic information system latitude and longitude coordinates of the system Download PDF

Info

Publication number
TWI434024B
TWI434024B TW96125945A TW96125945A TWI434024B TW I434024 B TWI434024 B TW I434024B TW 96125945 A TW96125945 A TW 96125945A TW 96125945 A TW96125945 A TW 96125945A TW I434024 B TWI434024 B TW I434024B
Authority
TW
Taiwan
Prior art keywords
latitude
mode output
voice recognition
querying
geographic information
Prior art date
Application number
TW96125945A
Other languages
Chinese (zh)
Other versions
TW200905167A (en
Original Assignee
Chunghwa Telecom Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chunghwa Telecom Co Ltd filed Critical Chunghwa Telecom Co Ltd
Priority to TW96125945A priority Critical patent/TWI434024B/en
Publication of TW200905167A publication Critical patent/TW200905167A/en
Application granted granted Critical
Publication of TWI434024B publication Critical patent/TWI434024B/en

Links

Landscapes

  • Telephonic Communication Services (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Description

利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統A system for querying latitude and longitude coordinates of geographic information systems using speech recognition input and multi-mode output technology

本發明係關於一種利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法,尤指一種可透過大字彙語音辨識技術將遠端使用者輸入的語音查詢訊息經處理轉換後得到的GIS經緯度座標資訊,查詢結果可利用多模式輸出技術(如透過語音、簡訊或即時數據傳送)傳送至使用者終端設備上的系統與方法。The present invention relates to a system and method for querying GIS latitude and longitude coordinates by using voice recognition input and multi-mode output technology, in particular, a method for translating a voice query message input by a remote user through a large vocabulary voice recognition technology. GIS latitude and longitude coordinate information, the query results can be transmitted to the user terminal device by means of multi-mode output technology (such as voice, text message or instant data transmission).

隨著衛星導航設備的日漸普及,目前許多新出廠的車輛已將衛星導航列為選配設備,未來將成為車輛出廠標準配備之一。當使用導航系統尋找不顯著或較偏僻的目的地時,普遍的作法為先找一個離目的地附近較為明顯的地標(如火車站),到達預設的地標後再詢問當地人士確切目的地之位置,不過如此一來衛星導航也只發揮了部份功能。With the increasing popularity of satellite navigation devices, many new factory vehicles have already listed satellite navigation as an optional device, and will become one of the standard equipment for vehicles in the future. When using the navigation system to find destinations that are not significant or remote, it is common practice to first find a landmark (such as a train station) that is more obvious near the destination. After reaching the preset landmark, ask the local person the exact destination. Location, but as a result, satellite navigation has only played some functions.

再者用戶的終端設備隨著科技的進展也越來越多樣化,由傳統的市話話機到現在的兼具導航及PDA功能的多媒體手機,遠端的訊息輸出方式也由以往依靠語音或傳真演變成可透過聲音、文字、圖片及視訊等模式傳送到使用者的終端設備上。雖然上述的座標訊息查詢方法已分別應用於各式的導航設備上,但將目的地查詢訊息(例如輸入商家、機構、風景名勝或住家的名稱、地址或市話電話號碼資訊)透過遠端伺服器處理並轉換成查詢目的地之名稱、住址、市話電話號碼及GIS經緯度座標再傳回使用者端雖然有部份專利內容提及,但皆未提出一套較完整的利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標資訊的系統與方法。Furthermore, the user's terminal equipment has become more and more diversified with the advancement of technology. From the traditional local telephone to the current multimedia mobile phone with navigation and PDA functions, the remote message output method has also relied on voice or fax. It evolved into a terminal device that can be transmitted to users through voice, text, pictures and video. Although the above coordinate information query method has been applied to various navigation devices separately, the destination query message (for example, inputting the name of the merchant, institution, scenic spot or home, address or local telephone number information) is transmitted through the remote servo. The name, address, local telephone number and GIS latitude and longitude coordinates of the query destination are translated and converted back to the user end. Although some patents are mentioned, none of them has a complete set of voice recognition input. Multi-mode output technology to query GIS latitude and longitude coordinate information system and method.

由此可見,上述習用方式仍有諸多缺失,實非一良善之設計,而亟待加以改良。It can be seen that there are still many shortcomings in the above-mentioned methods of use, which is not a good design, but needs to be improved.

本發明人鑑於上述習用方式所衍生的各項缺點,乃亟思加以改良創新,並經多年苦心孤詣潛心研究後,終於成功研發完成本發明利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法。In view of the shortcomings derived from the above-mentioned conventional methods, the present inventors have improved and innovated, and after years of painstaking research, finally successfully developed and completed the present invention using voice recognition input and multi-mode output technology to query GIS latitude and longitude coordinates. System and method.

【發明目的】[Object of the Invention]

本發明之目的即在於提供一種利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法,係應用大字彙語音辨識技術對遠端使用者所輸入的聲音訊號進行辨識處理,並將確認後目的地(商家、機構、風景名勝或住家)的名稱、地址或市話電話號碼資訊辨識結果轉換成對應的GIS經緯度座標資訊(例如台北101大樓的經緯度座標為【東經121度33分51.1秒,北緯25度02分01.6秒】,可適用於全球定位系統(GPS,Global Position System)座標格式),並可透過語音、簡訊或即時數據傳送方式將查詢結果(包含目的地之名稱、住址、市話電話號碼及GIS經緯度座標資訊)傳送到使用者的終端設備上,以便使用者能將經緯度座標或住址、電話等相關資訊輸入車用或攜帶式導航設備中,快速並且正確地到達目的地。The object of the present invention is to provide a system and method for querying GIS latitude and longitude coordinates by using speech recognition input and multi-mode output technology, and applying the large vocabulary speech recognition technology to identify and process the audio signal input by the remote user, and After confirming, the name, address or local telephone number information identification result of the destination (merchant, institution, scenic spot or home) is converted into the corresponding GIS latitude and longitude coordinate information (for example, the latitude and longitude coordinates of the Taipei 101 building is [East 121 degrees 33 minutes 51.1 Seconds, north latitude 25 degrees 02 minutes 01.6 seconds], can be applied to the Global Positioning System (GPS) coordinate format), and can query results (including the destination name, address) by voice, SMS or instant data transmission The local telephone number and the GIS latitude and longitude coordinate information are transmitted to the user's terminal device, so that the user can input the latitude and longitude coordinates or address, telephone and other related information into the vehicle or the portable navigation device, and quickly and correctly reach the purpose. Ground.

達成上述發明目的之利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法,其中該系統之組成包括有下列四個模組:(一)對話模組,係接收遠端使用者所傳送來的語音查詢訊息,並將該訊息送交至本系統語音辨識模組處理;查詢結果亦可透過本模組傳送至使用者終端設備上。The system and method for querying GIS latitude and longitude coordinates by using voice recognition input and multi-mode output technology to achieve the above object, wherein the system comprises the following four modules: (1) a dialogue module, which is a remote user. The transmitted voice query message is sent to the voice recognition module of the system for processing. The query result can also be transmitted to the user terminal device through the module.

(二)語音辨識模組,係處理對話模組接收到的語音查詢訊息。本模組包含一大字彙語音辨識單元,係用以辨識使用者所輸入的語音信號內容。(2) The voice recognition module processes the voice query message received by the dialog module. The module includes a large vocabulary speech recognition unit for recognizing the content of the speech signal input by the user.

(三)資料庫模組,包含一資料庫單元,係用以儲存查詢目的地之地址、名稱、市話電話號碼、語音辨識文法及GIS經緯度座標欄位資料。(3) The database module includes a database unit for storing the address, name, local telephone number, voice recognition grammar and GIS latitude and longitude coordinate field of the query destination.

(四)多模式輸出模組,係將查詢結果利用語音合成、簡訊或以數據傳輸方式,透過本發明之對話模組和使用者進行對話之功能。本模組包含一語音合成單元、一多媒體簡訊系統(MMS,Multimedia Message System)單元及一即時數據傳送單元:(1)語音合成單元係用以產生播報查詢結果之語音,並可透過本發明之對話模組傳送至使用者手機或市話電話機裝置端;(2)多媒體簡訊系統單元係用簡訊型式將查詢目的地的名稱、住址、市話電話號碼及GIS經緯度座標資訊透過本發明之對話模組傳送至使用者手機端;(3)即時數據傳送單元係用IP(Internal Protocol)封包,透過本發明之對話模組傳遞查詢目的地的名稱、住址、市話電話號碼及GIS經緯度座標資訊至使用者的手機、PDA或個人電腦上。(4) The multi-mode output module uses the voice synthesis, the short message or the data transmission mode to perform the dialogue function with the user through the dialogue module of the present invention. The module comprises a speech synthesis unit, a multimedia message system (MMS) unit and an instant data transmission unit: (1) the speech synthesis unit is configured to generate a speech for broadcasting the query result, and can be used according to the present invention. The dialog module is transmitted to the user's mobile phone or the local telephone device; (2) the multimedia newsletter system unit uses the short message type to query the name, address, local telephone number and GIS latitude and longitude coordinate information of the destination through the dialogue mode of the present invention. The group is transmitted to the user's mobile phone; (3) the instant data transfer unit uses IP (Internal Protocol) packet, and transmits the name, address, local telephone number and GIS latitude and longitude coordinate information of the query destination through the dialog module of the present invention to On the user's mobile phone, PDA or personal computer.

上述四個模組使本系統可以很容易地依訊務容量彈性的調整系統規模。The above four modules make it easy for the system to adjust the system size according to the flexibility of the traffic capacity.

請參閱圖一所示,為本發明利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法之系統架構圖。該系統包含一對話模組101;一語音辨識模組102,包含一大字彙語音辨識單元105;一資料庫模組103,包含一資料庫單元109;一多模式輸出模組104,包含一語音合成單元106、一多媒體簡訊系統單元107、以及一即時數據傳送單元108;使用者122、客服人員119、衛星導航裝置120係利用電話CPE(Customer Premises Equipment)端設備,如行動電話機(手機)115及一市話電話機116;或數據CPE端設備,如個人數位助理117及一個人電腦118,經由一電信網路接取介面,包含一數位交換機設備110、一3G/GPRS閘道器111及一行動電話基地台121;或一網際網路接取介面,包含一路由器112、一無線網路基地台113及一防火牆114,與該系統連接。其中對話模組101係為一電腦設備,包含電話介面卡、網路介面卡以及一使用者對話介面軟體程式;該電話介面卡的種類和使用的通訊協定需與交換機的通訊協定設定相匹配,該電話介面卡並可辨識DTMF信號內容。本對話模組101係接收並處理遠端使用者122所傳送來的語音查詢訊息,並將查詢結果透過電話介面卡或網路介面卡傳送至使用者終端設備上。Please refer to FIG. 1 , which is a system architecture diagram of a system and method for querying GIS latitude and longitude coordinates by using voice recognition input and multi-mode output technology. The system includes a dialog module 101; a voice recognition module 102, including a large vocabulary voice recognition unit 105; a database module 103, including a database unit 109; and a multi-mode output module 104, including a voice The synthesizing unit 106, a multimedia messaging system unit 107, and an instant data transmitting unit 108; the user 122, the customer service staff 119, and the satellite navigation device 120 utilize a telephone CPE (Customer Premises Equipment) end device, such as a mobile phone (mobile phone) 115. And a local telephone set 116; or a data CPE end device, such as a personal digital assistant 117 and a personal computer 118, via a telecommunications network access interface, including a digital switch device 110, a 3G/GPRS gateway 111, and an action The telephone base station 121; or an internet access interface includes a router 112, a wireless network base station 113, and a firewall 114 connected to the system. The dialog module 101 is a computer device, including a telephone interface card, a network interface card, and a user interface software program; the type of the interface card and the communication protocol used need to match the communication protocol settings of the switch. The phone interface card can recognize the content of the DTMF signal. The dialog module 101 receives and processes the voice query message transmitted by the remote user 122, and transmits the query result to the user terminal device through the telephone interface card or the network interface card.

語音辨識模組102係將對話模組101接收的語音查詢訊息透過大字彙語音辨識單元105進行處理。上述處理結果送至資料庫模組103進行檢索轉換處理,可得到查詢目的地之名稱、住址、市話電話號碼及GIS經緯度座標資訊。The voice recognition module 102 processes the voice query message received by the dialog module 101 through the large vocabulary voice recognition unit 105. The processing result is sent to the database module 103 for search and conversion processing, and the name, address, local telephone number and GIS latitude and longitude coordinate information of the query destination can be obtained.

大字彙語音辨識單元105係為一軟體程式,係針對使用者122輸入語音信號內容進行辨識處理,並轉換成文字訊息。使用者122輸入的語音信號內容可為目的地(商家、機構、風景名勝或住家)名稱、地址或市話電話號碼資訊,辨識結果產生之最佳一個或多個結果(candidate)可透過對話模組101由使用者122用按鍵或語音輸入方式選擇一個正確辨識結果。The large vocabulary speech recognition unit 105 is a software program for recognizing the input of the speech signal content by the user 122 and converting it into a text message. The content of the voice signal input by the user 122 may be the destination (business, institution, scenic spot or home) name, address or local telephone number information, and the best one or more results (candidate) generated by the identification result may be transmitted through the dialogue mode. Group 101 is selected by user 122 using a button or voice input to select a correct identification result.

資料庫模組103係為儲存設備,其可為一記憶體(memory),如快閃磁碟(flash disk)、硬碟(hard disk)等,或一遠端伺服器(server)硬體儲存設備。本模組包含一資料庫單元109,儲存並處理包含目的地(商家、機構、風景名勝或住家)的住址、市話電話號碼、語音辨識文法及GIS經緯度座標欄位資訊,其用途係將本系統語音辨識模組102處理所得到的目的地名稱、住址或市話電話號碼的格式化文字訊息傳送至資料庫系統中進行檢索轉換以得到查詢目的地之名稱、住址、市話電話號碼及GIS經緯度座標資訊,並將查詢結果送交多模式輸出模組104以不同媒體輸出型式,透過對話模組101傳送至使用者終端設備上。The database module 103 is a storage device, which can be a memory, such as a flash disk, a hard disk, or the like, or a remote server (server). device. The module includes a database unit 109 for storing and processing the address including the destination (business, institution, scenic spot or home), the local telephone number, the voice recognition grammar and the GIS latitude and longitude coordinate field information, and the use thereof is The system voice recognition module 102 processes the formatted text message of the obtained destination name, address or local telephone number into the database system for retrieval and conversion to obtain the name, address, local telephone number and GIS of the query destination. The latitude and longitude coordinate information is sent to the multi-mode output module 104 for transmission to the user terminal device through the dialog module 101 in different media output formats.

多模式輸出模組104係將查詢目的地之名稱、住址、市話電話號碼及GIS經緯度座標以不同媒體輸出型式,透過對話模組101傳送至使用者終端設備上。媒體輸出型式係為語音、簡訊或即時數據。使用者終端設備可為手機115、市話電話機116、PDA 117或個人電腦118。The multi-mode output module 104 transmits the name, address, local telephone number, and GIS latitude and longitude coordinates of the query destination to the user terminal device through the dialog module 101 in different media output formats. The media output type is voice, text message or instant data. The user terminal device can be a mobile phone 115, a local telephone 116, a PDA 117 or a personal computer 118.

語音合成單元106係負責本系統語音輸出部分,可將本系統資料庫模組103得到的文字訊息查詢結果轉換成相對應的語音信號並透過PSTN(Public Switched Telephone Network)或PLMN(Public Land Mobile Network)電信網路播放給使用者122收聽。本單元所輸出的語音信號可為預先錄製或利用文字轉語音(text-to-speech)技術產生。The voice synthesizing unit 106 is responsible for the voice output portion of the system, and can convert the text message query result obtained by the system database module 103 into a corresponding voice signal and pass through a PSTN (Public Switched Telephone Network) or a PLMN (Public Land Mobile Network). The telecommunication network plays to the user 122 to listen. The speech signal output by this unit can be pre-recorded or generated using text-to-speech technology.

多媒體簡訊系統單元107係用簡訊模式傳送查詢目的地之名稱、住址、市話電話號碼及GIS經緯度座標資訊至使用者手機端。目的地之名稱、住址、市話電話號碼及座標資訊可用純文字或傳送夾檔型式呈現,以便使用者122將座標資訊匯入衛星導航裝置中。本簡訊傳送模式為非同步方式,查詢結果須等待一小段時間後才能傳送至使用者手機端。The multimedia newsletter system unit 107 transmits the name, address, local telephone number and GIS latitude and longitude coordinate information of the query destination to the user's mobile phone terminal in the short message mode. The destination name, address, local telephone number, and coordinate information can be presented in plain text or in a folder format so that the user 122 can transfer the coordinate information into the satellite navigation device. The transmission mode of this newsletter is asynchronous, and the query result has to wait for a short time before being transmitted to the user's mobile phone.

即時數據傳送單元108係利用IP封包即時傳遞查詢目的地之名稱、住址、市話電話號碼及GIS經緯度座標資訊至使用者終端設備上。使用者終端設備可為手機115、PDA 117或個人電腦118。The instant data transfer unit 108 instantly transmits the name, address, local telephone number and GIS latitude and longitude coordinate information of the query destination to the user terminal device by using the IP packet. The user terminal device can be a cell phone 115, a PDA 117 or a personal computer 118.

數位交換機110、3G/GPRS閘道器111及行動電話基地台121係為電信設備,係將電話CPE端設備送出的查詢訊息透過PSTN或PLMN電信網路經由交換後送至本發明之對話模組101進行處理。電話CPE端設備係包含行動電話機115及市話電話機116。The digital switch 110, the 3G/GPRS gateway 111 and the mobile phone base station 121 are telecommunication devices, and the inquiry message sent by the CPE end device of the telephone is sent to the dialog module of the present invention via the PSTN or PLMN telecommunication network. 101 for processing. The telephone CPE end device includes a mobile phone 115 and a local telephone 116.

路由器112、無線網路基地台113及防火牆114係為網際網路設備,係將數據CPE端設備送出的封包化語音訊號(Voice over Package Switch)查詢訊息透過有線或無線網路接取技術傳送到本發明之對話模組101進行處理。數據CPE端設備係包含行動電話機115、個人數位助理117或個人電腦118。The router 112, the wireless network base station 113, and the firewall 114 are Internet devices, and the voice over packet switch sent by the data CPE device is transmitted to the wired or wireless network access technology. The dialog module 101 of the present invention performs processing. The data CPE end device includes a mobile phone 115, a personal digital assistant 117, or a personal computer 118.

GIS經緯度座標查詢結果可以語音、簡訊或即時數據媒體型式透過對話模組101及電信網路或網際網路傳送至使用者終端設備上。其中語音信號訊息可由語音合成單元106產生,透過電信網路傳送至行動電話機115或市話電話機116 CPE端設備;簡訊訊息可由多媒體簡訊系統單元107透過對話模組101及電信網路傳送至行動電話機115CPE端;即時數據傳送可由即時數據傳送單元108將查詢結果利用網路通訊協定(如HTTP),透過對話模組101及網際網路傳送至行動電話機115、個人數位助理117及個人電腦118數據CPE端設備上。使用者122接收到查詢結果後便可將結果輸入至衛星導航裝置120上。The GIS latitude and longitude coordinate query results can be transmitted to the user terminal device through the dialog module 101 and the telecommunication network or the Internet through the voice, short message or instant data media type. The voice signal message may be generated by the voice synthesizing unit 106 and transmitted to the mobile phone 115 or the local telephone 116 CPE device through the telecommunication network; the short message message may be transmitted by the multimedia message system unit 107 to the mobile phone through the dialogue module 101 and the telecommunication network. 115CPE end; real-time data transfer can be transmitted by the real-time data transfer unit 108 using a network protocol (such as HTTP), through the dialog module 101 and the Internet to the mobile phone 115, the personal digital assistant 117, and the personal computer 118 data CPE. On the end device. After receiving the query result, the user 122 can input the result to the satellite navigation device 120.

請參閱圖二所示,為本發明利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法之系統架構圖之方法流程圖。本系統對話模組101接收到使用者122傳送的查詢需求訊息後即將其傳送至語音辨識模組,該模組將接收的語音訊息經由大字彙語音辨識單元105辨識處理及進行資料檢索轉換後即可利用本系統多模式輸出模組104將查詢結果以不同媒體輸出型式,並透過對話模組101及及電信網路或網際網路傳送至使用者CPE端裝置。Please refer to FIG. 2, which is a flowchart of a method for querying a system architecture diagram of a system and method for querying GIS latitude and longitude coordinates by using voice recognition input and multi-mode output technology. After receiving the inquiry request message transmitted by the user 122, the system dialogue module 101 transmits the inquiry request message to the voice recognition module, and the module recognizes the received voice message through the large vocabulary speech recognition unit 105 and performs data retrieval and conversion. The multi-mode output module 104 of the system can be used to output the query results in different media output formats and transmitted to the user CPE device through the dialog module 101 and the telecommunication network or the Internet.

請參閱圖三所示,為本發明利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法之大字彙語音辨識單元105之流程圖。本單元可由本發明第一圖中之對話模組101及大字彙語音辨識單元105所組成。使用者122撥入本查詢伺服系統後由對話模組101錄製使用者語音,再將錄音資料傳送至大字彙語音辨識單元105,辨識結果經使用者122確認後即可利用本系統資料庫模組103及多模式輸出模組104將查詢結果透過對話模組101及電信網路或網際網路傳送至使用者CPE端裝置;若辨識結果不正確,使用者122亦可重新進行錄音辨識或透過對話模組101直接與客服人員119進行對話。Please refer to FIG. 3, which is a flowchart of the large vocabulary speech recognition unit 105 of the system and method for querying GIS latitude and longitude coordinates by using speech recognition input and multi-mode output technology. The unit can be composed of the dialogue module 101 and the large vocabulary speech recognition unit 105 in the first figure of the present invention. After the user 122 dials into the query servo system, the user voice is recorded by the dialog module 101, and the recorded data is transmitted to the large vocabulary voice recognition unit 105. After the identification result is confirmed by the user 122, the system database module can be utilized. The 103 and the multi-mode output module 104 transmit the query result to the user CPE device through the dialog module 101 and the telecommunication network or the Internet; if the recognition result is incorrect, the user 122 can re-record the recording or through the dialogue. The module 101 directly talks with the agent 119.

【特點及功效】[Features and effects]

本發明所提供之利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法,與其他習用技術相互比較時,更具備下列優點:1.本發明可允許使用者透過語音輸入查詢資訊,並將查詢資訊利用有線或無線接取技術傳送到遠端伺服器進行處理後將處理結果轉換成查詢目的地之名稱、住址、市話電話號碼及GIS經緯度座標資訊,並透過語音合成、簡訊或即時數據傳送型式將處理結果傳回至使用者終端設備上。The system and method for querying GIS latitude and longitude coordinates by using voice recognition input and multi-mode output technology provided by the invention have the following advantages when compared with other conventional technologies: 1. The invention can allow users to query information through voice input. And the query information is transmitted to the remote server by wired or wireless access technology for processing, and the processing result is converted into the name, address, local telephone number and GIS latitude and longitude coordinate information of the query destination, and through voice synthesis, newsletter Or an instant data transfer pattern that passes the processing result back to the user terminal device.

2.本發明語音輸入訊息內容係利用目的地之名稱、地址或市話電話號碼來查詢GIS經緯度座標資訊,加快衛星導航系統目的地座標的搜尋時間。2. The voice input message content of the present invention uses the name, address or local telephone number of the destination to query the GIS latitude and longitude coordinate information, and accelerates the search time of the satellite navigation system destination coordinates.

上列詳細說明乃針對本發明之一可行實施例進行具體說明,惟該實施例並非用以限制本發明之專利範圍,凡未脫離本發明技藝精神所為之等效實施或變更,均應包含於本發明之專利範圍中。The detailed description of the present invention is intended to be illustrative of a preferred embodiment of the invention, and is not intended to limit the scope of the invention. Within the scope of the patent of the present invention.

綜上所述,本發明不僅於技術思想上確屬創新,並具備習用之傳統方法所不及之上述多項功效,已充分符合新穎性及進步性之法定發明專利要件,爰依法提出申請,懇請 貴局核准本件發明專利申請案,以勵發明,至感德便。In summary, the present invention is not only innovative in terms of technical thinking, but also has the above-mentioned plurality of functions that are not in the conventional methods of the conventional use, and has fully complied with the statutory invention patent requirements of novelty and progressiveness, and applied for it according to law. The bureau approved the application for the invention patent, in order to invent the invention, to the sense of virtue.

101...對話模組101. . . Dialogue module

102...語音辨識模組102. . . Speech recognition module

103...資料庫模組103. . . Database module

104...多模式輸出模組104. . . Multi-mode output module

105...大字彙語音辨識單元105. . . Large vocabulary speech recognition unit

106...語音合成單元106. . . Speech synthesis unit

107...多媒體簡訊系統單元107. . . Multimedia newsletter system unit

108...即時數據傳送單元108. . . Instant data transfer unit

109...資料庫單元109. . . Database unit

110...數位交換機110. . . Digital switch

1113G/GPRS...閘道器1113G/GPRS. . . Gateway

112...路由器112. . . router

113...無線網路基地台113. . . Wireless network base station

114...防火牆114. . . Firewall

115...行動電話機(手機)115. . . Mobile phone (mobile phone)

116...市話電話機116. . . Local telephone

117...個人數位助理(PDA,Personal Digital Assistant)117. . . Personal Digital Assistant (PDA, Personal Digital Assistant)

118...個人電腦118. . . personal computer

119...客服人員119. . . Customer service

120...衛星導航裝置120. . . Satellite navigation device

121...行動電話基地台121. . . Mobile phone base station

122...使用者122. . . user

請參閱有關本發明之詳細說明及其附圖,將可進一步瞭解本發明之技術內容及其目的功效;有關附圖為:圖一為本發明利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法之系統架構圖;圖二為該利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法之方法流程圖;以及圖三為該利用語音辨識輸入及多模式輸出技術來查詢GIS經緯度座標的系統與方法之大字彙語音辨識單元之流程圖。Please refer to the detailed description of the present invention and its accompanying drawings, which can further understand the technical content of the present invention and its effects. The related drawings are: FIG. 1 is a schematic diagram for querying GIS latitude and longitude by using voice recognition input and multi-mode output technology. System architecture diagram of the system and method of coordinates; Figure 2 is a flow chart of the method and method for querying GIS latitude and longitude coordinates using voice recognition input and multi-mode output technology; and Figure 3 is the voice recognition input and multi-mode output A flow chart of a large vocabulary speech recognition unit for a system and method for querying GIS latitude and longitude coordinates.

101...對話模組101. . . Dialogue module

102...語音辨識模組102. . . Speech recognition module

103...資料庫模組103. . . Database module

104...多模式輸出模組104. . . Multi-mode output module

105...大字彙語音辨識單元105. . . Large vocabulary speech recognition unit

106...語音合成單元106. . . Speech synthesis unit

107...多媒體簡訊系統單元107. . . Multimedia newsletter system unit

108...即時數據傳送單元108. . . Instant data transfer unit

109...資料庫單元109. . . Database unit

110...數位交換機110. . . Digital switch

111...3G/GPRS閘道器111. . . 3G/GPRS gateway

112...路由器112. . . router

113...無線網路基地台113. . . Wireless network base station

114...防火牆114. . . Firewall

115...行動電話機(手機)115. . . Mobile phone (mobile phone)

116...市話電話機116. . . Local telephone

117...個人數位助理(PDA,Personal Digital Assistant)117. . . Personal Digital Assistant (PDA, Personal Digital Assistant)

118...個人電腦118. . . personal computer

119...客服人員119. . . Customer service

120...衛星導航裝置120. . . Satellite navigation device

121...行動電話基地台121. . . Mobile phone base station

122...使用者122. . . user

Claims (23)

一種利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中包括:a、一對話模組,係接收遠端使用者所傳送來之目的地查詢訊息,將該查詢訊息送交至語音辨識模組處理,並從資料庫模組之查詢結果透過該對話模組及電信網路或網際網路傳送至使用者終端設備上;b、一語音辨識模組,係處理對話模組所接收的目的地語音查詢訊息;c、一資料庫模組,係用以儲存從多模式輸出模組要求之查詢目的物經緯度座標相關資料;d、一多模式輸出模組,係將從資料庫模組之查詢結果利用不同媒體型式,透過對話模組傳送至使用者終端設備上;其中該語音辨識模組係包含一大字彙語音辨識單元,用以辨識使用者所輸入的語音信號,並將辨識結果轉換成文字訊息,且該大字彙語音辨識單元可以產生一個或多個辨識結果,該大字彙語音辨識單元產生多個辨識結果時,由使用者選擇一個正確辨識結果。 A system for querying latitude and longitude coordinates of a geographic information system by using a voice recognition input and a multi-mode output technology, comprising: a, a dialog module, receiving a destination query message transmitted by a remote user, sending the query message Handed over to the speech recognition module for processing, and the query result from the database module is transmitted to the user terminal device through the dialog module and the telecommunication network or the internet; b, a speech recognition module, which processes the dialog mode The destination voice query message received by the group; c. a database module for storing the latitude and longitude coordinate related information of the target object requested by the multi-mode output module; d, a multi-mode output module, The query result of the database module is transmitted to the user terminal device through the dialogue module by using different media types; wherein the voice recognition module includes a large vocabulary speech recognition unit for recognizing the voice signal input by the user. Converting the identification result into a text message, and the large vocabulary speech recognition unit can generate one or more recognition results, and the large vocabulary speech recognition When the identification unit generates multiple identification results, the user selects a correct identification result. 如申請專利範圍第1項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的 系統,其中該對話模組依系統規模及容量,包含一部以上電腦設備。 The use of speech recognition input and multi-mode output technology to query the latitude and longitude coordinates of the geographic information system as described in claim 1 The system, wherein the dialog module includes more than one computer device depending on the size and capacity of the system. 如申請專利範圍第2項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該電腦設備,包含電話介面卡、網路介面卡以及使用者對話介面軟體程式。 A system for querying a latitude and longitude coordinate of a geographic information system using a voice recognition input and a multi-mode output technology as described in claim 2, wherein the computer device includes a telephone interface card, a network interface card, and a user interface software program. . 如申請專利範圍第3項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該電話介面卡的通訊協定需與交換機的通訊協定相匹配並信號可相互溝通,該電話介面卡中的軔體(firmware)程式並可辨識DTMF(Dual-Tone Multi-Frequency)信號內容。 The system for querying the latitude and longitude coordinates of the geographic information system by using the voice recognition input and the multi-mode output technology as described in claim 3, wherein the communication protocol of the telephone interface card needs to match the communication protocol of the switch and the signals can communicate with each other. The firmware program in the phone interface card can recognize the DTMF (Dual-Tone Multi-Frequency) signal content. 如申請專利範圍第2項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該對話模組包含多台電腦設備時,該電腦設備之間以區域網路(LAN)相互連結。 For example, the system for querying the latitude and longitude coordinates of a geographic information system using voice recognition input and multi-mode output technology as described in claim 2, wherein the dialog module includes multiple computer devices, the regional network between the computer devices (LAN) is connected to each other. 如申請專利範圍第1項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該使用者用DTMF(Dual-Tone Multi-Frequency)按鍵或語音輸入方式,選擇出一個正確辨識結果。 The system for querying the latitude and longitude coordinates of the geographic information system by using the voice recognition input and the multi-mode output technology as described in claim 1, wherein the user selects a DTMF (Dual-Tone Multi-Frequency) button or a voice input method. A correct identification result is produced. 如申請專利範圍第1項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該資料庫模組係包含一資料庫單元,儲存並處理包含查詢目的地之住址、市話電話號碼、語音辨識文法及地理資訊系統經緯度座標欄位。 The system for querying latitude and longitude coordinates of a geographic information system by using a voice recognition input and a multi-mode output technology as described in claim 1, wherein the database module includes a database unit for storing and processing the query destination. Address, local phone number, voice recognition grammar and GIS latitude and longitude coordinates. 如申請專利範圍第7項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該資料庫模組之儲存裝置為本端之快閃記憶體(Flash)磁碟或儲存硬碟(Hard Disk)或一遠端伺服器(Server)。 The system for querying the latitude and longitude coordinates of the geographic information system by using the voice recognition input and the multi-mode output technology as described in claim 7 of the patent application scope, wherein the storage device of the database module is the flash memory of the local end (Flash) A disk or a Hard Disk or a remote server (Server). 如申請專利範圍第1項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該多模式輸出模組係包含一語音合成單元、一多媒體簡訊系統(MMS,Multimedia Message System)單元、以及一即時數據傳送單元。 The system for querying the latitude and longitude coordinates of the geographic information system by using the voice recognition input and the multi-mode output technology, as described in claim 1, wherein the multi-mode output module comprises a voice synthesis unit and a multimedia message system (MMS, The Multimedia Message System unit and an instant data transfer unit. 如申請專利範圍第9項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該語音合成單元係用以播報查詢結果之語音。 The system for querying the latitude and longitude coordinates of the geographic information system by using the voice recognition input and the multi-mode output technology according to claim 9 of the patent application scope, wherein the voice synthesis unit is configured to broadcast the voice of the query result. 如申請專利範圍第10項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座 標的系統,其中該語音合成單元所輸出的語音信號為預先錄製或利用文字轉換語音(text-to-speech)技術產生。 Querying the geographic information system latitude and longitude seat using speech recognition input and multi-mode output technology as described in claim 10 The subject system, wherein the speech signal output by the speech synthesis unit is pre-recorded or generated using a text-to-speech technique. 如申請專利範圍第9項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該多媒體簡訊系統單元可用文字簡訊型式傳送查詢目的地之地址、名稱、市話電話號碼及地理資訊系統經緯度座標資訊。 The system for querying the latitude and longitude coordinates of the geographic information system by using the voice recognition input and the multi-mode output technology, as described in claim 9, wherein the multimedia message system unit can transmit the address, name, and local address of the query destination by using a text message type. Phone number and GIS latitude and longitude coordinates information. 如申請專利範圍第9項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該多媒體簡訊系統(MMS,Multimedia Message System)單元用多媒體簡訊型式傳送查詢目的地之地址、名稱、市話電話號碼及地理資訊系統經緯度座標資訊。 A system for querying a latitude and longitude coordinate of a geographic information system by using a voice recognition input and a multi-mode output technology according to claim 9 of the patent application scope, wherein the multimedia message system (MMS) transmits a query destination by using a multimedia message type Address, name, local phone number and GIS latitude and longitude coordinates information. 如申請專利範圍第13項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中該多媒體簡訊型式係用簡訊夾檔方式傳送查詢目的地之地址、名稱、市話電話號碼及地理資訊系統經緯度座標資訊。 The system for querying the latitude and longitude coordinates of the geographic information system by using the voice recognition input and the multi-mode output technology as described in claim 13 of the patent application scope, wherein the multimedia message type transmits the address, name, and city of the query destination by using the short message folder mode. Phone number and GIS latitude and longitude coordinates information. 如申請專利範圍第9項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標 的系統,其中該即時數據傳送單元係用IP封包傳送查詢目的地之地址、名稱、市話電話號碼及地理資訊系統經緯度座標資訊。 Querying the latitude and longitude coordinates of the geographic information system using voice recognition input and multi-mode output technology as described in claim 9 The system, wherein the instant data transfer unit transmits the address, name, local telephone number, and GIS latitude and longitude coordinate information of the query destination by using an IP packet. 如申請專利範圍第1項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的系統,其中使用者終端設備為行動電話機或市話電話機或個人數位助理(PDA,Personal Digital Assistant)或個人電腦。 A system for querying latitude and longitude coordinates of a geographic information system using voice recognition input and multi-mode output technology as described in claim 1, wherein the user terminal device is a mobile phone or a local telephone or a personal digital assistant (PDA, Personal Digital) Assistant) or a personal computer. 一種利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的方法,其包括下列步驟:a、經對話模組將端使用者輸入語音信號傳送到本系統之伺服器端進行大字彙語音辨識處理;b、將辨識結果經資料庫檢索輸出查詢目的地之名稱、住址、市話電話號碼及地理資訊系統經緯度座標資訊,其中資料庫儲存包含目的地(商家、機構、風景名勝或住家)的住址、市話電話號碼、語音辨識文法及GIS經緯度座標欄位資訊;c、將查詢結果經對話模組傳送至使用者的終端設備上。 A method for querying latitude and longitude coordinates of a geographic information system by using voice recognition input and multi-mode output technology, comprising the following steps: a, transmitting a user input voice signal to a server end of the system via a dialog module for large-word voice Identification processing; b, the identification result is retrieved through the database to output the name, address, local telephone number and geographic information system latitude and longitude coordinate information of the query destination, wherein the database storage includes the destination (business, institution, scenic spot or home) The address, the local telephone number, the voice recognition grammar and the GIS latitude and longitude coordinate field information; c, the query result is transmitted to the user's terminal device via the dialogue module. 如申請專利範圍第17項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的方法,其中該步驟a中之語音信號內容係為所需查詢目的地之名稱、地址、市話電話號碼及地理資訊系統經緯度座標資訊。 The method for querying the latitude and longitude coordinates of the geographic information system by using the voice recognition input and the multi-mode output technology, as described in claim 17, wherein the voice signal content in the step a is the name, address, and Local phone number and GIS latitude and longitude coordinates information. 如申請專利範圍第17項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的方法,其中該步驟a中之輸入語音信號為行動電話機或市話電話機之語音信號。 The method for querying a latitude and longitude coordinate of a geographic information system by using a voice recognition input and a multi-mode output technology, as described in claim 17, wherein the input voice signal in the step a is a voice signal of a mobile phone or a local telephone. 如申請專利範圍第17項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的方法,其中該步驟a中之遠端使用者輸入語音信號係透過PSTN(Public Switched Telephone Network)或PLMN(Public Land Mobile Network)電信網路傳送至本系統之伺服器端。 The method for querying the latitude and longitude coordinates of the geographic information system by using the voice recognition input and the multi-mode output technology, as described in claim 17, wherein the remote user input voice signal in the step a is transmitted through the PSTN (Public Switched Telephone Network) ) or the PLMN (Public Land Mobile Network) telecommunications network is transmitted to the server side of the system. 如申請專利範圍第17項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的方法,其中該步驟b中之語音辨識結果係利用資料庫單元來進行資料檢索及輸出查詢目的地之名稱、住址、市話電話號碼及地理資訊系統經緯度座標資訊。 The method for querying the latitude and longitude coordinates of the geographic information system by using the voice recognition input and the multi-mode output technology as described in claim 17, wherein the voice recognition result in the step b is using the database unit for data retrieval and output query. The name, address, local phone number and GIS latitude and longitude coordinates of the destination. 如申請專利範圍第17項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的方法,其中該步驟c中之使用者的終端設備為行動電話機或市話電話機或個人數位助理或個人電腦。 The method for querying a latitude and longitude coordinate of a geographic information system by using a voice recognition input and a multi-mode output technology, as described in claim 17, wherein the terminal device of the user in the step c is a mobile phone or a local telephone or a personal digital device. Assistant or personal computer. 如申請專利範圍第17項所述之利用語音辨識輸入及多模式輸出技術來查詢地理資訊系統經緯度座標的方法,其中該步驟c中係利用PSTN或PLMN電信網路將查詢結果以合成語音型式傳送至使用者之終端設備上。 The method for querying latitude and longitude coordinates of a geographic information system by using a voice recognition input and a multi-mode output technology as described in claim 17, wherein the step c uses a PSTN or PLMN telecommunication network to transmit the query result in a synthesized voice pattern. To the user's terminal device.
TW96125945A 2007-07-17 2007-07-17 The use of voice recognition input and multi-mode output technology to query the geographic information system latitude and longitude coordinates of the system TWI434024B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW96125945A TWI434024B (en) 2007-07-17 2007-07-17 The use of voice recognition input and multi-mode output technology to query the geographic information system latitude and longitude coordinates of the system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW96125945A TWI434024B (en) 2007-07-17 2007-07-17 The use of voice recognition input and multi-mode output technology to query the geographic information system latitude and longitude coordinates of the system

Publications (2)

Publication Number Publication Date
TW200905167A TW200905167A (en) 2009-02-01
TWI434024B true TWI434024B (en) 2014-04-11

Family

ID=44722628

Family Applications (1)

Application Number Title Priority Date Filing Date
TW96125945A TWI434024B (en) 2007-07-17 2007-07-17 The use of voice recognition input and multi-mode output technology to query the geographic information system latitude and longitude coordinates of the system

Country Status (1)

Country Link
TW (1) TWI434024B (en)

Also Published As

Publication number Publication date
TW200905167A (en) 2009-02-01

Similar Documents

Publication Publication Date Title
US11782975B1 (en) Photographic memory
US9685195B2 (en) Geographical location information/signal quality-context based recording and playback of multimedia data from a conference session
US20090177462A1 (en) Wireless terminals, language translation servers, and methods for translating speech between languages
US9128981B1 (en) Phone assisted ‘photographic memory’
RU2292089C2 (en) System and method for service rendering corresponding to location using stored location information
US8180277B2 (en) Smartphone for interactive radio
CN1285118A (en) Data transmission over a coded voice channel
US20050288927A1 (en) Quality of service call routing system using counselor and speech recognition engine and method thereof
JP2007020193A (en) Apparatus and method for providing subscriber information during wait time in mobile communication system
WO2020055128A3 (en) Method and system for providing voice guidance based public transportation route guidance service
US10192240B1 (en) Method and apparatus of requesting customized location information at a mobile station
US20100267360A1 (en) Method and system of starting voice call
JP6606697B1 (en) Call system and call program
CN1543610A (en) System and method for bookmarking a route
CN101232703A (en) Double machine positioning information system and method
TWI434024B (en) The use of voice recognition input and multi-mode output technology to query the geographic information system latitude and longitude coordinates of the system
US7336963B1 (en) Method and system for delivering location information through a voice messaging system
CN216086647U (en) Hand microphone system
JP4814753B2 (en) Method and system for linking data information and voice information
CN103644906A (en) Navigation system and method based on semantic analysis
JP2003152870A (en) Method for identifying mother language (official language) and foreign language voice guide service device
RU31666U1 (en) System for providing data on the location of an object or objects (options)
US20110009131A1 (en) Permission-Based Mobile-Device Positioning System and Method Thereof
WO2017202382A1 (en) Acoustic internet of things information transmission system, relay base station and method
JP2005521957A (en) Information server having a database of information about a specific location and telephone for accessing and querying the database from a remote location

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees