WO2007070013A1 - A method and apparatus for accessing a digital file from a collection of digital files - Google Patents
A method and apparatus for accessing a digital file from a collection of digital files Download PDFInfo
- Publication number
- WO2007070013A1 WO2007070013A1 PCT/SG2006/000384 SG2006000384W WO2007070013A1 WO 2007070013 A1 WO2007070013 A1 WO 2007070013A1 SG 2006000384 W SG2006000384 W SG 2006000384W WO 2007070013 A1 WO2007070013 A1 WO 2007070013A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- file
- language
- information
- speech input
- speech
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 23
- 230000001419 dependent effect Effects 0.000 claims description 3
- 238000012545 processing Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 230000000881 depressing effect Effects 0.000 description 2
- 230000000994 depressogenic effect Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/102—Programmed access in sequence to addressed parts of tracks of operating record carriers
- G11B27/105—Programmed access in sequence to addressed parts of tracks of operating record carriers of operating discs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
- G06F16/432—Query formulation
- G06F16/433—Query formulation using audio data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/263—Language identification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
Definitions
- This invention relates to a method and apparatus for accessing a digital file from a collection of digital files, and particularly relates to the accessing of files using speech input.
- Such “smart” devices do not have multiple language recognition capabilities at this moment. As such, the makers of such devices are required to make different versions of the same product for markets with language capabilities other than English, and this inadvertently increases the cost of manufacturing each device, since either a dedicated production line/facility is required, or a production line/facility for the English version needs to be modified as and when required to produce the other versions.
- a method for accessing at least one digital file from a collection comprising one than one digital file in an electronic device including: generating one index comprising of information entries obtained from each of the more than one digital file in the collection, with each digital file in the collection information being linked to at least one information entry; receiving a speaker independent speech input in at least one language during a speech reception mode; determining a language of the speech input; and setting the speech reception mode to the language of the speech input; comparing the speech input received during the speech reception mode with the entries in the index.
- the file may advantageously be accessed when the speech input coincides with at least one of the information entries in the index.
- the digital files may be stored in the electronic device, any device functionally connected to the electronic device or a combination of the aforementioned.
- the at least one digital file may be received from a source selected from: a memory device, a wired computer network or a wireless computer network.
- the digital file may be of the type such as documents, spreadsheets, playlists, folders, music files, image files and video files.
- the information entry comprises at least one word and obtains information from the digital file such as, for example, file name, file extension, song title from file metadata, artiste name from file metadata, truncated song title from file metadata, truncated artiste name from file metadata, translated song title or alternative song title.
- the information entry may be in any language.
- the speech input may be either in one language or a phrase of at least one language.
- the speech reception mode may be set either manually or automatically.
- the electronic device may be a desktop computer, a notebook computer, a PDA, a portable media player, or a mobile phone.
- the facility of accessing at least one digital file in the electronic device may be by depressing a pre-determined button at least once.
- an apparatus for accessing at least one digital file from a collection comprising more than one digital file stored within the apparatus.
- the apparatus includes: an indexer for generating an index comprising of information entries obtained from each of the more than one digital file in the collection, with each digital file in the collection information being linked to at least one information entry; a speech reception means for receiving a speaker independent speech input in at least one language during a speech reception mode; a processor to determine a language of the speech input; and the processor being able to compare the speech input received during the speech reception mode with the entries in the index.
- the file is accessed when the speech input coincides with at least one of the information entries in the index.
- the apparatus may be selected from the group comprising: desktop computer, notebook computer, PDA, portable media player and mobile phone.
- the speech reception means is a microphone.
- the language of the speech input may be selected either automatically or manually.
- the speech input may be in one language or a phrase of at least one language.
- the information entries may preferably comprise at least one word in any language.
- the information entry may obtains information from the digital file such as, for example, file name, file extension, song title from file metadata, artiste name from file metadata, truncated song title from file metadata, truncated artiste name from file metadata, translated song title and alternative song title.
- the apparatus may including at least one button to activate a facility to access a digital file by depressing the at least one button at least once.
- the apparatus may preferably include a display.
- Figure 1 shows a flow chart of a process of a preferred embodiment of the present invention.
- Figure 2 shows a schematic diagram of an apparatus of a preferred embodiment of the present invention.
- Figure 3 shows an enlargement of the speech processing process 36 in Figure 1.
- program modules include routines, programs, characters, components, data structures, that perform particular tasks or implement particular abstract data types.
- program modules include routines, programs, characters, components, data structures, that perform particular tasks or implement particular abstract data types.
- program modules may be practiced with other computer system configurations, including hand-held devices, multiprocessor systems, microprocessor-based or programmable consumer electronics, network PCs, minicomputers, mainframe computers, and the like.
- the invention may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network.
- program modules may be located in both local and remote memory storage devices.
- the electronic device may be for example, a desktop computer, a notebook computer, a PDA, a portable media player, or a mobile phone.
- the digital files in the collection may include: documents, spreadsheets, playlists, folders, music files, and video files.
- the digital files stored in the collection are media files (image, music and video files).
- the at least one digital file may be received from a source such as, for example, a memory device, a wired computer network or a wireless computer network.
- the collection of digital files may reside in the memory device in the electronic device or a memory device that is connectable to the electronic device.
- the memory devices may be non-volatile memory and may be either flash memory or a hard disk drive.
- a facility to enable a speech reception mode is activated 20 in the electronic device.
- the electronic device may have a display showing a menu from which this facility is selectable, or the device may have a shortcut switch/button that is depressed at least once to activate the speech reception mode.
- a user may be able to manually select a specific language or multiple languages for speech input 22. This aids the device in processing the speech input.
- each dialect of a particular language such as, for example, the Chinese dialects of Cantonese, Teochew and Hokkien among others is considered to be a different language.
- a system for accessing a digital file from a collection in an electronic device is initialised 24 in preparation of incoming speech inputs for accessing the files in the collection.
- the information extracted 25 and indexed from each file may include at least one of the following: file name, file extension, song title from file metadata, artiste name from file metadata, truncated song title from file metadata, truncated artiste name from file metadata, and alternative song title.
- the aforementioned information may also be obtained from alternative sources 29, such as, for example, the internet or a host if the electronic device is connected to the alternative source.
- Each information entry should comprise at least one word.
- the extracted information may be in any language and need not be Anglo alphanumeric alphabet based.
- each digital file has a plurality of information entries in the information index so as to enable the file to be accessed via various paths such as, for example, by artiste name, by song title, by file name and so forth. In the case where fewer files have been detected, the information entries of non-existent files are removed when creating the index.
- a user may also give a particular song an alternative title and this alternative title may also be included in the index.
- a character codeset identification function 27 analyzes the information of each media file and identifies the codeset or codesets used in each file.
- An index of all the information entries from each digital file in the electronic device and any functionally connected memory device together with the character codeset information is then formed in the electronic device 30, and subsequent to the building of the index, the index is loaded in the electronic device 32 such that all the information entries in the index are accessible.
- the information index may also be loaded 32 after confirming the existence of an information index 26, if no new digital files have been detected and if no digital files are deleted from when the information index was built.
- duration of time required for the aforementioned steps is dependent on data processing speed, memory I/O speed and network/remote server latency. It is apparent that the greater the digital files, the longer the duration required for the aforementioned steps due to the volume of data to be processed.
- the electronic device is ready to receive a speech input.
- the electronic device may either sound an audible alert or show a visual alert to prompt a user that it is ready to receive speech input in a sound reception mode.
- the speech input is speaker independent. No pre-recording is required and the electronic device is basically "pick-and-use". Speech processing in the method may be sufficiently robust to be able to distinguish the speech input in spite of any particularly strong accents or mumbling.
- the speech is input into the electronic device 34.
- the speech input may be in one language.
- the speech input may also be a phrase comprising more than one language. For example, a song title like " ⁇ ⁇ c flower” may be acceptable and able to be processed.
- For digital files with translated titles in their metadata use of either the original or translated title allows access to the same digital file. For example, "Sl'FSS" or "No Reserve In Love" allows access to the same digital file.
- the speech is processed 36. If language selection was not done 22 earlier manually, the language of the speech input is determined and the appropriate speech reception mode correlating to the language of the speech input is automatically set. If the language selection is set manually, then a language model specified by the user will be loaded correspondingly. This allows for an accurate determination of the speech input. Referring to Figure 3, there is described the sequence that the speech is processed automatically.
- the media header information 361 obtained in 25, the character codeset 362 obtained in 27 and media information 363 gathered from remote sources in 29 are entered into a language recognition identification function 364 to enable the most appropriate speech recognition language model(s) to be loaded 365.
- the language recognition identification function determines that the codesets used in the media files are ASCII and GB while the country of origins are the United States of America (USA) and the Peoples' Republic of China (PRC), both the USA English language model and the PRC Putonghua language model will be loaded for voice recognition.
- the speech input is further "filtered” 366 where meaningful media information such as song titles, artist and album is extracted from the speech input and provided to a speech recognizer as subjects for speech recognition. For example, with a speech input of "Play H M. ⁇ ft U? by Sharon Lau", "U)H 5 F ft Hf " will be extracted as song title information while “Sharon Lau” will be intelligently extracted as artiste information. This extracted information is then added to the speech recognition pool 367. Filtering is also done on the speech input to determine the entries into the recognised speech pool when the manual selection of language 22 is done.
- the input is compared with the information entries in the index 38.
- the digital file(s) linked to the information entry(s) are displayed 40 for the user's selection.
- the digital file(s) shown may be a result list and the user may be able to select a desired song 42, a desired playlist 44 or songs from a desired artiste 46. These options are merely for illustrative purpose and are not limiting.
- an apparatus 50 for accessing at least one digital file from a collection comprising more than one digital file stored within the apparatus 50.
- the apparatus 50 may be a device such as, for example, a desktop computer, a notebook computer, a PDA, a portable media player, or a mobile phone.
- the digital files may be files such as, for example, documents, spreadsheets, playlists, folders, music files, or video files.
- the at least one digital file may be received from a source such as, for example, a memory device, a wired computer network, or a wireless computer network.
- the collection of digital files may reside in a memory device 58 included in the apparatus 50 or the digital files may reside in a separate memory device that may be connectable to the apparatus 50.
- the memory device may be non-volatile memory and may be either flash memory or a hard disk drive.
- the apparatus 50 may have a display 54 showing a menu that allows this facility to be enabled, or the apparatus 50 may have a shortcut switch/button (not shown) that is depressed at least once to activate the facility.
- the apparatus 50 may have a housing 52 to contain its various components.
- the apparatus 50 may have a display 54 for displaying information of the apparatus 50, including information about the files stored in the apparatus 50 or accessible to the apparatus 50.
- There may be an indexer 56 for generating an index comprising of information entries obtained from each of the more than one digital file in the collection.
- Each digital file in the collection information may be linked to at least one information entry.
- the information entry may comprise at least one word and may be in any langauge.
- the information extracted and indexed from each file may include at least one of the following: file name, file extension, song title from file metadata, artiste name from file metadata, truncated song title from file metadata, truncated artiste name from file metadata, truncated song title and alternative song title.
- Each information entry should comprise at least one word.
- the extracted information may be in any language and need not be Anglo alphanumeric alphabet based.
- the various forms of Chinese characters (simplified and traditional), various forms of Japanese characters (kanji, hiragana and katakana), Korean characters, Islamic characters and the like may all be extractable.
- Transliteration of the aforementioned non-English characters into English may also be stored in the information index.
- Translations of the aforementioned non-English characters to English may also be stored in the information index if such information is found in the file metadata. It may be possible that each digital file has a plurality of information entries in the information index so as to enable the file to be accessed via various paths such as, for example, by artiste name, by song title, by file name and so forth. The user may also give a particular song an alternative title and this alternative title may also be included in the index.
- the apparatus 50 may include a speech reception means 60 for receiving a speech input in at least one language during a speech reception mode.
- the speech reception means may be a microphone or any other device that allows for the input of audio signals.
- the speech reception means 60 passes on speech input to a processor 62.
- the speech input may be in one language.
- the speech input may also be a phrase comprising more than one language. For example, a song title like " ⁇ deflower" may be understood.
- use of either title allows access to the same digital file. For example, "it TE. ⁇ U. ⁇ ? " or "No Reserve in Love” allows access to the same digital file.
- the processor 62 may be able to determine a language of the speech input automatically.
- the apparatus 50 may also be able to manually set the language of the speech input such that the processor 62 does not need to carry out the task automatically.
- the processor 62 may also be used to compare the speech input received during the speech reception mode with the entries in the index.
- the speech input is speaker independent. No pre-recording is required and the apparatus 50 is basically "pick-and-use".
- the speech recognition module in the apparatus 50 may be sufficiently robust to be able to distinguish the speech input in spite of any particularly strong accents and mumbling.
- the digital file(s) linked to the information entry(s) are shown on the display 54 for the user's selection.
- the digital file(s) shown may be a result list and the user may be able to select a desired song, a desired playlist or songs from a desired artiste. These options are merely for illustrative purpose and are not limiting.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- General Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
- Management Or Editing Of Information On Record Carriers (AREA)
Abstract
Description
Claims
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
NZ569291A NZ569291A (en) | 2005-12-12 | 2006-12-11 | A method and apparatus for accessing a digital speech file from a collection of digital files |
CA002633505A CA2633505A1 (en) | 2005-12-12 | 2006-12-11 | A method and apparatus for accessing a digital file from a collection of digital files |
BRPI0619607-1A BRPI0619607A2 (en) | 2005-12-12 | 2006-12-11 | method and apparatus for accessing a digital file from a set of digital files |
AU2006325555A AU2006325555B2 (en) | 2005-12-12 | 2006-12-11 | A method and apparatus for accessing a digital file from a collection of digital files |
EP06835979A EP1969590A4 (en) | 2005-12-12 | 2006-12-11 | A method and apparatus for accessing a digital file from a collection of digital files |
JP2008545547A JP2009519538A (en) | 2005-12-12 | 2006-12-11 | Method and apparatus for accessing a digital file from a collection of digital files |
NO20083087A NO20083087L (en) | 2005-12-12 | 2008-07-09 | Method and apparatus for accessing a digital file from a collection of digital files |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SG200508000-7 | 2005-12-12 | ||
SG200508000-7A SG133419A1 (en) | 2005-12-12 | 2005-12-12 | A method and apparatus for accessing a digital file from a collection of digital files |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2007070013A1 true WO2007070013A1 (en) | 2007-06-21 |
Family
ID=38140537
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/SG2006/000384 WO2007070013A1 (en) | 2005-12-12 | 2006-12-11 | A method and apparatus for accessing a digital file from a collection of digital files |
Country Status (15)
Country | Link |
---|---|
US (1) | US8015013B2 (en) |
EP (1) | EP1969590A4 (en) |
JP (1) | JP2009519538A (en) |
KR (1) | KR20080083290A (en) |
CN (1) | CN101341531A (en) |
AU (1) | AU2006325555B2 (en) |
BR (1) | BRPI0619607A2 (en) |
CA (1) | CA2633505A1 (en) |
NO (1) | NO20083087L (en) |
NZ (1) | NZ569291A (en) |
RU (1) | RU2008128440A (en) |
SG (1) | SG133419A1 (en) |
TW (1) | TW200805251A (en) |
WO (1) | WO2007070013A1 (en) |
ZA (1) | ZA200805567B (en) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100197255A1 (en) * | 2009-02-05 | 2010-08-05 | Panasonic Automotive Systems Company Of America, Division Of Panasonic Corporation Of North America | Method and apparatus for dynamic station preset configuration in a radio |
US10140320B2 (en) | 2011-02-28 | 2018-11-27 | Sdl Inc. | Systems, methods, and media for generating analytical data |
US20120221319A1 (en) * | 2011-02-28 | 2012-08-30 | Andrew Trese | Systems, Methods and Media for Translating Informational Content |
US20120284276A1 (en) * | 2011-05-02 | 2012-11-08 | Barry Fernando | Access to Annotated Digital File Via a Network |
US8983963B2 (en) * | 2011-07-07 | 2015-03-17 | Software Ag | Techniques for comparing and clustering documents |
US9984054B2 (en) | 2011-08-24 | 2018-05-29 | Sdl Inc. | Web interface including the review and manipulation of a web document and utilizing permission based control |
KR102081925B1 (en) * | 2012-08-29 | 2020-02-26 | 엘지전자 주식회사 | display device and speech search method thereof |
US9916306B2 (en) | 2012-10-19 | 2018-03-13 | Sdl Inc. | Statistical linguistic analysis of source content |
KR102115397B1 (en) | 2013-04-01 | 2020-05-26 | 삼성전자주식회사 | Portable apparatus and method for displaying a playlist |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020184003A1 (en) * | 2001-03-28 | 2002-12-05 | Juha Hakkinen | Determining language for character sequence |
US20020193989A1 (en) * | 1999-05-21 | 2002-12-19 | Michael Geilhufe | Method and apparatus for identifying voice controlled devices |
US20030050779A1 (en) * | 2001-08-31 | 2003-03-13 | Soren Riis | Method and system for speech recognition |
US20030177013A1 (en) * | 2002-02-04 | 2003-09-18 | Falcon Stephen Russell | Speech controls for use with a speech system |
US20040249635A1 (en) * | 1999-11-12 | 2004-12-09 | Bennett Ian M. | Method for processing speech signal features for streaming transport |
US20050033575A1 (en) * | 2002-01-17 | 2005-02-10 | Tobias Schneider | Operating method for an automated language recognizer intended for the speaker-independent language recognition of words in different languages and automated language recognizer |
US20060149548A1 (en) * | 2004-12-31 | 2006-07-06 | Delta Electronics, Inc. | Speech input method and system for portable device |
US20060206331A1 (en) * | 2005-02-21 | 2006-09-14 | Marcus Hennecke | Multilingual speech recognition |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4833714A (en) * | 1983-09-30 | 1989-05-23 | Mitsubishi Denki Kabushiki Kaisha | Speech recognition apparatus |
JPH0594512A (en) * | 1991-10-02 | 1993-04-16 | Kobe Nippon Denki Software Kk | Electronic filing device |
CA2115088A1 (en) | 1993-02-08 | 1994-08-09 | David Michael Boyle | Multi-lingual voice response unit |
CA2091658A1 (en) * | 1993-03-15 | 1994-09-16 | Matthew Lennig | Method and apparatus for automation of directory assistance using speech recognition |
US6081774A (en) * | 1997-08-22 | 2000-06-27 | Novell, Inc. | Natural language information retrieval system and method |
JP4036528B2 (en) * | 1998-04-27 | 2008-01-23 | 富士通株式会社 | Semantic recognition system |
JP4292646B2 (en) * | 1999-09-16 | 2009-07-08 | 株式会社デンソー | User interface device, navigation system, information processing device, and recording medium |
JP2001285759A (en) * | 2000-03-28 | 2001-10-12 | Pioneer Electronic Corp | Av information processor and information recording medium having program for av information processing computer readably recorded thereon |
US20020099533A1 (en) * | 2001-01-23 | 2002-07-25 | Evan Jaqua | Data processing system for searching and communication |
US6952691B2 (en) * | 2002-02-01 | 2005-10-04 | International Business Machines Corporation | Method and system for searching a multi-lingual database |
US6907397B2 (en) * | 2002-09-16 | 2005-06-14 | Matsushita Electric Industrial Co., Ltd. | System and method of media file access and retrieval using speech recognition |
US7046984B2 (en) * | 2002-11-28 | 2006-05-16 | Inventec Appliances Corp. | Method for retrieving vocabulary entries in a mobile phone |
US7321852B2 (en) * | 2003-10-28 | 2008-01-22 | International Business Machines Corporation | System and method for transcribing audio files of various languages |
US7725318B2 (en) * | 2004-07-30 | 2010-05-25 | Nice Systems Inc. | System and method for improving the accuracy of audio searching |
US7711542B2 (en) * | 2004-08-31 | 2010-05-04 | Research In Motion Limited | System and method for multilanguage text input in a handheld electronic device |
US7376648B2 (en) * | 2004-10-20 | 2008-05-20 | Oracle International Corporation | Computer-implemented methods and systems for entering and searching for non-Roman-alphabet characters and related search systems |
US7840399B2 (en) * | 2005-04-07 | 2010-11-23 | Nokia Corporation | Method, device, and computer program product for multi-lingual speech recognition |
-
2005
- 2005-12-12 SG SG200508000-7A patent/SG133419A1/en unknown
-
2006
- 2006-12-11 KR KR1020087015673A patent/KR20080083290A/en not_active Application Discontinuation
- 2006-12-11 WO PCT/SG2006/000384 patent/WO2007070013A1/en active Application Filing
- 2006-12-11 RU RU2008128440/09A patent/RU2008128440A/en not_active Application Discontinuation
- 2006-12-11 CN CNA2006800468015A patent/CN101341531A/en active Pending
- 2006-12-11 JP JP2008545547A patent/JP2009519538A/en active Pending
- 2006-12-11 EP EP06835979A patent/EP1969590A4/en not_active Ceased
- 2006-12-11 BR BRPI0619607-1A patent/BRPI0619607A2/en not_active IP Right Cessation
- 2006-12-11 AU AU2006325555A patent/AU2006325555B2/en not_active Ceased
- 2006-12-11 NZ NZ569291A patent/NZ569291A/en not_active IP Right Cessation
- 2006-12-11 CA CA002633505A patent/CA2633505A1/en not_active Abandoned
- 2006-12-11 US US11/637,357 patent/US8015013B2/en active Active
- 2006-12-12 TW TW095146399A patent/TW200805251A/en unknown
-
2008
- 2008-06-25 ZA ZA200805567A patent/ZA200805567B/en unknown
- 2008-07-09 NO NO20083087A patent/NO20083087L/en not_active Application Discontinuation
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020193989A1 (en) * | 1999-05-21 | 2002-12-19 | Michael Geilhufe | Method and apparatus for identifying voice controlled devices |
US20040249635A1 (en) * | 1999-11-12 | 2004-12-09 | Bennett Ian M. | Method for processing speech signal features for streaming transport |
US20020184003A1 (en) * | 2001-03-28 | 2002-12-05 | Juha Hakkinen | Determining language for character sequence |
US20030050779A1 (en) * | 2001-08-31 | 2003-03-13 | Soren Riis | Method and system for speech recognition |
US20050033575A1 (en) * | 2002-01-17 | 2005-02-10 | Tobias Schneider | Operating method for an automated language recognizer intended for the speaker-independent language recognition of words in different languages and automated language recognizer |
US20030177013A1 (en) * | 2002-02-04 | 2003-09-18 | Falcon Stephen Russell | Speech controls for use with a speech system |
US20060149548A1 (en) * | 2004-12-31 | 2006-07-06 | Delta Electronics, Inc. | Speech input method and system for portable device |
US20060206331A1 (en) * | 2005-02-21 | 2006-09-14 | Marcus Hennecke | Multilingual speech recognition |
Non-Patent Citations (5)
Title |
---|
BAUMANN S. ET AL.: "Super-convenience for Non-musicians: Querying MP3 and the Semantic Web", PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MUSIC INFORMATION RETRIEVAL, 2002, XP003014548 * |
SCHULZ C.H. ET AL.: "A Spoken Language Front-end for a Multilingual Music Data Base", PROCEEDINGS OF THE BERLINER XML-TAGE, 13 March 2004 (2004-03-13), XP003014547, Retrieved from the Internet <URL:http://www.dfki.de/~romanell/BerlinerXMLTage2004.pdf> * |
See also references of EP1969590A4 * |
SIE H-W. ET AL.: "A multilingual automatic speech recognition (ASR) engine embedded on personal digital assistant (PDA)", 9TH INTERNATIONAL WORKSHOP ON CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS, 28 May 2005 (2005-05-28) - 30 May 2005 (2005-05-30), pages 174 - 177, XP010855292 * |
WANG Y-F.H. ET AL.: "Speech-controlled Media File Selection on Embedded Systems", 6TH SIGDIAL WORKSHOP ON DISCOURSE AND DIALOGUE, 2 September 2005 (2005-09-02) - 3 September 2005 (2005-09-03), XP002417506 * |
Also Published As
Publication number | Publication date |
---|---|
RU2008128440A (en) | 2010-01-20 |
JP2009519538A (en) | 2009-05-14 |
CN101341531A (en) | 2009-01-07 |
CA2633505A1 (en) | 2007-06-21 |
US8015013B2 (en) | 2011-09-06 |
EP1969590A4 (en) | 2010-01-06 |
EP1969590A1 (en) | 2008-09-17 |
US20070136065A1 (en) | 2007-06-14 |
TW200805251A (en) | 2008-01-16 |
SG133419A1 (en) | 2007-07-30 |
NZ569291A (en) | 2010-03-26 |
AU2006325555B2 (en) | 2012-03-08 |
AU2006325555A1 (en) | 2007-06-21 |
BRPI0619607A2 (en) | 2011-10-11 |
KR20080083290A (en) | 2008-09-17 |
ZA200805567B (en) | 2009-06-24 |
NO20083087L (en) | 2008-09-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2006325555B2 (en) | A method and apparatus for accessing a digital file from a collection of digital files | |
US7870142B2 (en) | Text to grammar enhancements for media files | |
US8355919B2 (en) | Systems and methods for text normalization for text to speech synthesis | |
US8396714B2 (en) | Systems and methods for concatenation of words in text to speech synthesis | |
US8583418B2 (en) | Systems and methods of detecting language and natural language strings for text to speech synthesis | |
US8712776B2 (en) | Systems and methods for selective text to speech synthesis | |
EP2005319B1 (en) | System and method for extraction of meta data from a digital media storage device for media selection in a vehicle | |
KR101586890B1 (en) | Input processing method and apparatus | |
US20100082344A1 (en) | Systems and methods for selective rate of speech and speech preferences for text to speech synthesis | |
US20060143007A1 (en) | User interaction with voice information services | |
US20100082327A1 (en) | Systems and methods for mapping phonemes for text to speech synthesis | |
KR20140047633A (en) | Speech recognition repair using contextual information | |
KR20080000203A (en) | Method for searching music file using voice recognition | |
US20080162472A1 (en) | Method and apparatus for voice searching in a mobile communication device | |
WO2010036486A2 (en) | Systems and methods for speech preprocessing in text to speech synthesis | |
CN103384290A (en) | Mobile terminal with positioning and navigation functions and fast positioning and navigation method of mobile terminal | |
US8484582B2 (en) | Entry selection from long entry lists | |
CN101415259A (en) | System and method for searching information of embedded equipment based on double-language voice enquiry | |
US20080243281A1 (en) | Portable device and associated software to enable voice-controlled navigation of a digital audio player | |
CN114297143A (en) | File searching method, file displaying device and mobile terminal | |
JP7297266B2 (en) | SEARCH SUPPORT SERVER, SEARCH SUPPORT METHOD, AND COMPUTER PROGRAM | |
KR20050071237A (en) | Image searching apparatus and method using voice recognition technic | |
KR20090062548A (en) | Method for searching contents and mobile communication terminal using the same | |
KR20090054616A (en) | Search method of a voice terminal using guide word | |
WO2005055077A2 (en) | Predictive input |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200680046801.5 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DPE1 | Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2633505 Country of ref document: CA Ref document number: 2008545547 Country of ref document: JP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006325555 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 569291 Country of ref document: NZ |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020087015673 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 5845/DELNP/2008 Country of ref document: IN |
|
ENP | Entry into the national phase |
Ref document number: 2006325555 Country of ref document: AU Date of ref document: 20061211 Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006835979 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2008128440 Country of ref document: RU |
|
ENP | Entry into the national phase |
Ref document number: PI0619607 Country of ref document: BR Kind code of ref document: A2 Effective date: 20080612 |