WO2007079357A2 - Voice controlled portable memory storage device - Google Patents

Voice controlled portable memory storage device Download PDF

Info

Publication number
WO2007079357A2
WO2007079357A2 PCT/US2006/062336 US2006062336W WO2007079357A2 WO 2007079357 A2 WO2007079357 A2 WO 2007079357A2 US 2006062336 W US2006062336 W US 2006062336W WO 2007079357 A2 WO2007079357 A2 WO 2007079357A2
Authority
WO
WIPO (PCT)
Prior art keywords
user
template
voice
access
portable memory
Prior art date
Application number
PCT/US2006/062336
Other languages
French (fr)
Other versions
WO2007079357A3 (en
Inventor
Kevin M. Conley
Original Assignee
Sandisk Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US11/314,476 external-priority patent/US20070143117A1/en
Priority claimed from US11/313,841 external-priority patent/US20070143111A1/en
Application filed by Sandisk Corporation filed Critical Sandisk Corporation
Publication of WO2007079357A2 publication Critical patent/WO2007079357A2/en
Publication of WO2007079357A3 publication Critical patent/WO2007079357A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Definitions

  • the present invention relates to portable devices, and more particularly, to voice activated and controlled, portable non-volatile memory storage devices,
  • Non-volatile semiconductor memory devices such as flash memory storage drives are commonly used to store digital information in various applications, for example, digital cameras, cell phones, MP3 or other audio/video players, notebook computers, desktop computers and other applications. These memory devices are small, portable, and reliable with a large capacity to store data .
  • the memory devices can be connected to the foregoing using standard interfaces, for example, the Universal Serial Bus (OSB) port or an IEEE 1394 (“Firewire”) port.
  • OSB Universal Serial Bus
  • IEEE 1394 IEEE 1394
  • biometric parameters like, fingerprints
  • the biometric solution has shortcomings as well. For example, fingerprints can change over time or become unrecognizable. Also, fingerprint sensors are complex, sometimes unreliable, and expensive.
  • Portable devices for example, an MP3 player or any other type of audio/video player
  • buttons to control various functions.
  • portable audio/video players use plural buttons for recording, playback and other functions. These buttons are expensive and occupy real estate on portable devices that are small in size to begin with. The buttons are inconvenient to use, for example, at night or while exercising.
  • a portable memory device (“device”)
  • the device includes a microphone for receiving a voice command from a user; and a device controller that creates a voice based template for the voice command and stores the voice based template in a plurality of nonvolatile memory cells, wherein the voice based template is associated with one or more button control actions entered by the user for" certain device functionality.
  • a portable memory device (“device”) is provided. The device includes, a mi-crcphone for receiving a voice command with a file name from a user; and a device controller that creates a phonetic pattern for the received file name and compares the received file name phonetic pattern with a phonetic pattern for files stored in the memory device.
  • a portable memory storage device (“device”) includes a microphone for receiving a user voice input; a controller that receives the voice input and creates a first template; and a plurality of non-volatile memory eel] s for storing the first template, wherein the template is used to authenticate the user for any subsequent user request for accessing the device and an application is launched when the device interfaces with a host system to enroll the user as an authorized user to access device functionality and access host system functionality and the controller creates a second template for a voice command and stores the second template in a plurality of non-volatile memory cells, wherein the second template is associated with one or more button control actions entered by the user for certain device functionality, [0014] In another aspect, a method for a portable memory device is provided.
  • the method includes, recording a keyword and creating a voice based template for the keyword, wherein a processor for the portable memory device creates the voice based template and stores the voice based template in non-volatile memory cells; prompting the user to capture button control actions related to a portable memory device functionality; and associating the button control actions to the voice based template.
  • a method for a portable memory device includes, receiving a voice command from a user with a file name; creating a phonetic pattern for the received file name; comparing the phonetic pattern for the received file name with phonetic patterns for file names for files stored in the memory device; and executing a function received with the voice command if the phonetic pattern for the received file name matches with a phonetic pattern for file names for files stored in the memory device.
  • a method for a portable memory storage device is provided.
  • the method includes, prompting a user to capture button control actions related to the portable memory device functionality; associating the button control actions to a pre-loaded voice command that is stored in a plurality of non-volatile memory cells; and executing a function when a user issues a voice command.
  • Figure IA shows a top-level block diagram of a portable memory device coupled to a host system, according to one aspect of the present invention
  • Figure IE shows a block diagram of the internal architecture of the host system in Figure LA;
  • Figure 1C shows a block diagram of a memory controller in Figure IA, according to one aspect of the present invention.
  • Figure ID shows a top-level block diagram of an audio/video player with voice control, according to one aspect of the present invention
  • Figure IE shows an example of storing information in non-volatile memory cells, according to one aspect of the present invention
  • Figure 2 shows a process flow diagram for using a voice controlled portable memory device, according to one aspect of the present invention
  • Figure 3A shows a process flow diagram for performing enrollment of a portable memory device, according to one aspect of the present invention
  • Figure 3B shows an example of a voice print created according to one aspect of the present invention
  • Figure 4 shows a process flow diagram for authenticating a portable memory device, according to one aspect of the present invention
  • Figure 5 shows a process flow diagram for creating a password bank, according to one aspect of the present invention
  • Figure 6 shows a process flow diagram for reinitializing a portable device, according to one aspect of the present invention
  • Figure 7A shows an example for storing keywords in a portable device, according to one aspect of the present invention.
  • Figure 7B shows an example of phonetic patterns associated with file names, according to one aspect of the present invention
  • Figure 8 shows a process flow diagram for creating a macro in a portable memory device, according to one aspect of the present invention
  • Figure 9 shows an example of a macro, according to one aspect of the present invention.
  • Figure 10 shows a process flow diagram for associating a function to a file name, according to one aspect of the present: invention.
  • FIG. 1A shows a functional block diagram of a portable memory device (may also be referred to as a "flash device” or “flash memory device”) 105 coupled to a host device (or system, used interchangeably) 100 via a bus 10OA.
  • flash device or flash memory device
  • a microphone 106B is provided to capture a user's voice (shown as input 106D) that is then sent to an analog/digital (A/D) converter 106A.
  • a digital signal 106C is received and processed by controller 106 (may also be referred to as "memory controller” or “controller”) , as described below.
  • Controller 106 interfaces with host system 100 via a bus interface 10OA.
  • controller 106 may be a part of an integrated circuit (for example, an application specific integrated circuit (ASIC)) or any other circuit.
  • ASIC application specific integrated circuit
  • Flash memory device 105 includes solid-state memory modules/cells 107-108 (shown as Memory Module #1 and Memory Module #N) . Merrory cells 107/108 are used to store data, applications and other information.
  • flash memory cards There are currently many different types of flash memory cards that are commercially available, examples being the CompactFlash (CF) , the MultiMediaCard (MMC) , Secure Digital (SD) , miniSD, Memory Stick, SmartMedia and TransFlash cards. Although each of these cards has a unique mechanical and/or electrical interface according to its standardized specifications (for example. The Universal Serial Bus (USB) specification, incorporated herein by reference in its entirety) , the flash memory included in each is very similar.
  • CF CompactFlash
  • MMC MultiMediaCard
  • SD Secure Digital
  • miniSD miniSD
  • Memory Stick SmartMedia
  • TransFlash cards Flash memory cards
  • SanDisk aisc provides a line of flash drives under its Cruzer trademark, which are hand held memory systems in small packages that have a Universal Serial Bus (USB) plug for connecting with a host by plugging into the nest's USB receptacle.
  • USB Universal Serial Bus
  • Each of these memory cards and flash drives includes controllers that interface with the host and control operation of the flash memory within them.
  • Host devices for example, 100
  • PCs personal computers
  • laptop and other portable computers cellular telephones
  • PDAs personal digital assistants
  • digital still cameras digital movie cameras and portable audio players.
  • the host typically includes a built-in receptacle for one or more types of memory cards or flash drives but some require adapters into which a memory card is plugged.
  • a NAND architecture of the memory cell arrays 107-108 is currently preferred, although other architectures, such as NOR, can also be used instead. Examples of NAND flash memories and their operation as part of a memory system may be had by reference to United States patents nos. 5,570,315, 5,774,397, 6,046,935, 6,373,746, 6,456,528, 6,522,580, 6,771,536 and 6,781,877 and United States patent application publication no. 2003/0147278.
  • Figure IB shows a block diagram of a typical host system 100 that includes a central processing unit (“CPU”) (or microprocessor) 101 connected to a system bus IQlB.
  • CPU central processing unit
  • IQlB system bus
  • Host system 100 is coupled with flash device 105 via a bus interface 104.
  • Random access main memory (“RAM”) 103 is coupled to system bus 101B and provides CPU 101 with access to memory storage. When executing program instructions, CPU 101 stores those process steps In RAM 103 and executes the stored process steps out of RAM 103.
  • Host system 100 connects to a computer network (not shown) via network interface 10IA (and through a network connection (not shown) ) .
  • a computer network not shown
  • network interface 10IA and through a network connection (not shown)
  • One such network is the Internet that allows host system 100 to download applications, code, documents and others electronic Information.
  • ROM 102 Read only memory 102 is provided to store invariant instruction sequences such as start-up instruction sequences or basic Input/output operating system (BIOS) sequences.
  • invariant instruction sequences such as start-up instruction sequences or basic Input/output operating system (BIOS) sequences.
  • I/O device interface 102A allows host 100 to connect to various input/out devices, for example, a keyboard, a pointing device ("mouse”) , a monitor, printer, a modem and the like.
  • I/O device interface 102A is shown as a single block for simplicity and may include plural interfaces to interface with different types of I/O devices.
  • Figure 1C shows a block diagram of the internal architecture of controller module 106, Controller module 106 includes a microcontroller 109 that interfaces with various other components via interface logic 111.
  • Memory 110 stores firmware and software instructions that are used by microcontroller 109 to control the operation of flash device 105.
  • Memory 110 may be volatile re-prograirmable random access memory (“RAM”) , a non-volatile memory that is not re-programmable (“ROM”), a one-time programmable memory or a re-programmable flash electrically-erasable and programmable read-only memory (“EEPROM”) .
  • RAM re-prograirmable random access memory
  • ROM non-volatile memory that is not re-programmable
  • EEPROM electrically-erasable and programmable read-only memory
  • a host interface 113 interfaces with host system 100, while a flash interface 112 interfaces with memory modules
  • Microphone 106B is used to capture user voice input (106D) .
  • the analog voice data is then converted into digital data by A/D converter 106A and the digital signal 106C is then processed by microcontroller 109. It is noteworthy that digital signal may be accessed by microcontroller 109 via interface logic 111.
  • Enrollment module 109A Is provided so that a user can trigger the enrollment process, described below, according to one aspect of the present Invention.
  • enrollment module includes a "button" or a physical interface that the user activates to start the enrollment process, according to one aspect of the present invention.
  • Figure ID shows another example of a portable device that is voice controlled, according to one aspect of the present invention.
  • Portable device in Figure ID is an audio/video player 115 (may be referred to as Player 115) that can play an audio file (for example, an MP3 file) stored in memory cells 107/108. Flash device 105 in this aspect is a part of Player 115. Player 115 is also capable of playing a video file or displaying an image, [0055] It Is noteworthy that the present invention is not limited to any particular audio/video file format.
  • Player 115 includes a player controller 117 that controls overall functionality. Player controller 117 interfaces with a display module 123 via a LCD module I/F 124 to display information to a user. Typically, the information relates to the music that is being played. [0057] Player controller 117 also interfaces with a host system via a host interface 118 via port 126. Port 126 may be USB, parallel port, RS232, SCSI or any other type of port .
  • Decoder 120 decodes au ⁇ io files and sends the decoded signal to an audio signal generator 121.
  • the audio signal generator outputs the audio, for example, to ear phones 122.
  • Player 115 also includes a button interface 119 that receives input from button 125. To request certain functionality the user uses Button 125. It is noteworthy that block ]25 is intended to simply provide an example and is not intended to limit the present invention to any particular number/type of buttons or physical interface that is used by the user to request functionality. Button 125 can be used by the user to begin the enrollment/training process, according to one aspect of the present invention, as described below in detail.
  • Figure IE shows a block diagram for flash device 105 that interfaces with host system 100 via a USB interface.
  • Flash device 105 conforms to the USB specification (i.e. can be accessed via a USB interface) and appears to host 100 having plural Logical Units (LUNs) of storage space and each LUN may appear to be of a different class of storage device.
  • LUNs Logical Units
  • flash device 105 may appear to have both a standard Mass Storage Class volume - " LUN 0, 106E), which imitates the behavior of a SCSI Hard Disk Drive, and a MMC Class volume, which imitates the behavior of a CD-ROM (LUN 1, 106F) .
  • LUN 0 (106E) as a mass storage- device for storing data and other information
  • LUN 1 106F as a CD-POM that can store an auto-run application code for launching an application.
  • Hidden area 106G is secured and may be used to store a voice print template, as discussed below.
  • Figure 2 snows a top-level flow diagram for using a flash device 105 (or Player 115, used interchangeably through out this specification and may also be referred to as a "device"), according to one aspect of the present invention.
  • Flash device 105 is initialized in step S200.
  • step S201 the process determines if the device needs to be enrolled.
  • step S202 If enrollment is needed, then the process moves to step S202, described below in detail with respect to Figure 3.
  • step S203 If enrollment is not needed, then the user is authenticated in step S203, described below with respect to Figure 4. After authentication, in step S204, the user is granted access to the device, described below in detail with respect Figure 5.
  • step S205 If the user cannot be authenticated In step S203, then the device is re-initialized in step S205, described below with respect to Figure 6. The process ends in step S206. [0066] Enrollment:
  • the enrollment process captures a user' s voice input 106D and stores it in flash m €;mory cells 107/108 (preferably in a secured hidden area, for example, 106G, Figure IE) , according to one aspect of the present Invention.
  • a device user may be asked to repeat a password/phrase more than once to capture an accurate voice print profile in flash device 105. Multiple password phrases may be stored allowing more than one user to access flash device 105 or if a user Is concerned about remembering a specific phrase, according to one aspect of the present invention.
  • Controller 106 receives the voice input (106D) and stores It as a template in memory cells 107/108.
  • the enrollment process can be performed in two ways: An application ( Figure IE) Is launched (in step S301) when flash device 105 interfaces with a host system (or when Player 115 is powered on for use for the first time) .
  • Enrollment can also be initiated manually, as shown in step S302. In this case a user manually launches an application by selecting an application shown in Figure IE or by pressing a button (125, as shown in Figure ID) .
  • the application prompts the user to repeat ⁇ phrase and in step S303 f the user voice input is received by flash device 105.
  • the voice input is stored in nonvolatile memory cells 107/108, Controller 106 stores the voice input.
  • FIG. 3B shows an example of a voice template (is also referred to as a "template") 305.
  • Template 305 is used to authenticate a user' s request to access flash device 105. Separate templates can be stored so that multiple users can securely use flash device 105.
  • Template 305 is also associated with other passwords (referred to as a password bank 313) . For example, a user may store a password 307 that allows the user to access and use application 306. Password 307 is associated with template 305.
  • a password similar to 307 may also be used to access a host system 100 or to connect to a network via network interface 101A.
  • a data file 308 (that may be protected by encryption 309) can be protected by a voice-based password 311.
  • Password 311 is also associated with template 305.
  • Password 312 used by a user to access a web site 310 (for example, an online banking website) can also be associated with template 305. When the user wants to access website 310, password 312 is automatically filled in because it is linked to template 305.
  • the password bank features are further described in detail below.
  • step S400 the authentication process begins in step S400. This may occur when flash device 105 interfaces with a host system (or when Player 115 is powered up) and an application is launched.
  • step S401 the user is requested for a voice input sample.
  • step S402 the user voice input 106D is captured by the microphone 106B and converted into a digital signal by an A/D converter 106A.
  • step S403 tne captured voice sample Is compared io a voice template stored in flash memory cells 107/108 (for example, 305) .
  • step S404 flash device microcontroller 109 determines if the voice input matches with stored voice templates.
  • the comparison is performed on the flash device 105 for security reasons.
  • a software module (not shown) running en the host system; and/or a hardware circuit (e.g. an ASIC) can be used to perform the comparison.
  • the user is granted access to flash device 105 in Step S405.
  • the level of access may depend on the type of user. For example, certain users may be granted only "read-only" privilege, i.e., the user can only view Information and is not allowed to modify stored content, while others are allowed to read and write. This level is set during enrollment.
  • the user If the user cannot be authenticated, then the user is given an option in Step S406 to re-initialize flash device 105 as discussed below in Figure 6.
  • Password Bank/Application Access /File Access [0084] Figure 5 shows various examples of using flash device 105 with template 305.
  • step S501 the user accesses a website (for example, 310) using a computing system that interfaces with flash device 105,
  • step S502 the user enters a password and user name to control access to the website.
  • step S503 the password and user name is associated with a voice-based template (for example, 305) .
  • the password and username associated with the template are filled in automatically (in S504) .
  • step S505 a user accesses a computer application (306) , for example, a Windows ⁇ based application and then protects access to the application by storing an application specific password/username (307).
  • step S506 the password and username are associated with template 305.
  • step S507 when the user subsequently wants to access the application again, the password/username is automatically retrieved because they are linked with the voice print template 305.
  • steps S505-S50 '7 can be used to access a host system 100 or access a network via network interface 10IA.
  • a user encrypts a data file that is stored in memory cells 107/108.
  • a file specific voice based passphrase (keyword) is used to secure file data.
  • the user voice input is a passphrase that is associated with a particular file/directory/sub-directory.
  • the voice-based passphrase provides additional protection to secure data, according to one aspect of the present invention. For example, template 305 limits access to flash device 105, the encryption protects the file data at the next level, and then the voice based passphrase 310 limits access to file data in step S510.
  • FIG. 6 shows a block diagram for re-initializing flash device 105,
  • step S601 the previous voice based templates are erased
  • step S602 data associated with the user may also be deleted.
  • the user again goes through the enrollment process (i.e. a terrplate or "new image" is reloaded) described above and the re-enrollment is completed in step S604.
  • Macros the enrollment process (i.e. a terrplate or "new image" is reloaded) described above and the re-enrollment is completed in step S604.
  • a voice-based template is associated with a control button of a portable device.
  • the user can record the word "play” and the keyword play is associated with the functionality of the "play” button.
  • the adaptive aspects of the present invention also allow a user to create "macros" for certain functions for which there are no control buttons or for which more than one button needs to be pressed.
  • One example of such a macro is for the mute function for an audio/video player. The mute function allows a user to mute/silence the player.
  • ->o Typically, one either has a dedicated button or has to press more than one button to mure the player.
  • the user stores activation keywords and assigns the keywords to various functions.
  • the keywords are captured via microphone 106B and once captured, a template is created and stored in memory cells 107/108.
  • Controller 106 saves the template.
  • the user then captures one or more button control functions (for example, "play”; rewind, fast forward, pause, and others) and the button control functions are associated with the keywords and stored in non-volatile memory cells 107/108.
  • button control functions for example, "play”; rewind, fast forward, pause, and others
  • Figure 1 A shows an example of how keywords stored in memory cells are related tc functions.
  • Plural keywords shown as 1 to N may be stored to perform plural functions (1 to N) .
  • FIG 8 shows a process flow diagram for training and using a portable device so that device functions can be performed based on voice input.
  • a user records a specific keyword.
  • the user is enrolled and authenticated by the portable device as explained above.
  • An application is launched to train and store the keywords.
  • the keywords can also be pre-loaded in memory cells 107/108.
  • step S802 controller 106 stores a voiceprint template for the keyword.
  • step S803 the template is stored in non-volatile memory cells.
  • step S804 the user captures a button control sequence for a function that the user intends to associate with the stored keyword.
  • the button sequence can be for a function which has a dedicated button (for example, the play function) , or for which a user has to perform a burton sequence (for example, to achieve the mute function, in various audio/video players one has to press more than one button/key) .
  • step S805 the button control action is associated with the stored keyword.
  • Controller 106 performs this function.
  • a host processor may also perform this function.
  • step S806 f the user terminates the button sequence. Termination of a button sequence is signaled by an action that normally does nor take place, for example, by holding a specific button for a pre-deter ⁇ u ned period.
  • the foregoing process steps are used to store plural keywords that are associated to plural device functions.
  • Figure 9 shows an example, of associating the mute function to user keyword "Mute". Each device has a "Menu" option and a user selects the "Menu" option ro begin training the device. From the Menu option, the user chooses the "Setting" option. The user then selects the "Voice Command" option that allows the user to move to the Train option.
  • the user selects the Train option and is prompted to enter a voice command.
  • the user says "Mute” and device 105 creates a Mute template.
  • the user is then prompted to enter a button sequence (for example, Menu>Volume>Level 0) that can be associated with the voice command "Mute”. Pressing certain buttons for certain duration (for example, the A/B repeat button for 4 seconds) terminates the sequence.
  • the spoken word can be used to activate the function for which it is programmed. For example, wnen the user says Mute, the device (Player 115) becomes mute.
  • FIG 10 shows a process flow diagram for executing device 105 functions when a user states a command with a file name for a file stored in memory ceils 107/108, according to one aspect of the present invention.
  • the process begins in step SlOOO, when player 115 receives a voice command with a file name from a user. For example ⁇ the user states "Play Beethoven", where "play” is a command to play an audio file named "Beethoven”.
  • step S10Q2 player 115 parses the file name and creates a phonetic pattern. For example, '"Beethoven” is reduced to a pattern "bee", "tho” and "ven”.
  • step S1004 player 115 searches plural files that are stored in a directory in memory ceils 107/108 to determine if the phonetic pattern in step S1002 matches the phonetic pattern for the stored files.
  • Player 115 creates a phonetic pattern for the stored file names either real time when it receives a command in step SlOOO or maintains a list of phonetic patterns that is updated every time a file is added.
  • the received file name phonetic pattern (for example, bee, tho, ven) is compared with the phonetic patterns of the stored files. If there is a match, the function is executed in step S1005. In this example, the file named "Beethoven" is played.
  • Figure 7B shows an example of how file names with associated phonetic patterns are stored in memory cells 107/108.
  • the files can be for audio, video or any other information .
  • buttons are needed to operate a device like an audio/video player (for example, an MP3 player) .
  • the user is given an option to create voice commands for standard functions as well as custom functions.
  • the device is user friendly and cheaper because fewer burtons are needed.

Abstract

A portable memory device (105) is provided. The device includes a microphone (106B) for receiving a voice command from a user; and a device controller (106) that creates a voice based template for the voice command and stores the voice based template in a plurality of non-volatile memory cells (107, 108), wherein the voice based template is associated with one or more button control actions entered by the user for certain device functionality. A method for a portable memory device (105) is provided. The method includes, recording a keyword and creating a voice based template for the keyword, wherein a processor for the portable memory device (105) creates the voice based template and stores the voice based template in non-volatile memory cells (107, 108); prompting the user to capture button control actions related to a portable memory device (105) functionality; and associating the button control actions to the voice based template.

Description

VOICE CONTROLLED PORTABLE MEMORY STORAGE DEVICE
Inventor Cs) : Kevin. M, Conley
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This patent application is related to the following applications, the disclosure of which is incorporated herein by reference in its entirety: [0002] Serial Number 11/314,933, filed on December 21,
2005, entitled "VOICE CONTROLLED PORTABLE MEMORY STORAGE DEVICE"; and
[0003] Serial Number 11/314,522, filed on December 21, 2005, entitled "VOICE CONTROLLED PORTABLE MEMORY STORAGE DEVICE". BACKGROUND OF THE INVENTION
1. Field of the Invention
[0004] The present invention relates to portable devices, and more particularly, to voice activated and controlled, portable non-volatile memory storage devices,
2. Background
[0005] Non-volatile semiconductor memory devices, such as flash memory storage drives are commonly used to store digital information in various applications, for example, digital cameras, cell phones, MP3 or other audio/video players, notebook computers, desktop computers and other applications. These memory devices are small, portable, and reliable with a large capacity to store data . The memory devices can be connected to the foregoing using standard interfaces, for example, the Universal Serial Bus (OSB) port or an IEEE 1394 ("Firewire") port.
[0006] The rapid popularity of flash memory devices also poses security risks and challenges. Access to stored data and to device functionality needs to be authorized and secure .
[0007] One common way to control access to such devices has been via a traditional password and a PIN (personal identification information) . The password/ PIN solution is not very effective, because the password can be hacked or forgotten.
[0008] Another solution has been to use biometric parameters, like, fingerprints, to control access to such devices. The biometric solution has shortcomings as well. For example, fingerprints can change over time or become unrecognizable. Also, fingerprint sensors are complex, sometimes unreliable, and expensive.
[0009] Portable devices (for example, an MP3 player or any other type of audio/video player) also use different buttons to control various functions. For example, portable audio/video players use plural buttons for recording, playback and other functions. These buttons are expensive and occupy real estate on portable devices that are small in size to begin with. The buttons are inconvenient to use, for example, at night or while exercising.
[0010] Therefore, there is a need for a portable device that can efficiently provide secured access to a user; and also minimizes the use of buttons. SUMMARY OF THE INVENTION
[0011] In one aspect, a portable memory device ("device") is provided. The device includes a microphone for receiving a voice command from a user; and a device controller that creates a voice based template for the voice command and stores the voice based template in a plurality of nonvolatile memory cells, wherein the voice based template is associated with one or more button control actions entered by the user for" certain device functionality. [0012] In another aspect, a portable memory device ("device") is provided. The device includes, a mi-crcphone for receiving a voice command with a file name from a user; and a device controller that creates a phonetic pattern for the received file name and compares the received file name phonetic pattern with a phonetic pattern for files stored in the memory device.
[0013] In yet another aspect, a portable memory storage device ("device") is provided. The device includes a microphone for receiving a user voice input; a controller that receives the voice input and creates a first template; and a plurality of non-volatile memory eel] s for storing the first template, wherein the template is used to authenticate the user for any subsequent user request for accessing the device and an application is launched when the device interfaces with a host system to enroll the user as an authorized user to access device functionality and access host system functionality and the controller creates a second template for a voice command and stores the second template in a plurality of non-volatile memory cells, wherein the second template is associated with one or more button control actions entered by the user for certain device functionality, [0014] In another aspect, a method for a portable memory device is provided. The method includes, recording a keyword and creating a voice based template for the keyword, wherein a processor for the portable memory device creates the voice based template and stores the voice based template in non-volatile memory cells; prompting the user to capture button control actions related to a portable memory device functionality; and associating the button control actions to the voice based template.
[0015] In yet another aspect, a method for a portable memory device is provided. The method includes, receiving a voice command from a user with a file name; creating a phonetic pattern for the received file name; comparing the phonetic pattern for the received file name with phonetic patterns for file names for files stored in the memory device; and executing a function received with the voice command if the phonetic pattern for the received file name matches with a phonetic pattern for file names for files stored in the memory device. [0016] In another aspect, a method for a portable memory storage device is provided. The method includes, prompting a user to capture button control actions related to the portable memory device functionality; associating the button control actions to a pre-loaded voice command that is stored in a plurality of non-volatile memory cells; and executing a function when a user issues a voice command.
[0017] This brief summary has been provided so that the nature of the invention may be understood quickly, A more complete understanding of the invention can be obtained by reference to the following detailed description of the preferred embodiments thereof, in connection with the attached drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
[0018] The foregoing features and other features of the present invention will now be described with reference to the drawings of a preferred embodiment. In the drawings, the same components have the same reference numerals. The illustrated embodiment is intended to illustrate,, but not to limit the invention. The drawings include che following Figures:
[0019] Figure IA shows a top-level block diagram of a portable memory device coupled to a host system, according to one aspect of the present invention;
[0020] Figure IE shows a block diagram of the internal architecture of the host system in Figure LA;
[0021] Figure 1C shows a block diagram of a memory controller in Figure IA, according to one aspect of the present invention;
[0022] Figure ID shows a top-level block diagram of an audio/video player with voice control, according to one aspect of the present invention;
[0023] Figure IE shows an example of storing information in non-volatile memory cells, according to one aspect of the present invention; [0024] Figure 2 shows a process flow diagram for using a voice controlled portable memory device, according to one aspect of the present invention;
[0025] Figure 3A shows a process flow diagram for performing enrollment of a portable memory device, according to one aspect of the present invention;
[0026] Figure 3B shows an example of a voice print created according to one aspect of the present invention;
[0027] Figure 4 shows a process flow diagram for authenticating a portable memory device, according to one aspect of the present invention;
[0028] Figure 5 shows a process flow diagram for creating a password bank, according to one aspect of the present invention; [0029] Figure 6 shows a process flow diagram for reinitializing a portable device, according to one aspect of the present invention;
[0030] Figure 7A shows an example for storing keywords in a portable device, according to one aspect of the present invention;
[0031] Figure 7B shows an example of phonetic patterns associated with file names, according to one aspect of the present invention; [0032] Figure 8 shows a process flow diagram for creating a macro in a portable memory device, according to one aspect of the present invention;
[0033] Figure 9 shows an example of a macro, according to one aspect of the present invention; and
[0034] Figure 10 shows a process flow diagram for associating a function to a file name, according to one aspect of the present: invention.
DETAILED DESCRIPTION OF THE PREFEPPED EMBODIMENTS [0035] To facilitate an understanding of the preferred embodiment, the general architecture and operation of a computing system/portable non-volatile memory storage device will first be described. The specific architecture and operation of the preferred embodiment will then be described with reference to the general architecture. [0036] Computing System/Portable Memory Device [0037] Figure IA shows a functional block diagram of a portable memory device (may also be referred to as a "flash device" or "flash memory device") 105 coupled to a host device (or system, used interchangeably) 100 via a bus 10OA. The term portable memory device as used throughout this specification is intended to include a portable flash drive, a portable audio/video player (including an MP3 player) and other similar devices. [0038] A microphone 106B is provided to capture a user's voice (shown as input 106D) that is then sent to an analog/digital (A/D) converter 106A. A digital signal 106C is received and processed by controller 106 (may also be referred to as "memory controller" or "controller") , as described below. Controller 106 interfaces with host system 100 via a bus interface 10OA.
[0039] It is noteworthy that controller 106 may be a part of an integrated circuit (for example, an application specific integrated circuit (ASIC)) or any other circuit.
[0040] Flash memory device 105 includes solid-state memory modules/cells 107-108 (shown as Memory Module #1 and Memory Module #N) . Merrory cells 107/108 are used to store data, applications and other information. [0041] There are currently many different types of flash memory cards that are commercially available, examples being the CompactFlash (CF) , the MultiMediaCard (MMC) , Secure Digital (SD) , miniSD, Memory Stick, SmartMedia and TransFlash cards. Although each of these cards has a unique mechanical and/or electrical interface according to its standardized specifications (for example. The Universal Serial Bus (USB) specification, incorporated herein by reference in its entirety) , the flash memory included in each is very similar. These cards are all available from SanDisk Corporation, assignee of the present application. [0042] SanDisk aisc provides a line of flash drives under its Cruzer trademark, which are hand held memory systems in small packages that have a Universal Serial Bus (USB) plug for connecting with a host by plugging into the nest's USB receptacle. Each of these memory cards and flash drives includes controllers that interface with the host and control operation of the flash memory within them. [0043] Host devices (for example, 100) that use such memory cards and flash drives are many and varied. They include personal computers (PCs), laptop and other portable computers, cellular telephones, personal digital assistants (PDAs) , digital still cameras, digital movie cameras and portable audio players. The host typically includes a built-in receptacle for one or more types of memory cards or flash drives but some require adapters into which a memory card is plugged. [0044] A NAND architecture of the memory cell arrays 107-108 is currently preferred, although other architectures, such as NOR, can also be used instead. Examples of NAND flash memories and their operation as part of a memory system may be had by reference to United States patents nos. 5,570,315, 5,774,397, 6,046,935, 6,373,746, 6,456,528, 6,522,580, 6,771,536 and 6,781,877 and United States patent application publication no. 2003/0147278.
[0045] Figure IB shows a block diagram of a typical host system 100 that includes a central processing unit ("CPU") (or microprocessor) 101 connected to a system bus IQlB.
Host system 100 is coupled with flash device 105 via a bus interface 104.
[0046] Random access main memory ("RAM") 103 is coupled to system bus 101B and provides CPU 101 with access to memory storage. When executing program instructions, CPU 101 stores those process steps In RAM 103 and executes the stored process steps out of RAM 103.
[0047] Host system 100 connects to a computer network (not shown) via network interface 10IA (and through a network connection (not shown) ) . One such network is the Internet that allows host system 100 to download applications, code, documents and others electronic Information.
[0048] Read only memory ("ROM") 102 is provided to store invariant instruction sequences such as start-up instruction sequences or basic Input/output operating system (BIOS) sequences.
[0049] Input/Output ("I/O") device interface 102A allows host 100 to connect to various input/out devices, for example, a keyboard, a pointing device ("mouse") , a monitor, printer, a modem and the like. I/O device interface 102A is shown as a single block for simplicity and may include plural interfaces to interface with different types of I/O devices. [0050] Figure 1C shows a block diagram of the internal architecture of controller module 106, Controller module 106 includes a microcontroller 109 that interfaces with various other components via interface logic 111. Memory 110 stores firmware and software instructions that are used by microcontroller 109 to control the operation of flash device 105. Memory 110 may be volatile re-prograirmable random access memory ("RAM") , a non-volatile memory that is not re-programmable ("ROM"), a one-time programmable memory or a re-programmable flash electrically-erasable and programmable read-only memory ("EEPROM") .
[0051] A host interface 113 interfaces with host system 100, while a flash interface 112 interfaces with memory modules
107-108.
[0052] Microphone 106B is used to capture user voice input (106D) . The analog voice data is then converted into digital data by A/D converter 106A and the digital signal 106C is then processed by microcontroller 109. It is noteworthy that digital signal may be accessed by microcontroller 109 via interface logic 111. [0053] Enrollment module 109A Is provided so that a user can trigger the enrollment process, described below, according to one aspect of the present Invention. In one aspect, enrollment module includes a "button" or a physical interface that the user activates to start the enrollment process, according to one aspect of the present invention. [0054] Figure ID shows another example of a portable device that is voice controlled, according to one aspect of the present invention. Portable device in Figure ID is an audio/video player 115 (may be referred to as Player 115) that can play an audio file (for example, an MP3 file) stored in memory cells 107/108. Flash device 105 in this aspect is a part of Player 115. Player 115 is also capable of playing a video file or displaying an image, [0055] It Is noteworthy that the present invention is not limited to any particular audio/video file format. [0056] Player 115 includes a player controller 117 that controls overall functionality. Player controller 117 interfaces with a display module 123 via a LCD module I/F 124 to display information to a user. Typically, the information relates to the music that is being played. [0057] Player controller 117 also interfaces with a host system via a host interface 118 via port 126. Port 126 may be USB, parallel port, RS232, SCSI or any other type of port .
[0058] Decoder 120 decodes auαio files and sends the decoded signal to an audio signal generator 121. The audio signal generator outputs the audio, for example, to ear phones 122. [0059] Player 115 also includes a button interface 119 that receives input from button 125. To request certain functionality the user uses Button 125. It is noteworthy that block ]25 is intended to simply provide an example and is not intended to limit the present invention to any particular number/type of buttons or physical interface that is used by the user to request functionality. Button 125 can be used by the user to begin the enrollment/training process, according to one aspect of the present invention, as described below in detail.
[0060] Figure IE shows a block diagram for flash device 105 that interfaces with host system 100 via a USB interface. Flash device 105 conforms to the USB specification (i.e. can be accessed via a USB interface) and appears to host 100 having plural Logical Units (LUNs) of storage space and each LUN may appear to be of a different class of storage device. For example, flash device 105 may appear to have both a standard Mass Storage Class volume -"LUN 0, 106E), which imitates the behavior of a SCSI Hard Disk Drive, and a MMC Class volume, which imitates the behavior of a CD-ROM (LUN 1, 106F) .
[0061] Host system 100 having its own operating system views LUN 0 (106E) as a mass storage- device for storing data and other information; and LUN 1 106F as a CD-POM that can store an auto-run application code for launching an application. Hidden area 106G is secured and may be used to store a voice print template, as discussed below. [0062] Process Flow: [0063] Figure 2 snows a top-level flow diagram for using a flash device 105 (or Player 115, used interchangeably through out this specification and may also be referred to as a "device"), according to one aspect of the present invention. Flash device 105 is initialized in step S200. In step S201, the process determines if the device needs to be enrolled. If enrollment is needed, then the process moves to step S202, described below in detail with respect to Figure 3. [0064] If enrollment is not needed, then the user is authenticated in step S203, described below with respect to Figure 4. After authentication, in step S204, the user is granted access to the device, described below in detail with respect Figure 5. [0065] If the user cannot be authenticated In step S203, then the device is re-initialized in step S205, described below with respect to Figure 6. The process ends in step S206. [0066] Enrollment:
[0067] The enrollment process captures a user' s voice input 106D and stores it in flash m€;mory cells 107/108 (preferably in a secured hidden area, for example, 106G, Figure IE) , according to one aspect of the present Invention. A device user may be asked to repeat a password/phrase more than once to capture an accurate voice print profile in flash device 105. Multiple password phrases may be stored allowing more than one user to access flash device 105 or if a user Is concerned about remembering a specific phrase, according to one aspect of the present invention. Controller 106 receives the voice input (106D) and stores It as a template in memory cells 107/108. [0068] Turning now in detail to Figure 3A, the enrollment process begins in step S300. The enrollment process can be performed in two ways: An application (Figure IE) Is launched (in step S301) when flash device 105 interfaces with a host system (or when Player 115 is powered on for use for the first time) . [0069] Enrollment can also be initiated manually, as shown in step S302. In this case a user manually launches an application by selecting an application shown in Figure IE or by pressing a button (125, as shown in Figure ID) . [0070] The application prompts the user to repeat ε phrase and in step S303f the user voice input is received by flash device 105. In step S304, the voice input is stored in nonvolatile memory cells 107/108, Controller 106 stores the voice input. The voice input is stored as template that is used in subsequent authentication when a user wants to access flash device 105 functionality. In one aspect, controller 106 stores and maintains the template. [0071] Figure 3B shows an example of a voice template (is also referred to as a "template") 305. Template 305 is used to authenticate a user' s request to access flash device 105. Separate templates can be stored so that multiple users can securely use flash device 105. [0072] Template 305 is also associated with other passwords (referred to as a password bank 313) . For example, a user may store a password 307 that allows the user to access and use application 306. Password 307 is associated with template 305. It is noteworthy that a password similar to 307 may also be used to access a host system 100 or to connect to a network via network interface 101A. [0073] A data file 308 (that may be protected by encryption 309) can be protected by a voice-based password 311. Password 311 is also associated with template 305. [0074] Password 312 used by a user to access a web site 310 (for example, an online banking website) can also be associated with template 305. When the user wants to access website 310, password 312 is automatically filled in because it is linked to template 305. [0075] The password bank features are further described in detail below.
[0076] Authentication :
[0077] When flash device 105 has been secured through the enrollment process, secured authentication is used to allow access to a user. The level of access will depend on the stored passwords.
[0078] Turning in detail to Figure 4, the authentication process begins in step S400. This may occur when flash device 105 interfaces with a host system (or when Player 115 is powered up) and an application is launched. In step S401, the user is requested for a voice input sample. In step S402f the user voice input 106D is captured by the microphone 106B and converted into a digital signal by an A/D converter 106A. [0079] In step S403, tne captured voice sample Is compared io a voice template stored in flash memory cells 107/108 (for example, 305) .
[0080] In step S404, flash device microcontroller 109 determines if the voice input matches with stored voice templates. The comparison is performed on the flash device 105 for security reasons. However, a software module (not shown) running en the host system; and/or a hardware circuit (e.g. an ASIC) can be used to perform the comparison.
[0081] If the user input matches with the stored template then the user is granted access to flash device 105 in Step S405. In one aspect, the level of access may depend on the type of user. For example, certain users may be granted only "read-only" privilege, i.e., the user can only view Information and is not allowed to modify stored content, while others are allowed to read and write. This level is set during enrollment. [0082] If the user cannot be authenticated, then the user is given an option in Step S406 to re-initialize flash device 105 as discussed below in Figure 6. [0083] Password Bank/Application Access /File Access: [0084] Figure 5 shows various examples of using flash device 105 with template 305. The user is first authenticated in step S500, as described above with respect to Figure 4. [0085] Steps S501-S5C4 relate to websites, steps S505-S507 relate to applications and steps S508-S510 relate to files. [0086] In step S501, the user accesses a website (for example, 310) using a computing system that interfaces with flash device 105, [0087] In step S502, the user enters a password and user name to control access to the website. In step S503, the password and user name is associated with a voice-based template (for example, 305) . When the user subsequently tries to access the same website, then the password and username associated with the template are filled in automatically (in S504) .
[0088] It is noteworthy that If a host system stores "cookies" containing user names/passwords from previous logins then the password bank based on voice input takes precedence. Furthermore, if multiple users are enrolled for flash device 105, then passwords for different users are kept separate and access is only granted to authenticated users. If a single user has multiple passwords enrolled, then the user stores the passwords/usernames multiple times based on the number of passwords/usernames. [0089] In step S505, a user accesses a computer application (306) , for example, a Windows© based application and then protects access to the application by storing an application specific password/username (307). In step S506 the password and username are associated with template 305. In step S507, when the user subsequently wants to access the application again, the password/username is automatically retrieved because they are linked with the voice print template 305. [0090] It is noteworthy that steps S505-S50'7 can be used to access a host system 100 or access a network via network interface 10IA.
[0091] In step S508, a user encrypts a data file that is stored in memory cells 107/108. In step S509 a file specific voice based passphrase (keyword) is used to secure file data. The user voice input is a passphrase that is associated with a particular file/directory/sub-directory. The voice-based passphrase provides additional protection to secure data, according to one aspect of the present invention. For example, template 305 limits access to flash device 105, the encryption protects the file data at the next level, and then the voice based passphrase 310 limits access to file data in step S510. [0092] Re-initialization: [0093] Figure 6 shows a block diagram for re-initializing flash device 105, In step S601, the previous voice based templates are erased, In step S602, data associated with the user may also be deleted. In one aspect, if a user is given a certain partition (segment) of storage space, then the data in that partition is also deleted. [0094] In step S603, the user again goes through the enrollment process (i.e. a terrplate or "new image" is reloaded) described above and the re-enrollment is completed in step S604. [0095] Macros:
[0096] In one aspect of the present invention^ a voice-based template is associated with a control button of a portable device. For example, for a Player (115), the user can record the word "play" and the keyword play is associated with the functionality of the "play" button. Hence, when the user says the word "play", Player 115 plays music/video. [0097] The adaptive aspects of the present invention also allow a user to create "macros" for certain functions for which there are no control buttons or for which more than one button needs to be pressed. One example of such a macro is for the mute function for an audio/video player. The mute function allows a user to mute/silence the player.
->o Typically, one either has a dedicated button or has to press more than one button to mure the player.
[0098] Device Training:
[0099] For a new portable device, the user stores activation keywords and assigns the keywords to various functions. The keywords are captured via microphone 106B and once captured, a template is created and stored in memory cells 107/108.
Controller 106 saves the template. The user then captures one or more button control functions (for example, "play"; rewind, fast forward, pause, and others) and the button control functions are associated with the keywords and stored in non-volatile memory cells 107/108.
[0100] It is noteworthy that instead of training the device for keywords, certain keyworαs can be pre-loaded in memory cells 107/108. The pre-loaded keywords are then associated with functions, as described below.
[0101] Figure 1A shows an example of how keywords stored in memory cells are related tc functions. Plural keywords (shown as 1 to N) may be stored to perform plural functions (1 to N) .
[0102] Figure 8 shows a process flow diagram for training and using a portable device so that device functions can be performed based on voice input. In step S801, a user records a specific keyword. The user is enrolled and authenticated by the portable device as explained above. An application is launched to train and store the keywords. As stated above, the keywords can also be pre-loaded in memory cells 107/108.
[0103] In step S802, controller 106 stores a voiceprint template for the keyword.
[0104] In step S803, the template is stored in non-volatile memory cells. [0105] In step S804, the user captures a button control sequence for a function that the user intends to associate with the stored keyword. The button sequence can be for a function which has a dedicated button (for example, the play function) , or for which a user has to perform a burton sequence (for example, to achieve the mute function, in various audio/video players one has to press more than one button/key) .
[0106] In step S805, the button control action is associated with the stored keyword. In one aspect, Controller 106 performs this function. In another aspect, a host processor may also perform this function.
[0107] In step S806f the user terminates the button sequence. Termination of a button sequence is signaled by an action that normally does nor take place, for example, by holding a specific button for a pre-deterπu ned period. [0108] The foregoing process steps are used to store plural keywords that are associated to plural device functions. [0109] Figure 9 shows an example, of associating the mute function to user keyword "Mute". Each device has a "Menu" option and a user selects the "Menu" option ro begin training the device. From the Menu option, the user chooses the "Setting" option. The user then selects the "Voice Command" option that allows the user to move to the Train option.
[0110] The user selects the Train option and is prompted to enter a voice command. The user says "Mute" and device 105 creates a Mute template. The user is then prompted to enter a button sequence (for example, Menu>Volume>Level 0) that can be associated with the voice command "Mute". Pressing certain buttons for certain duration (for example, the A/B repeat button for 4 seconds) terminates the sequence. [0111] Once device 105 is trained, the spoken word can be used to activate the function for which it is programmed. For example, wnen the user says Mute, the device (Player 115) becomes mute.
[0112] Figure 10 shows a process flow diagram for executing device 105 functions when a user states a command with a file name for a file stored in memory ceils 107/108, according to one aspect of the present invention. The process begins in step SlOOO, when player 115 receives a voice command with a file name from a user. For example^ the user states "Play Beethoven", where "play" is a command to play an audio file named "Beethoven".
[0113] In step S10Q2, player 115 parses the file name and creates a phonetic pattern. For example, '"Beethoven" is reduced to a pattern "bee", "tho" and "ven". [0114] In step S1004, player 115 searches plural files that are stored in a directory in memory ceils 107/108 to determine if the phonetic pattern in step S1002 matches the phonetic pattern for the stored files. Player 115 creates a phonetic pattern for the stored file names either real time when it receives a command in step SlOOO or maintains a list of phonetic patterns that is updated every time a file is added. The received file name phonetic pattern (for example, bee, tho, ven) is compared with the phonetic patterns of the stored files. If there is a match, the function is executed in step S1005. In this example, the file named "Beethoven" is played.
[0115] Figure 7B shows an example of how file names with associated phonetic patterns are stored in memory cells 107/108. The files can be for audio, video or any other information .
[0116] It is notev/orthy that although the foregoing example is based on playing an audio file, the adaptive aspects of the present invention are not limited to playing audio files or to any particular file type/format or to any type of command. For example, a user can command the device to "Delete XYX". The device then deletes the file XYZ after the phonetic pattern for XYZ matches with a stored file named XYZ.
[0117] In one aspect of the present invention, fewer buttons are needed to operate a device like an audio/video player (for example, an MP3 player) . The user is given an option to create voice commands for standard functions as well as custom functions. The device is user friendly and cheaper because fewer burtons are needed.
[0118] While the present invention is described above with respect to what is currently considered its preferred embodiments, it is to be understood that the invention is not limited to that described above. To the contrary, the invention is intended to cover various modifications and equivalent arrangements within the spirit and scope of the appended claims.

Claims

What is claimed is:
1. A portable memory device ("device")/ comprising: a microphone for receiving a voice command from a user; and a device controller that creates a voice based template for the voice command and stores the voice based template in a plurality of non-volatile memory cells, wherein the voice based template is associated with one or more button control actions entered by the user for certain device functionality.
2. The device of Claim 1, wherein the user after being authenticated can use the voice command for the device to perform the certain device functionality.
3. The device of Claim 1, wherein the non-volatile memory cells store a plurality of voice commands to execute a plurality of device functions.
4. The device of Claim 1, wherein the device is ar audio/video player.
5. A portable memory device ("device"), comprising: a microphone for receiving a voice command with a file name from a user; and a device controller that creates a phonetic pattern for the received file name and compares the received file name phonetic pattern with a phonetic pattern for files stored in the memory device.
6. The device of Claim 5, wherein when the user issues a voice command,, the controller executes the function associated with the file name if the phonetic pattern for the received file name matches with a phonetic pattern for file names for files stored in the memory device.
7. The device of Claim 5, wherein the user is authenticated before the user can use the voice command for the controller to execute a function.
8. The device of Claim 5, wherein the non-volatile memory cells of the portable memory device store plural files with file names whose phonetic patterns are compared with a phonetic pattern of the received file name.
9. The device of Claim 5, wherein the device is an audio/video player.
10. A portable memory storage device ("device") , comprising: a microphone for receiving a user voice input; a controller that receives the voice input and creates a first template; and a plurality of non-volatile memory cells for storing the first template, wherein the template is used to authenticate the user for any subsequent user request for accessing the device and an application is launched when the device interfaces with a host system to enroll the user as an authorized user to access device functionality and access host system functionality; and the controller creates a second template for a voice command and stores the second template in the plurality of non-volatile memory cells, wherein the second template is associated with one or more button control actions entered by the user for certain device functionality.
11. The device of Claim 10, wherein the user after being authenticated can use the voice command for the device to perform the certain device functionality.
12, The device of Claim 10, wherein the non-volatile memory cells store a plurality of voice commands to execute a plurality of device functions.
13. The device of Claim 10, wherein the user manually elects to enroll to access device functionality and access host system functionality.
14. The device of Claim 10, wherein the first template is associated with a password and username that a user uses to access a website, and the password and username are automatically filled when an enrolled and authenticated user subsequently attempts to access the website.
15. The device of Claim 10, wherein the first template is associated with a password and username that a user uses to access an application; and the; password and username are automatically filled when an enrolled and authenticated user, subsequently attempts to access the application.
16. The device of Claim 15, wherein the first template is associated with another user voice based keyword, wherein the keyword is used to allow the user to access a data file.
17. The device of Claim 16, wherein the data file is encrypted and stored in the non-volatile memory ceils.
18. The device of Claim 10, wherein the first template is associated with a password and username that a user uses to access the host system; and the password and username are automatically filled when an enrolled and authenticated user, subsequently attempts to access the host system.
19. The device of Claim 10, wherein the first template is associated with a password and username that the user uses to access a network; and the password and username are automatically filled when an enrolled and authenticated user, subsequently attempts to access the network.
20. The device of Claim 10, wherein plural voice inputs are stored as a first template, allowing the user to store plural passwords to access the device.
21. The device of Claim 10, wherein plural user voice inputs are stored in plural first templates,, allowing plural users to be enrolled so that the plural users can securely access the device.
22. The device of Claim 10, wherein the first template and the second template are stored in a secured area of the nonvolatile memory cells.
23. The device of Claim 10, wherein when the device interfaces with the host system, an application is launched to authenticate the user by receiving a voice input from the user and comparing the voice input with the first template; and after the user is authenticated, the host system is allowed access to information stored in the device,
24. The device of Claim 10, wherein if an unauthorized user attempts to access the device,, the device is re-initialized and during re-initialization the first template, the second template and any data associated with the user is erased.
25. The device of Claim 10, wherein the portable memory storage device is an audio/video player.
26. A method for a portable memory device, comprising: recording a keyword and creating a voice based template for the keyword, wherein a processor for the portable memory device creates the voice based template and stores the voice based template in non-volatile memory cells; prompting a user to capture button control actions related to a portable memory device functionality; and associating the button control actions to the voice based template.
27. The method of Claim 26, wherein the user can record plural keywords for different portable memory device functions.
28. The method of Claim 26, wherein the portable memory device is an audio/video player.
29. The method of Claim 26, wherein after the button control action is associated with the voice based template, the user after authentication can state the keyword and the portable memory device will perform the associated function.
30. A method for a portable memory device, comprising: receiving a voice command from a user with a file name; creating a phonetic pattern for the received file name; comparing the phonetic pattern for the received file name with phonet.ic patterns for file names for files stored in the memory device; and executing a function received with the voice command if the phonetic pattern for the received file name matches with a phonetic pattern for file names for files stored in the memory device.
31. The method of Claim 30, wherein the user is authenticated before the user can use the voice command for a controller to execute the function,
32. The method of Claim 30, wherein the non-volatile memory cells store plural files with file names whose phonetic patterns are compared with a phonetic pattern for the received file name.
33. The method of Claim 30, wherein a controller receives a voice command from the user and compares the phonetic pattern for the received file name with phonetic patterns for stored file names.
34. The method of Claim 30, wherein the function is associated with an audio file,, video file and a data file.
35. The device of Claim 30, wherein the device is an audio/video player.
36. A method for a portable memory storage device, comprising: prompting a user to capture button control actions related to a portable memory device functionality; associating the button control actions to a pre- loaded voice command that is stored in a plurality of nonvolatile memory cells; and executing a function when the user issues a voice command .
37. The method of Claim 36, wherein plural keywords can be used for different portable memory device functions.
38. The method of Claim 36, wherein the portable memory device is an audio/video player.
PCT/US2006/062336 2005-12-21 2006-12-19 Voice controlled portable memory storage device WO2007079357A2 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US11/313,841 2005-12-21
US11/314,476 2005-12-21
US11/314,476 US20070143117A1 (en) 2005-12-21 2005-12-21 Voice controlled portable memory storage device
US11/313,841 US20070143111A1 (en) 2005-12-21 2005-12-21 Voice controlled portable memory storage device

Publications (2)

Publication Number Publication Date
WO2007079357A2 true WO2007079357A2 (en) 2007-07-12
WO2007079357A3 WO2007079357A3 (en) 2007-12-13

Family

ID=38115934

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2006/062336 WO2007079357A2 (en) 2005-12-21 2006-12-19 Voice controlled portable memory storage device

Country Status (2)

Country Link
TW (1) TWI350475B (en)
WO (1) WO2007079357A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7917949B2 (en) 2005-12-21 2011-03-29 Sandisk Corporation Voice controlled portable memory storage device
US8161289B2 (en) 2005-12-21 2012-04-17 SanDisk Technologies, Inc. Voice controlled portable memory storage device

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5365574A (en) * 1990-05-15 1994-11-15 Vcs Industries, Inc. Telephone network voice recognition and verification using selectively-adjustable signal thresholds
EP1014338A1 (en) * 1998-12-23 2000-06-28 Hewlett-Packard Company Voice control input for portable capture devices
WO2001002949A1 (en) * 1999-07-06 2001-01-11 Chuang Li Methods and apparatus for controlling a portable electronic device using a touchpad
EP1073037A2 (en) * 1999-07-27 2001-01-31 Sony Corporation Speech recognition using prestored templates for system control
EP1220518A2 (en) * 2000-12-25 2002-07-03 Nec Corporation Mobile communications terminal, voice recognition method for same, and record medium storing program for voice recognition
US20050096098A1 (en) * 2002-04-25 2005-05-05 Woods Michael R. Wireless telephone system for electrically powered wheelchair
US20060206339A1 (en) * 2005-03-11 2006-09-14 Silvera Marja M System and method for voice-enabled media content selection on mobile devices

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5365574A (en) * 1990-05-15 1994-11-15 Vcs Industries, Inc. Telephone network voice recognition and verification using selectively-adjustable signal thresholds
EP1014338A1 (en) * 1998-12-23 2000-06-28 Hewlett-Packard Company Voice control input for portable capture devices
WO2001002949A1 (en) * 1999-07-06 2001-01-11 Chuang Li Methods and apparatus for controlling a portable electronic device using a touchpad
EP1073037A2 (en) * 1999-07-27 2001-01-31 Sony Corporation Speech recognition using prestored templates for system control
EP1220518A2 (en) * 2000-12-25 2002-07-03 Nec Corporation Mobile communications terminal, voice recognition method for same, and record medium storing program for voice recognition
US20050096098A1 (en) * 2002-04-25 2005-05-05 Woods Michael R. Wireless telephone system for electrically powered wheelchair
US20060206339A1 (en) * 2005-03-11 2006-09-14 Silvera Marja M System and method for voice-enabled media content selection on mobile devices

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ERICSSON INC: "Cellular phone with integrated MP3 player" RESEARCH DISCLOSURE, MASON PUBLICATIONS, HAMPSHIRE, GB, vol. 418, no. 15, February 1999 (1999-02), XP007123891 ISSN: 0374-4353 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7917949B2 (en) 2005-12-21 2011-03-29 Sandisk Corporation Voice controlled portable memory storage device
US8161289B2 (en) 2005-12-21 2012-04-17 SanDisk Technologies, Inc. Voice controlled portable memory storage device

Also Published As

Publication number Publication date
TWI350475B (en) 2011-10-11
TW200745934A (en) 2007-12-16
WO2007079357A3 (en) 2007-12-13

Similar Documents

Publication Publication Date Title
US7917949B2 (en) Voice controlled portable memory storage device
US20070143117A1 (en) Voice controlled portable memory storage device
TWI398792B (en) Method and system of digital key
TWI417732B (en) Memory device with near field communications, method of communicating wireless network settings between devices, and universal serial bus flash drive related therewith
KR102453780B1 (en) Apparatuses and methods for securing an access protection scheme
TWI326427B (en) Biometrics signal input device, computer system having the biometrics signal input device, and control method thereof
US9009816B2 (en) Removable memory storage device with multiple authentication processes
US20020073340A1 (en) Secure mass storage device with embedded biometri record that blocks access by disabling plug-and-play configuration
US20050097338A1 (en) Biometrics parameters protected USB interface portable data storage device with USB interface accessible biometrics processor
JP2006092547A (en) Computer system with basic input-output system and control method thereof
US8161289B2 (en) Voice controlled portable memory storage device
US7620761B2 (en) Multi-functional storage apparatus and control method thereof
KR20020087202A (en) Computer
US20070143111A1 (en) Voice controlled portable memory storage device
US20230385395A1 (en) Storage device with concurrent intialization and fingerprint recognition
US20050193195A1 (en) Method and system for protecting data of storage unit
KR100841982B1 (en) Memory card storing host identification information and access method thereof
US20030208698A1 (en) Plug and play device and access control method therefor
WO2007079359A2 (en) Voice controlled portable memory storage device
WO2007079357A2 (en) Voice controlled portable memory storage device
JP2007122731A (en) Hard disk apparatus with biometrics sensor and method of protecting data therein
US20070033648A1 (en) Method for Executing Commands to Control a Portable Storage Device
JP4838735B2 (en) Removable memory unit
JP2007095022A5 (en)
AU2021101257A4 (en) Usb: auto data store your gmail and link share your mobile no.) using ai- based programming

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase in:

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 06849094

Country of ref document: EP

Kind code of ref document: A2