JP2010078979A - Voice recording device, recorded voice retrieval method, and program - Google Patents

Voice recording device, recorded voice retrieval method, and program Download PDF

Info

Publication number
JP2010078979A
JP2010078979A JP2008247882A JP2008247882A JP2010078979A JP 2010078979 A JP2010078979 A JP 2010078979A JP 2008247882 A JP2008247882 A JP 2008247882A JP 2008247882 A JP2008247882 A JP 2008247882A JP 2010078979 A JP2010078979 A JP 2010078979A
Authority
JP
Japan
Prior art keywords
voice
character information
input
converted
recording
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2008247882A
Other languages
Japanese (ja)
Inventor
Shuji Koretsune
修二 是恒
Original Assignee
Nec Infrontia Corp
Necインフロンティア株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nec Infrontia Corp, Necインフロンティア株式会社 filed Critical Nec Infrontia Corp
Priority to JP2008247882A priority Critical patent/JP2010078979A/en
Publication of JP2010078979A publication Critical patent/JP2010078979A/en
Application status is Pending legal-status Critical

Links

Images

Abstract

<P>PROBLEM TO BE SOLVED: To speedily retrieve a desired voice from recorded voices. <P>SOLUTION: A voice recognition unit 50 converts a voice input through a microphone 11 and a voice recording unit 10 into character information. The voice input through the microphone 11 and voice recording unit 10 is stored as voice recording data 61, character information converted by the voice recognition unit 50 is stored as voice recognition data 62, and the character information and the voice converted into the character information are associated with each other and managed as voice management data 63. When a retrieval key is designated, a voice made to correspond to the character information is retrieved from the voice recording data 61 with the retrieval key, and reproduced and output through a voice reproduction unit 20 and a speaker 21. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

  The present invention relates to a voice recording apparatus for recording voice, and more particularly to a technique for retrieving recorded voice.

  In recent years, with the rapid spread of computers and mobile phones, voice mail that transmits and receives mail by voice is used (for example, see Patent Document 1). In a system in which voice is exchanged, such as voice mail, there is a demand for listening to only the necessary contents of the recorded call contents.

Therefore, a technique for detecting only a recording addressed to a designated call from the recorded call voice has been considered (see, for example, Patent Document 2).
JP 2001-306462 A JP 2008-72614 A

  However, as described above, in the case of detecting only the recording addressed to the designated call out of the recorded call voice, the voice addressed to the designated call can be detected, but the desired content of the voice is When the voice addressed to the call is in the middle, the desired call content is searched for by a function such as fast-forwarding. Therefore, when the recording time is long, the desired call content cannot be found quickly. is there.

  The present invention has been made in view of the problems of the above-described technology, and provides a voice recording device, a recorded voice search method, and a program capable of quickly searching for a desired voice from recorded voices. The purpose is to provide.

In order to achieve the above object, the present invention provides:
Voice input means for inputting voice, storage means for recording voice input through the voice input means, and voice output means for outputting voice recorded in the storage means In voice recording equipment,
Voice recognition means for converting voice input through the voice input means into character information;
The storage means stores the character information converted by the voice recognition means, manages the character information and the voice converted to the character information in association with each other,
When a search key is designated, the voice associated with the character information by the search key is searched from the storage means and output from the voice output means.

Also, a recorded voice search method for recording voice and searching for the recorded voice,
A process of converting the input speech into text information;
Storing the input voice and the converted character information, and managing the character information and the voice converted to the character information in association with each other;
When a search key is designated, a process for searching for a voice associated with character information by the search key is included.

Also on the computer,
The procedure to convert the input voice into text information,
Storing the input voice and the converted character information, and managing the character information and the voice converted to the character information in association with each other;
A program for executing, when a search key is designated, a procedure for searching for a voice associated with character information by the search key.

  Since the present invention is configured as described above, a desired voice can be quickly searched from recorded voices.

  Embodiments of the present invention will be described below with reference to the drawings.

  FIG. 1 is a block diagram showing an embodiment of a voice recording apparatus of the present invention.

  As shown in FIG. 1, the present embodiment is a voice recording unit 10 and a microphone 11 which are voice input means for inputting voice, and a storage means for recording voices input via the microphone 11 and the voice recording unit 10. A certain storage memory unit 60, a sound reproduction unit 20 and a speaker 21, which are sound output means for outputting sound recorded in the storage memory unit 60, a display unit 30, a key input unit 40, a microphone 11, and The voice recognition unit 50 that converts voice input through the voice recording unit 10 into character information and a control unit 70 are provided. The storage memory unit 60 includes voice recording data 61, voice recognition data 62, and voice management. Data 63 is recorded.

  Hereinafter, a recorded voice search method using the voice recording apparatus configured as described above will be described.

  When voice is input to the voice recording apparatus 1 shown in FIG. 1 via the microphone 11 and the voice recording unit 10, the input voice is recorded in the storage memory unit 60 under the control of the control unit 70. Recorded as data 61.

  The input voice is passed to the voice recognition unit 50 via the control unit 70, converted into character information such as text, for example, and stored as voice recognition data 62 in the storage memory unit 60. Is done. The voice recognition unit 50 may convert the voice into character information automatically by the control unit 70 after a predetermined time has elapsed since the voice was input via the microphone 11 and the voice recording unit 10. Alternatively, after the instruction to that effect is input via the key input unit 40, the control unit 70 may perform the operation.

  FIG. 2 is a diagram showing recording and storage states of the voice recording data 61 and the voice recognition data 62 in the storage memory unit 60 shown in FIG. 1, and (a) is a diagram showing a recording state of the voice recording data 61. (B) is a diagram showing a storage state of the voice recognition data 62, (c) is a diagram showing a state where a pointer is added to the voice recording data 61, and (b) is a state where a pointer is added to the voice recognition data 62. FIG. FIG. 3 is a diagram showing the structure of the voice management data 63 shown in FIG.

  The voice recording data 61 and the voice recognition data 62 recorded and stored in the storage memory 60 can identify the voice recording data 61 and the voice recognition data 62, respectively, as shown in FIGS. A management number is assigned and recorded and stored in the storage memory unit 60. The voice recording data 61 records the date and time when the voice was recorded.

  As shown in FIGS. 2C and 2D, the voice recording data 61 and voice recognition data 62 recorded and stored in the storage memory unit 60 are stored in the storage memory unit 60 as voice recording data 61. Address information indicating the storage area of the sound to be recorded is given as a pointer.

  In the voice management data 63, as shown in FIG. 3, for each voice recognition data 62 based on character information, an address serving as address information where the voice of the voice recording data 61 specified by the pointer is recorded is associated. Are managed. That is, in the voice management data 63, the voice recording data 61 and the voice recognition data 62 are managed in association with each other by the pointer.

  FIG. 4 is a diagram showing the correspondence between the voice recording data 61 and the voice recognition data 62 shown in FIG.

  As shown in FIG. 4, the voice recording data 61 and the voice recognition data 62 shown in FIG. 1 are associated with each other by providing the pointer as described above.

  After that, when reproducing the desired voice among the voices recorded as the voice recording data 61 in the storage memory unit 60, when a desired voice keyword serving as a search key is key-inputted via the key input unit 40, First, the control unit 70 searches for character information corresponding to the input keyword from the character information stored as the speech recognition data 62 in the storage memory unit 60. Further, the keyword input via the key input unit 40 is displayed on the display unit 30. In addition, it is conceivable that a keyword serving as a search key is not input via the key input unit 40 but is input as a voice via the microphone 11. In this case, the keyword based on the voice input via the microphone 11 is The voice recognition unit 50 converts the information into character information.

  Next, in the control unit 70, the voice recorded at the address specified by the pointer added to the retrieved character information, that is, the voice associated with the retrieved character information by the voice management data 63 is recorded as a voice. Retrieved from data 61.

  FIG. 5 is a diagram for explaining the detailed operation when the recorded voice is searched in the voice recording apparatus 1 shown in FIG.

  As shown in FIG. 4, since the voice recording data 61 and the voice recognition data 62 shown in FIG. 1 are associated with each other by the addition of the pointer as described above, as shown in FIG. When the character information matching the keyword is searched from the data 62, the voice recorded at the address specified by the pointer assigned to the character information is searched from the voice recording data 61. .

  Then, the audio retrieved from the audio recording data 61 is reproduced and output via the audio reproducing unit 20 and the speaker 21.

  Below, the utilization form of the audio | voice recording apparatus mentioned above is demonstrated.

  FIG. 6 is a diagram showing an example of a voice mail apparatus incorporating the voice recording apparatus 1 shown in FIG.

  As shown in FIG. 6, the voice mail apparatus in this example includes an extension telephone 102 and a main apparatus unit 101.

  The extension telephone 102 includes an interface unit 121, a voice processing unit 122, a handset 123, a speaker / microphone 124, a dial button 125, a function button 126, a control unit 127, a storage unit 128, and a display unit 129. It consists of and.

  The main unit 101 includes an external package 111, a call control circuit 112, an extension package 113, a unit interface 114, a call control unit 115, a voice mail unit 116, and a storage unit 117. The voice recording apparatus 1 shown in FIG. 1 is incorporated in the voice mail unit 116.

  When the voice recording apparatus 1 shown in FIG. 1 is incorporated in the voice mail apparatus configured as described above, the voice recording apparatus 1 is transmitted via the external line package 111 or the internal line package 113 instead of the microphone 11 shown in FIG. Voice input means for inputting incoming voice is provided.

  In the voice mail apparatus configured as described above, the above-described recording and searching can be performed on the voice mail transmitted via the external line or the internal line. A search across a plurality of call recordings is also possible.

  In the present invention, the processing in the voice recording apparatus 1 is performed on a recording medium that can be read by the voice recording apparatus 1 in addition to the above-described dedicated hardware. The program may be recorded, and the program recorded on the recording medium may be read into the voice recording apparatus 1 and executed. The recording medium readable by the voice recording apparatus 1 refers to an HDD or the like built in the voice recording apparatus 1 in addition to a transferable recording medium such as a floppy disk, a magneto-optical disk, a DVD, or a CD. The program recorded on this recording medium is read by a control block, for example, and the same processing as described above is performed under the control of the control block.

It is a block diagram which shows one Embodiment of the audio recording apparatus of this invention. It is a figure which shows the recording of the voice recording data and voice recognition data in the memory | storage part shown in FIG. 1, and a memory | storage state, (a) is a figure which shows the recording state of voice recording data, (b) is a figure of voice recognition data. The figure which shows a memory | storage state, (c) is a figure which shows the state to which the pointer was provided to voice recording data, (b) is the figure which shows the state to which the pointer was provided to voice recognition data. It is a figure which shows the structure of the audio | voice management data shown in FIG. It is a figure which shows matching with the voice recording data shown in FIG. 1, and voice recognition data. It is a figure for demonstrating the detailed operation | movement when the audio | voice recorded by the audio | voice recording apparatus shown in FIG. 1 is searched. It is a figure which shows an example of the voice mail apparatus incorporating the audio | voice recording apparatus shown in FIG.

Explanation of symbols

DESCRIPTION OF SYMBOLS 1 Voice recording device 10 Voice recording part 11 Microphone 20 Voice reproducing part 21 Speaker 30,129 Display part 40 Key input part 50 Voice recognition part 60 Memory | storage part 61 Voice recording data 62 Voice recognition data 63 Voice management data 70,127 Control Unit 101 Main unit 102 Extension telephone 111 External line package 112 Call control circuit 113 Extension package 114 Unit interface 115 Call control unit 116 Voice mail unit 117, 128 Storage unit 121 Interface unit 122 Voice processing unit 122 and 123 Transmitter / receiver 124 Speaker / microphone 125 Dial buttons 126 Function buttons

Claims (9)

  1. Voice input means for inputting voice, storage means for recording voice input through the voice input means, and voice output means for outputting voice recorded in the storage means In voice recording equipment,
    Voice recognition means for converting voice input through the voice input means into character information;
    The storage means stores the character information converted by the voice recognition means, manages the character information and the voice converted to the character information in association with each other,
    An audio recording apparatus, wherein when a search key is designated, a voice associated with character information by the search key is searched from the storage means and output from the voice output means.
  2. The voice recording device according to claim 1,
    The storage means assigns to the character information address information for recording the voice converted into the character information, and using the address information, the character information converted by the voice recognition means and the character A voice recording device that manages the voice converted to information in association with it.
  3. In the voice recording device according to claim 1 or 2,
    The search key is input to the voice input means by voice,
    The voice recording device, wherein the voice recognition means converts a search key by voice input through the voice input means into character information.
  4. A recorded voice search method for recording voice and searching for the recorded voice,
    A process of converting the input speech into text information;
    Storing the input voice and the converted character information, and managing the character information and the voice converted to the character information in association with each other;
    A recorded voice search method including a process of searching for a voice associated with character information by a search key when a search key is designated.
  5. The recorded voice search method according to claim 4,
    Recording to which the address information for recording the voice converted to the character information is added to the character information, and using the address information, the character information and the voice converted to the character information are associated and managed Voice search method.
  6. In the recorded voice search method according to claim 4 or 5,
    The search key is input by voice,
    A process of converting the input voice search key into character information;
    A recorded voice search method for searching for a voice associated with the converted character information.
  7. On the computer,
    The procedure to convert the input voice into text information,
    Storing the input voice and the converted character information, and managing the character information and the voice converted to the character information in association with each other;
    A program for executing, when a search key is designated, a procedure for searching for a voice associated with character information by the search key.
  8. The program according to claim 7,
    On the computer,
    A procedure of assigning address information for recording the voice converted to the character information to the character information, and managing the character information and the voice converted to the character information in association with the address information A program for running
  9. In the program according to claim 7 or 8,
    On the computer,
    A procedure for converting the input search key by voice into character information when the search key is input by voice;
    A program for executing a procedure for searching for speech associated with the converted character information.
JP2008247882A 2008-09-26 2008-09-26 Voice recording device, recorded voice retrieval method, and program Pending JP2010078979A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2008247882A JP2010078979A (en) 2008-09-26 2008-09-26 Voice recording device, recorded voice retrieval method, and program

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2008247882A JP2010078979A (en) 2008-09-26 2008-09-26 Voice recording device, recorded voice retrieval method, and program

Publications (1)

Publication Number Publication Date
JP2010078979A true JP2010078979A (en) 2010-04-08

Family

ID=42209494

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2008247882A Pending JP2010078979A (en) 2008-09-26 2008-09-26 Voice recording device, recorded voice retrieval method, and program

Country Status (1)

Country Link
JP (1) JP2010078979A (en)

Cited By (88)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013008357A (en) * 2011-06-03 2013-01-10 Apple Inc Automatic creation of mapping between text data and audio data
KR101462788B1 (en) * 2013-06-18 2014-11-21 정지성 recorder with voice search function
US9262612B2 (en) 2011-03-21 2016-02-16 Apple Inc. Device access using voice authentication
US9318108B2 (en) 2010-01-18 2016-04-19 Apple Inc. Intelligent automated assistant
US9330720B2 (en) 2008-01-03 2016-05-03 Apple Inc. Methods and apparatus for altering audio output signals
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US9483461B2 (en) 2012-03-06 2016-11-01 Apple Inc. Handling speech synthesis of content for multiple languages
US9495129B2 (en) 2012-06-29 2016-11-15 Apple Inc. Device, method, and user interface for voice-activated navigation and browsing of a document
US9535906B2 (en) 2008-07-31 2017-01-03 Apple Inc. Mobile device having human language translation capability with positional feedback
US9565301B2 (en) 2014-02-11 2017-02-07 Samsung Electronics Co., Ltd. Apparatus and method for providing call log
US9582608B2 (en) 2013-06-07 2017-02-28 Apple Inc. Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
US9620104B2 (en) 2013-06-07 2017-04-11 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US9626955B2 (en) 2008-04-05 2017-04-18 Apple Inc. Intelligent text-to-speech conversion
US9633674B2 (en) 2013-06-07 2017-04-25 Apple Inc. System and method for detecting errors in interactions with a voice-based digital assistant
US9633660B2 (en) 2010-02-25 2017-04-25 Apple Inc. User profiling for voice input processing
US9646609B2 (en) 2014-09-30 2017-05-09 Apple Inc. Caching apparatus for serving phonetic pronunciations
US9646614B2 (en) 2000-03-16 2017-05-09 Apple Inc. Fast, language-independent method for user authentication by voice
US9668121B2 (en) 2014-09-30 2017-05-30 Apple Inc. Social reminders
US9697820B2 (en) 2015-09-24 2017-07-04 Apple Inc. Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
US9715875B2 (en) 2014-05-30 2017-07-25 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US9721566B2 (en) 2015-03-08 2017-08-01 Apple Inc. Competing devices responding to voice triggers
US9760559B2 (en) 2014-05-30 2017-09-12 Apple Inc. Predictive text input
US9785630B2 (en) 2014-05-30 2017-10-10 Apple Inc. Text prediction using combined word N-gram and unigram language models
US9798393B2 (en) 2011-08-29 2017-10-24 Apple Inc. Text correction processing
US9818400B2 (en) 2014-09-11 2017-11-14 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US9842101B2 (en) 2014-05-30 2017-12-12 Apple Inc. Predictive conversion of language input
US9842105B2 (en) 2015-04-16 2017-12-12 Apple Inc. Parsimonious continuous-space phrase representations for natural language processing
US9858925B2 (en) 2009-06-05 2018-01-02 Apple Inc. Using context information to facilitate processing of commands in a virtual assistant
US9865280B2 (en) 2015-03-06 2018-01-09 Apple Inc. Structured dictation using intelligent automated assistants
US9886432B2 (en) 2014-09-30 2018-02-06 Apple Inc. Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US9886953B2 (en) 2015-03-08 2018-02-06 Apple Inc. Virtual assistant activation
US9899019B2 (en) 2015-03-18 2018-02-20 Apple Inc. Systems and methods for structured stem and suffix language models
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9953088B2 (en) 2012-05-14 2018-04-24 Apple Inc. Crowd sourcing information to fulfill user requests
US9966068B2 (en) 2013-06-08 2018-05-08 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
US9966065B2 (en) 2014-05-30 2018-05-08 Apple Inc. Multi-command single utterance input method
US9971774B2 (en) 2012-09-19 2018-05-15 Apple Inc. Voice-based media searching
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US10049668B2 (en) 2015-12-02 2018-08-14 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
US10057736B2 (en) 2011-06-03 2018-08-21 Apple Inc. Active transport based notifications
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
US10074360B2 (en) 2014-09-30 2018-09-11 Apple Inc. Providing an indication of the suitability of speech recognition
US10078631B2 (en) 2014-05-30 2018-09-18 Apple Inc. Entropy-guided text prediction using combined word and character n-gram language models
US10079014B2 (en) 2012-06-08 2018-09-18 Apple Inc. Name recognition system
US10083690B2 (en) 2014-05-30 2018-09-25 Apple Inc. Better resolution when referencing to concepts
US10083688B2 (en) 2015-05-27 2018-09-25 Apple Inc. Device voice control for selecting a displayed affordance
US10089072B2 (en) 2016-06-11 2018-10-02 Apple Inc. Intelligent device arbitration and control
US10101822B2 (en) 2015-06-05 2018-10-16 Apple Inc. Language input correction
US10127220B2 (en) 2015-06-04 2018-11-13 Apple Inc. Language identification from short strings
US10127911B2 (en) 2014-09-30 2018-11-13 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US10169329B2 (en) 2014-05-30 2019-01-01 Apple Inc. Exemplar-based natural language processing
US10176167B2 (en) 2013-06-09 2019-01-08 Apple Inc. System and method for inferring user intent from speech inputs
US10186254B2 (en) 2015-06-07 2019-01-22 Apple Inc. Context-based endpoint detection
US10185542B2 (en) 2013-06-09 2019-01-22 Apple Inc. Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10192552B2 (en) 2016-06-10 2019-01-29 Apple Inc. Digital assistant providing whispered speech
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10241752B2 (en) 2011-09-30 2019-03-26 Apple Inc. Interface for a virtual digital assistant
US10241644B2 (en) 2011-06-03 2019-03-26 Apple Inc. Actionable reminder entries
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US10255907B2 (en) 2015-06-07 2019-04-09 Apple Inc. Automatic accent detection using acoustic models
US10269345B2 (en) 2016-06-11 2019-04-23 Apple Inc. Intelligent task discovery
US10276170B2 (en) 2010-01-18 2019-04-30 Apple Inc. Intelligent automated assistant
US10283110B2 (en) 2009-07-02 2019-05-07 Apple Inc. Methods and apparatuses for automatic speech recognition
US10297253B2 (en) 2016-06-11 2019-05-21 Apple Inc. Application integration with a digital assistant
US10303715B2 (en) 2017-05-16 2019-05-28 Apple Inc. Intelligent automated assistant for media exploration
US10311144B2 (en) 2017-05-16 2019-06-04 Apple Inc. Emoji word sense disambiguation
US10318871B2 (en) 2005-09-08 2019-06-11 Apple Inc. Method and apparatus for building an intelligent automated assistant
US10332518B2 (en) 2017-05-09 2019-06-25 Apple Inc. User interface for correcting recognition errors
US10354011B2 (en) 2016-06-09 2019-07-16 Apple Inc. Intelligent automated assistant in a home environment
US10356243B2 (en) 2015-06-05 2019-07-16 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10366158B2 (en) 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
US10395654B2 (en) 2017-05-11 2019-08-27 Apple Inc. Text normalization based on a data-driven learning network
US10403283B1 (en) 2018-06-01 2019-09-03 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US10403278B2 (en) 2017-05-16 2019-09-03 Apple Inc. Methods and systems for phonetic matching in digital assistant services
US10410637B2 (en) 2017-05-12 2019-09-10 Apple Inc. User-specific acoustic models
US10417266B2 (en) 2017-05-09 2019-09-17 Apple Inc. Context-aware ranking of intelligent response suggestions
US10445429B2 (en) 2017-09-21 2019-10-15 Apple Inc. Natural language understanding using vocabularies with compressed serialized tries
US10446141B2 (en) 2014-08-28 2019-10-15 Apple Inc. Automatic speech recognition based on user feedback
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US10474753B2 (en) 2016-09-07 2019-11-12 Apple Inc. Language identification using recurrent neural networks
US10482874B2 (en) 2017-05-15 2019-11-19 Apple Inc. Hierarchical belief states for digital assistants
US10490187B2 (en) 2016-06-10 2019-11-26 Apple Inc. Digital assistant providing automated status report
US10496705B1 (en) 2018-06-03 2019-12-03 Apple Inc. Accelerated task performance
US10496753B2 (en) 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US10509862B2 (en) 2016-06-10 2019-12-17 Apple Inc. Dynamic phrase expansion of language input
US10521466B2 (en) 2016-06-11 2019-12-31 Apple Inc. Data driven natural language event detection and classification

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002366552A (en) * 2001-04-10 2002-12-20 Internatl Business Mach Corp <Ibm> Method and system for searching recorded speech and retrieving relevant segment

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002366552A (en) * 2001-04-10 2002-12-20 Internatl Business Mach Corp <Ibm> Method and system for searching recorded speech and retrieving relevant segment

Cited By (109)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9646614B2 (en) 2000-03-16 2017-05-09 Apple Inc. Fast, language-independent method for user authentication by voice
US10318871B2 (en) 2005-09-08 2019-06-11 Apple Inc. Method and apparatus for building an intelligent automated assistant
US10381016B2 (en) 2008-01-03 2019-08-13 Apple Inc. Methods and apparatus for altering audio output signals
US9330720B2 (en) 2008-01-03 2016-05-03 Apple Inc. Methods and apparatus for altering audio output signals
US9865248B2 (en) 2008-04-05 2018-01-09 Apple Inc. Intelligent text-to-speech conversion
US9626955B2 (en) 2008-04-05 2017-04-18 Apple Inc. Intelligent text-to-speech conversion
US10108612B2 (en) 2008-07-31 2018-10-23 Apple Inc. Mobile device having human language translation capability with positional feedback
US9535906B2 (en) 2008-07-31 2017-01-03 Apple Inc. Mobile device having human language translation capability with positional feedback
US9858925B2 (en) 2009-06-05 2018-01-02 Apple Inc. Using context information to facilitate processing of commands in a virtual assistant
US10475446B2 (en) 2009-06-05 2019-11-12 Apple Inc. Using context information to facilitate processing of commands in a virtual assistant
US10283110B2 (en) 2009-07-02 2019-05-07 Apple Inc. Methods and apparatuses for automatic speech recognition
US9548050B2 (en) 2010-01-18 2017-01-17 Apple Inc. Intelligent automated assistant
US10496753B2 (en) 2010-01-18 2019-12-03 Apple Inc. Automatically adapting user interfaces for hands-free interaction
US9318108B2 (en) 2010-01-18 2016-04-19 Apple Inc. Intelligent automated assistant
US10276170B2 (en) 2010-01-18 2019-04-30 Apple Inc. Intelligent automated assistant
US9633660B2 (en) 2010-02-25 2017-04-25 Apple Inc. User profiling for voice input processing
US10049675B2 (en) 2010-02-25 2018-08-14 Apple Inc. User profiling for voice input processing
US10417405B2 (en) 2011-03-21 2019-09-17 Apple Inc. Device access using voice authentication
US9262612B2 (en) 2011-03-21 2016-02-16 Apple Inc. Device access using voice authentication
US10102359B2 (en) 2011-03-21 2018-10-16 Apple Inc. Device access using voice authentication
JP2013008357A (en) * 2011-06-03 2013-01-10 Apple Inc Automatic creation of mapping between text data and audio data
US10057736B2 (en) 2011-06-03 2018-08-21 Apple Inc. Active transport based notifications
US10241644B2 (en) 2011-06-03 2019-03-26 Apple Inc. Actionable reminder entries
US9798393B2 (en) 2011-08-29 2017-10-24 Apple Inc. Text correction processing
US10241752B2 (en) 2011-09-30 2019-03-26 Apple Inc. Interface for a virtual digital assistant
US9483461B2 (en) 2012-03-06 2016-11-01 Apple Inc. Handling speech synthesis of content for multiple languages
US9953088B2 (en) 2012-05-14 2018-04-24 Apple Inc. Crowd sourcing information to fulfill user requests
US10079014B2 (en) 2012-06-08 2018-09-18 Apple Inc. Name recognition system
US9495129B2 (en) 2012-06-29 2016-11-15 Apple Inc. Device, method, and user interface for voice-activated navigation and browsing of a document
US9971774B2 (en) 2012-09-19 2018-05-15 Apple Inc. Voice-based media searching
US9582608B2 (en) 2013-06-07 2017-02-28 Apple Inc. Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
US9966060B2 (en) 2013-06-07 2018-05-08 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US9620104B2 (en) 2013-06-07 2017-04-11 Apple Inc. System and method for user-specified pronunciation of words for speech synthesis and recognition
US9633674B2 (en) 2013-06-07 2017-04-25 Apple Inc. System and method for detecting errors in interactions with a voice-based digital assistant
US9966068B2 (en) 2013-06-08 2018-05-08 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
US10176167B2 (en) 2013-06-09 2019-01-08 Apple Inc. System and method for inferring user intent from speech inputs
US10185542B2 (en) 2013-06-09 2019-01-22 Apple Inc. Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
KR101462788B1 (en) * 2013-06-18 2014-11-21 정지성 recorder with voice search function
US9565301B2 (en) 2014-02-11 2017-02-07 Samsung Electronics Co., Ltd. Apparatus and method for providing call log
US9842101B2 (en) 2014-05-30 2017-12-12 Apple Inc. Predictive conversion of language input
US9966065B2 (en) 2014-05-30 2018-05-08 Apple Inc. Multi-command single utterance input method
US10497365B2 (en) 2014-05-30 2019-12-03 Apple Inc. Multi-command single utterance input method
US9785630B2 (en) 2014-05-30 2017-10-10 Apple Inc. Text prediction using combined word N-gram and unigram language models
US10417344B2 (en) 2014-05-30 2019-09-17 Apple Inc. Exemplar-based natural language processing
US10083690B2 (en) 2014-05-30 2018-09-25 Apple Inc. Better resolution when referencing to concepts
US9715875B2 (en) 2014-05-30 2017-07-25 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US9760559B2 (en) 2014-05-30 2017-09-12 Apple Inc. Predictive text input
US10169329B2 (en) 2014-05-30 2019-01-01 Apple Inc. Exemplar-based natural language processing
US10078631B2 (en) 2014-05-30 2018-09-18 Apple Inc. Entropy-guided text prediction using combined word and character n-gram language models
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US9668024B2 (en) 2014-06-30 2017-05-30 Apple Inc. Intelligent automated assistant for TV user interactions
US10446141B2 (en) 2014-08-28 2019-10-15 Apple Inc. Automatic speech recognition based on user feedback
US9818400B2 (en) 2014-09-11 2017-11-14 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US10431204B2 (en) 2014-09-11 2019-10-01 Apple Inc. Method and apparatus for discovering trending terms in speech requests
US10074360B2 (en) 2014-09-30 2018-09-11 Apple Inc. Providing an indication of the suitability of speech recognition
US10390213B2 (en) 2014-09-30 2019-08-20 Apple Inc. Social reminders
US9986419B2 (en) 2014-09-30 2018-05-29 Apple Inc. Social reminders
US10453443B2 (en) 2014-09-30 2019-10-22 Apple Inc. Providing an indication of the suitability of speech recognition
US10438595B2 (en) 2014-09-30 2019-10-08 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US10127911B2 (en) 2014-09-30 2018-11-13 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US9668121B2 (en) 2014-09-30 2017-05-30 Apple Inc. Social reminders
US9886432B2 (en) 2014-09-30 2018-02-06 Apple Inc. Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US9646609B2 (en) 2014-09-30 2017-05-09 Apple Inc. Caching apparatus for serving phonetic pronunciations
US9865280B2 (en) 2015-03-06 2018-01-09 Apple Inc. Structured dictation using intelligent automated assistants
US10311871B2 (en) 2015-03-08 2019-06-04 Apple Inc. Competing devices responding to voice triggers
US9886953B2 (en) 2015-03-08 2018-02-06 Apple Inc. Virtual assistant activation
US9721566B2 (en) 2015-03-08 2017-08-01 Apple Inc. Competing devices responding to voice triggers
US10529332B2 (en) 2015-03-08 2020-01-07 Apple Inc. Virtual assistant activation
US9899019B2 (en) 2015-03-18 2018-02-20 Apple Inc. Systems and methods for structured stem and suffix language models
US9842105B2 (en) 2015-04-16 2017-12-12 Apple Inc. Parsimonious continuous-space phrase representations for natural language processing
US10083688B2 (en) 2015-05-27 2018-09-25 Apple Inc. Device voice control for selecting a displayed affordance
US10127220B2 (en) 2015-06-04 2018-11-13 Apple Inc. Language identification from short strings
US10101822B2 (en) 2015-06-05 2018-10-16 Apple Inc. Language input correction
US10356243B2 (en) 2015-06-05 2019-07-16 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10186254B2 (en) 2015-06-07 2019-01-22 Apple Inc. Context-based endpoint detection
US10255907B2 (en) 2015-06-07 2019-04-09 Apple Inc. Automatic accent detection using acoustic models
US9697820B2 (en) 2015-09-24 2017-07-04 Apple Inc. Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
US10366158B2 (en) 2015-09-29 2019-07-30 Apple Inc. Efficient word encoding for recurrent neural network language models
US10049668B2 (en) 2015-12-02 2018-08-14 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10354652B2 (en) 2015-12-02 2019-07-16 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10223066B2 (en) 2015-12-23 2019-03-05 Apple Inc. Proactive assistance based on dialog communication between devices
US10446143B2 (en) 2016-03-14 2019-10-15 Apple Inc. Identification of voice inputs providing credentials
US9934775B2 (en) 2016-05-26 2018-04-03 Apple Inc. Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9972304B2 (en) 2016-06-03 2018-05-15 Apple Inc. Privacy preserving distributed evaluation framework for embedded personalized systems
US10249300B2 (en) 2016-06-06 2019-04-02 Apple Inc. Intelligent list reading
US10049663B2 (en) 2016-06-08 2018-08-14 Apple, Inc. Intelligent automated assistant for media exploration
US10354011B2 (en) 2016-06-09 2019-07-16 Apple Inc. Intelligent automated assistant in a home environment
US10067938B2 (en) 2016-06-10 2018-09-04 Apple Inc. Multilingual word prediction
US10509862B2 (en) 2016-06-10 2019-12-17 Apple Inc. Dynamic phrase expansion of language input
US10490187B2 (en) 2016-06-10 2019-11-26 Apple Inc. Digital assistant providing automated status report
US10192552B2 (en) 2016-06-10 2019-01-29 Apple Inc. Digital assistant providing whispered speech
US10297253B2 (en) 2016-06-11 2019-05-21 Apple Inc. Application integration with a digital assistant
US10521466B2 (en) 2016-06-11 2019-12-31 Apple Inc. Data driven natural language event detection and classification
US10269345B2 (en) 2016-06-11 2019-04-23 Apple Inc. Intelligent task discovery
US10089072B2 (en) 2016-06-11 2018-10-02 Apple Inc. Intelligent device arbitration and control
US10474753B2 (en) 2016-09-07 2019-11-12 Apple Inc. Language identification using recurrent neural networks
US10043516B2 (en) 2016-09-23 2018-08-07 Apple Inc. Intelligent automated assistant
US10332518B2 (en) 2017-05-09 2019-06-25 Apple Inc. User interface for correcting recognition errors
US10417266B2 (en) 2017-05-09 2019-09-17 Apple Inc. Context-aware ranking of intelligent response suggestions
US10395654B2 (en) 2017-05-11 2019-08-27 Apple Inc. Text normalization based on a data-driven learning network
US10410637B2 (en) 2017-05-12 2019-09-10 Apple Inc. User-specific acoustic models
US10482874B2 (en) 2017-05-15 2019-11-19 Apple Inc. Hierarchical belief states for digital assistants
US10403278B2 (en) 2017-05-16 2019-09-03 Apple Inc. Methods and systems for phonetic matching in digital assistant services
US10311144B2 (en) 2017-05-16 2019-06-04 Apple Inc. Emoji word sense disambiguation
US10303715B2 (en) 2017-05-16 2019-05-28 Apple Inc. Intelligent automated assistant for media exploration
US10445429B2 (en) 2017-09-21 2019-10-15 Apple Inc. Natural language understanding using vocabularies with compressed serialized tries
US10403283B1 (en) 2018-06-01 2019-09-03 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US10504518B1 (en) 2018-06-03 2019-12-10 Apple Inc. Accelerated task performance
US10496705B1 (en) 2018-06-03 2019-12-03 Apple Inc. Accelerated task performance

Similar Documents

Publication Publication Date Title
US7779357B2 (en) Audio user interface for computing devices
CN101035092B (en) Information processor, method
US20070098145A1 (en) Hands free contact database information entry at a communication device
US20120084634A1 (en) Method and apparatus for annotating text
KR20090010960A (en) User experience for multimedia mobile note taking
JP2009163496A (en) Content reproduction system
JP2010205394A (en) Sound source-reproducing device and sound source-selecting and reproducing method
US20050107120A1 (en) Mobile storage device with wireless bluetooth module attached thereto
CN101291268A (en) Data communication system, portable electronic apparatus, server apparatus, data communication method and data communication program
CN104276100A (en) Cradle for mobile telephone, videophone system, karaoke system, car navigation system and emergency information notification system
US7818170B2 (en) Method and apparatus for distributed voice searching
US7953590B2 (en) Using separate recording channels for speech-to-speech translation systems
US9412368B2 (en) Display apparatus, interactive system, and response information providing method
KR20080096040A (en) Mobile communication device capable of storing video chatting log and operating method thereof
CN102256030A (en) Photo album showing system capable of matching background music and background matching method thereof
CN101790850A (en) Method for storing telephone number by automatically analyzing message and mobile terminal executing the method
US9824143B2 (en) Apparatus, method and program to facilitate retrieval of voice messages
EP2828736B1 (en) Information processing device, information processing method, information processing program, and terminal device
CN103926981B (en) Electronic equipment and its control method
US9049540B2 (en) Wireless attached reader screen for cell phones
JP2007527575A (en) Method and apparatus for synchronizing and identifying content
KR20110000679A (en) Method, apparatus and computer program product for providing an information model-based user interface
JP5765940B2 (en) Method and apparatus for reproducing images
US20090157830A1 (en) Apparatus for and method of generating a multimedia email
KR20140074549A (en) Method and apparatus for providing context aware service using speech recognition

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20100921

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20111227

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20120110

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20120227

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20120814