CN107818786A - A kind of call voice processing method, mobile terminal - Google Patents

A kind of call voice processing method, mobile terminal Download PDF

Info

Publication number
CN107818786A
CN107818786A CN201711015890.3A CN201711015890A CN107818786A CN 107818786 A CN107818786 A CN 107818786A CN 201711015890 A CN201711015890 A CN 201711015890A CN 107818786 A CN107818786 A CN 107818786A
Authority
CN
China
Prior art keywords
call
call voice
keyword
voice
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711015890.3A
Other languages
Chinese (zh)
Inventor
王健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vivo Mobile Communication Co Ltd
Original Assignee
Vivo Mobile Communication Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vivo Mobile Communication Co Ltd filed Critical Vivo Mobile Communication Co Ltd
Priority to CN201711015890.3A priority Critical patent/CN107818786A/en
Publication of CN107818786A publication Critical patent/CN107818786A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/63Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for estimating an emotional state

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Child & Adolescent Psychology (AREA)
  • General Health & Medical Sciences (AREA)
  • Hospice & Palliative Care (AREA)
  • Psychiatry (AREA)
  • Telephone Function (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

The embodiments of the invention provide a kind of call voice processing method and mobile terminal, wherein methods described includes:Call voice is changed into text;Wherein, the call voice is that user carries out the caused voice that communicates by the mobile terminal;Identify the call tone corresponding to the call voice;It is determined that call mood keyword corresponding with the call tone, and the identity of conversation object corresponding with the call voice;The text, call mood keyword, the identity and the call voice are subjected to corresponding storage.Pass through call voice processing scheme provided in an embodiment of the present invention, when user searches certain call voice in advance, input keyword mobile terminal is only needed to find the call voice with Keywords matching, call voice lookup is carried out one by one manually without user, search and take short, efficiency high, the usage experience of user can be lifted.

Description

A kind of call voice processing method, mobile terminal
Technical field
The present embodiments relate to communication technical field, more particularly to a kind of call voice processing method, mobile terminal.
Background technology
With the continuous lifting of mobile terminal function, user not only can carry out language by taking the form of phone with other people Sound is conversed, and can also carry out voice call with other people by the social class application program installed in mobile terminal.
At present when the form by calling carries out voice call, most of mobile terminals can not enter to call voice Therefore row storage can not also provide the user the service of voice backtracking.Even with the presence of a small number of mobile terminal calling record work( Can, the call voice recorded also only be it is single be stored in the storage card of mobile terminal, call voice is not easy to recall.Tool Body, if user searches certain section of history call voice in advance, but user can only obscure the conversation content remembered substantially, now, user Audition is searched one by one in the call voice for needing to store into storage card, and time-consuming, and search efficiency is low, and influence user uses body Test.It can be seen that efficiently history call voice can not be searched in the prior art.
The content of the invention
The embodiment of the present invention provides a kind of call voice processing method, mobile terminal, to solve present in prior art The problem of efficiently can not searching history call voice.
In order to solve the above-mentioned technical problem, the present invention is realized in:A kind of call voice processing method, including:Will Call voice changes into text;Wherein, the call voice is that user carries out the caused voice that communicates by the mobile terminal; Identify the call tone corresponding to the call voice;It is determined that call mood keyword corresponding with the call tone, Yi Jiyu The identity of conversation object corresponding to the call voice;By the text, the call mood keyword, the identity mark Know and the call voice carries out corresponding storage.
In a first aspect, the embodiment of the present invention additionally provides a kind of mobile terminal, including:Conversion module, for the language that will converse Sound changes into text;Wherein, the call voice is that user carries out the caused voice that communicates by the mobile terminal;Identify mould Block, for identifying the tone of being conversed corresponding to the call voice;First determining module, for determining corresponding to the call tone Call mood keyword, and the identity of conversation object corresponding with the call voice;Memory module, for by described in Text, call mood keyword, the identity and the call voice carry out corresponding storage.
Second aspect, the embodiments of the invention provide a kind of mobile terminal, including processor, memory and it is stored in described On memory and the computer program that can run on the processor, the computer program is by real during the computing device In the existing embodiment of the present invention the step of any described call voice processing method.
The third aspect, the embodiments of the invention provide a kind of computer-readable recording medium, it is characterised in that the calculating Computer program is stored on machine readable storage medium storing program for executing, the computer program is realized in the embodiment of the present invention when being executed by processor The step of any described call voice processing method.
In embodiments of the present invention, by the way that call voice is changed into herein, call mood corresponding to call voice is determined Keyword, the identity of conversation object is determined, when storing call voice by the information and call voice of above three dimension Carry out corresponding storage.This kind stores the scheme of call voice, when user searches certain call voice in advance, it is only necessary to inputs keyword shifting Dynamic terminal can find the call voice with Keywords matching, carry out call voice lookup one by one manually without user, search Short, efficiency high is taken, the usage experience of user can be lifted.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And can be practiced according to the content of specification, and in order to allow above and other objects of the present invention, feature and advantage can Become apparent, below especially exemplified by the embodiment of the present invention.
Brief description of the drawings
By reading the detailed description of hereafter preferred embodiment, various advantages and benefit are for ordinary skill people Member will be clear understanding.Accompanying drawing is only used for showing preferred embodiment, and is not considered as limitation of the present invention.And In whole accompanying drawing, identical part is denoted by the same reference numerals.In the accompanying drawings:
Fig. 1 is a kind of step flow chart of according to embodiments of the present invention one call voice processing method;
Fig. 2 is a kind of step flow chart of according to embodiments of the present invention two call voice processing method;
Fig. 3 is a kind of step flow chart of according to embodiments of the present invention three call voice processing method;
Fig. 4 is a kind of structured flowchart of according to embodiments of the present invention four mobile terminal;
Fig. 5 is a kind of hardware block diagram of according to embodiments of the present invention five mobile terminal.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is part of the embodiment of the present invention, rather than whole embodiments.Based on this hair Embodiment in bright, the every other implementation that those of ordinary skill in the art are obtained under the premise of creative work is not made Example, belongs to the scope of protection of the invention.
Embodiment one
Reference picture 1, show a kind of step flow chart of call voice processing method of the embodiment of the present invention one.
The call voice processing method of the embodiment of the present invention comprises the following steps:
Step 101:Call voice is changed into text.
Wherein, call voice is that user carries out caused voice in communication process by mobile terminal.The communication process can Think that user takes the communication of phone progress, or the communication that user is carried out by social class application program.
Mobile terminal records the call voice of user, and call voice storage is carried out after end of conversation.To call voice Before being stored, the information of three dimensions corresponding to call voice need to be determined, is respectively:Text, call mood keyword and The identity of conversation object.It is determined that after the information of these three dimensions, the information of these three dimensions and call voice are carried out Corresponding storage.
In the embodiment of the present invention, illustrated exemplified by handling a call voice, during specific implementation, The call voice handling process shown in the embodiment of the present invention can be repeated, caused by during communication of mobile terminal Each bar call voice is handled.
Step 102:Identify the call tone corresponding to call voice.
The call tone can reflect mood during user's communication, therefore by identifying call tone energy corresponding to call voice Enough reversely derive mood during user's communication.
Step 103:It is determined that call mood keyword corresponding with the call tone, and call pair corresponding with call voice The identity of elephant.
Call mood can include but is not limited to:Happily, it is angry, sad, surprised, frightened and gentle etc., correspondingly lead to Words mood corresponding to keyword can be correspondingly arranged for:Happily, it is angry, sad, surprised, frightened and gentle.
The collection scene of call voice is different, and correspondingly the identity acquiring way of conversation object is also different.Call pair The identity of elephant can be contact name of the conversation object in address list, or the social networking application journey of conversation object Sequence account or the pet name etc..
It should be noted that the identity mark that step 101 is text flow path switch, step 102 to step 103 is conversation object Knowledge and mood keyword constant current journey really of conversing, during specific implementation, the two flows have no inevitable priority and performed Sequentially, the two can also be performed parallel.
Step 104:Text, call mood keyword, identity and call voice are subjected to corresponding storage.
The corresponding relation can be stored in any appropriate memory space in the terminal.Lead to when user carries out history When language sound is searched, triggering mobile terminal ejection call voice search box, search key is inputted in call voice search box, Mobile terminal carries out Keywords matching from corresponding memory space and can search for obtaining destination call voice.
Call voice processing method provided in an embodiment of the present invention, by the way that call voice is changed into herein, it is determined that call Call mood keyword, determines the identity of conversation object, ties up above three when storing call voice corresponding to voice The information of degree carries out corresponding storage with call voice.This kind stores the scheme of call voice, when user searches certain call voice in advance When, it is only necessary to call voice with Keywords matching can be found by inputting keyword mobile terminal, be entered one by one manually without user Row call voice is searched, and is searched and is taken short, efficiency high, can lift the usage experience of user.
Embodiment two
Reference picture 2, show a kind of step flow chart of call voice processing method of the embodiment of the present invention two.
In the call voice processing method of the embodiment of the present invention, including call voice storage and two stages of lookup, lead to Language voice handling method specifically includes following steps:
Step 201:Call voice is changed into text.
Wherein, call voice is that user carries out the caused voice that communicates by mobile terminal.
Mobile terminal is provided with text conversion module and the tone and Emotion identification module.Mobile terminal records the logical of user Language sound, call text conversion module that the call voice recorded is changed into text after end of conversation.
Call voice is changed into the specific transform mode of text with reference to correlation technique, to this in the embodiment of the present invention It is not specifically limited.
Step 202:Identify the call tone corresponding to call voice.
The tone and Emotion identification module utilize the tone and Emotion identification technology, identify this section of call voice pair recorded The call mood answered, and further determine that keyword corresponding to call mood.The particular technique reference of the tone and Emotion identification Correlation technique, this is not specifically limited in the embodiment of the present invention.
Call mood keyword can include:Happily, it is at least one of angry, sad, surprised, frightened and gentle.
Step 203:It is determined that call mood keyword corresponding with the call tone, and call pair corresponding with call voice The identity of elephant.
The identity of conversation object can obtain from the address list of mobile terminal, can also be from the society that user is logged in Hand over and obtained in class application program.Specific acquiring way needs the way of production accommodation according to the call voice, such as: The call voice is recorded for subscriber phone, then the identity of conversation object needs to obtain from address list;Again for example:The call Voice is that user is recorded by the voice call of social class application program and good friend, then the identity of conversation object is needed from society Hand over and obtained in the Affiliates List of class application program.
Step 204:Text, call mood keyword, identity and call voice are subjected to corresponding storage.
The corresponding relation that the information of three dimensions is carried out with call voice is established in the embodiment of the present invention.When carrying out call language When sound is searched, the information for any one dimension that user is inputted in above three dimensional information, mobile terminal can be its matching To destination call voice, call voice lookup is carried out one by one manually without user.
Step 205:User is received in call voice search box, the first keyword of input.
Call voice search box can be controlled by the corresponding interface and shown, show that call voice is searched for when the interface is called Frame.Specifically, the interface can be arranged in the administration interface of call voice, and user enters in administration interface in advance searches call language During sound, call the interface to can trigger mobile terminal and show call voice search box.
Wherein, the first keyword is text key word, call at least one of mood keyword and identity.When So, user can also input text key word, the keyword for these three dimensions of mood keyword and identity of conversing simultaneously, Or the keyword of any two dimension.
The dimension of the keyword inputted is more, then the accuracy of the call voice finally found is higher.
Step 206:Travel through each text, call mood keyword, identity and the call voice that have stored, search with The destination call voice of first Keywords matching simultaneously exports.
When traveling through text, call mood keyword, identity and call voice, each text can be traveled through one by one, is led to Talk about mood keyword, identity and call voice corresponding relation.Specifically, by the first keyword and first corresponding relation In call mood keyword, identity and text matched respectively, if with the information matches of one of dimension into Work(, then the call voice in the corresponding relation is defined as destination call voice;And carry out the matching of next corresponding relation.If Fail with the information matches of three dimensions, then with the call mood keyword in second corresponding relation, identity and Text is matched respectively, the like untill target complete call voice is matched.
It should be noted that a target call voice can be obtained by searching, a plurality of destination call can also be obtained Voice, mobile terminal are showing each bar target call voice for user's selection, and user therefrom selects required call voice point Hit broadcasting.
By call voice processing method provided in an embodiment of the present invention, user only passes through the keyword of a dimension Find required call voice.Such as:Telephonograph with someone before user searches one month in advance, but the user can not remember Conversation object and dialog context are obtained, only remember vaguelying, it is very irritated to converse at that time, therefore input is " raw in user speech search box This sign call mood keyword of gas ", you can match mood in the call voice that triggering mobile terminal is stored from history To there is the history call voice of " anger " two word inside the history call voice of " anger ", or corresponding text, then these The destination call voice found is listed, and user selects required message registration i.e. in listed destination call voice Can.
Call voice processing method provided in an embodiment of the present invention, by the way that call voice is changed into herein, it is determined that call Call mood keyword, determines the identity of conversation object, ties up above three when storing call voice corresponding to voice The information of degree carries out corresponding storage with call voice.This kind stores the scheme of call voice, when user searches certain call voice in advance When, it is only necessary to call voice with Keywords matching can be found by inputting keyword mobile terminal, be entered one by one manually without user Row call voice is searched, and is searched and is taken short, efficiency high, can lift the usage experience of user.
Embodiment three
Reference picture 3, show a kind of step flow chart of call voice processing method of the embodiment of the present invention three.
In the call voice processing method of the embodiment of the present invention, including call voice storage and two stages of lookup, lead to Language voice handling method specifically includes following steps:
Step 301:Call voice is changed into text.
Wherein, call voice is that user carries out the caused voice that communicates by mobile terminal.
Mobile terminal is provided with text conversion module and the tone and Emotion identification module.Mobile terminal records the logical of user Language sound, call text conversion module that the call voice recorded is changed into text after end of conversation.Call voice is turned The specific transform mode reference correlation technique of text is melted into, this is not specifically limited in the embodiment of the present invention.
Substantial amounts of word and short sentence, therefore the text after call voice conversion are included in text after call voice conversion Can be as a dimension of Keywords matching.
Step 302:The call tone corresponding to call voice is identified, and the call mood corresponding to the tone that determines to converse is crucial Word.
The tone and Emotion identification module utilize the tone and Emotion identification technology, identify this section of call voice pair recorded The call mood answered, and further determine that keyword corresponding to call mood.The particular technique reference of the tone and Emotion identification Correlation technique, this is not specifically limited in the embodiment of the present invention.
Call mood keyword can include:Happily, it is at least one of angry, sad, surprised, frightened and gentle.
Step 303:It is determined that call mood keyword corresponding to the call tone, and conversation object corresponding with call voice Identity.
The identity of conversation object can obtain from the address list of mobile terminal, can also be from the society that user is logged in Hand over and obtained in class application program.Specific acquiring way needs the way of production accommodation according to the call voice.
Step 304:Determine time and the place of this call.
Call place can (Global Positioning System, the whole world be fixed by the GPS that is set in mobile terminal Position system) determine.Air time can be determined by the clock application program set in mobile terminal.
Step 305:Text, call mood keyword, time, place, identity and call voice are carried out correspondingly Storage.
The corresponding relation that the information of five dimensions is carried out with call voice is established in the embodiment of the present invention.When carrying out call language When sound is searched, the information for any one or more dimensions that user is inputted in above-mentioned five dimensional informations, mobile terminal can be It matches destination call voice, and call voice lookup is carried out one by one manually without user.
The embodiment of the present invention increases compared to the call voice processing method in embodiment two when carrying out call voice storage Time, the information of the two dimensions of place are added, so when user carries out call voice lookup, by increasing capacitance it is possible to increase user searches logical The range of language sound keyword.
Step 306:User is received in call voice search box, the second keyword of input.
Wherein, the second keyword be in text key word, call mood keyword, identity, time, place at least One of.Certainly, user can also input the keyword of five dimensions, or any two dimension, three dimensions or four simultaneously The keyword of dimension.The dimension of the keyword inputted is more, then the accuracy of the call voice finally found is higher.
Step 307:Travel through each text, call mood keyword, identity, time, place and the call stored Voice, search the destination call voice with the second Keywords matching and export.
, can one by one time when traveling through each text, call mood keyword, identity, time, place and call voice Go through each corresponding relation.Specifically, by the call mood keyword in the second keyword and first corresponding relation, identity, Text, time and place are matched respectively, if the information matches success with one of dimension, by the corresponding relation Call voice be defined as destination call voice;And carry out the matching of next corresponding relation.If the information with five dimensions With failing, then distinguish with the call mood keyword in second corresponding relation, identity, text, time and place Matched, the like untill target complete call voice is matched.It should be noted that it can be obtained by searching One target call voice, can also obtain a plurality of destination call voice, and mobile terminal is showing each bar target call voice Selected for user, user therefrom selects required call voice and clicks on broadcasting.
By call voice processing method provided in an embodiment of the present invention, user only passes through the keyword of a dimension Find required call voice.Such as:User searches call voice when being conversed by certain social networking application program in advance, only remembers This call voice is obtained on transferring accounts, but user can not remember the information such as conversation object, time and place, therefore use The keyword of " transferring accounts " this sign dialog context is inputted in the phonetic search frame of family, you can triggering mobile terminal is deposited from history Store up matching inside each text and include " transferring accounts " two word text, and call voice corresponding to the text that matches of determination, then These call voices found are listed as destination call voice, and user selects institute in listed destination call voice The message registration needed.
Call voice processing method provided in an embodiment of the present invention, except with the call voice processing shown in embodiment one Possessed by method outside beneficial effect, time and place also corresponding to determination call voice, time, place are also served as into key Word be added to in the corresponding relation of call voice, by increasing the keywords such as time, place, by increasing capacitance it is possible to increase user search call The range of voice keyword, even user only remember that air time or call place can also find destination call voice, The usage experience of user can further be lifted.
Example IV
Reference picture 4, show a kind of structured flowchart of mobile terminal of the embodiment of the present invention four.
The mobile terminal of the embodiment of the present invention can include:Conversion module 401, for call voice to be changed into text; Wherein, the call voice is that user carries out the caused voice that communicates by the mobile terminal;Identification module 402, for knowing The call tone corresponding to not described call voice;First determining module 403, for determining call corresponding with the call tone Mood keyword, and the identity of conversation object corresponding with the call voice;Memory module 404, for by described in Text, call mood keyword, the identity and the call voice carry out corresponding storage.
Preferably, the mobile terminal also includes:Second determining module 405, for determine this call time and Place;The memory module 404 is specifically used for:By the text, the call mood keyword, the time, the place, The identity and the call voice carry out corresponding storage.
Preferably, the call mood keyword includes:Happily, in angry, sad, surprised, frightened and gentle extremely It is one of few.
Preferably, the mobile terminal also includes:First receiving module 406, in the corresponding storage of the memory module After the text, the call mood keyword, the identity and the call voice, user is received in call language In sound search box, the first keyword of input;Wherein, first keyword be text key word, call mood keyword with And at least one of identity;First searching modul 407, for travel through stored each text, call mood keyword, Identity and call voice, search the destination call voice with first Keywords matching and export.
Preferably, the mobile terminal also includes:Second receiving module 408, in the memory module by the text Originally after, call mood keyword, the identity and the call voice carry out corresponding storage, receive user and exist In call voice search box, the second keyword of input;Wherein, second keyword is text key word, call mood pass At least one of keyword, identity, time, place;Second searching modul 409, for traveling through each text stored, leading to Mood keyword, identity, time, place and call voice are talked about, searches and leads to the target of second Keywords matching Language sound simultaneously exports.
Mobile terminal provided in an embodiment of the present invention can realize that mobile terminal is realized in Fig. 1 to Fig. 3 embodiment of the method Each process, to avoid repeating, repeat no more here.
Mobile terminal provided in an embodiment of the present invention, by the way that call voice is changed into herein, determine that call voice is corresponding Call mood keyword, the identity of conversation object is determined, when storing call voice by the information of above three dimension Corresponding storage is carried out with call voice.This kind stores the scheme of call voice, when user searches certain call voice in advance, it is only necessary to defeated Call voice with Keywords matching can be found by entering keyword mobile terminal, and call voice is carried out one by one manually without user Search, search and take short, efficiency high, the usage experience of user can be lifted.
Embodiment five
Reference picture 5, show a kind of hardware block diagram of mobile terminal of the embodiment of the present invention five.
Fig. 5 is a kind of hardware architecture diagram for the mobile terminal for realizing each embodiment of the present invention, the mobile terminal 500 Including but not limited to:It is radio frequency unit 501, mixed-media network modules mixed-media 502, audio output unit 503, input block 504, sensor 505, aobvious Show the parts such as unit 506, user input unit 507, interface unit 508, memory 509, processor 510 and power supply 511. It will be understood by those skilled in the art that the mobile terminal structure shown in Fig. 5 does not form the restriction to mobile terminal, it is mobile whole End can be included than illustrating more or less parts, either combine some parts or different parts arrangement.In the present invention In embodiment, mobile terminal includes but is not limited to mobile phone, tablet personal computer, notebook computer, palm PC, car-mounted terminal, can worn Wear equipment and pedometer etc..
Wherein, processor 510, for call voice to be changed into text;Wherein, the call voice passes through institute for user State mobile terminal and carry out the caused voice that communicates;Identify the call tone corresponding to the call voice;It is determined that with the call language Call mood keyword corresponding to gas, and the identity of conversation object corresponding with the call voice;By the text, Call mood keyword, the identity and the call voice carry out corresponding storage.
Mobile terminal provided in an embodiment of the present invention, by the way that call voice is changed into herein, determine that call voice is corresponding Call mood keyword, the identity of conversation object is determined, when storing call voice by the information of above three dimension Corresponding storage is carried out with call voice.This kind stores the scheme of call voice, when user searches certain call voice in advance, it is only necessary to defeated Call voice with Keywords matching can be found by entering keyword mobile terminal, and call voice is carried out one by one manually without user Search, search and take short, efficiency high, the usage experience of user can be lifted.
It should be understood that in the embodiment of the present invention, radio frequency unit 501 can be used for receiving and sending messages or communication process in, signal Reception and transmission, specifically, by from base station downlink data receive after, handled to processor 510;In addition, will be up Data are sent to base station.Generally, radio frequency unit 501 includes but is not limited to antenna, at least one amplifier, transceiver, coupling Device, low-noise amplifier, duplexer etc..In addition, radio frequency unit 501 can also by wireless communication system and network and other set Standby communication.
Mobile terminal has provided the user wireless broadband internet by mixed-media network modules mixed-media 502 and accessed, and such as helps user to receive Send e-mails, browse webpage and access streaming video etc..
Audio output unit 503 can be receiving by radio frequency unit 501 or mixed-media network modules mixed-media 502 or in memory 509 It is sound that the voice data of storage, which is converted into audio signal and exported,.Moreover, audio output unit 503 can also be provided and moved The audio output for the specific function correlation that dynamic terminal 500 performs is (for example, call signal receives sound, message sink sound etc. Deng).Audio output unit 503 includes loudspeaker, buzzer and receiver etc..
Input block 504 is used to receive audio or video signal.Input block 504 can include graphics processor (Graphics Processing Unit, GPU) 5041 and microphone 5042, graphics processor 5041 is in video acquisition mode Or the static images or the view data of video obtained in image capture mode by image capture apparatus (such as camera) are carried out Reason.Picture frame after processing may be displayed on display unit 506.Picture frame after the processing of graphics processor 5041 can be deposited Storage is transmitted in memory 509 (or other storage mediums) or via radio frequency unit 501 or mixed-media network modules mixed-media 502.Mike Wind 5042 can receive sound, and can be voice data by such acoustic processing.Voice data after processing can be The form output of mobile communication base station can be sent to via radio frequency unit 501 by being converted in the case of telephone calling model.
Mobile terminal 500 also includes at least one sensor 505, such as optical sensor, motion sensor and other biographies Sensor.Specifically, optical sensor includes ambient light sensor and proximity transducer, wherein, ambient light sensor can be according to environment The light and shade of light adjusts the brightness of display panel 5061, and proximity transducer can close when mobile terminal 500 is moved in one's ear Display panel 5061 and/or backlight.As one kind of motion sensor, accelerometer sensor can detect in all directions (general For three axles) size of acceleration, size and the direction of gravity are can detect that when static, available for identification mobile terminal posture (ratio Such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as pedometer, tap);Pass Sensor 505 can also include fingerprint sensor, pressure sensor, iris sensor, molecule sensor, gyroscope, barometer, wet Meter, thermometer, infrared ray sensor etc. are spent, will not be repeated here.
Display unit 506 is used for the information for showing the information inputted by user or being supplied to user.Display unit 506 can wrap Display panel 5061 is included, liquid crystal display (Liquid Crystal Display, LCD), Organic Light Emitting Diode can be used Forms such as (Organic Light-Emitting Diode, OLED) configures display panel 5061.
User input unit 507 can be used for the numeral or character information for receiving input, and produce the use with mobile terminal The key signals input that family is set and function control is relevant.Specifically, user input unit 507 include contact panel 5071 and Other input equipments 5072.Contact panel 5071, also referred to as touch-screen, collect touch operation of the user on or near it (for example user uses any suitable objects or annex such as finger, stylus on contact panel 5071 or in contact panel 5071 Neighbouring operation).Contact panel 5071 may include both touch detecting apparatus and touch controller.Wherein, touch detection Device detects the touch orientation of user, and detects the signal that touch operation is brought, and transmits a signal to touch controller;Touch control Device processed receives touch information from touch detecting apparatus, and is converted into contact coordinate, then gives processor 510, receiving area Manage the order that device 510 is sent and performed.It is furthermore, it is possible to more using resistance-type, condenser type, infrared ray and surface acoustic wave etc. Type realizes contact panel 5071.Except contact panel 5071, user input unit 507 can also include other input equipments 5072.Specifically, other input equipments 5072 can include but is not limited to physical keyboard, function key (such as volume control button, Switch key etc.), trace ball, mouse, action bars, will not be repeated here.
Further, contact panel 5071 can be covered on display panel 5061, when contact panel 5071 is detected at it On or near touch operation after, send processor 510 to determine the type of touch event, be followed by subsequent processing device 510 according to touch The type for touching event provides corresponding visual output on display panel 5061.Although in Figure 5, contact panel 5071 and display Panel 5061 is the part independent as two to realize the input of mobile terminal and output function, but in some embodiments In, can be integrated by contact panel 5071 and display panel 5061 and realize input and the output function of mobile terminal, it is specific this Place does not limit.
Interface unit 508 is the interface that external device (ED) is connected with mobile terminal 500.For example, external device (ED) can include Line or wireless head-band earphone port, external power source (or battery charger) port, wired or wireless FPDP, storage card end Mouth, port, audio input/output (I/O) port, video i/o port, earphone end for connecting the device with identification module Mouthful etc..Interface unit 508 can be used for receive the input (for example, data message, electric power etc.) from external device (ED) and One or more elements that the input received is transferred in mobile terminal 500 can be used in the He of mobile terminal 500 Data are transmitted between external device (ED).
Memory 509 can be used for storage software program and various data.Memory 509 can mainly include storing program area And storage data field, wherein, storing program area can storage program area, application program (such as the sound needed at least one function Sound playing function, image player function etc.) etc.;Storage data field can store according to mobile phone use created data (such as Voice data, phone directory etc.) etc..In addition, memory 509 can include high-speed random access memory, can also include non-easy The property lost memory, a for example, at least disk memory, flush memory device or other volatile solid-state parts.
Processor 510 is the control centre of mobile terminal, utilizes each of various interfaces and the whole mobile terminal of connection Individual part, by running or performing the software program and/or module that are stored in memory 509, and call and be stored in storage Data in device 509, the various functions and processing data of mobile terminal are performed, so as to carry out integral monitoring to mobile terminal.Place Reason device 510 may include one or more processing units;Preferably, processor 510 can integrate application processor and modulatedemodulate is mediated Device is managed, wherein, application processor mainly handles operating system, user interface and application program etc., and modem processor is main Handle radio communication.It is understood that above-mentioned modem processor can not also be integrated into processor 510.
Mobile terminal 500 can also include the power supply 511 (such as battery) to all parts power supply, it is preferred that power supply 511 Can be logically contiguous by power-supply management system and processor 510, so as to realize management charging by power-supply management system, put The function such as electricity and power managed.
In addition, mobile terminal 500 includes some unshowned functional modules, will not be repeated here.
Preferably, the embodiment of the present invention also provides a kind of mobile terminal, including processor 510, memory 509, is stored in On memory 509 and the computer program that can be run on the processor 510, the computer program are performed by processor 510 Each process of the above-mentioned call voice processing method embodiments of Shi Shixian, and identical technique effect can be reached, to avoid repeating, Here repeat no more.
The embodiment of the present invention also provides a kind of computer-readable recording medium, and meter is stored with computer-readable recording medium Calculation machine program, the computer program realize each process of above-mentioned call voice processing method embodiment when being executed by processor, And identical technique effect can be reached, to avoid repeating, repeat no more here.Wherein, described computer-readable recording medium, Such as read-only storage (Read-OnlyMemory, abbreviation ROM), random access memory (Random Access Memory, letter Claim RAM), magnetic disc or CD etc..
It should be noted that herein, term " comprising ", "comprising" or its any other variant are intended to non-row His property includes, so that process, method, article or device including a series of elements not only include those key elements, and And also include the other element being not expressly set out, or also include for this process, method, article or device institute inherently Key element.In the absence of more restrictions, the key element limited by sentence "including a ...", it is not excluded that including this Other identical element also be present in the process of key element, method, article or device.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can add the mode of required general hardware platform to realize by software, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on such understanding, technical scheme is substantially done to prior art in other words Going out the part of contribution can be embodied in the form of software product, and the computer software product is stored in a storage medium In (such as ROM/RAM, magnetic disc, CD), including some instructions to cause a station terminal (can be mobile phone, computer, service Device, air conditioner, or network equipment etc.) perform method described in each embodiment of the present invention.
Embodiments of the invention are described above in conjunction with accompanying drawing, but the invention is not limited in above-mentioned specific Embodiment, above-mentioned embodiment is only schematical, rather than restricted, one of ordinary skill in the art Under the enlightenment of the present invention, in the case of present inventive concept and scope of the claimed protection is not departed from, it can also make a lot Form, belong within the protection of the present invention.

Claims (12)

1. a kind of call voice processing method, applied to mobile terminal, it is characterised in that methods described includes:
Call voice is changed into text;Wherein, the call voice is that user carries out communication generation by the mobile terminal Voice;
Identify the call tone corresponding to the call voice;
It is determined that call mood keyword corresponding with the call tone, and conversation object corresponding with the call voice Identity;
The text, call mood keyword, the identity and the call voice are subjected to corresponding storage.
2. according to the method for claim 1, it is characterised in that methods described also includes:
Determine time and the place of this call;
It is described by the text, call mood keyword, the identity and the call voice carry out corresponding deposit The step of storage, including:
By the text, call mood keyword, the time, the place, the identity and the call Voice carries out corresponding storage.
3. according to the method for claim 1, it is characterised in that the call mood keyword includes:Happily, it is angry, sad It is wound, surprised, frightened and at least one of gentle.
4. according to the method for claim 1, it is characterised in that store the text, the call mood in described correspond to After the step of keyword, the identity and call voice, methods described also includes:
User is received in call voice search box, the first keyword of input;Wherein, first keyword is that text is crucial Word, call at least one of mood keyword and identity;
Each text, call mood keyword, identity and the call voice stored is traveled through, is searched and the described first key The destination call voice of word matching simultaneously exports.
5. according to the method for claim 2, it is characterised in that it is described by the text, the call mood keyword, After the identity and the call voice carry out corresponding the step of storing, methods described also includes:
User is received in call voice search box, the second keyword of input;Wherein, second keyword is that text is crucial At least one of word, call mood keyword, identity, time, place;
Travel through each text, call mood keyword, identity, time, place and the call voice stored, lookup and institute State the destination call voice of the second Keywords matching and export.
A kind of 6. mobile terminal, it is characterised in that including:
Conversion module, for call voice to be changed into text;Wherein, the call voice is that user passes through the mobile terminal Carry out the caused voice that communicates;
Identification module, for identifying the tone of being conversed corresponding to the call voice;
First determining module, for determine it is described call the tone corresponding to converse mood keyword, and with the call voice The identity of corresponding conversation object;
Memory module, for the text, call mood keyword, the identity and the call voice to be entered The corresponding storage of row.
7. mobile terminal according to claim 6, it is characterised in that the mobile terminal also includes:
Second determining module, for determining time and the place of this call;
The memory module is specifically used for:By the text, the call mood keyword, the time, the place, described Identity and the call voice carry out corresponding storage.
8. mobile terminal according to claim 6, it is characterised in that the call mood keyword includes:Happily, it is raw It is gas, sadness, surprised, frightened and at least one of gentle.
9. mobile terminal according to claim 6, it is characterised in that the mobile terminal also includes:
First receiving module, for correspondingly storing the text, the call mood keyword, the body in the memory module After part mark and the call voice, user is received in call voice search box, the first keyword of input;Wherein, First keyword is text key word, call at least one of mood keyword and identity;
First searching modul, for traveling through each text, call mood keyword, identity and the call voice that have stored, Search the destination call voice with first Keywords matching and export.
10. mobile terminal according to claim 7, it is characterised in that the mobile terminal also includes:
Second receiving module, in the memory module by the text, the call mood keyword, the identity And after the call voice carries out corresponding storage, user is received in call voice search box, the second keyword of input; Wherein, second keyword be in text key word, call mood keyword, identity, time, place at least it One;
Second searching modul, for travel through stored each text, call mood keyword, identity, the time, place and Call voice, search the destination call voice with second Keywords matching and export.
11. a kind of mobile terminal, it is characterised in that including processor, memory and be stored on the memory and can be in institute The computer program run on processor is stated, the computer program is realized such as claim 1 to 5 during the computing device Any one of call voice processing method the step of.
12. a kind of computer-readable recording medium, it is characterised in that computer journey is stored on the computer-readable recording medium Sequence, the call voice processing side as any one of claim 1 to 5 is realized when the computer program is executed by processor The step of method.
CN201711015890.3A 2017-10-25 2017-10-25 A kind of call voice processing method, mobile terminal Pending CN107818786A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711015890.3A CN107818786A (en) 2017-10-25 2017-10-25 A kind of call voice processing method, mobile terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711015890.3A CN107818786A (en) 2017-10-25 2017-10-25 A kind of call voice processing method, mobile terminal

Publications (1)

Publication Number Publication Date
CN107818786A true CN107818786A (en) 2018-03-20

Family

ID=61604122

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711015890.3A Pending CN107818786A (en) 2017-10-25 2017-10-25 A kind of call voice processing method, mobile terminal

Country Status (1)

Country Link
CN (1) CN107818786A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109194807A (en) * 2018-10-16 2019-01-11 珠海格力电器股份有限公司 A kind of telephone number management method, device and terminal device
CN109215683A (en) * 2018-08-10 2019-01-15 维沃移动通信有限公司 A kind of reminding method and terminal
CN109407504A (en) * 2018-11-30 2019-03-01 华南理工大学 A kind of personal safety detection system and method based on smartwatch
CN110933225A (en) * 2019-11-04 2020-03-27 Oppo(重庆)智能科技有限公司 Call information acquisition method and device, storage medium and electronic equipment
CN111092996A (en) * 2019-10-31 2020-05-01 国网山东省电力公司信息通信公司 Centralized scheduling recording system and control method
CN111354377A (en) * 2019-06-27 2020-06-30 深圳市鸿合创新信息技术有限责任公司 Method and device for recognizing emotion through voice and electronic equipment
CN111460210A (en) * 2019-12-04 2020-07-28 上海明略人工智能(集团)有限公司 Target voice processing method and device
CN114221920A (en) * 2021-12-13 2022-03-22 中国平安财产保险股份有限公司 Automatic contact method, device, computer equipment and medium based on artificial intelligence
CN115643229A (en) * 2022-09-29 2023-01-24 深圳市毅光信电子有限公司 Call item processing method, device, system, electronic equipment and storage medium

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101030368A (en) * 2006-03-03 2007-09-05 国际商业机器公司 Method and system for communicating across channels simultaneously with emotion preservation
CN101237489A (en) * 2008-03-05 2008-08-06 北京邮电大学 Processing method and device based on voice communication content
CN102572372A (en) * 2011-12-28 2012-07-11 中兴通讯股份有限公司 Extraction method and device for conference summary
CN103067608A (en) * 2013-01-23 2013-04-24 广东欧珀移动通信有限公司 Method and system for mobile terminal recent call searching
CN103354575A (en) * 2013-06-14 2013-10-16 广东欧珀移动通信有限公司 Method for prompting history conversation content at time of calling or being called, and mobile terminal
CN104184870A (en) * 2014-07-29 2014-12-03 小米科技有限责任公司 Call log marking method and device and electronic equipment
CN104714981A (en) * 2013-12-17 2015-06-17 腾讯科技(深圳)有限公司 Voice message search method, device and system
US20150235654A1 (en) * 2011-06-17 2015-08-20 At&T Intellectual Property I, L.P. Speaker association with a visual representation of spoken content
CN106024013A (en) * 2016-04-29 2016-10-12 努比亚技术有限公司 Voice data searching method and system
CN106024009A (en) * 2016-04-29 2016-10-12 北京小米移动软件有限公司 Audio processing method and device

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101030368A (en) * 2006-03-03 2007-09-05 国际商业机器公司 Method and system for communicating across channels simultaneously with emotion preservation
CN101237489A (en) * 2008-03-05 2008-08-06 北京邮电大学 Processing method and device based on voice communication content
US20150235654A1 (en) * 2011-06-17 2015-08-20 At&T Intellectual Property I, L.P. Speaker association with a visual representation of spoken content
CN102572372A (en) * 2011-12-28 2012-07-11 中兴通讯股份有限公司 Extraction method and device for conference summary
CN103067608A (en) * 2013-01-23 2013-04-24 广东欧珀移动通信有限公司 Method and system for mobile terminal recent call searching
CN103354575A (en) * 2013-06-14 2013-10-16 广东欧珀移动通信有限公司 Method for prompting history conversation content at time of calling or being called, and mobile terminal
CN104714981A (en) * 2013-12-17 2015-06-17 腾讯科技(深圳)有限公司 Voice message search method, device and system
CN104184870A (en) * 2014-07-29 2014-12-03 小米科技有限责任公司 Call log marking method and device and electronic equipment
CN106024013A (en) * 2016-04-29 2016-10-12 努比亚技术有限公司 Voice data searching method and system
CN106024009A (en) * 2016-04-29 2016-10-12 北京小米移动软件有限公司 Audio processing method and device

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109215683A (en) * 2018-08-10 2019-01-15 维沃移动通信有限公司 A kind of reminding method and terminal
CN109215683B (en) * 2018-08-10 2021-09-14 维沃移动通信有限公司 Prompting method and terminal
CN109194807A (en) * 2018-10-16 2019-01-11 珠海格力电器股份有限公司 A kind of telephone number management method, device and terminal device
CN109407504B (en) * 2018-11-30 2021-05-14 华南理工大学 Personal safety detection system and method based on smart watch
CN109407504A (en) * 2018-11-30 2019-03-01 华南理工大学 A kind of personal safety detection system and method based on smartwatch
CN111354377A (en) * 2019-06-27 2020-06-30 深圳市鸿合创新信息技术有限责任公司 Method and device for recognizing emotion through voice and electronic equipment
CN111092996A (en) * 2019-10-31 2020-05-01 国网山东省电力公司信息通信公司 Centralized scheduling recording system and control method
CN110933225B (en) * 2019-11-04 2022-03-15 Oppo(重庆)智能科技有限公司 Call information acquisition method and device, storage medium and electronic equipment
CN110933225A (en) * 2019-11-04 2020-03-27 Oppo(重庆)智能科技有限公司 Call information acquisition method and device, storage medium and electronic equipment
CN111460210A (en) * 2019-12-04 2020-07-28 上海明略人工智能(集团)有限公司 Target voice processing method and device
CN111460210B (en) * 2019-12-04 2024-04-05 上海明略人工智能(集团)有限公司 Target voice processing method and device
CN114221920A (en) * 2021-12-13 2022-03-22 中国平安财产保险股份有限公司 Automatic contact method, device, computer equipment and medium based on artificial intelligence
CN114221920B (en) * 2021-12-13 2023-06-23 中国平安财产保险股份有限公司 Automatic contact method, device, computer equipment and medium based on artificial intelligence
CN115643229A (en) * 2022-09-29 2023-01-24 深圳市毅光信电子有限公司 Call item processing method, device, system, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN107818786A (en) A kind of call voice processing method, mobile terminal
CN103578474B (en) A kind of sound control method, device and equipment
CN104966086B (en) Live body discrimination method and device
CN108334539A (en) Object recommendation method, mobile terminal and computer readable storage medium
CN107633051A (en) Desktop searching method, mobile terminal and computer-readable recording medium
CN107798107A (en) The method and mobile device of song recommendations
CN108492836A (en) A kind of voice-based searching method, mobile terminal and storage medium
CN106973168A (en) Speech playing method, device and computer equipment
CN107967339A (en) Image processing method, device, computer-readable recording medium and computer equipment
CN108536638A (en) Setting method, mobile terminal, system and the readable storage medium storing program for executing of intelligent bookmark
CN107623794A (en) A kind of processing method of speech data, device and mobile terminal
CN107862059A (en) A kind of song recommendations method and mobile terminal
CN108549681A (en) Data processing method and device, electronic equipment, computer readable storage medium
CN107450744A (en) A kind of personal data inputting method and mobile terminal
CN107831988A (en) The operating method and mobile terminal of a kind of mobile terminal
CN107622137A (en) The method and apparatus for searching speech message
CN110222245A (en) A kind of reminding method and device
CN109063076A (en) A kind of Picture Generation Method and mobile terminal
CN108459813A (en) A kind of searching method and mobile terminal
CN107809515A (en) A kind of display control method and mobile terminal
CN107832420A (en) photo management method and mobile terminal
CN107360297A (en) A kind of contact searching method, terminal and computer-readable recording medium
CN108494949B (en) A kind of image classification method and mobile terminal
CN110136724A (en) A kind of data processing method and terminal device
CN108200266A (en) A kind of schedule creation method, device and mobile terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180320