CN108492833A - Voice messaging acquisition method, instantaneous communication system, mobile terminal and storage medium - Google Patents
Voice messaging acquisition method, instantaneous communication system, mobile terminal and storage medium Download PDFInfo
- Publication number
- CN108492833A CN108492833A CN201810277436.3A CN201810277436A CN108492833A CN 108492833 A CN108492833 A CN 108492833A CN 201810277436 A CN201810277436 A CN 201810277436A CN 108492833 A CN108492833 A CN 108492833A
- Authority
- CN
- China
- Prior art keywords
- voice input
- data
- voice
- input data
- acquisition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 35
- 238000004891 communication Methods 0.000 title claims abstract description 25
- 238000006243 chemical reaction Methods 0.000 claims abstract description 21
- 238000001228 spectrum Methods 0.000 claims abstract description 21
- 238000009877 rendering Methods 0.000 claims description 21
- 230000003542 behavioural effect Effects 0.000 claims description 16
- 238000004590 computer program Methods 0.000 claims description 5
- 230000004069 differentiation Effects 0.000 abstract description 9
- 238000010586 diagram Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 230000008054 signal transmission Effects 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 230000002123 temporal effect Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 235000015170 shellfish Nutrition 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transforming into visible information
- G10L21/12—Transforming into visible information by displaying time domain information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L51/00—User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
- H04L51/07—User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail characterised by the inclusion of specific contents
- H04L51/10—Multimedia information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Quality & Reliability (AREA)
- Computer Networks & Wireless Communication (AREA)
- Telephonic Communication Services (AREA)
- Telephone Function (AREA)
Abstract
The present invention provides a kind of voice messaging acquisition method, instantaneous communication system, mobile terminal and storage medium, methods to include:Start timing when receiving voice input instruction, obtains current speech data in real time, and according to current speech data Dynamic Announce voice input picture;It when receiving Speech Record stop instruction, closes voice input picture and stops the acquisition of current speech data, to obtain voice input data, format conversion is carried out to voice input data, and transformed voice input data are stored;The preset icon image being locally stored is obtained, and obtains the sonic data in voice input data and voice input time;Sonic data and voice input time are shown on preset icon image by the way of sound spectrum image with mode that numerical value is shown respectively, the mode that the present invention is shown by using the mode and numerical value of sound spectrum image, so as to be distinctly displayed to voice input data, differentiation of the user to different voice input data is facilitated.
Description
Technical field
The present invention relates to field of communication technology, more particularly to a kind of voice messaging acquisition method, instantaneous communication system, movement
Terminal and storage medium.
Background technology
Voice collecting is the preposition stage of speech recognition, and by carrying out data under voice to user pronunciation, extraction is adopted
The phonetic feature of the voice data of collection according to the phonetic feature that is extracted carries out speech recognition, it can be achieved that user pronunciation content
The purpose of determining, identifying user identity.Current voice collecting mode is that (such as smart mobile phone is put down using terminal device is set to
The user equipmenies such as plate computer) on voice acquisition device (such as microphone) user pronunciation is acquired, obtain voice data,
Feature extraction then carried out to the voice data that is acquired, voice messaging acquisition method in the use of instantaneous communication system particularly
Frequently, thus convenience of the people to voice messaging acquisition method more stringent requirements are proposed.
In existing voice messaging acquisition method when carrying out voice input, the typing picture of display is more single, reduces
User experience, and in existing voice messaging acquisition method, when carrying out phonetic storage after the typing for completing voice data, no
The display icon all same used after same voice data storage, and then it is more tired to cause user to distinguish different voice data
It is difficult.
Invention content
Based on this, being designed to provide for the embodiment of the present invention a kind of facilitates user to distinguish different phonetic data
Voice messaging acquisition method, instantaneous communication system, mobile terminal and storage medium.
In a first aspect, the present invention provides a kind of voice messaging acquisition method, the method includes:
Start timing when receiving voice input instruction, obtains current speech data in real time, and according to the current language
Behavioral characteristics Dynamic Announce voice input picture in sound data;
When receiving Speech Record stop instruction, closes the voice input picture and stop obtaining for the current speech data
It takes, to obtain voice input data, format conversion is carried out to the voice input data, and by the transformed voice input
Data are stored;
The preset icon image being locally stored is obtained, and obtains the sonic data and Speech Record in the voice input data
The angle of incidence;
The sonic data and the voice input time are sequentially corresponded to and shown with numerical value by the way of sound spectrum image
Mode shown on the preset icon image respectively.
Above-mentioned voice messaging acquisition method shows have in such a way that the sonic data is used sound spectrum image
Effect is distinctly displayed the different voice input data, and then facilitates subsequent user to the different voices
The differentiation of logging data, the design shown by the acquisition and use numerical value of the voice input time, has further facilitated use
The family subsequently differentiation to the different voice input data, by the design of voice input picture described in Dynamic Announce, effectively
Improve user experience, it is therefore prevented that uninteresting phenomenon during user speech occurs.
Further, the behavioral characteristics Dynamic Announce voice input picture according in the current speech data:
Present timing time and the default background image being locally stored, recording icon are obtained in real time, and according to described current
Timing time, the default background image and the recording icon perform image display;
The voice decibel information in the current speech data is obtained in real time, and according to the voice decibel information to described
Icon of recording is rendered into Mobile state.
Further, described the step of being rendered into Mobile state to the recording icon according to the voice decibel information, wraps
It includes:
Judge the decibel ratings of the voice decibel information, and judges rendering region according to the decibel ratings;
The rendered color being locally stored is obtained, and color rendering is carried out to the rendering region according to the rendered color.
Further, after the step of real-time acquisition current speech data, the method further includes:
Judge the current decibels in the current speech data received in preset time whether continuously less than default
Decibels;
If so, closing the voice input picture and stopping the acquisition of the current speech data.
Further, described that format conversion is carried out to the voice input data, and by the transformed voice input
The step of data are stored include:
The voice input data are converted into amr formats, and Text region is carried out to obtain to the voice input data
It is the word that number of repetition is most in the voice input data to take Feature Words, the Feature Words;
Renaming is carried out according to the Feature Words voice input data transformed to format.
Further, described that format conversion is carried out to the voice input data, and by the transformed voice input
The step of data are stored include:
The voice input data are converted into amr formats, and obtain current time;
Renaming is carried out according to the current time voice input data transformed to format.
Second aspect, the present invention provides a kind of instantaneous communication systems, including:
Immediate communication platform, the operational order for receiving user;
Voice capture device is communicated to connect with the immediate communication platform, for being received according to the immediate communication platform
The operational order arrived, with the broadcasting of the corresponding acquisition or voice data for carrying out voice data;
The voice capture device includes:
First acquisition module, for when receive voice input instruction when start timing, obtain current speech data in real time,
And according to the behavioral characteristics Dynamic Announce voice input picture in the current speech data;
Memory module is worked as when receiving Speech Record stop instruction, closing described in the voice input picture and stopping
The acquisition of preceding voice data carries out format conversion, and will be after conversion to obtain voice input data to the voice input data
The voice input data stored;
Second acquisition module for obtaining the preset icon image being locally stored, and obtains in the voice input data
Sonic data and the voice input time;
Display module, for the sonic data and the voice input time sequentially to be corresponded to the side using sound spectrum image
The mode that formula and numerical value are shown is shown on the preset icon image respectively.
Above-mentioned instantaneous communication system carries out in such a way that the sonic data is used sound spectrum image by the display module
It has been shown that, is effectively distinctly displayed the different voice input data, and then facilitate subsequent user to different
The differentiation of the voice input data is shown by the display module to the acquisition of the voice input time and using numerical value
Design, further facilitated differentiation of the user subsequently to the different voice input data, pass through it is described first obtain mould
The design of voice input picture, effectively raises user experience described in block Dynamic Announce, it is therefore prevented that during user speech
Uninteresting phenomenon occurs.
Further, first acquisition module includes:
First acquisition unit, default background image, recording figure for obtaining the present timing time in real time He being locally stored
Mark, and performed image display according to the present timing time, the default background image and the recording icon;
Second acquisition unit, for obtaining the voice decibel information in the current speech data in real time, and according to described
Voice decibel information renders the recording icon into Mobile state.
The third aspect, the present invention provides a kind of mobile terminal, including memory and processor, the memory is used for
Computer program is stored, the processor runs the computer program so that the above-mentioned voice messaging of the mobile terminal execution
Acquisition method.
Fourth aspect, the present invention provides a kind of storage medium, the meter being stored thereon with used in above-mentioned mobile terminal
Calculation machine program.
Description of the drawings
Fig. 1 is the flow chart for the voice messaging acquisition method that first embodiment of the invention provides;
Fig. 2 is the flow chart for the voice messaging acquisition method that second embodiment of the invention provides;
Fig. 3 is the flow chart of the specific implementation step of step S21 in Fig. 2;
Fig. 4 is the structural schematic diagram for the instantaneous communication system that third embodiment of the invention provides;
Fig. 5 is the structural schematic diagram for the instantaneous communication system that fourth embodiment of the invention provides;
Essential element symbol description
Instantaneous communication system | 100 | Immediate communication platform | 101 |
Voice capture device | 102 | First acquisition module | 10 |
First acquisition unit | 11 | Second acquisition unit | 12 |
Judging unit | 13 | Rendering unit | 14 |
Memory module | 20 | First converting unit | 21 |
First name unit | 22 | Second converting unit | 23 |
Second name unit | 24 | Second acquisition module | 30 |
Display module | 31 | Judgment module | 40 |
Stopping modular | 41 |
Specific implementation mode
For the ease of more fully understanding the present invention, the present invention is carried out further below in conjunction with related embodiment attached drawing
It explains.The embodiment of the present invention is given in attached drawing, but the present invention is not limited in above-mentioned preferred embodiment.On the contrary, providing
The purpose of these embodiments be in order to make disclosure of the invention face more fully.
Referring to Fig. 1, the flow chart of the voice messaging acquisition method provided for first embodiment of the invention, including step
S10 to S50.
Step S10 starts timing when receiving voice input instruction, obtains current speech data in real time, and according to institute
State the behavioral characteristics Dynamic Announce voice input picture in current speech data;
Wherein, the voice input instruction is carried out by the way of electric signal, push button signalling, wireless signal or voice signal
Transmission, the acquisition of the current speech data by way of activate microphone to be obtained, specifically, logical in the present embodiment
Setting behavioral characteristics are crossed, the Dynamic Announce for carrying out the voice input picture in equipment are shown with control, the behavioral characteristics can
Think temporal characteristics or voice decibel feature etc..
Step S20 closes the voice input picture and stops the current speech when receiving Speech Record stop instruction
The acquisition of data, to obtain voice input data;
Wherein, the transmission mode of the Speech Record stop instruction is identical as the voice input instruction, is sent out when receiving user
When the Speech Record stop instruction gone out, control shows the display for stopping the voice input picture in equipment, and mute microphone (MIC)
To the acquisition when close voice data, at this point, being institute from the data got when voice input stops to typing are proceeded by
Predicate sound logging data.
Step S30, to the voice input data carry out format conversion, and by the transformed voice input data into
Row storage;
Wherein, since the original document of the voice input data is larger, use the mode of format conversion to reduce
The size of the voice input data file facilitates the storage subsequently to the voice input data;
It preferably, will by background thread when locally completing format conversion and the storage to the voice input data
The voice input data upload to a server, and control received server-side program and carried out while receiving recording file
Encryption, server is arrived in storage in the form of a file after the completion of encryption, and then when the local voice input loss of data
Or it when damage, to carry out the acquisition of the voice input data, and then can be improved by carrying out file download in the server
The security performance of the voice messaging acquisition method.
Step S40 obtains the preset icon image being locally stored, and obtains the sonic data in the voice input data
With the voice input time;
Wherein, the preset icon image is the pre-set picture of user, which can be for local picture or based on net
The picture that network is downloaded, the sonic data are obtained by sound spectrograph, the voice input time used timing
Device is obtained.
Step S50, the sonic data and the voice input time are sequentially corresponded to using sound spectrum image by the way of with
The mode that numerical value is shown is shown on the preset icon image respectively;
Wherein, since the corresponding sound spectrum image of the difference sonic data is different, sound spectrum is used in the present embodiment
The mode that image is shown carries out distinctly displaying for the voice input data, facilitates area of the user to the voice input data
Point;
Preferably, since uncontrollable factor leads to the not phase of voice input time described in the different voice input data
Together, therefore distinctly displaying for the voice input can be carried out by way of showing voice input time progress numerical value.
In the present embodiment, shown in such a way that the sonic data is used sound spectrum image, it effectively will be different
The voice input data distinctly displayed, and then facilitate subsequent user to the different voice input data
It distinguishes, the design shown by the acquisition and use numerical value of the voice input time has further facilitated user subsequently to not
The differentiation of the same voice input data, by the Dynamic Announce for carrying out the voice input picture based on the behavioral characteristics
Design, effectively raise user experience, it is therefore prevented that uninteresting phenomenon during user speech occurs.
Referring to Fig. 2, the flow chart of the voice messaging acquisition method provided for second embodiment of the invention, the method packet
Include step S11 to S71.
Step S11 starts timing when receiving voice input instruction, obtains current speech data in real time, obtains in real time
Present timing time and the default background image being locally stored, recording icon, and according to present timing time, described default
Background image and the recording icon perform image display;
Wherein, the voice input instruction is carried out by the way of electric signal, push button signalling, wireless signal or voice signal
Transmission, the acquisition of the current speech data by way of activating microphone to be obtained, when by the present timing
Between acquisition and display, facilitate user voice input operation.
Step S21 obtains the voice decibel information in the current speech data, and is believed according to the voice decibel in real time
Breath renders the recording icon into Mobile state;
Wherein, it by the way that behavioral characteristics are arranged in the present embodiment, is shown with control and carries out the voice input picture in equipment
Dynamic Announce, the behavioral characteristics can be temporal characteristics or voice decibel feature etc., due to user carry out voice input
Shi Suoshu voice decibel information is changed in real time, therefore carries out the recording figure according to the variation with corresponding in the present embodiment
Target dynamic renders, to prevent the uninteresting phenomenon during user speech from occurring.It preferably, can to the rendering of the recording icon
It is rendered into Mobile state in a manner of using color rendering or image change, the perception to improve user is experienced.
Referring to Fig. 3, for the specific implementation step of step S21 in Fig. 2:
Step S210 judges the decibel ratings of the voice decibel information, and judges rendering area according to the decibel ratings
Domain;
Wherein, be locally stored render region table, it is described render region table in be stored with different decibel ratings for wash with watercolours
Coordinate is contaminated, the rendering of different zones should be carried out according to the rendering coordinate pair inquired, such as when decibel ratings are 0, recording
Icon both sides show the camber line of grey, with the increase of the decibel ratings of judgement, gradually carry out the color rendering of different zones, with
Dynamic Announce effect is formed, user experience is improved.
Step S211 obtains the rendered color being locally stored, and is carried out to the rendering region according to the rendered color
Color rendering;
Wherein, the rendered color can be independently configured according to the demand of user, rendered color described in the present embodiment
For blue, specifically, with the increase of decibel ratings, the camber line of grey is gradually replaced with the camber line of blue from inside to outside, with
Dynamic Announce effect is formed on the recording icon.
Please continue to refer to Fig. 2, step S31, when receiving Speech Record stop instruction, closes the voice input picture and stop
The only acquisition of the current speech data, to obtain voice input data;
Wherein, the transmission mode of the Speech Record stop instruction is identical as the voice input instruction, is sent out when receiving user
When the Speech Record stop instruction gone out, control shows the display for stopping the voice input picture in equipment, and mute microphone (MIC)
To the acquisition when close voice data, at this point, being institute from the data got when voice input stops to typing are proceeded by
Predicate sound logging data.
The voice input data are converted to amr formats, and carry out word to the voice input data by step S41
To obtain Feature Words, the Feature Words are the word that number of repetition is most in the voice input data for identification;
Wherein, since the original document of the voice input data is larger, use the mode of format conversion to reduce
The size of the voice input data file facilitates the storage subsequently to the voice input data, and by institute's predicate
Sound logging data carries out the design of Text region, facilitates the acquisition of the follow-up Feature Words;
It preferably, will by background thread when locally completing format conversion and the storage to the voice input data
The voice input data upload to a server, and control received server-side program and carried out while receiving recording file
Encryption, server is arrived in storage in the form of a file after the completion of encryption, and then when the local voice input loss of data
Or it when damage, to carry out the acquisition of the voice input data, and then can be improved by carrying out file download in the server
The security performance of the voice messaging acquisition method.
Step S51 carries out renaming according to the Feature Words voice input data transformed to format;
Wherein, the number occurred in the voice input data due to the Feature Words is most, and then can be by with institute
The corresponding voice input data of Feature Words statement are stated, and carries out the voice input data by the way of renaming and corresponds to text
The name of part is shown.
Preferably, described that format conversion is carried out to the voice input data, and by the transformed voice input number
It may also include according to the step of being stored:
The voice input data are converted into amr formats, and obtain current time;
Renaming is carried out according to the current time voice input data transformed to format;
Step S61 obtains the preset icon image being locally stored, and obtains the sonic data in the voice input data
With the voice input time;
Wherein, the preset icon image is the pre-set picture of user, which can be for local picture or based on net
The picture that network is downloaded, the sonic data are obtained by sound spectrograph, the voice input time used timing
Device is obtained.
Step S71, the sonic data and the voice input time are sequentially corresponded to using sound spectrum image by the way of with
The mode that numerical value is shown is shown on the preset icon image respectively;
Wherein, since the corresponding sound spectrum image of the difference sonic data is different, sound spectrum is used in the present embodiment
The mode that image is shown carries out distinctly displaying for the voice input data, facilitates area of the user to the voice input data
Point;
Preferably, since uncontrollable factor leads to the not phase of voice input time described in the different voice input data
Together, therefore distinctly displaying for the voice input can be carried out by way of showing voice input time progress numerical value.
Preferably, after the step of real-time acquisition current speech data, the method further includes:
Judge the current decibels in the current speech data received in preset time whether continuously less than default
Decibels;
If so, closing the voice input picture and stopping the acquisition of the current speech data;
Wherein, whether continued by the current decibels in the current speech data that judges to receive in preset time
Less than the design of default decibels, effectively prevent since user forgets kwh loss caused by stopping voice input.
In the present embodiment, shown in such a way that the sonic data is used sound spectrum image, it effectively will be different
The voice input data distinctly displayed, and then facilitate subsequent user to the different voice input data
It distinguishes, the design shown by the acquisition and use numerical value of the voice input time has further facilitated user subsequently to not
The differentiation of the same voice input data effectively raises use by the design of voice input picture described in Dynamic Announce
It experiences at family, it is therefore prevented that the uninteresting phenomenon during user speech occurs.
Referring to Fig. 4, the structural schematic diagram of the instantaneous communication system 100 provided for third embodiment of the invention, including:
Immediate communication platform 101, the operational order for receiving user;
Voice capture device 102 is communicated to connect with the immediate communication platform 101, for flat according to the instant messaging
The operational order that platform receives, with the broadcasting of the corresponding acquisition or voice data for carrying out voice data;
The voice capture device 102 includes:
First acquisition module 10 obtains current speech number in real time for starting timing when receiving voice input instruction
According to, and according to the behavioral characteristics Dynamic Announce voice input picture in the current speech data;
Wherein, the voice input instruction is carried out by the way of electric signal, push button signalling, wireless signal or voice signal
Transmission, the acquisition of the current speech data by way of activate microphone to be obtained, specifically, logical in the present embodiment
Setting behavioral characteristics are crossed, the Dynamic Announce for carrying out the voice input picture in equipment are shown with control, the behavioral characteristics can
Think temporal characteristics or voice decibel feature etc..
Memory module 20, for when receiving Speech Record stop instruction, closing described in the voice input picture and stopping
The acquisition of current speech data carries out format conversion, and will conversion to obtain voice input data to the voice input data
The voice input data afterwards are stored;
Wherein, the transmission mode of the Speech Record stop instruction is identical as the voice input instruction, is sent out when receiving user
When the Speech Record stop instruction gone out, control shows the display for stopping the voice input picture in equipment, and mute microphone (MIC)
To the acquisition when close voice data, at this point, being institute from the data got when voice input stops to typing are proceeded by
Predicate sound logging data.
Specifically, since the original document of the voice input data is larger, use the mode of format conversion to drop
The size of the low voice input data file, facilitates the storage subsequently to the voice input data;
It preferably, will by background thread when locally completing format conversion and the storage to the voice input data
The voice input data upload to a server, and control received server-side program and carried out while receiving recording file
Encryption, server is arrived in storage in the form of a file after the completion of encryption, and then when the local voice input loss of data
Or it when damage, to carry out the acquisition of the voice input data, and then can be improved by carrying out file download in the server
The security performance of the voice messaging acquisition method.
Second acquisition module 30 for obtaining the preset icon image being locally stored, and obtains the voice input data
In sonic data and the voice input time;
Wherein, the preset icon image is the pre-set picture of user, which can be for local picture or based on net
The picture that network is downloaded, the sonic data are obtained by sound spectrograph, the voice input time used timing
Device is obtained,
Display module 31, for sequentially corresponding to the sonic data and the voice input time using sound spectrum image
The mode that mode and numerical value are shown is shown on the preset icon image respectively;
Wherein, since the corresponding sound spectrum image of the difference sonic data is different, sound spectrum is used in the present embodiment
The mode that image is shown carries out distinctly displaying for the voice input data, facilitates area of the user to the voice input data
Point;
Preferably, since uncontrollable factor leads to the not phase of voice input time described in the different voice input data
Together, therefore distinctly displaying for the voice input can be carried out by way of showing voice input time progress numerical value.
First acquisition module 10 includes:
First acquisition unit 11, default background image, recording for obtaining the present timing time in real time and being locally stored
Icon, and performed image display according to the present timing time, the default background image and the recording icon;
Second acquisition unit 12, for obtaining the voice decibel information in the current speech data in real time, and according to institute
Predicate cent shellfish information renders the recording icon into Mobile state.
The second acquisition unit 12 includes:
Judging unit 13, the decibel ratings for judging the voice decibel information, and judged according to the decibel ratings
Render region;
Rendering unit 14, for obtaining the rendered color being locally stored, and according to the rendered color to the rendering area
Domain carries out color rendering.
Wherein, be locally stored render region table, it is described render region table in be stored with different decibel ratings for wash with watercolours
Coordinate is contaminated, the rendering of different zones should be carried out according to the rendering coordinate pair inquired, such as when decibel ratings are 0, recording
Icon both sides show the camber line of grey, with the increase of the decibel ratings of judgement, gradually carry out the color rendering of different zones, with
Dynamic Announce effect is formed, user experience is improved.
The memory module 20 includes:
First converting unit 21, for the voice input data to be converted to amr formats, and to the voice input number
Feature Words are obtained according to Text region is carried out, the Feature Words are the word that number of repetition is most in the voice input data;
Wherein, since the original document of the voice input data is larger, use the mode of format conversion to reduce
The size of the voice input data file facilitates the storage subsequently to the voice input data, and by institute's predicate
Sound logging data carries out the design of Text region, facilitates the acquisition of the follow-up Feature Words;
It preferably, will by background thread when locally completing format conversion and the storage to the voice input data
The voice input data upload to a server, and control received server-side program and carried out while receiving recording file
Encryption, server is arrived in storage in the form of a file after the completion of encryption, and then when the local voice input loss of data
Or it when damage, to carry out the acquisition of the voice input data, and then can be improved by carrying out file download in the server
The security performance of the voice messaging acquisition method.
First name unit 22, for carrying out weight according to the Feature Words voice input data transformed to format
Name;
Wherein, the number occurred in the voice input data due to the Feature Words is most, and then can be by with institute
The corresponding voice input data of Feature Words statement are stated, and carries out the voice input data by the way of renaming and corresponds to text
The name of part is shown.
Second converting unit 23 for the voice input data to be converted to amr formats, and obtains current time;
Second name unit 24, for being carried out according to the current time voice input data transformed to format
Renaming.
In the present embodiment, shown in such a way that the sonic data is used sound spectrum image by the display module 31
Show, effectively distinctly displayed the different voice input data, and then facilitates subsequent user to different institutes
The differentiation of predicate sound logging data is shown by the display module 31 to the acquisition of the voice input time and using numerical value
Design, further facilitated differentiation of the user subsequently to the different voice input data, pass through it is described first obtain mould
The design of voice input picture, effectively raises user experience, it is therefore prevented that during user speech described in 10 Dynamic Announce of block
Uninteresting phenomenon occur.
Referring to Fig. 5, the structural schematic diagram of the instantaneous communication system 100 provided for fourth embodiment of the invention, the 4th
The structure of embodiment and 3rd embodiment is more or less the same, and difference lies in voice capture device 102 described in the present embodiment is also wrapped
It includes:.
Judgment module 40, for judging that the current decibels in the current speech data received in preset time are
It is no continuously less than default decibels;
Stopping modular 41, for when the judging result of the judgment module 40 is to be, closing the voice input picture
And stop the acquisition of the current speech data.
In the present embodiment, judged in the current speech data received in preset time by the judgment module 40
Current decibels whether continuously less than default decibels design, effectively prevent due to user forget stop voice input
Caused kwh loss.
The present embodiment additionally provides a kind of mobile terminal, including memory and processor, and the memory is for storing
Computer program, the processor run the computer program so that the mobile terminal execution it is above-mentioned voice messaging acquisition
Method.
The present embodiment additionally provides a kind of storage medium, the computer journey being stored thereon with used in above-mentioned mobile terminal
Sequence, the program when being executed, include the following steps:
Start timing when receiving voice input instruction, obtains current speech data in real time, and according to the current language
Behavioral characteristics Dynamic Announce voice input picture in sound data;
When receiving Speech Record stop instruction, closes the voice input picture and stop obtaining for the current speech data
It takes, to obtain voice input data, format conversion is carried out to the voice input data, and by the transformed voice input
Data are stored;
The preset icon image being locally stored is obtained, and obtains the sonic data and Speech Record in the voice input data
The angle of incidence;
The sonic data and the voice input time are sequentially corresponded to and shown with numerical value by the way of sound spectrum image
Mode shown on the preset icon image respectively.The storage medium, such as:ROM/RAM, magnetic disc, CD etc..
Above embodiment described the technical principles of the present invention, and the description is merely to explain the principles of the invention, and
It cannot be construed to the limitation of the scope of the present invention in any way.Based on the explanation herein, those skilled in the art is not required to
Other specific implementation modes of the present invention can be associated by paying performing creative labour, these modes fall within the present invention's
In protection domain.
Claims (10)
1. a kind of voice messaging acquisition method, which is characterized in that the method includes:
Start timing when receiving voice input instruction, obtains current speech data in real time, and according to the current speech number
Behavioral characteristics Dynamic Announce voice input picture in;
When receiving Speech Record stop instruction, closes the voice input picture and stops the acquisition of the current speech data,
To obtain voice input data, format conversion is carried out to the voice input data, and by the transformed voice input number
According to being stored;
When obtaining the preset icon image being locally stored, and obtaining sonic data and voice input in the voice input data
Between;
The sonic data and the voice input time are sequentially corresponded into the side that by the way of sound spectrum image and numerical value is shown
Formula is shown on the preset icon image respectively.
2. voice messaging acquisition method according to claim 1, which is characterized in that described according to the current speech data
In behavioral characteristics Dynamic Announce voice input picture:
Present timing time and the default background image being locally stored, recording icon are obtained in real time, and according to the present timing
Time, the default background image and the recording icon perform image display;
The voice decibel information in the current speech data is obtained in real time, and according to the voice decibel information to the recording
Icon is rendered into Mobile state.
3. voice messaging acquisition method according to claim 2, which is characterized in that described according to the voice decibel information
The step of being rendered into Mobile state to the recording icon include:
Judge the decibel ratings of the voice decibel information, and judges rendering region according to the decibel ratings;
The rendered color being locally stored is obtained, and color rendering is carried out to the rendering region according to the rendered color.
4. voice messaging acquisition method according to claim 1, which is characterized in that the real-time acquisition current speech data
The step of after, the method further includes:
Judge the current decibels in the current speech data received in preset time whether continuously less than default decibel
Number;
If so, closing the voice input picture and stopping the acquisition of the current speech data.
5. voice messaging acquisition method according to claim 1, which is characterized in that it is described to the voice input data into
Row format is converted, and the step of transformed voice input data are stored includes:
The voice input data are converted into amr formats, and Text region is carried out to obtain spy to the voice input data
Word is levied, the Feature Words are the word that number of repetition is most in the voice input data;
Renaming is carried out according to the Feature Words voice input data transformed to format.
6. voice messaging acquisition method according to claim 1, which is characterized in that it is described to the voice input data into
Row format is converted, and the step of transformed voice input data are stored includes:
The voice input data are converted into amr formats, and obtain current time;
Renaming is carried out according to the current time voice input data transformed to format.
7. a kind of instantaneous communication system, which is characterized in that including:
Immediate communication platform, the operational order for receiving user;
Voice capture device is communicated to connect with the immediate communication platform, for what is received according to the immediate communication platform
The operational order, with the broadcasting of the corresponding acquisition or voice data for carrying out voice data;
The voice capture device includes:
First acquisition module obtains current speech data, and root in real time for starting timing when receiving voice input instruction
According to the behavioral characteristics Dynamic Announce voice input picture in the current speech data;
Memory module, for when receiving Speech Record stop instruction, closing the voice input picture and stopping the current language
The acquisition of sound data carries out format conversion, and by transformed institute to obtain voice input data to the voice input data
Predicate sound logging data is stored;
Second acquisition module for obtaining the preset icon image being locally stored, and obtains the sound in the voice input data
Wave number evidence and voice input time;
Display module, for the sonic data and the voice input time are sequentially corresponded to using sound spectrum image by the way of with
The mode that numerical value is shown is shown on the preset icon image respectively.
8. instantaneous communication system according to claim 7, which is characterized in that first acquisition module includes:
First acquisition unit, default background image, recording icon for obtaining the present timing time in real time and being locally stored, and
It is performed image display according to the present timing time, the default background image and the recording icon;
Second acquisition unit, for obtaining the voice decibel information in the current speech data in real time, and according to the voice
Decibel information renders the recording icon into Mobile state.
9. a kind of mobile terminal, which is characterized in that including memory and processor, the memory is for storing computer journey
Sequence, the processor runs the computer program so that the mobile terminal execution is according to described in any one of claim 1 to 6
Voice messaging acquisition method.
10. a kind of storage medium, which is characterized in that it is stored with the calculating used in the mobile terminal described in claim 9
Machine program.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810277436.3A CN108492833A (en) | 2018-03-30 | 2018-03-30 | Voice messaging acquisition method, instantaneous communication system, mobile terminal and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810277436.3A CN108492833A (en) | 2018-03-30 | 2018-03-30 | Voice messaging acquisition method, instantaneous communication system, mobile terminal and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108492833A true CN108492833A (en) | 2018-09-04 |
Family
ID=63317239
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810277436.3A Pending CN108492833A (en) | 2018-03-30 | 2018-03-30 | Voice messaging acquisition method, instantaneous communication system, mobile terminal and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108492833A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109902199A (en) * | 2019-03-01 | 2019-06-18 | 深圳市伟文无线通讯技术有限公司 | A kind of near field corpus acquisition method and device |
CN111326162A (en) * | 2020-04-15 | 2020-06-23 | 厦门快商通科技股份有限公司 | Voiceprint feature acquisition method, device and equipment |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103186557A (en) * | 2011-12-28 | 2013-07-03 | 宇龙计算机通信科技(深圳)有限公司 | Method and device for automatically naming sound record or video files |
CN103942048A (en) * | 2014-04-09 | 2014-07-23 | Tcl集团股份有限公司 | Method and device for displaying voice volume in cartoon mode |
US9147393B1 (en) * | 2013-02-15 | 2015-09-29 | Boris Fridman-Mintz | Syllable based speech processing method |
CN105788612A (en) * | 2016-03-31 | 2016-07-20 | 广州酷狗计算机科技有限公司 | Method and device for testing tone quality |
CN105915831A (en) * | 2015-02-25 | 2016-08-31 | 卡西欧计算机株式会社 | Audio recording apparatus and audio recording method |
CN106468998A (en) * | 2016-09-23 | 2017-03-01 | 华为技术有限公司 | A kind of display packing of information and terminal |
JP2017134326A (en) * | 2016-01-29 | 2017-08-03 | 株式会社コシダカホールディングス | Cv which allows user to get right into one's role |
-
2018
- 2018-03-30 CN CN201810277436.3A patent/CN108492833A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103186557A (en) * | 2011-12-28 | 2013-07-03 | 宇龙计算机通信科技(深圳)有限公司 | Method and device for automatically naming sound record or video files |
US9147393B1 (en) * | 2013-02-15 | 2015-09-29 | Boris Fridman-Mintz | Syllable based speech processing method |
CN103942048A (en) * | 2014-04-09 | 2014-07-23 | Tcl集团股份有限公司 | Method and device for displaying voice volume in cartoon mode |
CN105915831A (en) * | 2015-02-25 | 2016-08-31 | 卡西欧计算机株式会社 | Audio recording apparatus and audio recording method |
JP2017134326A (en) * | 2016-01-29 | 2017-08-03 | 株式会社コシダカホールディングス | Cv which allows user to get right into one's role |
CN105788612A (en) * | 2016-03-31 | 2016-07-20 | 广州酷狗计算机科技有限公司 | Method and device for testing tone quality |
CN106468998A (en) * | 2016-09-23 | 2017-03-01 | 华为技术有限公司 | A kind of display packing of information and terminal |
Non-Patent Citations (1)
Title |
---|
刘景云: "执行"隐蔽行动",保护本本安全", 《电脑知识与技术》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109902199A (en) * | 2019-03-01 | 2019-06-18 | 深圳市伟文无线通讯技术有限公司 | A kind of near field corpus acquisition method and device |
CN111326162A (en) * | 2020-04-15 | 2020-06-23 | 厦门快商通科技股份有限公司 | Voiceprint feature acquisition method, device and equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109087669B (en) | Audio similarity detection method and device, storage medium and computer equipment | |
CN109858702A (en) | Client upgrades prediction technique, device, equipment and the readable storage medium storing program for executing complained | |
CN107919138B (en) | Emotion processing method in voice and mobile terminal | |
CN113488024B (en) | Telephone interrupt recognition method and system based on semantic recognition | |
CN112017650B (en) | Voice control method and device of electronic equipment, computer equipment and storage medium | |
CN110113677A (en) | The generation method and device of video subject | |
CN108391020A (en) | A kind of call control method, device, equipment and storage medium | |
CN108304368A (en) | The kind identification method and device and storage medium and processor of text message | |
CN106601251A (en) | Generating method and system of communication text | |
CN107871494A (en) | The method, apparatus and electronic equipment of a kind of phonetic synthesis | |
CN107301030A (en) | A kind of method for controlling volume, device and a kind of terminal | |
CN107135452A (en) | Audiphone adaptation method and device | |
CN108492833A (en) | Voice messaging acquisition method, instantaneous communication system, mobile terminal and storage medium | |
CN113033245A (en) | Function adjusting method and device, storage medium and electronic equipment | |
CN112420049A (en) | Data processing method, device and storage medium | |
CN110931028A (en) | Voice processing method and device and electronic equipment | |
JP5532781B2 (en) | Waiting service server, waiting service system using the server, and expected end time calculation method for waiting service | |
CN111048115A (en) | Voice recognition method and device | |
CN108804897B (en) | Screen control method and device, computer equipment and storage medium | |
CN113065879A (en) | Data stream quality inspection method and system | |
CN105812535A (en) | Method of recording speech communication information and terminal | |
CN107239307A (en) | The display methods and device of version updating information | |
CN111710339A (en) | Voice recognition interaction system and method based on data visualization display technology | |
CN112017651A (en) | Voice control method and device of electronic equipment, computer equipment and storage medium | |
JP2020030739A5 (en) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180904 |
|
RJ01 | Rejection of invention patent application after publication |