CN106649807A - Audio file processing method and mobile terminal - Google Patents
Audio file processing method and mobile terminal Download PDFInfo
- Publication number
- CN106649807A CN106649807A CN201611243118.2A CN201611243118A CN106649807A CN 106649807 A CN106649807 A CN 106649807A CN 201611243118 A CN201611243118 A CN 201611243118A CN 106649807 A CN106649807 A CN 106649807A
- Authority
- CN
- China
- Prior art keywords
- target
- audio file
- voice
- text content
- content
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
Abstract
The invention provides an audio file processing method and a mobile terminal. The method comprises the steps of recognizing a voice in a target audio file, and determining target textual content corresponding to the target audio file; adding the target textual content to a filename of the target audio file. In this way, according to the audio file processing method and the mobile terminal, the corresponding target textual content can be determined according to the voice in the target audio file, the target textual content is added to the filename of the target audio file, a user can seek the target audio file according to the target textual content, thus the efficiency of seeking the audio file of the user is improved, and the user experience is strengthened.
Description
Technical field
The present invention relates to audio file processing technology field, more particularly to a kind of audio file processing method and movement are eventually
End.
Background technology
With the continuous development of society, the information that people touch daily is also more and more, at present many information be all with
The form of voice is passed on, and user can obtain information by playing audio file.Audio file is supported that user is obtained with ear and is believed
Breath, can liberate the eyes of user, there is certain help to sight protectio.
When there is multiple audio files on mobile terminal, user needs to put out audio file one by one and play a period of time
That audio file for oneself wanting to look up can be just found, is taken longer.Certainly, user can also pass through some of audio file
Feature (such as voice duration or file designation) goes to search the audio file for needing, however, for voice duration identical sound
Frequency file, user cannot be distinguished by, and voice duration causes certain puzzlement to the memory of user;Additionally, for by file
The mode that name is searched, user needs the filename for pre-setting each audio file, operates comparatively laborious.It can be seen that, prior art
Middle pitch frequency file inconvenience user search, and affects Consumer's Experience.
The content of the invention
The embodiment of the present invention provides a kind of audio file processing method and mobile terminal, to solve prior art sound intermediate frequency text
Part inconvenience user search, and affects the problem of Consumer's Experience.
In a first aspect, embodiments providing a kind of audio file processing method, methods described includes:
Voice in identification target audio file, and determine target text content corresponding with the target audio file;
The target text content is added in the filename of the target audio file.
Second aspect, the embodiment of the present invention also provides a kind of mobile terminal, and the mobile terminal includes:
Determining module, for recognizing target audio in voice, and determine corresponding with target audio file target
Word content;
Add module, for the target text content to be added in the filename of the target audio file.
In embodiments of the present invention, the voice in the audio file processing method identification target audio file, and determine
Target text content corresponding with the target audio file;The target text content is added to into the target audio file
Filename in.So, the audio file processing method and mobile terminal that the present invention is provided can be according in target audio file
Voice determine corresponding target text content, and the target text content is added to into the file of the target audio file
Name in, user can according to the target text content search target audio file, improve user search audio file
Efficiency, enhance Consumer's Experience.
Description of the drawings
In order to be illustrated more clearly that the technical scheme of the embodiment of the present invention, below will be to needed for embodiment of the present invention description
The accompanying drawing to be used is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the present invention,
For those of ordinary skill in the art, without having to pay creative labor, can be being obtained according to these accompanying drawings
Obtain other accompanying drawings.
Fig. 1 is a kind of flow chart of audio file processing method that first embodiment of the invention is provided;
Fig. 2 is the flow chart of another kind of audio file processing method that second embodiment of the invention is provided;
Fig. 3 is the flow chart of another kind of audio file processing method that third embodiment of the invention is provided;
Fig. 4 is a kind of structure chart of mobile terminal that fourth embodiment of the invention is provided;
Fig. 5 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided;
Fig. 6 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided;
Fig. 7 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided;
Fig. 8 is a kind of structure chart of second converting unit that fourth embodiment of the invention is provided;
Fig. 9 is the structure chart of another kind of mobile terminal that fifth embodiment of the invention is provided.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation is described, it is clear that described embodiment is a part of embodiment of the invention, rather than the embodiment of whole.Based on this
Embodiment in bright, the every other enforcement that those of ordinary skill in the art are obtained under the premise of creative work is not made
Example, belongs to the scope of protection of the invention.
First embodiment
Referring to Fig. 1, Fig. 1 is a kind of flow chart of audio file processing method provided in an embodiment of the present invention, methods described
In being applied to a mobile terminal, as shown in figure 1, the audio file processing method includes:
Voice in step 101, identification target audio file, and determine target text corresponding with the target voice file
Word content.
In the step, the audio file processing method carries out speech recognition to target audio file, and according to identification knot
Fruit determines target text content corresponding with the target voice.The target audio file can be the multimedia for including sound
File, or only including sound file, can also be the speech message in social software, it should be noted that
In embodiments of the invention, the type of the target audio file is not limited.
The audio file processing method can recognize all voices in the target audio file, then by the mesh
All voices in mark with phonetic symbols frequency file are converted into the first word content.The audio file processing method can determine described first
Word content is the target text content, it is also possible to determine that the first keyword in first word content is the target
Word content.
The audio file can also recognize the target sound frequency range in the target audio file, then by the audio section
Interior voice is converted into the second word content.The audio file processing method can determine that second word content is described
Target text content, it is also possible to determine that the second keyword in second word content is the target text content.
Step 102, the target text content is added in the filename of the target audio file.
In the step, the target text content is added to the target audio file by the audio file processing method
Filename in, so, user can according to the file destination content search target audio file.
The audio file processing method can delete the old file name of the target audio file, and the target is literary
New file name of the word content as the target audio file;The old file name of the target audio file can not also be deleted,
But combine to form new filename with old file name in the target text content, the combination can be the target
Word content is before the old file name, it is also possible to after the old file name, can also be interspersed in the old file name
Centre, here do not limit.
Alternatively, it is described that the target text content is added in the filename of the target audio file, including:
Using the target text content as the target audio file new file name;Or
The target text content and old file name are combined the new file name as the target audio file.
In the embodiment, the audio file processing method deletes the old file name of the target audio file, and by institute
Target text content is stated as the new file name of the target audio file;Or do not delete the original text of the target audio file
Part name, but combine to form new filename with old file name in the target text content.It should be noted that the combination
Mode can be the target text content before the old file name, it is also possible to after the old file name, can be with
The centre of the old file name is interspersed in, here is not limited.
For example, for old file name is the target audio file of " A ", the audio file processing method can be true
Fixed its target text content is after " examination ", the target text content " examination " to be substituted into old file name " A ", will be described
The filename of target audio file is updated to the target text content " examination ".It is understood that at the audio file
Reason method the target text content " examination " can also be combined with the old file name " A " after as the target audio
The new file name of file, for example, be updated to " examination A " or " A examinations " by the filename of the file destination.
In the embodiment of the present invention, above-mentioned mobile terminal can be any mobile terminal for possessing shoot function, for example:Hand
Machine, panel computer (Tablet Personal Computer), kneetop computer (Laptop Computer), individual digital are helped
Reason (personal digital assistant, abbreviation PDA), mobile Internet access device (Mobile Internet Device,
) or Wearable device (Wearable Device) etc. MID.
The audio file processing method of first embodiment of the invention, recognize target audio file in voice, and determine with
The corresponding target text content of the target audio file;The target text content is added to into the target audio file
In filename.So, user can according to the target text content search target audio file, the present embodiment provide
Audio file processing method facilitates user rapidly and accurately to search audio file, improves the efficiency that user searches audio file,
Enhance Consumer's Experience.
Second embodiment
Referring to Fig. 2, Fig. 2 is the flow chart of another kind of audio file processing method that second embodiment of the invention is provided, such as
Shown in Fig. 2, the audio file processing method includes:
Step 201, the whole voices in target audio file are converted into into the first word content.
In the step, the audio file processing method recognizes the whole voices in the target audio file, and by institute
The whole voices stated in target audio file are converted into the first word content.The identification voice simultaneously converts speech into word category
In prior art category, will not be described here.
Step 202, the first keyword determined according to the first preset rules in first word content are target text
Content.
In the present embodiment, the audio file processing method determines first word content according to the first preset rules
The first keyword be target text content, first preset rules can be word occur frequency, described first is crucial
Word can be a keyword, or multiple keywords, and here is not limited.For example, the audio file processing method
Can determine in first word content that one or more most words of occurrence number are first keyword, or can be with
Determine that occurrence number is first keyword more than one or more words of preset times in first word content.This
Sample, when the target audio file content is longer, by determine keyword for target text content method can take compared with
Few space.
Step 203, the target text content is added in the filename of the target audio file.
The step 203 is identical with the step 102 in first embodiment of the invention, will not be described here.
In second embodiment of the invention, the audio file processing method changes the whole voices in target audio file
Into the first word content;First word content is determined for target text content, or it is true according to first preset rules
The first keyword in fixed first word content is target text content;The target text content is added to into the mesh
In the filename of mark with phonetic symbols frequency file.So, user can according to the target text content search target audio file, this
The audio file processing method that embodiment is provided facilitates user rapidly and accurately to search audio file, improves user and searches audio frequency
The efficiency of file, enhances Consumer's Experience.
Referring to Fig. 3, Fig. 3 is the flow chart of another kind of audio file processing method that third embodiment of the invention is provided, such as
Shown in Fig. 3, the audio file processing method includes:
Voice in step 301, identification target audio file in target sound frequency range, and by the language in the target sound frequency range
Sound is converted into the second word content.
In the step, the audio file processing method obtains the target sound frequency range in the target audio file, identification
Voice in the target sound frequency range, and the voice in the target sound frequency range is converted into into the second word content.
In the embodiment, the target sound frequency range can be preset duration, or random duration.The audio file
Processing method can obtain the audio section of preset duration as target audio according to preset rules from the target audio file
Section, it is also possible to which the random audio section of preset duration that obtains from the target audio file is used as the target sound frequency range.
It should be noted that when not having voice in the target sound frequency range, the audio file processing method can be from institute
State target audio file to reacquire target sound frequency range and recognize voice, for example, can also may be used with another audio section of preset duration
To obtain audio section by adjustment duration, here is not limited, until identifying the voice in target sound frequency range.
Step 302, second word content is determined for target text content, or determine institute according to the second preset rules
It is target text content to state the second keyword in the second word content.
In the present embodiment, the audio file processing method can determine second word content in target text
Hold, it is also possible to which the second preset rules determine that the second keyword of second word content is target text content.Described second
Preset rules can be identical with the first preset rules in second embodiment of the invention, it is also possible to first preset rules not
Together, here is not limited.Similarly, second keyword can be a keyword, or multiple keywords.
Step 303, the target text content is added in the filename of the target audio file.
The step 303 is identical with the step 102 in first embodiment of the invention, will not be described here.
Alternatively, the voice in the identification target audio file in target sound frequency range, and by the target audio
Voice in section is converted into the second word content, including:
Recognize the first object audio section in the target audio file, and whether judge in the first object audio section
Including voice;
If including voice in the first object audio section, the voice in the first object audio section is converted into into second
Word content;
If not including voice in the first object audio section, the second target audio in the target audio file is recognized
Section, and the voice in the second target sound frequency range is converted into into the second word content.
In the embodiment, the audio file processing method first recognizes the first object audio frequency in the target audio file
Section (audio section of such as 0~a durations, a represents the time), and judge whether include voice in the first object audio section, if
Include voice in the first object audio section, the voice in the first object audio section is converted into into the second word content.
If on the contrary, not including voice in the first object audio section, the audio file processing method recognizes the target audio
The voice in the second target sound frequency range (such as the audio section of the audio section of a~2a durations or 0~2a durations) in file, and
Voice in the second target sound frequency range is converted into into the second word content.If it is understood that second target sound
Do not include voice in frequency range, the audio file processing method can again adjust the target sound frequency range (such as 2a~3a),
Until identifying the voice in target sound frequency range.
Third embodiment of the invention, in the audio file processing method identification target audio file in target sound frequency range
Voice, and the voice in the target sound frequency range is converted into into the second word content;Determine that second word content is target
Word content, or determine that the second keyword in second word content is in target text according to the second preset rules
Hold;The target text content is added in the filename of the target audio file.So, user can be according to the mesh
Mark word content searches the target audio file, and the audio file processing method that the present embodiment is provided facilitates user quick and precisely
Audio file is searched on ground, improves the efficiency that user searches audio file, enhances Consumer's Experience.
Fourth embodiment
It is a kind of structure chart of mobile terminal that fourth embodiment of the invention is provided, as shown in figure 4, the shifting referring to Fig. 4
Dynamic terminal 400 includes:
Determining module 401, for recognizing target audio in voice, and determine corresponding with target audio file mesh
Mark word content;
Add module 402, for the target text content to be added in the filename of the target audio file.
Alternatively, referring to Fig. 5, Fig. 5 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided, and is such as schemed
Shown in 5, the determining module 401 includes:
First converting unit 4011, for the whole voices in the target audio file to be converted in the first word
Hold;
First determining unit 4012, for determining that in first word content first is crucial according to the first preset rules
Word is the target text content.
Alternatively, referring to Fig. 6, Fig. 6 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided, and is such as schemed
Shown in 6, the determining module 401 includes:
Second converting unit 4013, for recognizing the target audio file in voice in target sound frequency range, and by institute
State the voice in target sound frequency range and be converted into the second word content;
Second determining unit 4014, for determining that second word content is the target text content;Or
Referring to Fig. 7, Fig. 7 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided, as shown in fig. 7,
The determining module 401 can include:3rd determining unit 4015, for determining second word according to the second preset rules
The second keyword in content is the target text content.
Alternatively, referring to Fig. 8, Fig. 8 is a kind of structure of second converting unit 4013 that fourth embodiment of the invention is provided
Figure, as shown in figure 8, second converting unit 4013 includes:
Judgment sub-unit 40131, for recognizing the target audio file in first object audio section, and judge described
Whether include voice in first object audio section;
First conversion subunit 40132, if for including voice in the first object audio section, by the first object
Voice in audio section is converted into the second word content;
Second conversion subunit 40133, if for not including voice in the first object audio section, recognizing the target
The second target sound frequency range in audio frequency, and the voice in the second target sound frequency range is converted into into the second word content.
Alternatively, the add module 402 is used for:
Using the target text content as the target audio file new file name;Or
The target text content and old file name are combined the new file name as the target audio file.
Mobile terminal 400 can realize each process that mobile terminal is realized in the embodiment of the method for Fig. 1 to Fig. 3, to keep away
Exempt to repeat, will not be described here.
5th embodiment
Referring to Fig. 9, Fig. 9 is the structural representation of another kind of mobile terminal 900 that the present invention is provided, as shown in figure 9, mobile
Terminal 900 includes:At least one processor 901, memory 902, at least one user interface 903 and network interface 904.It is mobile
Each component in terminal 900 is coupled by bus system 905, it is understood that bus system 905 is used to realize
Connection communication between these components.Bus system 905 except including in addition to data wire, also including power bus, controlling bus and
Status signal bus in addition.But for the sake of for clear explanation, in fig .9 various buses are all designated as into bus system 905.
Wherein, user interface 903 can include display, keyboard or pointing device, such as mouse, trace ball
(trackball), touch-sensitive plate or touch-screen etc..
It is appreciated that the memory 902 in the embodiment of the present invention can be volatile memory or nonvolatile memory,
Or may include both volatibility and nonvolatile memory.Wherein, nonvolatile memory can be read-only storage (Read-
Only Memory, ROM), programmable read only memory (Programmable ROM, PROM), the read-only storage of erasable programmable
Device (Erasable PROM, EPROM), Electrically Erasable Read Only Memory (Electrically EPROM, EEPROM) or
Flash memory.Volatile memory can be random access memory (Random Access Memory, RAM), and it is used as outside height
Speed caching.By exemplary but be not restricted explanation, the RAM of many forms can use, such as static RAM
(Static RAM, SRAM), dynamic random access memory (Dynamic RAM, DRAM), Synchronous Dynamic Random Access Memory
(Synchronous DRAM, SDRAM), double data speed synchronous dynamic RAM (Double Data Rate
SDRAM, DDRSDRAM), enhancement mode Synchronous Dynamic Random Access Memory (Enhanced SDRAM, ESDRAM), synchronized links
Dynamic random access memory (Synchlink DRAM, SLDRAM) and direct rambus random access memory (Direct
Rambus RAM, DRRAM).The memory 902 of system and method described herein be intended to including but not limited to these and arbitrarily its
It is adapted to the memory of type.
In some embodiments, memory 902 stores following element, can perform module or data structure, or
Person their subset, or their superset:Operating system 9021 and application program 9022.
Wherein, operating system 9021, comprising various system programs, such as ccf layer, core library layer, driving layer etc., are used for
Realize various basic businesses and process hardware based task.Application program 9022, comprising various application programs, such as media
Player (Media Player), browser (Browser) etc., for realizing various applied business.Realize the embodiment of the present invention
The program of method may be embodied in application program 9022.
In embodiments of the present invention, by call memory 902 store program or instruction, specifically, can be application
The program stored in program 9022 or instruction, processor 901 is used for:
Voice in identification target audio file, and determine target text content corresponding with the target audio file;
The target text content is added in the filename of the target audio file.
The method that the embodiments of the present invention are disclosed can apply in processor 901, or be realized by processor 901.
A kind of possibly IC chip of processor 901, the disposal ability with signal.During realization, said method it is each
Step can be completed by the instruction of the integrated logic circuit of the hardware in processor 901 or software form.Above-mentioned process
Device 901 can be general processor, digital signal processor (Digital Signal Processor, DSP), special integrated electricity
Road (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field
Programmable Gate Array, FPGA) either other PLDs, discrete gate or transistor logic,
Discrete hardware components.Can realize or perform disclosed each method in the embodiment of the present invention, step and logic diagram.It is general
Processor can be microprocessor or the processor can also be any conventional processor etc..With reference to embodiment of the present invention institute
The step of disclosed method, can be embodied directly in hardware decoding processor and perform and complete, or with the hardware in decoding processor
And software module combination execution is completed.Software module may be located at random access memory, and flash memory, read-only storage may be programmed read-only
In the ripe storage medium in this area such as memory or electrically erasable programmable memory, register.The storage medium is located at
Memory 902, processor 901 reads the information in memory 902, the step of complete said method with reference to its hardware.
It is understood that embodiments described herein can with hardware, software, firmware, middleware, microcode or its
Combine to realize.For hardware is realized, processing unit can be realized in one or more special ICs (Application
Specific Integrated Circuits, ASIC), digital signal processor (Digital Signal Processing,
DSP), digital signal processing appts (DSP Device, DSPD), programmable logic device (Programmable Logic
Device, PLD), field programmable gate array (Field-Programmable Gate Array, FPGA), general processor,
In controller, microcontroller, microprocessor, other electronic units for performing herein described function or its combination.
For software is realized, can be realized herein by performing the module (such as process, function etc.) of function described herein
Described technology.Software code is storable in memory and by computing device.Memory can within a processor or
Realize processor outside.
Alternatively, the voice in the identification of the processor 901 target audio file, and determine and the target audio file
Corresponding target text content, including:
Whole voices in the target audio file are converted into into the first word content;
Determine that the first keyword in first word content is the target text content according to the first preset rules.
Alternatively, the voice in the identification of the processor 901 target audio file, and determine and the target audio file
Corresponding target text content, including:
The voice in target sound frequency range in the target audio file is recognized, and the voice in the target sound frequency range is turned
Change the second word content into;
Determine that second word content is the target text content;Or
Determine that the second keyword in second word content is the target text content according to the second preset rules.
The processor 901 recognizes the voice in the target audio file in target sound frequency range, and by the target sound
Voice in frequency range is converted into the second word content, including:
Recognize the first object audio section in the target audio file, and whether judge in the first object audio section
Including voice;
If including voice in the first object audio section, the voice in the first object audio section is converted into into second
Word content;
If not including voice in the first object audio section, the second target audio in the target audio file is recognized
Section, and the voice in the second target sound frequency range is converted into into the second word content.
Alternatively, it is described that the target text content is added in the filename of the target audio file, including:
Using the target text content as the target audio file new file name;Or
The target text content and old file name are combined the new file name as the target audio file.
Mobile terminal 900 can realize each process that mobile terminal 900 is realized in previous embodiment, to avoid repeating,
Here repeat no more.
The mobile terminal 900 of the embodiment of the present invention, recognizes the voice in target audio file, and determines and the target sound
The corresponding target text content of frequency file;The target text content is associated with the target audio file.So, Yong Huke
With the target audio file according to the target text content search, the audio file processing method that the present embodiment is provided is convenient
User rapidly and accurately searches audio file, improves the efficiency that user searches audio file, enhances Consumer's Experience.
Those of ordinary skill in the art are it is to be appreciated that the list of each example with reference to the embodiments described herein description
Unit and algorithm steps, being capable of being implemented in combination in electronic hardware or computer software and electronic hardware.These functions are actually
Performed with hardware or software mode, depending on the application-specific and design constraint of technical scheme.Professional and technical personnel
Each specific application can be used different methods to realize described function, but this realization it is not considered that exceeding
The scope of the present invention.
Those skilled in the art can be understood that, for convenience and simplicity of description, the system of foregoing description,
The specific work process of device and unit, may be referred to the corresponding process in preceding method embodiment, will not be described here.
In embodiment provided herein, it should be understood that disclosed apparatus and method, can pass through other
Mode is realized.For example, device embodiment described above is only schematic, and for example, the division of the unit is only
A kind of division of logic function, can there is an other dividing mode when actually realizing, such as multiple units or component can with reference to or
Person is desirably integrated into another system, or some features can be ignored, or does not perform.Another, shown or discussed is mutual
Between coupling or direct-coupling or communication connection can be INDIRECT COUPLING or communication link by some interfaces, device or unit
Connect, can be electrical, mechanical or other forms.
The unit as separating component explanation can be or may not be it is physically separate, it is aobvious as unit
The part for showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple
On NE.Some or all of unit therein can be according to the actual needs selected to realize embodiment of the present invention scheme
Purpose.
In addition, each functional unit in each embodiment of the invention can be integrated in a processing unit, it is also possible to
It is that unit is individually physically present, it is also possible to which two or more units are integrated in a unit.
If the function is realized and as independent production marketing or when using using in the form of SFU software functional unit, can be with
In being stored in a computer read/write memory medium.Based on such understanding, technical scheme is substantially in other words
The part contributed to prior art or the part of the technical scheme can be embodied in the form of software product, the meter
Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be individual
People's computer, server, or network equipment etc.) perform all or part of step of each embodiment methods described of the invention.
And aforesaid storage medium includes:USB flash disk, portable hard drive, ROM, RAM, magnetic disc or CD etc. are various can be with store program codes
Medium.
The above, the only specific embodiment of the present invention, but protection scope of the present invention is not limited thereto, any
Those familiar with the art the invention discloses technical scope in, change or replacement can be readily occurred in, all should contain
Cover within protection scope of the present invention.Therefore, protection scope of the present invention should be defined by scope of the claims.
Claims (10)
1. a kind of audio file processing method, it is characterised in that methods described includes:
Voice in identification target audio file, and determine target text content corresponding with the target audio file;
The target text content is added in the filename of the target audio file.
2. the method for claim 1, it is characterised in that the voice in the identification target audio file, and determine with
The corresponding target text content of the target audio file, including:
Whole voices in the target audio file are converted into into the first word content;
Determine that the first keyword in first word content is the target text content according to the first preset rules.
3. the method for claim 1, it is characterised in that the voice in the identification target audio file, and determine with
The corresponding target text content of the target audio file, including:
The voice in target sound frequency range in the target audio file is recognized, and the voice in the target sound frequency range is converted into
Second word content;
Determine that second word content is the target text content;Or
Determine that the second keyword in second word content is the target text content according to the second preset rules.
4. method as claimed in claim 3, it is characterised in that in the identification target audio file in target sound frequency range
Voice, and the voice in the target sound frequency range is converted into into the second word content, including:
The first object audio section in the target audio file is recognized, and judges whether include in the first object audio section
Voice;
If including voice in the first object audio section, the voice in the first object audio section is converted into into the second word
Content;
If not including voice in the first object audio section, the second target sound frequency range in the target audio file is recognized,
And the voice in the second target sound frequency range is converted into into the second word content.
5. the method as described in any one of Claims 1 to 4, it is characterised in that described to be added to the target text content
In the filename of the target audio file, including:
Using the target text content as the target audio file new file name;Or
The target text content and old file name are combined the new file name as the target audio file.
6. a kind of mobile terminal, it is characterised in that the mobile terminal includes:
Determining module, for recognizing target audio in voice, and determine corresponding with target audio file target text
Content;
Add module, for the target text content to be added in the filename of the target audio file.
7. mobile terminal as claimed in claim 6, it is characterised in that the determining module includes:
First converting unit, for the whole voices in the target audio file to be converted into into the first word content;
First determining unit, for determining that the first keyword in first word content is described according to the first preset rules
Target text content.
8. mobile terminal as claimed in claim 6, it is characterised in that the determining module includes:
Second converting unit, for recognizing the target audio file in voice in target sound frequency range, and by the target sound
Voice in frequency range is converted into the second word content;
Second determining unit, for determining that second word content is the target text content;Or
3rd determining unit, for determining that the second keyword in second word content is described according to the second preset rules
Target text content.
9. mobile terminal as claimed in claim 8, it is characterised in that second converting unit includes:
Judgment sub-unit, for recognizing the target audio file in first object audio section, and judge the first object
Whether include voice in audio section;
First conversion subunit, if for including voice in the first object audio section, by the first object audio section
Voice be converted into the second word content;
Second conversion subunit, if for not including voice in the first object audio section, in recognizing the target audio
Second target sound frequency range, and the voice in the second target sound frequency range is converted into into the second word content.
10. the mobile terminal as described in any one of claim 6~9, it is characterised in that the add module is used for:
Using the target text content as the target audio file new file name;Or
The target text content and old file name are combined the new file name as the target audio file.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611243118.2A CN106649807A (en) | 2016-12-29 | 2016-12-29 | Audio file processing method and mobile terminal |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611243118.2A CN106649807A (en) | 2016-12-29 | 2016-12-29 | Audio file processing method and mobile terminal |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106649807A true CN106649807A (en) | 2017-05-10 |
Family
ID=58835898
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611243118.2A Pending CN106649807A (en) | 2016-12-29 | 2016-12-29 | Audio file processing method and mobile terminal |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106649807A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101437115A (en) * | 2007-11-12 | 2009-05-20 | 鸿富锦精密工业(深圳)有限公司 | Digital camera and method for setting image name |
CN102074235A (en) * | 2010-12-20 | 2011-05-25 | 上海华勤通讯技术有限公司 | Method of video speech recognition and search |
CN103390016A (en) * | 2012-05-07 | 2013-11-13 | Lg电子株式会社 | Method for displaying text associated with audio file and electronic device |
-
2016
- 2016-12-29 CN CN201611243118.2A patent/CN106649807A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101437115A (en) * | 2007-11-12 | 2009-05-20 | 鸿富锦精密工业(深圳)有限公司 | Digital camera and method for setting image name |
CN102074235A (en) * | 2010-12-20 | 2011-05-25 | 上海华勤通讯技术有限公司 | Method of video speech recognition and search |
CN103390016A (en) * | 2012-05-07 | 2013-11-13 | Lg电子株式会社 | Method for displaying text associated with audio file and electronic device |
Non-Patent Citations (3)
Title |
---|
王永琦: "《MATLAB与音视频技术》", 30 November 2013 * |
赵力: "《语音信号处理》", 31 May 2016 * |
陆虎敏: "《飞机座舱显示与控制技术》", 31 December 2015 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20220148594A1 (en) | Using multiple modality input to feedback context for natural language understanding | |
US20220214775A1 (en) | Method for extracting salient dialog usage from live data | |
US9299342B2 (en) | User query history expansion for improving language model adaptation | |
US20230072352A1 (en) | Speech Recognition Method and Apparatus, Terminal, and Storage Medium | |
JP5799621B2 (en) | Information processing apparatus, information processing method, and program | |
CN103339623B (en) | It is related to the method and apparatus of Internet search | |
US20160055251A1 (en) | System and method for compending blogs | |
CN108090174A (en) | A kind of robot answer method and device based on system function syntax | |
US20190196782A1 (en) | Techniques to present a user interface for the visually impaired | |
CN1637741A (en) | Annotation management in pen-based computing system | |
CN103902533B (en) | It is a kind of to search for through method and apparatus | |
AU2017216520A1 (en) | Common data repository for improving transactional efficiencies of user interactions with a computing device | |
US20150169676A1 (en) | Generating a Table of Contents for Unformatted Text | |
JP2017146720A (en) | Patent requirement adequacy prediction device and patent requirement adequacy prediction program | |
CN101689198A (en) | Phonetic search using normalized string | |
CN111428030B (en) | Corpus classifying method and system | |
CN107977420A (en) | The abstract extraction method, apparatus and readable storage medium storing program for executing of a kind of evolved document | |
US20190317648A1 (en) | System enabling audio-based navigation and presentation of a website | |
CN109710732A (en) | Information query method, device, storage medium and electronic equipment | |
CN108197105A (en) | Natural language processing method, apparatus, storage medium and electronic equipment | |
CN109657043B (en) | Method, device and equipment for automatically generating article and storage medium | |
CN104933099B (en) | Method and device for providing target search result for user | |
CN106650351A (en) | running method of application program and mobile terminal | |
CN103514182B (en) | Music searching method and device | |
CN106649807A (en) | Audio file processing method and mobile terminal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20171107 Address after: 283 No. 523860 Guangdong province Dongguan city Changan town usha BBK Avenue Applicant after: VIVO MOBILE COMMUNICATION CO., LTD. Applicant after: Wewo Mobile Communication Co. Ltd. Beijing branch Address before: 283 No. 523860 Guangdong province Dongguan city Changan town usha BBK Avenue Applicant before: VIVO MOBILE COMMUNICATION CO., LTD. |
|
TA01 | Transfer of patent application right | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170510 |
|
RJ01 | Rejection of invention patent application after publication |