CN106649807A - Audio file processing method and mobile terminal - Google Patents

Audio file processing method and mobile terminal Download PDF

Info

Publication number
CN106649807A
CN106649807A CN201611243118.2A CN201611243118A CN106649807A CN 106649807 A CN106649807 A CN 106649807A CN 201611243118 A CN201611243118 A CN 201611243118A CN 106649807 A CN106649807 A CN 106649807A
Authority
CN
China
Prior art keywords
target
audio file
voice
text content
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611243118.2A
Other languages
Chinese (zh)
Inventor
唐俊坤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vivo Mobile Communication Co Ltd Beijing Branch
Original Assignee
Vivo Mobile Communication Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vivo Mobile Communication Co Ltd filed Critical Vivo Mobile Communication Co Ltd
Priority to CN201611243118.2A priority Critical patent/CN106649807A/en
Publication of CN106649807A publication Critical patent/CN106649807A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Abstract

The invention provides an audio file processing method and a mobile terminal. The method comprises the steps of recognizing a voice in a target audio file, and determining target textual content corresponding to the target audio file; adding the target textual content to a filename of the target audio file. In this way, according to the audio file processing method and the mobile terminal, the corresponding target textual content can be determined according to the voice in the target audio file, the target textual content is added to the filename of the target audio file, a user can seek the target audio file according to the target textual content, thus the efficiency of seeking the audio file of the user is improved, and the user experience is strengthened.

Description

A kind of audio file processing method and mobile terminal
Technical field
The present invention relates to audio file processing technology field, more particularly to a kind of audio file processing method and movement are eventually End.
Background technology
With the continuous development of society, the information that people touch daily is also more and more, at present many information be all with The form of voice is passed on, and user can obtain information by playing audio file.Audio file is supported that user is obtained with ear and is believed Breath, can liberate the eyes of user, there is certain help to sight protectio.
When there is multiple audio files on mobile terminal, user needs to put out audio file one by one and play a period of time That audio file for oneself wanting to look up can be just found, is taken longer.Certainly, user can also pass through some of audio file Feature (such as voice duration or file designation) goes to search the audio file for needing, however, for voice duration identical sound Frequency file, user cannot be distinguished by, and voice duration causes certain puzzlement to the memory of user;Additionally, for by file The mode that name is searched, user needs the filename for pre-setting each audio file, operates comparatively laborious.It can be seen that, prior art Middle pitch frequency file inconvenience user search, and affects Consumer's Experience.
The content of the invention
The embodiment of the present invention provides a kind of audio file processing method and mobile terminal, to solve prior art sound intermediate frequency text Part inconvenience user search, and affects the problem of Consumer's Experience.
In a first aspect, embodiments providing a kind of audio file processing method, methods described includes:
Voice in identification target audio file, and determine target text content corresponding with the target audio file;
The target text content is added in the filename of the target audio file.
Second aspect, the embodiment of the present invention also provides a kind of mobile terminal, and the mobile terminal includes:
Determining module, for recognizing target audio in voice, and determine corresponding with target audio file target Word content;
Add module, for the target text content to be added in the filename of the target audio file.
In embodiments of the present invention, the voice in the audio file processing method identification target audio file, and determine Target text content corresponding with the target audio file;The target text content is added to into the target audio file Filename in.So, the audio file processing method and mobile terminal that the present invention is provided can be according in target audio file Voice determine corresponding target text content, and the target text content is added to into the file of the target audio file Name in, user can according to the target text content search target audio file, improve user search audio file Efficiency, enhance Consumer's Experience.
Description of the drawings
In order to be illustrated more clearly that the technical scheme of the embodiment of the present invention, below will be to needed for embodiment of the present invention description The accompanying drawing to be used is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the present invention, For those of ordinary skill in the art, without having to pay creative labor, can be being obtained according to these accompanying drawings Obtain other accompanying drawings.
Fig. 1 is a kind of flow chart of audio file processing method that first embodiment of the invention is provided;
Fig. 2 is the flow chart of another kind of audio file processing method that second embodiment of the invention is provided;
Fig. 3 is the flow chart of another kind of audio file processing method that third embodiment of the invention is provided;
Fig. 4 is a kind of structure chart of mobile terminal that fourth embodiment of the invention is provided;
Fig. 5 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided;
Fig. 6 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided;
Fig. 7 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided;
Fig. 8 is a kind of structure chart of second converting unit that fourth embodiment of the invention is provided;
Fig. 9 is the structure chart of another kind of mobile terminal that fifth embodiment of the invention is provided.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation is described, it is clear that described embodiment is a part of embodiment of the invention, rather than the embodiment of whole.Based on this Embodiment in bright, the every other enforcement that those of ordinary skill in the art are obtained under the premise of creative work is not made Example, belongs to the scope of protection of the invention.
First embodiment
Referring to Fig. 1, Fig. 1 is a kind of flow chart of audio file processing method provided in an embodiment of the present invention, methods described In being applied to a mobile terminal, as shown in figure 1, the audio file processing method includes:
Voice in step 101, identification target audio file, and determine target text corresponding with the target voice file Word content.
In the step, the audio file processing method carries out speech recognition to target audio file, and according to identification knot Fruit determines target text content corresponding with the target voice.The target audio file can be the multimedia for including sound File, or only including sound file, can also be the speech message in social software, it should be noted that In embodiments of the invention, the type of the target audio file is not limited.
The audio file processing method can recognize all voices in the target audio file, then by the mesh All voices in mark with phonetic symbols frequency file are converted into the first word content.The audio file processing method can determine described first Word content is the target text content, it is also possible to determine that the first keyword in first word content is the target Word content.
The audio file can also recognize the target sound frequency range in the target audio file, then by the audio section Interior voice is converted into the second word content.The audio file processing method can determine that second word content is described Target text content, it is also possible to determine that the second keyword in second word content is the target text content.
Step 102, the target text content is added in the filename of the target audio file.
In the step, the target text content is added to the target audio file by the audio file processing method Filename in, so, user can according to the file destination content search target audio file.
The audio file processing method can delete the old file name of the target audio file, and the target is literary New file name of the word content as the target audio file;The old file name of the target audio file can not also be deleted, But combine to form new filename with old file name in the target text content, the combination can be the target Word content is before the old file name, it is also possible to after the old file name, can also be interspersed in the old file name Centre, here do not limit.
Alternatively, it is described that the target text content is added in the filename of the target audio file, including:
Using the target text content as the target audio file new file name;Or
The target text content and old file name are combined the new file name as the target audio file.
In the embodiment, the audio file processing method deletes the old file name of the target audio file, and by institute Target text content is stated as the new file name of the target audio file;Or do not delete the original text of the target audio file Part name, but combine to form new filename with old file name in the target text content.It should be noted that the combination Mode can be the target text content before the old file name, it is also possible to after the old file name, can be with The centre of the old file name is interspersed in, here is not limited.
For example, for old file name is the target audio file of " A ", the audio file processing method can be true Fixed its target text content is after " examination ", the target text content " examination " to be substituted into old file name " A ", will be described The filename of target audio file is updated to the target text content " examination ".It is understood that at the audio file Reason method the target text content " examination " can also be combined with the old file name " A " after as the target audio The new file name of file, for example, be updated to " examination A " or " A examinations " by the filename of the file destination.
In the embodiment of the present invention, above-mentioned mobile terminal can be any mobile terminal for possessing shoot function, for example:Hand Machine, panel computer (Tablet Personal Computer), kneetop computer (Laptop Computer), individual digital are helped Reason (personal digital assistant, abbreviation PDA), mobile Internet access device (Mobile Internet Device, ) or Wearable device (Wearable Device) etc. MID.
The audio file processing method of first embodiment of the invention, recognize target audio file in voice, and determine with The corresponding target text content of the target audio file;The target text content is added to into the target audio file In filename.So, user can according to the target text content search target audio file, the present embodiment provide Audio file processing method facilitates user rapidly and accurately to search audio file, improves the efficiency that user searches audio file, Enhance Consumer's Experience.
Second embodiment
Referring to Fig. 2, Fig. 2 is the flow chart of another kind of audio file processing method that second embodiment of the invention is provided, such as Shown in Fig. 2, the audio file processing method includes:
Step 201, the whole voices in target audio file are converted into into the first word content.
In the step, the audio file processing method recognizes the whole voices in the target audio file, and by institute The whole voices stated in target audio file are converted into the first word content.The identification voice simultaneously converts speech into word category In prior art category, will not be described here.
Step 202, the first keyword determined according to the first preset rules in first word content are target text Content.
In the present embodiment, the audio file processing method determines first word content according to the first preset rules The first keyword be target text content, first preset rules can be word occur frequency, described first is crucial Word can be a keyword, or multiple keywords, and here is not limited.For example, the audio file processing method Can determine in first word content that one or more most words of occurrence number are first keyword, or can be with Determine that occurrence number is first keyword more than one or more words of preset times in first word content.This Sample, when the target audio file content is longer, by determine keyword for target text content method can take compared with Few space.
Step 203, the target text content is added in the filename of the target audio file.
The step 203 is identical with the step 102 in first embodiment of the invention, will not be described here.
In second embodiment of the invention, the audio file processing method changes the whole voices in target audio file Into the first word content;First word content is determined for target text content, or it is true according to first preset rules The first keyword in fixed first word content is target text content;The target text content is added to into the mesh In the filename of mark with phonetic symbols frequency file.So, user can according to the target text content search target audio file, this The audio file processing method that embodiment is provided facilitates user rapidly and accurately to search audio file, improves user and searches audio frequency The efficiency of file, enhances Consumer's Experience.
Referring to Fig. 3, Fig. 3 is the flow chart of another kind of audio file processing method that third embodiment of the invention is provided, such as Shown in Fig. 3, the audio file processing method includes:
Voice in step 301, identification target audio file in target sound frequency range, and by the language in the target sound frequency range Sound is converted into the second word content.
In the step, the audio file processing method obtains the target sound frequency range in the target audio file, identification Voice in the target sound frequency range, and the voice in the target sound frequency range is converted into into the second word content.
In the embodiment, the target sound frequency range can be preset duration, or random duration.The audio file Processing method can obtain the audio section of preset duration as target audio according to preset rules from the target audio file Section, it is also possible to which the random audio section of preset duration that obtains from the target audio file is used as the target sound frequency range.
It should be noted that when not having voice in the target sound frequency range, the audio file processing method can be from institute State target audio file to reacquire target sound frequency range and recognize voice, for example, can also may be used with another audio section of preset duration To obtain audio section by adjustment duration, here is not limited, until identifying the voice in target sound frequency range.
Step 302, second word content is determined for target text content, or determine institute according to the second preset rules It is target text content to state the second keyword in the second word content.
In the present embodiment, the audio file processing method can determine second word content in target text Hold, it is also possible to which the second preset rules determine that the second keyword of second word content is target text content.Described second Preset rules can be identical with the first preset rules in second embodiment of the invention, it is also possible to first preset rules not Together, here is not limited.Similarly, second keyword can be a keyword, or multiple keywords.
Step 303, the target text content is added in the filename of the target audio file.
The step 303 is identical with the step 102 in first embodiment of the invention, will not be described here.
Alternatively, the voice in the identification target audio file in target sound frequency range, and by the target audio Voice in section is converted into the second word content, including:
Recognize the first object audio section in the target audio file, and whether judge in the first object audio section Including voice;
If including voice in the first object audio section, the voice in the first object audio section is converted into into second Word content;
If not including voice in the first object audio section, the second target audio in the target audio file is recognized Section, and the voice in the second target sound frequency range is converted into into the second word content.
In the embodiment, the audio file processing method first recognizes the first object audio frequency in the target audio file Section (audio section of such as 0~a durations, a represents the time), and judge whether include voice in the first object audio section, if Include voice in the first object audio section, the voice in the first object audio section is converted into into the second word content. If on the contrary, not including voice in the first object audio section, the audio file processing method recognizes the target audio The voice in the second target sound frequency range (such as the audio section of the audio section of a~2a durations or 0~2a durations) in file, and Voice in the second target sound frequency range is converted into into the second word content.If it is understood that second target sound Do not include voice in frequency range, the audio file processing method can again adjust the target sound frequency range (such as 2a~3a), Until identifying the voice in target sound frequency range.
Third embodiment of the invention, in the audio file processing method identification target audio file in target sound frequency range Voice, and the voice in the target sound frequency range is converted into into the second word content;Determine that second word content is target Word content, or determine that the second keyword in second word content is in target text according to the second preset rules Hold;The target text content is added in the filename of the target audio file.So, user can be according to the mesh Mark word content searches the target audio file, and the audio file processing method that the present embodiment is provided facilitates user quick and precisely Audio file is searched on ground, improves the efficiency that user searches audio file, enhances Consumer's Experience.
Fourth embodiment
It is a kind of structure chart of mobile terminal that fourth embodiment of the invention is provided, as shown in figure 4, the shifting referring to Fig. 4 Dynamic terminal 400 includes:
Determining module 401, for recognizing target audio in voice, and determine corresponding with target audio file mesh Mark word content;
Add module 402, for the target text content to be added in the filename of the target audio file.
Alternatively, referring to Fig. 5, Fig. 5 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided, and is such as schemed Shown in 5, the determining module 401 includes:
First converting unit 4011, for the whole voices in the target audio file to be converted in the first word Hold;
First determining unit 4012, for determining that in first word content first is crucial according to the first preset rules Word is the target text content.
Alternatively, referring to Fig. 6, Fig. 6 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided, and is such as schemed Shown in 6, the determining module 401 includes:
Second converting unit 4013, for recognizing the target audio file in voice in target sound frequency range, and by institute State the voice in target sound frequency range and be converted into the second word content;
Second determining unit 4014, for determining that second word content is the target text content;Or
Referring to Fig. 7, Fig. 7 is the structure chart of another kind of mobile terminal that fourth embodiment of the invention is provided, as shown in fig. 7, The determining module 401 can include:3rd determining unit 4015, for determining second word according to the second preset rules The second keyword in content is the target text content.
Alternatively, referring to Fig. 8, Fig. 8 is a kind of structure of second converting unit 4013 that fourth embodiment of the invention is provided Figure, as shown in figure 8, second converting unit 4013 includes:
Judgment sub-unit 40131, for recognizing the target audio file in first object audio section, and judge described Whether include voice in first object audio section;
First conversion subunit 40132, if for including voice in the first object audio section, by the first object Voice in audio section is converted into the second word content;
Second conversion subunit 40133, if for not including voice in the first object audio section, recognizing the target The second target sound frequency range in audio frequency, and the voice in the second target sound frequency range is converted into into the second word content.
Alternatively, the add module 402 is used for:
Using the target text content as the target audio file new file name;Or
The target text content and old file name are combined the new file name as the target audio file.
Mobile terminal 400 can realize each process that mobile terminal is realized in the embodiment of the method for Fig. 1 to Fig. 3, to keep away Exempt to repeat, will not be described here.
5th embodiment
Referring to Fig. 9, Fig. 9 is the structural representation of another kind of mobile terminal 900 that the present invention is provided, as shown in figure 9, mobile Terminal 900 includes:At least one processor 901, memory 902, at least one user interface 903 and network interface 904.It is mobile Each component in terminal 900 is coupled by bus system 905, it is understood that bus system 905 is used to realize Connection communication between these components.Bus system 905 except including in addition to data wire, also including power bus, controlling bus and Status signal bus in addition.But for the sake of for clear explanation, in fig .9 various buses are all designated as into bus system 905.
Wherein, user interface 903 can include display, keyboard or pointing device, such as mouse, trace ball (trackball), touch-sensitive plate or touch-screen etc..
It is appreciated that the memory 902 in the embodiment of the present invention can be volatile memory or nonvolatile memory, Or may include both volatibility and nonvolatile memory.Wherein, nonvolatile memory can be read-only storage (Read- Only Memory, ROM), programmable read only memory (Programmable ROM, PROM), the read-only storage of erasable programmable Device (Erasable PROM, EPROM), Electrically Erasable Read Only Memory (Electrically EPROM, EEPROM) or Flash memory.Volatile memory can be random access memory (Random Access Memory, RAM), and it is used as outside height Speed caching.By exemplary but be not restricted explanation, the RAM of many forms can use, such as static RAM (Static RAM, SRAM), dynamic random access memory (Dynamic RAM, DRAM), Synchronous Dynamic Random Access Memory (Synchronous DRAM, SDRAM), double data speed synchronous dynamic RAM (Double Data Rate SDRAM, DDRSDRAM), enhancement mode Synchronous Dynamic Random Access Memory (Enhanced SDRAM, ESDRAM), synchronized links Dynamic random access memory (Synchlink DRAM, SLDRAM) and direct rambus random access memory (Direct Rambus RAM, DRRAM).The memory 902 of system and method described herein be intended to including but not limited to these and arbitrarily its It is adapted to the memory of type.
In some embodiments, memory 902 stores following element, can perform module or data structure, or Person their subset, or their superset:Operating system 9021 and application program 9022.
Wherein, operating system 9021, comprising various system programs, such as ccf layer, core library layer, driving layer etc., are used for Realize various basic businesses and process hardware based task.Application program 9022, comprising various application programs, such as media Player (Media Player), browser (Browser) etc., for realizing various applied business.Realize the embodiment of the present invention The program of method may be embodied in application program 9022.
In embodiments of the present invention, by call memory 902 store program or instruction, specifically, can be application The program stored in program 9022 or instruction, processor 901 is used for:
Voice in identification target audio file, and determine target text content corresponding with the target audio file;
The target text content is added in the filename of the target audio file.
The method that the embodiments of the present invention are disclosed can apply in processor 901, or be realized by processor 901. A kind of possibly IC chip of processor 901, the disposal ability with signal.During realization, said method it is each Step can be completed by the instruction of the integrated logic circuit of the hardware in processor 901 or software form.Above-mentioned process Device 901 can be general processor, digital signal processor (Digital Signal Processor, DSP), special integrated electricity Road (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field Programmable Gate Array, FPGA) either other PLDs, discrete gate or transistor logic, Discrete hardware components.Can realize or perform disclosed each method in the embodiment of the present invention, step and logic diagram.It is general Processor can be microprocessor or the processor can also be any conventional processor etc..With reference to embodiment of the present invention institute The step of disclosed method, can be embodied directly in hardware decoding processor and perform and complete, or with the hardware in decoding processor And software module combination execution is completed.Software module may be located at random access memory, and flash memory, read-only storage may be programmed read-only In the ripe storage medium in this area such as memory or electrically erasable programmable memory, register.The storage medium is located at Memory 902, processor 901 reads the information in memory 902, the step of complete said method with reference to its hardware.
It is understood that embodiments described herein can with hardware, software, firmware, middleware, microcode or its Combine to realize.For hardware is realized, processing unit can be realized in one or more special ICs (Application Specific Integrated Circuits, ASIC), digital signal processor (Digital Signal Processing, DSP), digital signal processing appts (DSP Device, DSPD), programmable logic device (Programmable Logic Device, PLD), field programmable gate array (Field-Programmable Gate Array, FPGA), general processor, In controller, microcontroller, microprocessor, other electronic units for performing herein described function or its combination.
For software is realized, can be realized herein by performing the module (such as process, function etc.) of function described herein Described technology.Software code is storable in memory and by computing device.Memory can within a processor or Realize processor outside.
Alternatively, the voice in the identification of the processor 901 target audio file, and determine and the target audio file Corresponding target text content, including:
Whole voices in the target audio file are converted into into the first word content;
Determine that the first keyword in first word content is the target text content according to the first preset rules.
Alternatively, the voice in the identification of the processor 901 target audio file, and determine and the target audio file Corresponding target text content, including:
The voice in target sound frequency range in the target audio file is recognized, and the voice in the target sound frequency range is turned Change the second word content into;
Determine that second word content is the target text content;Or
Determine that the second keyword in second word content is the target text content according to the second preset rules.
The processor 901 recognizes the voice in the target audio file in target sound frequency range, and by the target sound Voice in frequency range is converted into the second word content, including:
Recognize the first object audio section in the target audio file, and whether judge in the first object audio section Including voice;
If including voice in the first object audio section, the voice in the first object audio section is converted into into second Word content;
If not including voice in the first object audio section, the second target audio in the target audio file is recognized Section, and the voice in the second target sound frequency range is converted into into the second word content.
Alternatively, it is described that the target text content is added in the filename of the target audio file, including:
Using the target text content as the target audio file new file name;Or
The target text content and old file name are combined the new file name as the target audio file.
Mobile terminal 900 can realize each process that mobile terminal 900 is realized in previous embodiment, to avoid repeating, Here repeat no more.
The mobile terminal 900 of the embodiment of the present invention, recognizes the voice in target audio file, and determines and the target sound The corresponding target text content of frequency file;The target text content is associated with the target audio file.So, Yong Huke With the target audio file according to the target text content search, the audio file processing method that the present embodiment is provided is convenient User rapidly and accurately searches audio file, improves the efficiency that user searches audio file, enhances Consumer's Experience.
Those of ordinary skill in the art are it is to be appreciated that the list of each example with reference to the embodiments described herein description Unit and algorithm steps, being capable of being implemented in combination in electronic hardware or computer software and electronic hardware.These functions are actually Performed with hardware or software mode, depending on the application-specific and design constraint of technical scheme.Professional and technical personnel Each specific application can be used different methods to realize described function, but this realization it is not considered that exceeding The scope of the present invention.
Those skilled in the art can be understood that, for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, may be referred to the corresponding process in preceding method embodiment, will not be described here.
In embodiment provided herein, it should be understood that disclosed apparatus and method, can pass through other Mode is realized.For example, device embodiment described above is only schematic, and for example, the division of the unit is only A kind of division of logic function, can there is an other dividing mode when actually realizing, such as multiple units or component can with reference to or Person is desirably integrated into another system, or some features can be ignored, or does not perform.Another, shown or discussed is mutual Between coupling or direct-coupling or communication connection can be INDIRECT COUPLING or communication link by some interfaces, device or unit Connect, can be electrical, mechanical or other forms.
The unit as separating component explanation can be or may not be it is physically separate, it is aobvious as unit The part for showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.Some or all of unit therein can be according to the actual needs selected to realize embodiment of the present invention scheme Purpose.
In addition, each functional unit in each embodiment of the invention can be integrated in a processing unit, it is also possible to It is that unit is individually physically present, it is also possible to which two or more units are integrated in a unit.
If the function is realized and as independent production marketing or when using using in the form of SFU software functional unit, can be with In being stored in a computer read/write memory medium.Based on such understanding, technical scheme is substantially in other words The part contributed to prior art or the part of the technical scheme can be embodied in the form of software product, the meter Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be individual People's computer, server, or network equipment etc.) perform all or part of step of each embodiment methods described of the invention. And aforesaid storage medium includes:USB flash disk, portable hard drive, ROM, RAM, magnetic disc or CD etc. are various can be with store program codes Medium.
The above, the only specific embodiment of the present invention, but protection scope of the present invention is not limited thereto, any Those familiar with the art the invention discloses technical scope in, change or replacement can be readily occurred in, all should contain Cover within protection scope of the present invention.Therefore, protection scope of the present invention should be defined by scope of the claims.

Claims (10)

1. a kind of audio file processing method, it is characterised in that methods described includes:
Voice in identification target audio file, and determine target text content corresponding with the target audio file;
The target text content is added in the filename of the target audio file.
2. the method for claim 1, it is characterised in that the voice in the identification target audio file, and determine with The corresponding target text content of the target audio file, including:
Whole voices in the target audio file are converted into into the first word content;
Determine that the first keyword in first word content is the target text content according to the first preset rules.
3. the method for claim 1, it is characterised in that the voice in the identification target audio file, and determine with The corresponding target text content of the target audio file, including:
The voice in target sound frequency range in the target audio file is recognized, and the voice in the target sound frequency range is converted into Second word content;
Determine that second word content is the target text content;Or
Determine that the second keyword in second word content is the target text content according to the second preset rules.
4. method as claimed in claim 3, it is characterised in that in the identification target audio file in target sound frequency range Voice, and the voice in the target sound frequency range is converted into into the second word content, including:
The first object audio section in the target audio file is recognized, and judges whether include in the first object audio section Voice;
If including voice in the first object audio section, the voice in the first object audio section is converted into into the second word Content;
If not including voice in the first object audio section, the second target sound frequency range in the target audio file is recognized, And the voice in the second target sound frequency range is converted into into the second word content.
5. the method as described in any one of Claims 1 to 4, it is characterised in that described to be added to the target text content In the filename of the target audio file, including:
Using the target text content as the target audio file new file name;Or
The target text content and old file name are combined the new file name as the target audio file.
6. a kind of mobile terminal, it is characterised in that the mobile terminal includes:
Determining module, for recognizing target audio in voice, and determine corresponding with target audio file target text Content;
Add module, for the target text content to be added in the filename of the target audio file.
7. mobile terminal as claimed in claim 6, it is characterised in that the determining module includes:
First converting unit, for the whole voices in the target audio file to be converted into into the first word content;
First determining unit, for determining that the first keyword in first word content is described according to the first preset rules Target text content.
8. mobile terminal as claimed in claim 6, it is characterised in that the determining module includes:
Second converting unit, for recognizing the target audio file in voice in target sound frequency range, and by the target sound Voice in frequency range is converted into the second word content;
Second determining unit, for determining that second word content is the target text content;Or
3rd determining unit, for determining that the second keyword in second word content is described according to the second preset rules Target text content.
9. mobile terminal as claimed in claim 8, it is characterised in that second converting unit includes:
Judgment sub-unit, for recognizing the target audio file in first object audio section, and judge the first object Whether include voice in audio section;
First conversion subunit, if for including voice in the first object audio section, by the first object audio section Voice be converted into the second word content;
Second conversion subunit, if for not including voice in the first object audio section, in recognizing the target audio Second target sound frequency range, and the voice in the second target sound frequency range is converted into into the second word content.
10. the mobile terminal as described in any one of claim 6~9, it is characterised in that the add module is used for:
Using the target text content as the target audio file new file name;Or
The target text content and old file name are combined the new file name as the target audio file.
CN201611243118.2A 2016-12-29 2016-12-29 Audio file processing method and mobile terminal Pending CN106649807A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611243118.2A CN106649807A (en) 2016-12-29 2016-12-29 Audio file processing method and mobile terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611243118.2A CN106649807A (en) 2016-12-29 2016-12-29 Audio file processing method and mobile terminal

Publications (1)

Publication Number Publication Date
CN106649807A true CN106649807A (en) 2017-05-10

Family

ID=58835898

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611243118.2A Pending CN106649807A (en) 2016-12-29 2016-12-29 Audio file processing method and mobile terminal

Country Status (1)

Country Link
CN (1) CN106649807A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101437115A (en) * 2007-11-12 2009-05-20 鸿富锦精密工业(深圳)有限公司 Digital camera and method for setting image name
CN102074235A (en) * 2010-12-20 2011-05-25 上海华勤通讯技术有限公司 Method of video speech recognition and search
CN103390016A (en) * 2012-05-07 2013-11-13 Lg电子株式会社 Method for displaying text associated with audio file and electronic device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101437115A (en) * 2007-11-12 2009-05-20 鸿富锦精密工业(深圳)有限公司 Digital camera and method for setting image name
CN102074235A (en) * 2010-12-20 2011-05-25 上海华勤通讯技术有限公司 Method of video speech recognition and search
CN103390016A (en) * 2012-05-07 2013-11-13 Lg电子株式会社 Method for displaying text associated with audio file and electronic device

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
王永琦: "《MATLAB与音视频技术》", 30 November 2013 *
赵力: "《语音信号处理》", 31 May 2016 *
陆虎敏: "《飞机座舱显示与控制技术》", 31 December 2015 *

Similar Documents

Publication Publication Date Title
US20220148594A1 (en) Using multiple modality input to feedback context for natural language understanding
US20220214775A1 (en) Method for extracting salient dialog usage from live data
US9299342B2 (en) User query history expansion for improving language model adaptation
US20230072352A1 (en) Speech Recognition Method and Apparatus, Terminal, and Storage Medium
JP5799621B2 (en) Information processing apparatus, information processing method, and program
CN103339623B (en) It is related to the method and apparatus of Internet search
US20160055251A1 (en) System and method for compending blogs
CN108090174A (en) A kind of robot answer method and device based on system function syntax
US20190196782A1 (en) Techniques to present a user interface for the visually impaired
CN1637741A (en) Annotation management in pen-based computing system
CN103902533B (en) It is a kind of to search for through method and apparatus
AU2017216520A1 (en) Common data repository for improving transactional efficiencies of user interactions with a computing device
US20150169676A1 (en) Generating a Table of Contents for Unformatted Text
JP2017146720A (en) Patent requirement adequacy prediction device and patent requirement adequacy prediction program
CN101689198A (en) Phonetic search using normalized string
CN111428030B (en) Corpus classifying method and system
CN107977420A (en) The abstract extraction method, apparatus and readable storage medium storing program for executing of a kind of evolved document
US20190317648A1 (en) System enabling audio-based navigation and presentation of a website
CN109710732A (en) Information query method, device, storage medium and electronic equipment
CN108197105A (en) Natural language processing method, apparatus, storage medium and electronic equipment
CN109657043B (en) Method, device and equipment for automatically generating article and storage medium
CN104933099B (en) Method and device for providing target search result for user
CN106650351A (en) running method of application program and mobile terminal
CN103514182B (en) Music searching method and device
CN106649807A (en) Audio file processing method and mobile terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20171107

Address after: 283 No. 523860 Guangdong province Dongguan city Changan town usha BBK Avenue

Applicant after: VIVO MOBILE COMMUNICATION CO., LTD.

Applicant after: Wewo Mobile Communication Co. Ltd. Beijing branch

Address before: 283 No. 523860 Guangdong province Dongguan city Changan town usha BBK Avenue

Applicant before: VIVO MOBILE COMMUNICATION CO., LTD.

TA01 Transfer of patent application right
RJ01 Rejection of invention patent application after publication

Application publication date: 20170510

RJ01 Rejection of invention patent application after publication