CN110111816A - Method, the method for audio processing, electronic equipment and the server-side of recording audio - Google Patents

Method, the method for audio processing, electronic equipment and the server-side of recording audio Download PDF

Info

Publication number
CN110111816A
CN110111816A CN201910147012.XA CN201910147012A CN110111816A CN 110111816 A CN110111816 A CN 110111816A CN 201910147012 A CN201910147012 A CN 201910147012A CN 110111816 A CN110111816 A CN 110111816A
Authority
CN
China
Prior art keywords
audio
recording
fragment
target
audio fragment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910147012.XA
Other languages
Chinese (zh)
Other versions
CN110111816B (en
Inventor
岳振
孙刚
蔡单奇
陈鹤群
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
MIGU Digital Media Co Ltd
MIGU Culture Technology Co Ltd
Original Assignee
MIGU Digital Media Co Ltd
MIGU Culture Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by MIGU Digital Media Co Ltd, MIGU Culture Technology Co Ltd filed Critical MIGU Digital Media Co Ltd
Priority to CN201910147012.XA priority Critical patent/CN110111816B/en
Publication of CN110111816A publication Critical patent/CN110111816A/en
Application granted granted Critical
Publication of CN110111816B publication Critical patent/CN110111816B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • G11B2020/10537Audio or video recording
    • G11B2020/10546Audio or video recording specifically adapted for audio data

Abstract

The present embodiments relate to fields of communication technology, disclose method, the method for audio processing, electronic equipment and the server-side of a kind of recording audio.The method of recording audio in the present invention is applied to recording device, comprising: obtains the semantics recognition of the audio fragment currently acquired as a result, and determining the affiliated theme of audio fragment according to semantics recognition result;Judge whether the affiliated theme of audio fragment and target topic are identical, and according to judging result, control the recording process of audio, wherein target topic is the generic to recorded audio content.Present embodiment can automatically control the recording to audio signal, avoid the typing of invalid audio fragment, improve the quality of recording.

Description

Method, the method for audio processing, electronic equipment and the server-side of recording audio
Technical field
The present embodiments relate to field of communication technology, in particular to the side of a kind of method of recording audio, audio processing Method, electronic equipment and server-side.
Background technique
With the continuous development of science and technology, current electronic equipment usually all has sound-recording function, for example, smart phone has Sound-recording function, MP3 have sound-recording function etc..User generallys use the equipment with sound-recording function in conference content or interview Appearance is recorded.
When user is recorded using the sound pick-up outfit, user can click " start button " on the sound pick-up outfit, with Start the recording that sound pick-up outfit carries out audio, and at the end of the audio recording, then user needs to click conclusion button (or again Click start button) to terminate audio recording.
At least there are the following problems in the prior art for inventor's discovery: at present in recording audio, user being needed to open manually Audio recording is turned off manually in dynamic audio recording.But in actual use, since usage scenario is complicated and changeable, user is often not Recording (or even many times user often forgets that pause is recorded), the invalid audio of typing, to reduce can be suspended in time Recording quality, while the invalid audio of typing can also occupy the memory space of sound pick-up outfit.For example, user B starting recording is set It is standby, the interview content with A is recorded, A has been connected to a phone suddenly during recording, and A is directed to the conversation content of phone simultaneously It is not the audio content that this interview needs to record, needs user B to suspend recording manually at this time, if user B does not suspend in time Record or forget pause record, then can the invalid audio of typing reduced since message is uncorrelated to this recording substance Recording quality increases the time (when arranging interview data, needing to delete invalid audio) that subsequent user arranges recording substance, together Shi Wuxiao audio also takes up the memory space of sound pick-up outfit.
Summary of the invention
A kind of method for being designed to provide recording of embodiment of the present invention allows to automatically control to audio signal Recording, avoid the typing of invalid audio fragment, improve the quality of recording.
In order to solve the above technical problems, embodiments of the present invention provide a kind of method of recording audio, it is applied to record Mixer, comprising: obtain the semantics recognition of the audio fragment currently acquired as a result, and determining audio piece according to semantics recognition result Theme belonging to section;Judge whether the affiliated theme of audio fragment and target topic are identical, and according to judging result, controls the record of audio Process processed, wherein target topic is the generic to recorded audio content.
Embodiments of the present invention additionally provide a kind of method of audio processing, are applied to server-side, comprising: receive recording The audio data that device is sent, wherein audio data includes the target audio that recording device is recorded and the control for recording target audio Information processed, information at the time of control information includes pause recording audio;According to the control information, target audio is carried out at editing Reason;Wherein, recording device records the process of target audio are as follows: is determined according to the semantics recognition result of the audio fragment currently acquired The affiliated theme of audio fragment judges whether the affiliated theme of audio fragment and target topic are identical, according to judging result, controls sound The recording process of frequency.
Embodiments of the present invention additionally provide a kind of electronic equipment, comprising: at least one processor;And at least The memory of one processor communication connection;Wherein, memory is stored with the instruction that can be executed by least one processor, instruction It is executed by least one processor, so that the method that at least one processor is able to carry out above-mentioned recording audio.
Embodiments of the present invention additionally provide a kind of server-side, comprising: at least one processor;And at least one The memory of a processor communication connection;Wherein, memory is stored with the instruction that can be executed by least one processor, instructs quilt At least one processor executes, so that the method that at least one processor is able to carry out above-mentioned audio processing.
Embodiment of the present invention in terms of existing technologies, by by the affiliated theme of the audio fragment currently acquired and mesh Mark theme is compared, and the recording process of audio is controlled according to judging result, and audio recording process includes to audio recording Pause and restart audio recording;Due to during the entire process of recording audio, without suspending the record of audio manually System, the problem of typing audio fragment unrelated with target topic due to manually forgetting pause can be avoided the occurrence of, to improve record The quality of sound.And semantics recognition is used to the audio fragment of acquisition, it can determine the content that the audio fragment is recorded, and then can be with The affiliated theme of the audio fragment currently acquired is quickly and accurately determined according to the content of recording;Simultaneously as in entire audio In recording process, be by judge acquisition the affiliated theme of audio fragment and target topic whether unanimously control audio record System can also be avoided the occurrence of because manually forget to cause to miss due to cancelling pause recording relevant to target topic audio the problem of, The step of further improving the quality of recording audio, reducing processing of the later period to the audio of recording, improves the audio to recording Processing speed.
In addition, the method for recording audio further include: if receiving the instruction that instruction terminates the recording of audio, terminate to sound The recording of frequency, obtains target audio;Audio data is uploaded to server-side, audio data includes target audio and recording target The control information of audio, wherein information at the time of control information includes pause recording audio.By by target audio and recording The control information of target audio is uploaded to server-side, can be handled by controlling information target audio by server-side, e.g., Editing processing, can simplify the step of follow-up service end is to audio processing, accelerate the speed to audio processing, to pass through service The audio quality of target audio is improved again in end.
In addition, controlling the recording process of audio according to judging result, specifically including: if it is determined that the affiliated theme of audio fragment It is not identical as target topic, then suspend the recording of audio;In the recording process of pause audio, if detecting and target topic phase Same audio fragment, then recording of the restarting to audio.It include to the temporary of audio recording during controlling audio recording Stop and restart, suspends the recording of audio, it can be to avoid typing and the incoherent audio fragment of target topic;And in pause audio In recording process, if detecting, the affiliated theme of the audio fragment of acquisition is identical as target topic, restarts the recording of audio, The problem of missing recording identical with target topic audio fragment can be avoided the occurrence of.
In addition, obtaining the semantics recognition of the audio fragment currently acquired as a result, specifically including: the audio piece that will currently acquire Section is uploaded to server-side, and receives the semantics recognition result of the audio fragment fed back by server-side;Alternatively, to the sound currently acquired Frequency segment carries out semantics recognition, obtains the semantics recognition result of audio fragment.It provides two kinds and obtains the audio fragment currently acquired Semantics recognition result mode, convenient for flexibly obtaining the semantics recognition result of the audio fragment currently acquired.
In addition, before the semantics recognition result for obtaining the audio fragment currently acquired, the method for recording audio further include: The semantics recognition of the first section audio fragment acquired for the first time is obtained as a result, and according to the semantics recognition of first section audio fragment as a result, really Determine the affiliated theme of first section audio fragment;And using the affiliated theme of first section audio fragment as target topic.Due to first section audio fragment In generally comprise subject content to recording audio, can be with thus using the affiliated theme of first section audio fragment as target topic Quickly and accurately determine target topic, and implementation is simple.
In addition, in the recording process of pause audio, the method for recording audio further include: in the recording process of pause audio In, if detecting audio fragment identical with target topic, save audio fragment identical with target topic.In pause audio In recording process, save identical with target topic audio fragment, avoid restart record when, occur leakage preservation currently with The problem of target topic identical audio fragment, to improve the quality of audio recording.
Detailed description of the invention
One or more embodiments are illustrated by the picture in corresponding attached drawing, these exemplary theorys The bright restriction not constituted to embodiment, the element in attached drawing with same reference numbers label are expressed as similar element, remove Non- to have special statement, composition does not limit the figure in attached drawing.
Fig. 1 is a kind of method idiographic flow schematic diagram for recording audio that first embodiment provides according to the present invention;
Fig. 2 is a kind of method idiographic flow schematic diagram for recording audio that second embodiment provides according to the present invention;
Fig. 3 is a kind of method idiographic flow schematic diagram for audio processing that third embodiment provides according to the present invention;
Fig. 4 is a kind of method idiographic flow schematic diagram for audio processing that the 4th embodiment provides according to the present invention;
Fig. 5 is the concrete structure schematic diagram for a kind of electronic equipment that the 5th embodiment provides according to the present invention;
Fig. 6 is a kind of concrete structure schematic diagram for server-side that sixth embodiment provides according to the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with attached drawing to the present invention Each embodiment be explained in detail.However, it will be understood by those skilled in the art that in each embodiment party of the present invention In formula, in order to make the reader understand this application better, many technical details are proposed.But even if without these technical details And various changes and modifications based on the following respective embodiments, the application technical solution claimed also may be implemented.
The division of each embodiment is for convenience, should not to constitute to specific implementation of the invention any below It limits, each embodiment can be combined with each other mutual reference under the premise of reconcilable.
The first embodiment of the present invention is related to a kind of methods of recording audio.The method of the recording audio is applied to recording Device, the recording device can be the electronic equipment with sound-recording function, as: smart phone, recording pen, with sound-recording function MP3 etc..The detailed process of the method for the recording audio is as shown in Figure 1.
Step 101: obtaining the semantics recognition of the audio fragment currently acquired as a result, and determining sound according to semantics recognition result The affiliated theme of frequency segment.
Specifically, start the recording device, which acquires the audio letter of the recording device ambient enviroment in real time Number, such as: the talk audio signal of user.Know wherein it is possible to obtain the semantic of the audio fragment currently acquired according to predetermined period Not as a result, for example, can just obtain the semantics recognition result of an audio fragment every 30 seconds.It can not also be according to predetermined period The semantics recognition of the audio fragment currently acquired is obtained as a result, for example, obtaining primary if detecting in 2 seconds without audio input The semantics recognition of the audio fragment currently acquired as a result, at the time of the audio fragment currently acquired is last obtains to it is current when Audio fragment in quarter.
The corresponding keyword of each theme can be preset, for example, theme is that keyword corresponding to " film " can be with Including " box office ", " showing ", " attendance ", " screening ", " play ", " director " and " protagonist " etc., theme " game " is corresponding Keyword may include: " E3 ", " Sony ", " big method ", " Nintendo ", " 3A " and " Steam " etc.;Theme is " finance " institute Corresponding keyword may include: " currency ", " trade war ", " fund ", " security ", " price of gold " and " stock market " etc..Certainly, often The quantity of keyword corresponding to a theme is with no restrictions.
The keyword in semantics recognition result is got, keyword corresponding to each theme in the audio fragment is counted Quantity chooses the theme comprising most keywords as the affiliated theme of the audio fragment.For example, the audio fragment currently acquired Semantics recognition result are as follows: " the big method of this E3 Sony is severe!There are so more exclusive 3A your writings!", by the semantics recognition result In word be compared with keyword corresponding to each theme, can determine occur 4 and theme in the audio fragment " game " relevant keyword (i.e. " E3 ", " Sony ", " big method " and " 3A "), that is, can determine that the affiliated theme of the audio fragment is " game ".
It is noted that needing to obtain target before the semantics recognition result for obtaining the audio fragment currently acquired Theme, there are many modes for obtaining target topic, for example, suggestion voice can be exported, prompting user's input, (input mode can be with It is voice input, keyboard input text etc.) target topic;It can also be obtained automatically by way of the automatic semantics recognition of recording device Take target topic.The automatic mode for obtaining target topic is described below:
The semantics recognition of the first section audio fragment acquired for the first time is obtained as a result, and according to the semantics recognition of first section audio fragment As a result, determining the affiliated theme of first section audio fragment;And using the affiliated theme of first section audio fragment as target topic.
Specifically, after first section audio fragment can be recording device starting, the first segment audio fragment of acquisition, and obtain The semantics recognition of the first section audio fragment as a result, extract the keyword in the semantics recognition result of the first section audio fragment, and with Keyword corresponding to preset themes is compared, so that it is determined that going out the affiliated theme of first section audio fragment, for example, first section audio The semantics recognition result of segment is " me is next allowed to chat middle-east situation ", can extract keyword " middle-east situation ", the pass The corresponding theme of keyword is " middle-east situation ", that is, can determine that the affiliated theme of first section audio fragment is " middle-east situation ".
In one concrete implementation, there are many modes of the semantics recognition result for the audio fragment that acquisition currently acquires, this Embodiment is using the two ways being exemplified below.
Mode one: the audio fragment currently acquired is uploaded to server-side, and receives the audio fragment fed back by server-side Semantics recognition result.
Specifically, the audio fragment currently acquired is uploaded to server-side, audio of the server-side to upload by recording device Segment carries out semantics recognition, can be using automatic speech recognition method (Automatic Speech Recognition, abbreviation " ASR "), recording device receives the semantics recognition result of the audio fragment of server-side return.
Mode two: semantics recognition is carried out to the audio fragment currently acquired, obtains the semantics recognition result of audio fragment.
Recording device oneself directly can also carry out semantics recognition to the audio fragment currently acquired, to obtain the audio The semantics recognition result of segment.
Both the above acquisition modes can be selected according to actual needs, for example, networking in recording device and server-side In the case where, the semantics recognition of the audio fragment currently acquired can be obtained with pass-through mode one as a result, if recording device is in nothing In the case where net, employing mode two obtains the semantics recognition result of the audio fragment currently acquired.
Step 102: judging whether the affiliated theme of audio fragment and target topic are identical, and according to judging result, control sound The recording process of frequency, wherein target topic is the generic to recorded audio content.
In one concrete implementation, according to judging result, controlling audio recording process includes: if it is determined that belonging to audio fragment Theme is not identical as target topic, then suspends the recording of audio;In the recording process of pause audio, if detecting and target master Identical audio fragment is inscribed, then recording of the restarting to audio.
Specifically, however, it is determined that the affiliated theme of audio fragment is identical as target topic, then continues the recording of audio.If it is determined that The affiliated theme of audio fragment and target topic be not identical, then suspends the recording of audio, and carry out to the audio fragment acquired in real time Detection, whether the affiliated theme of audio fragment that detection acquires in real time is identical as target topic, if detecting the audio currently acquired The affiliated theme of segment is identical as target topic, then recording of the restarting to audio.
It is noted that step 101 and step 102 can be repeated during the entire process of recording audio, thus Control the recording to entire audio.
Embodiment of the present invention in terms of existing technologies, by by the affiliated theme of the audio fragment currently acquired and mesh Mark theme is compared, and the recording process of audio is controlled according to judging result, and audio recording process includes to audio recording Pause and restart audio recording;Due to during the entire process of recording audio, without suspending the record of audio manually System, the problem of typing audio fragment unrelated with target topic due to manually forgetting pause can be avoided the occurrence of, to improve record The quality of sound.And semantics recognition is used to the audio fragment of acquisition, it can determine the content that the audio fragment is recorded, and then can be with The affiliated theme of the audio fragment currently acquired is quickly and accurately determined according to the content of recording;Simultaneously as in entire audio In recording process, be by judge acquisition the affiliated theme of audio fragment and target topic whether unanimously control audio record System can also be avoided the occurrence of because manually forget to cause to miss due to cancelling pause recording relevant to target topic audio the problem of, The step of further improving the quality of recording audio, reducing processing of the later period to the audio of recording, improves the audio to recording Processing speed.
Second embodiment of the present invention is related to a kind of method of recording audio.Second embodiment is to the first embodiment party The further improvement of formula, mainly thes improvement is that: in second embodiment of the invention, if receiving instruction terminates audio The instruction of recording then terminates recording to audio, obtains target audio, and by the audio data upload service comprising target audio End.The detailed process of the method for the recording audio is as shown in Figure 2.
Step 201: obtaining the semantics recognition of the audio fragment currently acquired as a result, and determining sound according to semantics recognition result The affiliated theme of frequency segment.
Step 202: judging whether the affiliated theme of audio fragment and target topic are identical, and according to judging result, control sound The recording process of frequency, wherein target topic is the generic to recorded audio content.
One in the specific implementation, during suspending audio recording, if detecting audio fragment identical with target topic, Save audio fragment identical with target topic.
Specifically, during suspending audio recording, recording device enters listening mode, i.e., obtains the sound of acquisition in real time The semantics recognition of frequency segment is as a result, judge whether the affiliated theme of audio fragment of acquisition is identical as target topic;For the ease of obtaining Take the semantics recognition to the audio fragment currently acquired as a result, can be in the affiliated theme of audio fragment for judging currently to acquire every time With target topic it is whether consistent before, cache the audio fragment, however, it is determined that the affiliated theme of the audio fragment is identical as target topic, The recording to audio is restarted, meanwhile, save the audio fragment, it is ensured that recording audio identical with target topic will not be missed Segment;If it is determined that the affiliated theme of the audio fragment and target topic be not identical, then the audio fragment is not saved.
In the recording process of pause audio, audio fragment identical with target topic is saved, avoids recording in restarting When processed, there is the problem of leakage saves currently audio fragment identical with target topic, to improve the quality of audio recording.
Step 203: if receiving the instruction that instruction terminates the recording of audio, terminating the recording to audio, obtain target Audio.
Specifically, user can send the instruction for terminating audio recording to the recording device, which can be by pre- If operation input, for example, inputting instruction by the end key on recording device, or (such as: terminating by speech-input instructions It records).Recording device upon receipt of the instructions, terminates the recording to audio, and what is obtained is target audio.
Step 204: audio data being uploaded to server-side, audio data includes target audio and recording target audio Control information, wherein information at the time of control information includes pause recording audio.
Specifically, information at the time of control information includes pause recording audio, for example, control information includes pause audio Recording the t1 moment, and restarting is to the t3 moment of the recording of audio, wherein the t1 moment is earlier than the t3 moment.By target Audio and control information as audio data are uploaded to server-side, are cut according to the control information to target audio by server-side Processing is collected, for example, server-side can obtain and suspend audio fragment corresponding at the time of recording audio according to the control information, Corresponding audio fragment, determines the identity characteristic information of invalidated object at the time of according to pause recording audio;Based on invalid right The identity characteristic information of elephant determines the corresponding audio fragment of invalidated object;Using the corresponding audio fragment of invalidated object as invalid sound Frequency segment, and delete invalid audio fragment.
It should be noted that step 201, step 202 in present embodiment respectively with the step in first embodiment 101 and step 102 it is roughly the same, will no longer repeat herein.
The method of the recording audio provided in present embodiment, by by target audio and record target audio control Information is uploaded to server-side, can carry out editing processing to target audio by control information by server-side, can simplify subsequent The step of server-side is to audio processing accelerates the speed to audio processing, to improve target audio again by server-side Audio quality.
The step of various methods divide above, be intended merely to describe it is clear, when realization can be merged into a step or Certain steps are split, multiple steps are decomposed into, as long as including identical logical relation, all in the protection scope of this patent It is interior;To adding inessential modification in algorithm or in process or introducing inessential design, but its algorithm is not changed Core design with process is all in the protection scope of the patent.
Third embodiment of the invention is related to a kind of method of audio processing, and the method for the audio processing is applied to service End, server-side and recording device communicate to connect, and server-side can be in communication with each other with recording device, the server-side can be cloud, Server etc..The detailed process of the method for the audio processing is as shown in Figure 3.
Step 301: receiving the audio data that recording device is sent, wherein audio data includes the mesh that recording device is recorded Mark with phonetic symbols frequency and the control information for recording target audio, information at the time of control information includes pause recording audio.
In one concrete implementation, recording device records the process of target audio are as follows: according to the audio fragment currently acquired Semantics recognition result determine the affiliated theme of audio fragment, judge whether the affiliated theme of audio fragment and target topic identical, According to judging result, the recording process of audio is controlled.
Recording device controls the recording process of audio according to judging result, after recording device terminates to audio recording, i.e., Target audio can be obtained.Recording device is using obtained target audio and records the control information of the target audio as audio data Upload service end, server-side receive the audio data.
Step 302: according to the control information, editing processing being carried out to target audio.
In one concrete implementation, according to information at the time of pause recording audio, at the time of acquisition with pause recording audio Corresponding audio fragment;Corresponding audio fragment at the time of according to pause recording audio, determines that the identity of invalidated object is special Reference breath;The corresponding audio fragment of the invalidated object is determined based on the identity characteristic information of the invalidated object;By the invalidated object Corresponding audio fragment deletes invalid audio fragment as invalid audio fragment.
Specifically, for the ending with the different audio fragment of target topic at the time of suspending recording audio, one temporarily Stop having a corresponding audio fragment at the time of recording audio.Recording device is right always before pause is to the operation of audio recording Audio fragment is saved, thus, recording device can save audio fragment corresponding to the pause moment.
According to the pause moment can obtain with audio fragment corresponding at the time of suspending recording audio, suspend recording audio The moment affiliated theme of corresponding audio fragment is not identical as target topic, obtains corresponding audio at the time of the pause recording audio The identity characteristic information of invalidated object in segment, identity characteristic information can be tone color, tone etc., and then according to the invalidated object Identity characteristic information.The identity characteristic information of the invalidated object is compared in entire target audio, determines have The audio fragment of the identity characteristic information of the invalidated object, using the corresponding audio fragment of invalidated object as invalid audio fragment, The invalid audio fragment determined is deleted from the target audio.
It is noted that after server-side carries out editing processing to target audio, it can be by editing treated target sound Frequency feeds back to recording device.
The method of the audio processing provided in present embodiment is determined to need by the control information in audio data The audio fragment of editing, to realize that the automatic editing to target audio simplifies the step of audio processing without human intervention Suddenly, the speed to audio processing is improved.In addition, according to the control information, the audio fragment of invalidated object is determined, to the target sound Invalid audio fragment is deleted in frequency, further improves the audio quality of target audio.
Four embodiment of the invention is related to a kind of method of audio processing.4th embodiment is to third embodiment Further improvement, mainly the improvement is that: according to the control information, after carrying out editing processing to target audio, the sound Frequency processing method can also to editing, treated that target audio is handled according to target topic, specific process such as Fig. 4 institute Show.
Step 401: receiving the audio data that recording device is sent, wherein audio data includes the mesh that recording device is recorded Mark with phonetic symbols frequency and the control information for recording target audio, information at the time of control information includes pause recording audio.
Step 402: according to the control information, editing processing being carried out to target audio.
Step 403: obtaining the affiliated theme of first section audio fragment, and as target topic.
Specifically, semantics recognition directly is carried out to the first section audio of target audio, obtains the semantic of the first section audio and knows Not as a result, determining the affiliated theme of first section audio fragment according to the keyword in the semantics recognition result, and as target master Topic.
Step 404: the target audio in addition to first section audio fragment is split as N number of audio fragment, N is the integer greater than 1, And each audio fragment is handled.
Specifically, segment processing is carried out to target audio according to preset frequency, i.e., will removes first section in the target audio Audio except audio fragment splits into several audio fragments according to preset frequency, and preset frequency can be according to practical need It is configured.
In one concrete implementation, to the treatment process of each audio fragment progress are as follows: carry out semantic knowledge to audio fragment Not, the semantics recognition result of audio fragment is obtained;The affiliated theme of audio fragment is determined according to the semantics recognition result of audio fragment; The affiliated theme of audio fragment is compared with target topic, however, it is determined that the affiliated theme of audio fragment and target topic be not identical, Then delete audio fragment.
Semantics recognition is carried out to each audio fragment, obtains recognition result, and obtain the keyword in semantics recognition result; According to the corresponding relationship between theme and keyword, every affiliated theme of section audio segment is determined, by every section audio segment institute owner Topic is compared with target topic respectively, will delete from target audio with the different audio fragment of target topic.
The method of the audio processing provided in present embodiment carries out at editing target audio according to the control information Reason and then it is secondary according to target topic, to editing, treated that target audio is handled, delete different with target topic Audio fragment further increases the audio quality of target audio.
Fifth embodiment of the invention is related to a kind of electronic equipment, and the specific structure of the electronic equipment 50 is as shown in figure 5, packet It includes: at least one processor 501;And the memory 502 with the communication connection of at least one processor 501;Wherein, memory 502 are stored with the instruction that can be executed by least one processor 501, and instruction is executed by least one processor 501, so that at least The method that one processor 501 is able to carry out recording audio in first embodiment or second embodiment.
Present embodiment is entity device embodiment corresponding with first embodiment or second embodiment, this implementation Mode can work in coordination implementation with first embodiment or second embodiment.It is mentioned in first embodiment or second embodiment The relevant technical details arrived are still effective in the present embodiment, and in order to reduce repetition, which is not described herein again.
Sixth embodiment of the invention is related to a kind of server-side, and the specific structure of the server-side 60 is as shown in Figure 6, comprising: At least one processor 601;And the memory 602 with the communication connection of at least one processor 601;Wherein, memory 602 It is stored with the instruction that can be executed by least one processor 601, instruction is executed by least one processor 601, so that at least one The method that a processor 601 is able to carry out third embodiment or the processing of the 4th embodiment sound intermediate frequency.
Present embodiment is entity device embodiment corresponding with third embodiment or the 4th embodiment, this implementation Mode can work in coordination implementation with third embodiment or the 4th embodiment.It is mentioned in third embodiment or the 4th embodiment The relevant technical details arrived are still effective in the present embodiment, and in order to reduce repetition, which is not described herein again.
It is noted that depositing in the electronic equipment in the 5th embodiment and the server-side in sixth embodiment Reservoir is all made of bus mode with processor and connects, and bus may include the bus and bridge of any number of interconnection, and bus is by one The various circuits of a or multiple processors and memory link together.Bus can also will such as peripheral equipment, voltage-stablizer and Various other circuits of management circuit or the like link together, and these are all it is known in the art, therefore, herein not It is described further again.Bus interface provides interface between bus and transceiver.Transceiver can be an element, It is also possible to multiple element, such as multiple receivers and transmitter, provides for logical with various other devices over a transmission medium The unit of letter.The data handled through processor are transmitted on the radio medium by antenna, and further, antenna also receives data And transfer data to processor.
Processor is responsible for managing bus and common processing, can also provide various functions, including periodically, peripheral interface, Voltage adjusting, power management and other control functions.And memory can be used for storage processor and execute operation when institute The data used.
It will be appreciated by those skilled in the art that implementing the method for the above embodiments is that can pass through Program is completed to instruct relevant hardware, which is stored in a storage medium, including some instructions are used so that one A equipment (can be single-chip microcontroller, chip etc.) or processor (processor) execute each embodiment the method for the application All or part of the steps.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can store journey The medium of sequence code.
It will be understood by those skilled in the art that the respective embodiments described above are to realize specific embodiments of the present invention, And in practical applications, can to it, various changes can be made in the form and details, without departing from the spirit and scope of the present invention.

Claims (10)

1. a kind of method of recording audio, which is characterized in that be applied to recording device, comprising:
The semantics recognition of the audio fragment currently acquired is obtained as a result, and determining the audio piece according to the semantics recognition result Theme belonging to section;
Judge whether the affiliated theme of the audio fragment and target topic are identical, and according to judging result, controls the recording of audio Process, wherein the target topic is the generic to recorded audio content.
2. the method for recording audio according to claim 1, which is characterized in that the method for the recording audio further include:
If receiving instruction terminates the instruction of recording of audio, the recording to audio is terminated, target audio is obtained;
Audio data is uploaded to server-side, the audio data includes the target audio and the recording target audio Control information, wherein information at the time of the control information includes pause recording audio.
3. the method for recording audio according to claim 1 or 2, which is characterized in that according to the judging result, control sound The recording process of frequency, specifically includes:
If it is determined that the affiliated theme of audio fragment and target topic be not identical, then suspend the recording of audio;
In the recording process of pause audio, if detecting audio fragment identical with the target topic, restarting pair The recording of audio.
4. the method for recording audio according to claim 1, which is characterized in that in the audio piece for obtaining and currently acquiring Before the semantics recognition result of section, the method for the recording audio further include:
The semantics recognition of the first section audio fragment acquired for the first time is obtained as a result, and according to the semantics recognition of the first section audio fragment As a result, determining the affiliated theme of first section audio fragment;
And using the affiliated theme of first section audio fragment as the target topic.
5. the method for recording audio according to claim 3, which is characterized in that the method for the recording audio further include:
In the recording process of pause audio, if detecting audio fragment identical with the target topic, save and the mesh Mark the identical audio fragment of theme.
6. a kind of method of audio processing, which is characterized in that be applied to server-side, comprising:
Receive recording device send audio data, wherein the audio data include recording device record target audio and Record the control information of the target audio, information at the time of the control information includes pause recording audio;
According to the control information, editing processing is carried out to the target audio;
Wherein, the recording device records the process of target audio are as follows: according to the semantics recognition knot of the audio fragment currently acquired Fruit determines the affiliated theme of the audio fragment, judges whether the affiliated theme of the audio fragment and target topic are identical, according to Judging result controls the recording process of audio.
7. the method for audio processing according to claim 6, which is characterized in that according to the control information, to the mesh Mark with phonetic symbols frequency carries out editing processing, specifically includes:
According to information at the time of pause recording audio, obtain and audio fragment corresponding at the time of pause recording audio;
Corresponding audio fragment, determines the identity characteristic information of invalidated object at the time of according to pause recording audio;
The corresponding audio fragment of the invalidated object is determined based on the identity characteristic information of the invalidated object;
Using the corresponding audio fragment of the invalidated object as invalid audio fragment, and delete the invalid audio fragment.
8. the method for audio processing according to claim 6, which is characterized in that according to the control information, to described After target audio carries out editing processing, the method for the audio processing further include:
The affiliated theme of first section audio fragment is obtained, and as the target topic;
The target audio in addition to the first section audio fragment is split as N number of audio fragment, N is the integer greater than 1, and right Each audio fragment is handled as follows:
Semantics recognition is carried out to the audio fragment, obtains the semantics recognition result of the audio fragment;
The affiliated theme of the audio fragment is determined according to the semantics recognition result of the audio fragment;
The affiliated theme of the audio fragment is compared with the target topic, however, it is determined that the affiliated theme of audio fragment with The target topic is not identical, then deletes the audio fragment.
9. a kind of electronic equipment characterized by comprising
At least one processor;And
The memory being connect at least one described processor communication;Wherein,
The memory is stored with the instruction that can be executed by least one described processor, and described instruction is by described at least one It manages device to execute, so that at least one described processor is able to carry out the side of recording audio as described in any one in claim 1-5 Method.
10. a kind of server-side characterized by comprising
At least one processor;And
The memory being connect at least one described processor communication;Wherein,
The memory is stored with the instruction that can be executed by least one described processor, and described instruction is by described at least one It manages device to execute, so that at least one described processor is able to carry out the side such as the described in any item audio processings of claim 6-8 Method.
CN201910147012.XA 2019-02-27 2019-02-27 Audio recording method, audio processing method, electronic equipment and server Active CN110111816B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910147012.XA CN110111816B (en) 2019-02-27 2019-02-27 Audio recording method, audio processing method, electronic equipment and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910147012.XA CN110111816B (en) 2019-02-27 2019-02-27 Audio recording method, audio processing method, electronic equipment and server

Publications (2)

Publication Number Publication Date
CN110111816A true CN110111816A (en) 2019-08-09
CN110111816B CN110111816B (en) 2021-03-05

Family

ID=67484251

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910147012.XA Active CN110111816B (en) 2019-02-27 2019-02-27 Audio recording method, audio processing method, electronic equipment and server

Country Status (1)

Country Link
CN (1) CN110111816B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113613068A (en) * 2021-08-03 2021-11-05 北京字跳网络技术有限公司 Video processing method and device, electronic equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102568473A (en) * 2011-12-30 2012-07-11 深圳市车音网科技有限公司 Method and device for recording voice signals
CN104038630A (en) * 2014-05-28 2014-09-10 小米科技有限责任公司 Speech processing method and device
CN104869233A (en) * 2015-04-27 2015-08-26 深圳市金立通信设备有限公司 Recording method
CN104952451A (en) * 2015-06-08 2015-09-30 广东欧珀移动通信有限公司 Sound recording processing method and sound recording processing device
CN107071575A (en) * 2016-06-13 2017-08-18 腾讯科技(北京)有限公司 Paster media file playing method and device
CN107066229A (en) * 2017-01-24 2017-08-18 广东欧珀移动通信有限公司 The method and terminal of recording
CN107464557A (en) * 2017-09-11 2017-12-12 广东欧珀移动通信有限公司 Call recording method, device, mobile terminal and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102568473A (en) * 2011-12-30 2012-07-11 深圳市车音网科技有限公司 Method and device for recording voice signals
CN104038630A (en) * 2014-05-28 2014-09-10 小米科技有限责任公司 Speech processing method and device
CN104869233A (en) * 2015-04-27 2015-08-26 深圳市金立通信设备有限公司 Recording method
CN104952451A (en) * 2015-06-08 2015-09-30 广东欧珀移动通信有限公司 Sound recording processing method and sound recording processing device
CN107071575A (en) * 2016-06-13 2017-08-18 腾讯科技(北京)有限公司 Paster media file playing method and device
CN107066229A (en) * 2017-01-24 2017-08-18 广东欧珀移动通信有限公司 The method and terminal of recording
CN107464557A (en) * 2017-09-11 2017-12-12 广东欧珀移动通信有限公司 Call recording method, device, mobile terminal and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113613068A (en) * 2021-08-03 2021-11-05 北京字跳网络技术有限公司 Video processing method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN110111816B (en) 2021-03-05

Similar Documents

Publication Publication Date Title
CN109309751B (en) Voice recording method, electronic device and storage medium
CN104394126B (en) Information recommendation method, server, client and system
CN107147618A (en) A kind of user registering method, device and electronic equipment
CN105554027A (en) Resource sharing method and device
JP5271703B2 (en) Context sensitive data processing methods
CN109284142A (en) File preloads method, apparatus, electronic equipment and computer readable storage medium
CN105187733A (en) Video processing method, device and terminal
CN107609047A (en) Using recommendation method, apparatus, mobile device and storage medium
CN108271096A (en) A kind of task executing method, device, intelligent sound box and storage medium
CN106507184A (en) Media file shares terminal, receiving terminal, transmission method and electronic equipment
CN109599115A (en) Minutes method and apparatus for audio collecting device and user terminal
CN107831886A (en) Association starts management-control method, device, storage medium and the intelligent terminal of application
CN111813900A (en) Multi-turn conversation processing method and device, electronic equipment and storage medium
CN106603649A (en) Terminal equipment, booking event prompt method and apparatus thereof
CN107862203A (en) Control method, device, storage medium and the terminal of application program
WO2001059607A3 (en) Entertainment file and related information integration method, apparatus and system
CN110111816A (en) Method, the method for audio processing, electronic equipment and the server-side of recording audio
CN110278273A (en) Multimedia file method for uploading, device, terminal, server and storage medium
CN110472033A (en) Answering method, device and server based on NLP model
CN111833857A (en) Voice processing method and device and distributed system
CN113949739B (en) Cross-device playing method and device, electronic device and storage medium
CN113163255B (en) Video playing method, device, terminal and storage medium
CN106708582A (en) Data storage method and device
CN108848472A (en) The method and device of change of voice call
CN108762633A (en) Picture adding method, device, terminal device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant